Yannic Kilcher
edd9268c9a
introduced api keys to inference
2023-02-11 10:50:51 +01:00
Yannic Kilcher
48212079f4
unified queueing
2023-02-11 01:31:25 +01:00
Yannic Kilcher
097c918cb9
bugfix
2023-02-10 23:30:42 +01:00
Yannic Kilcher
91e5b11315
touch
2023-02-10 23:01:34 +01:00
Yannic Kilcher
d1aea98ad5
added dockerfile for worker-full
2023-02-10 22:53:40 +01:00
Yannic Kilcher
90c3d5640e
Added database to inference server ( #1446 )
...
* added db for inference
* fixed dockerfiles for inference
2023-02-10 22:51:35 +01:00
Yannic Kilcher
4076afd0d8
added token buffer for catchiing stop sequences
2023-02-09 23:46:44 +01:00
Yannic Kilcher
27671e3220
HF inference fixes
2023-02-09 00:27:51 +01:00
Yannic Kilcher
bab056a73b
switched to HF text-generation-inference
2023-02-08 23:52:39 +01:00
Yannic Kilcher
ae5d16f394
added a tmux inference dev setup script
2023-01-27 15:40:21 +01:00
Yannic Kilcher
a0f4449e9f
added seed to parameters
2023-01-27 15:23:19 +01:00
Yannic Kilcher
040344a41f
made inference server a bit more robust
2023-01-26 21:01:52 +01:00
Yannic Kilcher
f1edcc8a28
added streaming worker
2023-01-26 16:41:57 +01:00
Yannic Kilcher
1709dc0324
Initial implementation of the inference system ( #869 )
...
* very primitive implementation of inference
* re-worked with security in mind
* removed polling from clients
* switched workers to websockets
* implemented back and forth chats
2023-01-21 22:38:18 +01:00