Yannic Kilcher
48212079f4
unified queueing
2023-02-11 01:31:25 +01:00
Yannic Kilcher
90c3d5640e
Added database to inference server ( #1446 )
...
* added db for inference
* fixed dockerfiles for inference
2023-02-10 22:51:35 +01:00
Andrew Maguire
b60eb1e1ae
minimal fastapi prom metrics ( #1426 )
...
* minimal fastapi prom metrics
2023-02-10 14:37:43 +01:00
Yannic Kilcher
ed7d920e5d
robustifying inference
2023-02-09 15:31:46 +01:00
Yannic Kilcher
c53d8e9bce
changed return type of GET chat
2023-02-09 08:52:45 +01:00
Yannic Kilcher
a85cc0a47d
endpoint to list chats
2023-02-09 08:49:01 +01:00
Yannic Kilcher
bab056a73b
switched to HF text-generation-inference
2023-02-08 23:52:39 +01:00
Yannic Kilcher
040344a41f
made inference server a bit more robust
2023-01-26 21:01:52 +01:00
Yannic Kilcher
1709dc0324
Initial implementation of the inference system ( #869 )
...
* very primitive implementation of inference
* re-worked with security in mind
* removed polling from clients
* switched workers to websockets
* implemented back and forth chats
2023-01-21 22:38:18 +01:00