Commit Graph

9 Commits

Author SHA1 Message Date
Yannic Kilcher 48212079f4 unified queueing 2023-02-11 01:31:25 +01:00
Yannic Kilcher 90c3d5640e Added database to inference server (#1446)
* added db for inference

* fixed dockerfiles for inference
2023-02-10 22:51:35 +01:00
Andrew Maguire b60eb1e1ae minimal fastapi prom metrics (#1426)
* minimal fastapi prom metrics
2023-02-10 14:37:43 +01:00
Yannic Kilcher ed7d920e5d robustifying inference 2023-02-09 15:31:46 +01:00
Yannic Kilcher c53d8e9bce changed return type of GET chat 2023-02-09 08:52:45 +01:00
Yannic Kilcher a85cc0a47d endpoint to list chats 2023-02-09 08:49:01 +01:00
Yannic Kilcher bab056a73b switched to HF text-generation-inference 2023-02-08 23:52:39 +01:00
Yannic Kilcher 040344a41f made inference server a bit more robust 2023-01-26 21:01:52 +01:00
Yannic Kilcher 1709dc0324 Initial implementation of the inference system (#869)
* very primitive implementation of inference

* re-worked with security in mind

* removed polling from clients

* switched workers to websockets

* implemented back and forth chats
2023-01-21 22:38:18 +01:00