Files
Open-Assistant/inference/README.md
T
Yannic Kilcher 1709dc0324 Initial implementation of the inference system (#869)
* very primitive implementation of inference

* re-worked with security in mind

* removed polling from clients

* switched workers to websockets

* implemented back and forth chats
2023-01-21 22:38:18 +01:00

36 lines
579 B
Markdown

# OpenAssitant Inference
Preliminary implementation of the inference engine for OpenAssistant.
## Development (you'll need multiple terminals)
Run a redis container (or use the one of the general docker compose file):
```bash
docker run --rm -it -p 6379:6379 redis
```
Run the inference server:
```bash
cd server
pip install -r requirements.txt
uvicorn main:app --reload
```
Run one (or more) workers:
```bash
cd worker
pip install -r requirements.txt
python __main__.py
```
Run the client:
```bash
cd text-client
pip install -r requirements.txt
python __main__.py
```