Commit Graph

59 Commits

Author SHA1 Message Date
James Mete d15d8357a1 126 twitter data (#620)
* Added a script file to process json archive files into more unified parquet files focused on tweet reply rows for further processing.

* Added README file for Twitter data collection.

* Re did code for processing json into standardized parquet files.

* Added file to process parquet files into a conversation tree jsonl file.

* Added requirements and ran pre-commit.
2023-01-21 13:54:21 +01:00
MattAlexMiracle 530194de08 Fix to ranking algorithm (#826)
* fixed ranking

* fix index error

Co-authored-by: Alexander Mattick <alex.mattick@fau.de>
2023-01-18 21:27:16 +01:00
Yannic Kilcher 5d441b1570 prepared playbook for prod 2023-01-15 23:30:16 +01:00
Yannic Kilcher cc03376d86 added root tokens and endpoint for adding api keys (#742)
* added root tokens and endpoint for adding api keys

* Change down revision to current alembic head

* removed added_by_root_token

* refactored description

* fixed jinja errors

Co-authored-by: Andreas Köpf <andreas.koepf@provisio.com>
2023-01-15 21:24:25 +00:00
Tom Zehle 30d7a3d0f5 Added Codeinstructor, fixed file-reading issue, and allowing for csv files in data_augment.py (#679) 2023-01-15 11:55:40 +01:00
Andrew Maguire 0f607c7a28 export and auto-commit docs/docs/api/openapi.json as part of test-api-contract workflow (#719)
- copies that openapi.json file to docs/docs/api/openapi.json as part of start-mock-server.sh.
- adds a step to the end of the test-api-contract workflow that just auto-commits changes to the docs/docs/api/openapi.json file.
- excludes docs/docs/api/openapi.json from pre-commit since its already been formatted by jq.
2023-01-14 22:40:33 +00:00
Nil Andreu a902c600fa Store Message Toxicity in database (#553)
* [NEW] MessageToxicity table

* [NEW] Alembic message Toxicity

* [NEW] Model name enum

* [NEW] Refactor Enum HF

* [NEW] Settings: DEBUT_SKIP_TOXICITY_CALCULATION

* [NEW] Store toxicity values

* [FIX] Merge conflict

* [FIX] Documentation

* [NEW] save_toxicity: function

* [FIX] Formatted string

* [NEW] DEBUG_SKIP_TOXICITY_CALCULATION=True

* [FIX] HfClassificationModel

* [FIX] Alembic merge heads

* [NEW] Refactor save_toxicity

* [NEW] Separating score/label

* [NEW] Store score and label

* [FIX] Cleaning Alembic

* [NEW] Clean HF names

* [NEW] Not type hinting

* [NEW] Update alembic versions

* [NEW] Revert the changes

* [NEW] Type hinting label & score

* Updated down_revision in migration script

Co-authored-by: Andreas Köpf <andreas.koepf@xamla.com>
2023-01-14 12:22:55 +00:00
Tom Zehle 011c023e41 added data_augment.py (#570)
* added data_augment.py

* .
2023-01-11 16:31:52 +01:00
Andreas Köpf 14fa08e2e7 Message tree state machine (#555)
* add query_incomplete_rankings()

* Add SQL queries for TreeManager task selection

* first working version of TreeManager.next_task()

* remove old generate_task(), add mandatory_labels to text_labels task

* Add ConversationMessage list to Ranking tasks

* add more sophisticated sql queries to find extendible trees

* add TreeManager.query_extendible_parents()

* fix task validation, seed data insertion (reviewed)

* provide user for task selection in text-frontend

* enter 'growing' state

* enter 'aborted_low_grade' state

* enter 'ranking' state

* check tree 'growing' state upon relpy insertion

* exclude user from labeling their own messages (added DEBUG_ALLOW_SELF_LABELING setting)

* add DEBUG_ALLOW_SELF_LABELING to docker-compose.yaml

* fix ranking submission

* add query_tree_ranking_results()

* add ranked_message_ids to RankingReactionPayload

* fix reply_messages instead of prompt_messages

* incorment 'ranking_count' of ranked replies

* added logic to check_condition_for_scoring_state

* changes to msg_tree_state_machine

* pre-commit changes

* enter 'ready_for_scoring' state

* re-add HF embedding call (lost during merge)

* use prepare_conversation() helper for seed-data creation

* Partially add user specified task selection

Co-authored-by: Daniel Hug <danielpatrickhug@gmail.com>
2023-01-11 10:54:03 +01:00
jojopirker 11d55d572a message embeddings in Messages table 2023-01-08 12:28:38 +01:00
MattAlexMiracle 3dbe0ae1ba Implement task selection (#383)
* commented out legacy numerical solver

* added comments and task_scheduling for selecting which task to serve to users

* removed standalone task weighting

* pre-commit hook rerun

Co-authored-by: Alexander Mattick <alex.mattick@fau.de>
2023-01-05 16:14:39 +01:00
Jack Michaud 5ed4131720 ci: run contract tests through separate cypress command and add into CI 2023-01-03 20:51:11 -05:00
Yannic Kilcher 9dd6c748b8 Merge pull request #300 from jack-michaud/jm/oasst-api-client-handle-error
feat: handle OasstError in OasstApiClient
2023-01-03 08:55:26 +01:00
Jack Michaud 789593bab7 fix: contract test script should fail when pytest fails 2023-01-02 20:37:34 -05:00
croumegous e3603e8f7d minor: add documentation on discord env variable to set up and fix shell script to run discord bot 2023-01-03 01:42:37 +01:00
Jack Michaud ed115ea17c refactor: move OasstApiClient into oasst-shared (#287)
* refactor: move api_client into oasst-shared

* refactor: move contract tests into oasst-shared

* fix: use new OasstApiClient imports in discord bot
2023-01-02 22:13:20 +01:00
Yannic Kilcher bcc73d6e02 fixed encoding header 2023-01-02 16:52:31 +01:00
Yannic Kilcher 07c62c925b Merge branch 'main' of github.com:LAION-AI/Open-Chat-GPT 2023-01-02 16:38:42 +01:00
Yannic Kilcher 9fd201ded5 added verify-lobby script 2023-01-02 16:38:38 +01:00
Gareth Davidson 7000e10bc0 apply pre-commit rules 2023-01-02 00:01:45 +00:00
Gareth Davidson c3c7a1701a run prettier with new params 2023-01-01 20:57:35 +00:00
Jack Michaud 857eaf26b1 ci: add Test API Contract workflow 2023-01-01 13:06:18 -05:00
Jack Michaud 29b08875d4 refactor: move print_openapi_schema into main 2023-01-01 18:58:01 +01:00
Jack Michaud 604ad2f8e7 style: run pre-commit hooks 2023-01-01 18:58:01 +01:00
Jack Michaud 953d243a6f chore: start-mock-server waits for health 2023-01-01 18:58:01 +01:00
Jack Michaud 036c4b6c6c test: add contract test for discord bot 2023-01-01 18:58:01 +01:00
Jack Michaud 5c9b9e3c9f fix: use prism instead of wiremock
the previous iteration of this script which used wiremock threw a null pointer exception.
2023-01-01 18:58:01 +01:00
Jack Michaud b5ebab8839 chore: add print_openapi_schema and start-mock-server.sh script 2023-01-01 18:58:01 +01:00
Andreas Köpf 71eb09b28a adapt backend-development/run-local.sh 2022-12-29 15:58:35 +01:00
Andreas Köpf fb516e91fc add backend-dev start-docker script for convenience 2022-12-29 00:58:52 +01:00
Yannic Kilcher 0ee3196064 added sleep instead of true containers in compose 2022-12-29 00:38:12 +01:00
croumegous 64206895ff Unified multiple Docker Compose files into one for easier usage 2022-12-28 13:26:30 +01:00
Keith Stevens 5ad18f61f9 Merge branch 'main' into 24-web-deploy-aws 2022-12-28 08:04:23 +09:00
Semen Zhydenko ebdbc0e3c0 Fixed typo: strenght -> strength 2022-12-27 14:14:48 +01:00
Semen Zhydenko 5c2bc24294 Fixed typo: existance -> existence 2022-12-27 14:13:58 +01:00
Yannic Kilcher 5125e84055 created debug credentials provider 2022-12-26 16:13:43 +01:00
Andreas Köpf f8c3008003 Merge pull request #57 from MattAlexMiracle/main
user scoring and question selection
2022-12-26 09:47:12 +01:00
Keith Stevens 8ffaf71aa7 Tweaking the docker files and prisma to prepare for reliable aws deployments 2022-12-26 12:07:40 +09:00
Yannic Kilcher 21f44d2d4c removed ALLOW_ANY_API_KEY and allow skipping API key check in debug 2022-12-25 17:25:30 +01:00
Alexander Mattick a62db13414 added fixed point definition for prompt 2022-12-25 13:56:15 +01:00
Alexander Mattick d198eaf045 added fixed 'good prompt' definition for ranking 2022-12-25 13:49:02 +01:00
Alexander Mattick a1c2580027 added utility functions to dataclass 2022-12-25 13:42:59 +01:00
Alexander Mattick 58885f1e9b simple scoring system for prompts and ranks 2022-12-25 13:39:21 +01:00
Alexander Mattick 76f7ed813b simple scoring system for scoreboard 2022-12-25 12:40:27 +01:00
Alexander Mattick 81666fc68c Merge remote-tracking branch 'refs/remotes/origin/main' 2022-12-25 11:55:09 +01:00
Yannic Kilcher ce92f184c5 used more extend configs for compose files 2022-12-24 20:46:03 +01:00
Keith Stevens 4ad46a45e4 Updating the README files to be a bit more comprehensive 2022-12-21 13:15:44 +09:00
Keith Stevens c6b00365fd Simplifying the end to end docker setup to ensure it prepares the web database properly before the website starts. 2022-12-21 12:17:08 +09:00
Keith Stevens c056a31d2f Ensuring the website can be built and deployed fully in docker. This includes an end to end docker-compose configuration as a simple demonstration. 2022-12-20 20:28:53 +09:00
Keith Stevens 2dbdd92791 Merge pull request #36 from LAION-AI/web-link-newui-to-backend
Link new UI to auth and backend
2022-12-20 17:52:15 +09:00