Commit Graph

40 Commits

Author SHA1 Message Date
Fabricio Braz bfccb51142 Writing prompt (#1274)
* Create writing_prompt.ipynb
2023-02-10 22:16:04 +00:00
Shahul ES 6dc884864b Cornell movies dialog dataset (#1319)
* cornell movie dialogs
2023-02-10 08:08:19 +00:00
Kian-Meng Ang 1e321a6fca Fix typos (#1143)
Found via `codespell -S .mypy_cache,yarn.lock,*.json,*.ipynb -L
rouge,nam,vie`
2023-02-05 20:18:03 +01:00
Valentino 091e93392b added custom functions for each dataset to conver QA pais to Instruct-Reponse conversations (#922) 2023-01-30 11:26:11 +00:00
Taylor 356058ed93 Add Hippocorpus dataset script (#750)
* Add Hippocorpus dataset script
2023-01-30 11:23:12 +00:00
kayjay 1eb3f05c44 PR: Create notebook to convert r/changemyview data (#839)
* (#737) Create notebook to convert r/changemyview data into cleaner format
2023-01-26 09:49:03 +00:00
Oliver Stanley 993ed803cf Merge pull request #807 from olliestanley/745-augment-codet
745: Add notebooks for CodeT datasets augmentation
2023-01-18 17:10:08 +00:00
Oliver b60d5ecb08 Add Colab links 2023-01-18 17:02:16 +00:00
Thierry Deruyttere 0eb9942388 Notebook to convert the DIVERSE dataset (#784)
* Notebook to convert the DIVERSE dataset to Open-Assistant format
2023-01-18 09:09:52 +00:00
Oliver 8adc1da6f9 Add notebooks for CodeT datasets augmentation 2023-01-17 20:22:19 +00:00
Andrew Maguire 86cadc6e9f Cleanup notebooks folder (#716)
- standardize file naming to be a little more pythonic.
- one folder per notebook plus a README for easy GH viewing.
- adjust colab badges for new structure.
2023-01-14 22:51:09 +00:00
Andrew Maguire 0f607c7a28 export and auto-commit docs/docs/api/openapi.json as part of test-api-contract workflow (#719)
- copies that openapi.json file to docs/docs/api/openapi.json as part of start-mock-server.sh.
- adds a step to the end of the test-api-contract workflow that just auto-commits changes to the docs/docs/api/openapi.json file.
- excludes docs/docs/api/openapi.json from pre-commit since its already been formatted by jq.
2023-01-14 22:40:33 +00:00
rallio a203900070 A notebook for question and answer generation using one of the most powerful opensource NLU models, FLAN-T5-11B. (#215)
* Create T5_closed_book_QA_generators.py

This is code that can be run in a notebook or by itself to generate a dictionary for use in creating synthetic dialogue that can be verified for factual accuracy. To use this notebook your need your trusted source material to be in the format of a list of strings (they will be truncated to under 1100 characters). Requires transformers and accelerate. Make sure to use T5 with bfloat16 or full precision.

* Create T5_closed_book_QA_generators.py

* Create T5_closed_book_QA_generators.md

* Delete T5_closed_book_QA_generators.py

Moved to proper folder structure

* fixed pre-commit issues

Co-authored-by: Yannic Kilcher <yk@users.noreply.github.com>
2023-01-14 22:16:54 +00:00
Valentino c36a7ae9da Add UnifiedQA datasets that are not in xP3 (#666)
* Add UnifiedQA datasets that are not in xP3
2023-01-14 09:57:22 +00:00
Andrew Maguire defd453639 replace andrewm4894 with LAION-AI now that example notebook is merged (#573) 2023-01-09 23:59:00 +01:00
Andrew Maguire a206a9a9ab add example notebook (#512)
adds an example notebook and some docs
2023-01-09 14:11:35 +00:00
Andrew Maguire eaefa68dea add colab buttons to notebooks (#468) 2023-01-07 13:54:16 +00:00
Yannic Kilcher 6a64af9782 Merge pull request #418 from furlat/main
Create openbugger_example.ipynb
2023-01-06 22:38:48 +01:00
furlat f7ee8fd74b pre_commit openbugger md 2023-01-05 23:10:34 +01:00
furlat bb66f5d750 Added md for openbugger_example 2023-01-05 23:09:01 +01:00
b-mc2 ee370e85a4 Changes from pre-commit 2023-01-05 14:55:26 -05:00
furlat 51739fa5b7 Update openbugger_example.ipynb 2023-01-05 20:33:04 +01:00
furlat 5b9cdcec48 Create openbugger_example.ipynb
added an example for open_bugger
2023-01-05 20:19:31 +01:00
b-mc2 65524a2140 StackExchange Builder update
Included markdown file and the initial notebook for ingesting and processing StackExchange data dumps
2023-01-03 21:36:05 -05:00
Janosh Riebesell d717c48a92 black-jupyter again after merge-conflict fix 2023-01-03 06:43:31 -08:00
Janosh Riebesell 731477e2e3 Merge branch 'main' into pre-commit-jupyter-black 2023-01-03 06:42:40 -08:00
Janosh Riebesell 3740e38342 pre-commit unexclude notebooks and pre-commit run --all-files 2023-01-03 06:40:42 -08:00
Yannic Kilcher 1ae880ddab Merge pull request #316 from Jac-Zac/main
Fix table in detoxify eval readme
2023-01-03 10:24:33 +01:00
Yannic Kilcher d7583efff1 Merge pull request #278 from chandru1003/main
added clear list for more understanding of result
2023-01-03 10:12:06 +01:00
Jac-Zac 556c4f2d91 Fix table in detoxify eval readme 2023-01-03 08:48:18 +01:00
Yannic Kilcher 194ee6632a added QA notebook from rallio 2023-01-02 21:51:36 +01:00
chandru 19502cf8bf added clear list for more understanding of result 2023-01-02 19:34:37 +05:30
James Melvin 2ff6108047 fix: typo in EssayInstructions.md and EssayRevision.md 2023-01-02 12:43:37 +05:30
Yannic Kilcher 4841550cd4 Merge pull request #212 from bitplane/prettier-markdown
Format markdown with prettier --prose-wrap=always
2023-01-01 22:11:11 +01:00
Gareth Davidson c3c7a1701a run prettier with new params 2023-01-01 20:57:35 +00:00
Alexander Goryunov 0f7aa6e02a Removed cell outputs 2023-01-01 12:57:01 +02:00
Alexander Goryunov f15f2c29ed Fixed text formatting and made code more Pythonic 2023-01-01 12:49:05 +02:00
GravermanDev a1ba434c31 added all the changes again but this time they are compatible 2022-12-31 17:18:02 +01:00
Szymon Ożóg 11241c9895 Fixed readme file for style checks 2022-12-31 15:00:28 +01:00
Szymon Ożóg 5bd4cc84f1 Created a detoxify evaluation notebook and a readme file containing a summary 2022-12-30 15:12:54 +01:00