Commit Graph

34 Commits

Author SHA1 Message Date
Loubna Ben Allal ae3f44fc7d Add Smollm2 pipeline (#205)
* add smollm2 pipeline

* update readme
2024-11-21 13:46:39 +01:00
Loubna Ben Allal 73dce0c35d Add Smollm (#194)
* add smollm

* add to news
2024-08-19 08:47:20 +02:00
Kashif Rasul 444e0f8414 Update README.md (#184)
fix formatting
2024-07-30 11:05:50 +02:00
Kashif Rasul 98563353d7 CITATION.cff and fix F401 warning (#183)
* fix F401 warning

* add CITATION.cff

* update version in CITATION

* update title

* fix label

* Update src/alignment/__init__.py

Co-authored-by: Alvaro Bartolome <36760800+alvarobartt@users.noreply.github.com>

* make style

* add Alvaro Bartolome

* update version in readme

---------

Co-authored-by: Alvaro Bartolome <36760800+alvarobartt@users.noreply.github.com>
2024-07-30 10:56:17 +02:00
Kashif Rasul 95dc47218c update API to use latest TRL (#182)
* update API

* update deepspeed

* update black

* remove unused import

* fix typos

* fix typos in readmes

* fix grammer

* removed as it exists in superclass

* fixes in readme

* Update README.md

Co-authored-by: Alvaro Bartolome <36760800+alvarobartt@users.noreply.github.com>

* Update src/alignment/configs.py

Co-authored-by: Alvaro Bartolome <36760800+alvarobartt@users.noreply.github.com>

* Update src/alignment/configs.py

Co-authored-by: Alvaro Bartolome <36760800+alvarobartt@users.noreply.github.com>

* Update src/alignment/configs.py

Co-authored-by: Alvaro Bartolome <36760800+alvarobartt@users.noreply.github.com>

* Update src/alignment/configs.py

Co-authored-by: Alvaro Bartolome <36760800+alvarobartt@users.noreply.github.com>

* add back dataset_kwargs

* use hub_model_revision in sft and dpo

* fix duplicate

---------

Co-authored-by: Alvaro Bartolome <36760800+alvarobartt@users.noreply.github.com>
2024-07-30 09:16:25 +02:00
Zizheng Yang 84f8c92820 Update README.md (#152)
If use 2.3.6, there will be an error

ImportError: /root/miniconda3/envs/handbook/lib/python3.10/site-packages/flash_attn_2_cuda.cpython-310-x86_64-linux-gnu.so: undefined symbol: _ZN2at4_ops9_pad_enum4callERKNS_6TensorEN3c108ArrayRefINS5_6SymIntEEElNS5_8optionalIdEE

If we use the newest flash_attn version, there will be no trouble!
2024-04-25 10:36:22 +02:00
Alvaro Bartolome cf1975a7cb Add ORPO within README.md files (#154)
* Add `ORPO` within `scripts/README.md`

* Fix typo in `ModelArguments.base_model_revision`

* Add `ORPO` within `README.md`

* Add Zephyr 141B in "News" section
2024-04-25 10:35:45 +02:00
Bram Vanroy 595023faa4 Adding continued_pretraining task (#131)
* add continued pretraining script

* simplify config; add dataset_config option

* add ds configs in data mixer creator

* use extended sftconfig

* add option to avoid setting chat template

* fix data_configs bug

* add continued pretraining info

* add gpt2-nl recipe for continued pretraining example

* add final newline

* make style

* Update README.md

Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

* Update README.md

Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

* Update recipes/gpt2-nl/README.md

Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>

* rename continued pretraining to cpt

* improve README

---------

Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>
2024-03-14 15:15:23 +01:00
lewtun a9b8a50a27 🌟 (#135)
* Add StarChat2

* Add DPO

* Fix unit test

* Typos

* Typo
2024-03-12 17:22:21 +01:00
lewtun ff618a4d13 🪁 (#129)
* Add Gemma 7B recipe

* Use Gemma template

* Make it work for dolly lol

* Enable cahce

* Clean up

* DPO to the max

* DPO, DPO, DPO

* Add openhermes

* Add custom configs

* Add kwargs

* Fix config

* Bump deps

* Move old recipes

* Add doc

* Add norte

* Renable cache

* Nuke

* Clean

* Apply suggestions from code review

Co-authored-by: Alvaro Bartolome <alvaro@argilla.io>

* Fix isort

* Update README.md

* Update config_full.yaml

---------

Co-authored-by: Alvaro Bartolome <alvaro@argilla.io>
Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com>
2024-03-01 17:29:42 +01:00
Ikko Eltociear Ashimine d00e6f043e Update README.md (#113)
evalutions -> evaluations
2024-02-02 09:20:20 +01:00
Kosti b4bd3a4984 Blog post url: "constitutional-ai" -> "constitutional_ai" (#112) 2024-02-01 09:21:53 -08:00
lewtun 995d50912b Update README.md (#111)
* Update README.md

* Update README.md
2024-02-01 17:02:43 +01:00
Edward Beeching cbcb3f60fb DPO/IPO/KTO ablations (#104)
* adds configs and readme

* cleaning config files

* fix typos and removes things from config

* updates text to use comparisons rather ablations

* fix readme and adds launch script

* fix launch script, adds blogpost link

* bump release version, added missing dep, fixes configs

* updates main readme file
2024-01-18 14:55:00 +01:00
lewtun f0ffa0d7a6 Update Zephyr configs to account for UltraFeedback & TRL fixes (#88)
* Add files

* Add checkpointing

* Add checkpointing to SFT

* Add loss type

* Fix setup|

* Clean SFT

* Add lora config

* Rename config

* Remove max eval samples

* Add kwargs tp push to hub

* Add DPO configs

* Fix dpo configs

* Extend chat template test to multi-turn

* Add warmup

* Refactor

* Fix LoRA -> QLoRA

* Fix configs

* Specify chat template

* Add sample logging

* Fix push to hub hanging

* Add reentrant

* Fix quality

* Add transformer logging

* Tweak grad acc

* Add null type

* Add doc
2024-01-10 17:42:24 +11:00
Girraj Jangid 7d6fe765ec Update README.md (#35)
update installation instruction. Added git cmd
2023-11-20 08:52:16 +01:00
Alvaro Bartolome 0e09b0c6ec Fix note syntax highlighting in README.md (#20) 2023-11-15 08:45:27 +01:00
lewtun a1afb2bbd4 Fix image alignment (#19) 2023-11-12 15:47:10 +01:00
lewtun 4c6226bc42 Add moar explanations (#18) 2023-11-12 15:43:39 +01:00
Kashif Rasul 4b0c1fe170 fix typos (#17) 2023-11-12 13:44:50 +01:00
lewtun 363e29ff95 Apply suggestions from code review
Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com>
2023-11-10 15:49:00 +01:00
Lewis Tunstall edf67d1d93 Tweaks 2023-11-10 11:15:45 +00:00
Lewis Tunstall 610a1a2de4 Add unit tests for data mixer 2023-11-10 08:37:53 +00:00
Lewis Tunstall 89f58a043c Add project structure 2023-11-09 14:40:23 +00:00
Lewis Tunstall 33a0ce3afd Add more doc 2023-11-09 13:39:03 +00:00
Lewis Tunstall d2900adc83 Make it work! 2023-11-08 16:31:57 +00:00
lewtun 3d8570af1e Update README.md 2023-10-26 23:21:53 +02:00
Lewis Tunstall 3a8003c782 Fix Makefile 2023-10-26 09:30:24 +00:00
Stefano Fiorucci 73c5c136c3 fix typos 2023-10-10 22:28:22 +02:00
Lewis Tunstall 8248414cf6 Rename repo 2023-10-09 19:12:47 +02:00
Lewis Tunstall 8197fe1b1e Update readme 2023-10-09 16:46:56 +02:00
Nathan Lambert c3fb784998 fixes 2023-08-29 16:30:18 -07:00
Lewis Tunstall 8903d4aff8 Add skeleton structure 2023-08-29 09:33:26 +02:00
lewtun 1aa5a69b17 Initial commit 2023-08-25 13:35:35 +02:00