Commit Graph

18 Commits

Author SHA1 Message Date
lewtun ff618a4d13 🪁 (#129)
* Add Gemma 7B recipe

* Use Gemma template

* Make it work for dolly lol

* Enable cahce

* Clean up

* DPO to the max

* DPO, DPO, DPO

* Add openhermes

* Add custom configs

* Add kwargs

* Fix config

* Bump deps

* Move old recipes

* Add doc

* Add norte

* Renable cache

* Nuke

* Clean

* Apply suggestions from code review

Co-authored-by: Alvaro Bartolome <alvaro@argilla.io>

* Fix isort

* Update README.md

* Update config_full.yaml

---------

Co-authored-by: Alvaro Bartolome <alvaro@argilla.io>
Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com>
2024-03-01 17:29:42 +01:00
lewtun 87cc800498 Apply quantization during DPO QLoRA (#115)
* Add QLoRA fix

* Update script
2024-02-05 16:50:17 +01:00
Edward Beeching cbcb3f60fb DPO/IPO/KTO ablations (#104)
* adds configs and readme

* cleaning config files

* fix typos and removes things from config

* updates text to use comparisons rather ablations

* fix readme and adds launch script

* fix launch script, adds blogpost link

* bump release version, added missing dep, fixes configs

* updates main readme file
2024-01-18 14:55:00 +01:00
lewtun c74ed11171 Bump lower version of huggingface_hub (#95)
* Bump lower version of huggingface_hub

* Fix dep
2024-01-11 23:09:48 +11:00
lewtun f0ffa0d7a6 Update Zephyr configs to account for UltraFeedback & TRL fixes (#88)
* Add files

* Add checkpointing

* Add checkpointing to SFT

* Add loss type

* Fix setup|

* Clean SFT

* Add lora config

* Rename config

* Remove max eval samples

* Add kwargs tp push to hub

* Add DPO configs

* Fix dpo configs

* Extend chat template test to multi-turn

* Add warmup

* Refactor

* Fix LoRA -> QLoRA

* Fix configs

* Specify chat template

* Add sample logging

* Fix push to hub hanging

* Add reentrant

* Fix quality

* Add transformer logging

* Tweak grad acc

* Add null type

* Add doc
2024-01-10 17:42:24 +11:00
Lewis Tunstall 8f6e5b666b Bump dev version v0.3.0.dev 2024-01-04 01:39:11 +00:00
Lewis Tunstall 551f901f95 Fix dep 2023-11-10 11:02:44 +00:00
Lewis Tunstall 8699f47bf3 Add jinja2 to req deps 2023-11-10 09:45:22 +00:00
lewtun b1b0c1c8c0 Update setup.py
Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com>
2023-11-10 10:44:06 +01:00
Lewis Tunstall 0af8011993 Bump deps 2023-11-10 08:41:17 +00:00
Lewis Tunstall 610a1a2de4 Add unit tests for data mixer 2023-11-10 08:37:53 +00:00
Lewis Tunstall 33a0ce3afd Add more doc 2023-11-09 13:39:03 +00:00
Lewis Tunstall d2900adc83 Make it work! 2023-11-08 16:31:57 +00:00
Lewis Tunstall 967eab4cfb Add skeleton 2023-11-08 13:21:57 +00:00
Lewis Tunstall b9d9aa0a29 Fix style 2023-10-30 10:00:43 +01:00
Lewis Tunstall a28b4cfc6e Bump dev version 2023-10-26 10:17:28 +00:00
Lewis Tunstall 1bde6a7931 Add doc builder 2023-10-26 09:41:04 +00:00
Lewis Tunstall 14ffdca030 Add skeleton deps 2023-10-26 09:01:07 +00:00