lewtun
87cc800498
Apply quantization during DPO QLoRA ( #115 )
...
* Add QLoRA fix
* Update script
2024-02-05 16:50:17 +01:00
Edward Beeching
cbcb3f60fb
DPO/IPO/KTO ablations ( #104 )
...
* adds configs and readme
* cleaning config files
* fix typos and removes things from config
* updates text to use comparisons rather ablations
* fix readme and adds launch script
* fix launch script, adds blogpost link
* bump release version, added missing dep, fixes configs
* updates main readme file
2024-01-18 14:55:00 +01:00
lewtun
c74ed11171
Bump lower version of huggingface_hub ( #95 )
...
* Bump lower version of huggingface_hub
* Fix dep
2024-01-11 23:09:48 +11:00
lewtun
f0ffa0d7a6
Update Zephyr configs to account for UltraFeedback & TRL fixes ( #88 )
...
* Add files
* Add checkpointing
* Add checkpointing to SFT
* Add loss type
* Fix setup|
* Clean SFT
* Add lora config
* Rename config
* Remove max eval samples
* Add kwargs tp push to hub
* Add DPO configs
* Fix dpo configs
* Extend chat template test to multi-turn
* Add warmup
* Refactor
* Fix LoRA -> QLoRA
* Fix configs
* Specify chat template
* Add sample logging
* Fix push to hub hanging
* Add reentrant
* Fix quality
* Add transformer logging
* Tweak grad acc
* Add null type
* Add doc
2024-01-10 17:42:24 +11:00
Lewis Tunstall
8f6e5b666b
Bump dev version v0.3.0.dev
2024-01-04 01:39:11 +00:00
Lewis Tunstall
551f901f95
Fix dep
2023-11-10 11:02:44 +00:00
Lewis Tunstall
8699f47bf3
Add jinja2 to req deps
2023-11-10 09:45:22 +00:00
lewtun
b1b0c1c8c0
Update setup.py
...
Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com >
2023-11-10 10:44:06 +01:00
Lewis Tunstall
0af8011993
Bump deps
2023-11-10 08:41:17 +00:00
Lewis Tunstall
610a1a2de4
Add unit tests for data mixer
2023-11-10 08:37:53 +00:00
Lewis Tunstall
33a0ce3afd
Add more doc
2023-11-09 13:39:03 +00:00
Lewis Tunstall
d2900adc83
Make it work!
2023-11-08 16:31:57 +00:00
Lewis Tunstall
967eab4cfb
Add skeleton
2023-11-08 13:21:57 +00:00
Lewis Tunstall
b9d9aa0a29
Fix style
2023-10-30 10:00:43 +01:00
Lewis Tunstall
a28b4cfc6e
Bump dev version
2023-10-26 10:17:28 +00:00
Lewis Tunstall
1bde6a7931
Add doc builder
2023-10-26 09:41:04 +00:00
Lewis Tunstall
14ffdca030
Add skeleton deps
2023-10-26 09:01:07 +00:00