Traun Leyden
5ad6db0c79
Fixes #96 by handling RepositoryNotFoundError ( #97 )
...
* Fixes #96 by handling RepositoryNotFoundError
* Update src/alignment/model_utils.py
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com >
* Remove redundant code
* Add unit test
* Reformat file
* make style
---------
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com >
2024-02-01 15:47:14 +01:00
Nathan Azrak
ad3d43aeea
Make peft bnb_4bit_compute_dtype consistent with torch_dtype ( #107 )
...
Co-authored-by: Nathan Azrak <nazrak@atlassian.com >
2024-01-29 11:59:15 +01:00
Nathan Azrak
de7d8883cd
Add check before inserting system message ( #106 )
...
* add check before inserting system message
* change in-place for consistency
* fix unit test
---------
Co-authored-by: Nathan Azrak <nazrak@atlassian.com >
2024-01-29 11:56:24 +01:00
Edward Beeching
cbcb3f60fb
DPO/IPO/KTO ablations ( #104 )
...
* adds configs and readme
* cleaning config files
* fix typos and removes things from config
* updates text to use comparisons rather ablations
* fix readme and adds launch script
* fix launch script, adds blogpost link
* bump release version, added missing dep, fixes configs
* updates main readme file
2024-01-18 14:55:00 +01:00
lewtun
c74ed11171
Bump lower version of huggingface_hub ( #95 )
...
* Bump lower version of huggingface_hub
* Fix dep
2024-01-11 23:09:48 +11:00
lewtun
f0ffa0d7a6
Update Zephyr configs to account for UltraFeedback & TRL fixes ( #88 )
...
* Add files
* Add checkpointing
* Add checkpointing to SFT
* Add loss type
* Fix setup|
* Clean SFT
* Add lora config
* Rename config
* Remove max eval samples
* Add kwargs tp push to hub
* Add DPO configs
* Fix dpo configs
* Extend chat template test to multi-turn
* Add warmup
* Refactor
* Fix LoRA -> QLoRA
* Fix configs
* Specify chat template
* Add sample logging
* Fix push to hub hanging
* Add reentrant
* Fix quality
* Add transformer logging
* Tweak grad acc
* Add null type
* Add doc
2024-01-10 17:42:24 +11:00
Nathan Azrak
c69ae4b8a5
Check that default_chat_template is also None ( #83 )
...
* Check that `default_chat_template` is also None before overwriting chat template
* add unit test to `get_tokenizer` to ensure default behaviour of chat template is not changed
---------
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com >
2024-01-08 17:54:23 +11:00
Kirill
98fe28fb14
Clean deprecated max_samples arguments ( #89 )
2024-01-05 09:06:47 +11:00
Evgenii Zheltonozhskii
e316174e1c
Add warmup to config ( #71 )
...
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com >
2024-01-04 16:04:46 +11:00
NielsRogge
57508b5c2d
Make SFT script consistent with DPO script ( #86 )
...
* Add argument
* Make scripts consistent
* Fix style
---------
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com >
2024-01-04 15:55:58 +11:00
Lewis Tunstall
8f6e5b666b
Bump dev version v0.3.0.dev
2024-01-04 01:39:11 +00:00
Scott Fleming
61a11a5c7d
Update docstring for data.py to reflect true behavior of shuffle parameter ( #60 )
...
* Update data.py
The docs state that the `shuffle` parameter in `mix_datasets` from `data.py` controls `Whether to shuffle the training data`, but then in the code if `shuffle` is set to `True` it also shuffles the test data. This small change makes the functionality consistent with the docstring. (If you instead want to keep the functionality the same, then we should update the docstring).
* Update data.py
Reverted to the original code structure but updated the docstring.
* Update docstring in `get_dataset` and `mix_datasets`
Updated docstrings to reflect the fact that `shuffle` being set to `True` leads to shuffling of both the training and testing/validation data.
2023-12-06 10:44:17 +01:00
lewtun
1c06e4e5e1
Update doc CI ( #64 )
2023-12-05 12:31:30 +01:00
Nathan Azrak
3f368a0748
Add check that parameters are not intended to be offloaded ( #51 )
...
* Add check that parameters are not intended to be offloaded
* Only push model to device if quantization config is set.
2023-12-04 09:10:41 +01:00
Dragan Milchevski
15279e7157
Allow loading datasets from disk using load_from_disk method. ( #53 )
...
* feat: Allow loading datasets from disk using `load_from_disk` method.
* Fixing the type of error being catched.
2023-12-01 11:05:35 +01:00
Dragan Milchevski
80e952ec47
Allow running DPO from a local model ( #49 )
...
* Update model_utils.py
Check if a model is adapter model when a local path is supplied instead of HF model
* Cleaner solution, thanks to lewtun
2023-11-27 11:31:09 +01:00
Thomas Capelle
f025057ce4
Missing config params on SFT ( #31 )
...
* fix warmup with total number of steps
* Explicitely tell to use 80GB Gpus
* Revert "fix warmup with total number of steps"
This reverts commit 760e477efdbf7f67be766a0d43b0c3b2ac26947a.
2023-11-21 12:00:09 +01:00
Alvaro Bartolome
c9d9035f95
Fix apply_chat_template function for dpo and unknown task ( #30 )
...
* Fix `apply_chat_template` function for `dpo` and unknown `task`
* Remove duplicated `# coding=utf-8`
* Manually run `black --line-length 119`
2023-11-21 11:47:21 +01:00
Girraj Jangid
7d6fe765ec
Update README.md ( #35 )
...
update installation instruction. Added git cmd
2023-11-20 08:52:16 +01:00
Alvaro Bartolome
0e09b0c6ec
Fix note syntax highlighting in README.md ( #20 )
2023-11-15 08:45:27 +01:00
lewtun
a1afb2bbd4
Fix image alignment ( #19 )
2023-11-12 15:47:10 +01:00
lewtun
4c6226bc42
Add moar explanations ( #18 )
2023-11-12 15:43:39 +01:00
Kashif Rasul
4b0c1fe170
fix typos ( #17 )
2023-11-12 13:44:50 +01:00
lewtun
43f52224db
Merge pull request #14 from sebastianschramm/ses/fix_typos_zephyr_recipe
...
Resolves #13 fix typo in zephyr recipe readme
2023-11-10 17:06:35 +01:00
Sebastian Schramm
d48a4a477b
Resolves #13 fix typo in zephyr recipe readme
2023-11-10 16:57:39 +01:00
lewtun
e4f98e7d8f
Merge pull request #11 from huggingface/zephyr-recipe
...
Code release
2023-11-10 15:54:13 +01:00
lewtun
363e29ff95
Apply suggestions from code review
...
Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com >
2023-11-10 15:49:00 +01:00
Lewis Tunstall
f5e70fbf9e
Add licenses
2023-11-10 14:47:54 +00:00
Lewis Tunstall
5a630a1989
Add QLoRA command
2023-11-10 13:57:52 +00:00
Lewis Tunstall
e2e8ab945d
Refactor imports
2023-11-10 13:38:45 +00:00
edbeeching
7f1a14e0d4
adds auto adapter merge to dpo script
2023-11-10 14:15:44 +01:00
Lewis Tunstall
54185783e0
Remove QLoRa for now
2023-11-10 11:20:39 +00:00
Lewis Tunstall
edf67d1d93
Tweaks
2023-11-10 11:15:45 +00:00
Lewis Tunstall
551f901f95
Fix dep
2023-11-10 11:02:44 +00:00
Lewis Tunstall
a0b8d49424
Rename recipe
2023-11-10 10:49:13 +00:00
Lewis Tunstall
64f1834e01
Add config tests
2023-11-10 10:00:05 +00:00
Lewis Tunstall
8699f47bf3
Add jinja2 to req deps
2023-11-10 09:45:22 +00:00
lewtun
b1b0c1c8c0
Update setup.py
...
Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com >
2023-11-10 10:44:06 +01:00
Lewis Tunstall
2ed5a45d25
Add model utils tests
2023-11-10 09:42:15 +00:00
Lewis Tunstall
0af8011993
Bump deps
2023-11-10 08:41:17 +00:00
Lewis Tunstall
610a1a2de4
Add unit tests for data mixer
2023-11-10 08:37:53 +00:00
edbeeching
0f0b61c096
ups lora bs x grad_acc to 64
2023-11-10 09:30:54 +01:00
edbeeching
13141a4b0b
adds updated model paths, adds eval to sft scripts
2023-11-10 09:26:39 +01:00
Lewis Tunstall
4b0769d137
Fix links
2023-11-09 14:42:57 +00:00
Lewis Tunstall
89f58a043c
Add project structure
2023-11-09 14:40:23 +00:00
Lewis Tunstall
44b324487d
Bump bs
2023-11-09 14:20:43 +00:00
Lewis Tunstall
756bb76d22
Fix Slurm opts
2023-11-09 14:09:52 +00:00
Lewis Tunstall
33a0ce3afd
Add more doc
2023-11-09 13:39:03 +00:00
edbeeching
3a5430222e
removes need for yq dep
2023-11-09 13:04:34 +01:00
edbeeching
49da3ef739
adds configs and instructions for lora training
2023-11-09 10:56:25 +01:00