Chansung Park
27f7dbf006
Fix BitsAndBytes JSON Serializable ( #191 )
...
* Update run_sft.py
* fix BitsAndBytes JSON serializable
* get_quantization_config to return dict
* to_dict() for load_in_8bit too
* convert quant test to use dict subscriptions instead of dot syntax
* Remove torch
---------
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com >
2024-08-20 15:02:18 +02:00
lewtun
a8dcde2cd3
Use ungated models for unit tests ( #196 )
...
* Fix unit test
* Fix chat template tests
* Remove deprecated test
* up
2024-08-19 12:33:11 +02:00
Kashif Rasul
95dc47218c
update API to use latest TRL ( #182 )
...
* update API
* update deepspeed
* update black
* remove unused import
* fix typos
* fix typos in readmes
* fix grammer
* removed as it exists in superclass
* fixes in readme
* Update README.md
Co-authored-by: Alvaro Bartolome <36760800+alvarobartt@users.noreply.github.com >
* Update src/alignment/configs.py
Co-authored-by: Alvaro Bartolome <36760800+alvarobartt@users.noreply.github.com >
* Update src/alignment/configs.py
Co-authored-by: Alvaro Bartolome <36760800+alvarobartt@users.noreply.github.com >
* Update src/alignment/configs.py
Co-authored-by: Alvaro Bartolome <36760800+alvarobartt@users.noreply.github.com >
* Update src/alignment/configs.py
Co-authored-by: Alvaro Bartolome <36760800+alvarobartt@users.noreply.github.com >
* add back dataset_kwargs
* use hub_model_revision in sft and dpo
* fix duplicate
---------
Co-authored-by: Alvaro Bartolome <36760800+alvarobartt@users.noreply.github.com >
2024-07-30 09:16:25 +02:00
Bram Vanroy
ba7e0e4fca
Fix dataloading for cpt ( #137 )
...
* avpid mutable parameter
* do not remove text_column for cpt
* fix typo
* add
* remove constant KEEPCOLS
* update tests with columns_to_keep
2024-03-21 20:05:53 +01:00
lewtun
a9b8a50a27
🌟 ( #135 )
...
* Add StarChat2
* Add DPO
* Fix unit test
* Typos
* Typo
2024-03-12 17:22:21 +01:00
Traun Leyden
5ad6db0c79
Fixes #96 by handling RepositoryNotFoundError ( #97 )
...
* Fixes #96 by handling RepositoryNotFoundError
* Update src/alignment/model_utils.py
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com >
* Remove redundant code
* Add unit test
* Reformat file
* make style
---------
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com >
2024-02-01 15:47:14 +01:00
Nathan Azrak
de7d8883cd
Add check before inserting system message ( #106 )
...
* add check before inserting system message
* change in-place for consistency
* fix unit test
---------
Co-authored-by: Nathan Azrak <nazrak@atlassian.com >
2024-01-29 11:56:24 +01:00
lewtun
f0ffa0d7a6
Update Zephyr configs to account for UltraFeedback & TRL fixes ( #88 )
...
* Add files
* Add checkpointing
* Add checkpointing to SFT
* Add loss type
* Fix setup|
* Clean SFT
* Add lora config
* Rename config
* Remove max eval samples
* Add kwargs tp push to hub
* Add DPO configs
* Fix dpo configs
* Extend chat template test to multi-turn
* Add warmup
* Refactor
* Fix LoRA -> QLoRA
* Fix configs
* Specify chat template
* Add sample logging
* Fix push to hub hanging
* Add reentrant
* Fix quality
* Add transformer logging
* Tweak grad acc
* Add null type
* Add doc
2024-01-10 17:42:24 +11:00
Nathan Azrak
c69ae4b8a5
Check that default_chat_template is also None ( #83 )
...
* Check that `default_chat_template` is also None before overwriting chat template
* add unit test to `get_tokenizer` to ensure default behaviour of chat template is not changed
---------
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com >
2024-01-08 17:54:23 +11:00
Lewis Tunstall
edf67d1d93
Tweaks
2023-11-10 11:15:45 +00:00
Lewis Tunstall
64f1834e01
Add config tests
2023-11-10 10:00:05 +00:00
Lewis Tunstall
2ed5a45d25
Add model utils tests
2023-11-10 09:42:15 +00:00
Lewis Tunstall
610a1a2de4
Add unit tests for data mixer
2023-11-10 08:37:53 +00:00
Lewis Tunstall
87033c09b3
Add tests folder
2023-10-26 09:54:20 +00:00