wassname
6d128ea986
wip
2025-06-04 05:37:07 +00:00
wassname
a264efaa4c
better formating
2025-06-03 22:21:18 +00:00
wassname
097e4e0b01
wip
2025-06-02 22:31:52 +00:00
wassname
880d4eda1e
chat template fix
2025-06-02 07:27:46 +00:00
wassname
2819dd46d0
fmt
2025-06-02 07:13:52 +00:00
wassname
8708597941
wip
2025-06-02 05:51:13 +00:00
Sergio Paniego Blanco
205b881c80
Typo fixed ( #206 )
2025-04-30 16:19:36 +02:00
Loubna Ben Allal
ae3f44fc7d
Add Smollm2 pipeline ( #205 )
...
* add smollm2 pipeline
* update readme
2024-11-21 13:46:39 +01:00
Loubna Ben Allal
73dce0c35d
Add Smollm ( #194 )
...
* add smollm
* add to news
2024-08-19 08:47:20 +02:00
Kashif Rasul
444e0f8414
Update README.md ( #184 )
...
fix formatting
2024-07-30 11:05:50 +02:00
Kashif Rasul
98563353d7
CITATION.cff and fix F401 warning ( #183 )
...
* fix F401 warning
* add CITATION.cff
* update version in CITATION
* update title
* fix label
* Update src/alignment/__init__.py
Co-authored-by: Alvaro Bartolome <36760800+alvarobartt@users.noreply.github.com >
* make style
* add Alvaro Bartolome
* update version in readme
---------
Co-authored-by: Alvaro Bartolome <36760800+alvarobartt@users.noreply.github.com >
2024-07-30 10:56:17 +02:00
Kashif Rasul
95dc47218c
update API to use latest TRL ( #182 )
...
* update API
* update deepspeed
* update black
* remove unused import
* fix typos
* fix typos in readmes
* fix grammer
* removed as it exists in superclass
* fixes in readme
* Update README.md
Co-authored-by: Alvaro Bartolome <36760800+alvarobartt@users.noreply.github.com >
* Update src/alignment/configs.py
Co-authored-by: Alvaro Bartolome <36760800+alvarobartt@users.noreply.github.com >
* Update src/alignment/configs.py
Co-authored-by: Alvaro Bartolome <36760800+alvarobartt@users.noreply.github.com >
* Update src/alignment/configs.py
Co-authored-by: Alvaro Bartolome <36760800+alvarobartt@users.noreply.github.com >
* Update src/alignment/configs.py
Co-authored-by: Alvaro Bartolome <36760800+alvarobartt@users.noreply.github.com >
* add back dataset_kwargs
* use hub_model_revision in sft and dpo
* fix duplicate
---------
Co-authored-by: Alvaro Bartolome <36760800+alvarobartt@users.noreply.github.com >
2024-07-30 09:16:25 +02:00
Zizheng Yang
84f8c92820
Update README.md ( #152 )
...
If use 2.3.6, there will be an error
ImportError: /root/miniconda3/envs/handbook/lib/python3.10/site-packages/flash_attn_2_cuda.cpython-310-x86_64-linux-gnu.so: undefined symbol: _ZN2at4_ops9_pad_enum4callERKNS_6TensorEN3c108ArrayRefINS5_6SymIntEEElNS5_8optionalIdEE
If we use the newest flash_attn version, there will be no trouble!
2024-04-25 10:36:22 +02:00
Alvaro Bartolome
cf1975a7cb
Add ORPO within README.md files ( #154 )
...
* Add `ORPO` within `scripts/README.md`
* Fix typo in `ModelArguments.base_model_revision`
* Add `ORPO` within `README.md`
* Add Zephyr 141B in "News" section
2024-04-25 10:35:45 +02:00
Bram Vanroy
595023faa4
Adding continued_pretraining task ( #131 )
...
* add continued pretraining script
* simplify config; add dataset_config option
* add ds configs in data mixer creator
* use extended sftconfig
* add option to avoid setting chat template
* fix data_configs bug
* add continued pretraining info
* add gpt2-nl recipe for continued pretraining example
* add final newline
* make style
* Update README.md
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com >
* Update README.md
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com >
* Update recipes/gpt2-nl/README.md
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com >
* rename continued pretraining to cpt
* improve README
---------
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com >
2024-03-14 15:15:23 +01:00
lewtun
a9b8a50a27
🌟 ( #135 )
...
* Add StarChat2
* Add DPO
* Fix unit test
* Typos
* Typo
2024-03-12 17:22:21 +01:00
lewtun
ff618a4d13
🪁 ( #129 )
...
* Add Gemma 7B recipe
* Use Gemma template
* Make it work for dolly lol
* Enable cahce
* Clean up
* DPO to the max
* DPO, DPO, DPO
* Add openhermes
* Add custom configs
* Add kwargs
* Fix config
* Bump deps
* Move old recipes
* Add doc
* Add norte
* Renable cache
* Nuke
* Clean
* Apply suggestions from code review
Co-authored-by: Alvaro Bartolome <alvaro@argilla.io >
* Fix isort
* Update README.md
* Update config_full.yaml
---------
Co-authored-by: Alvaro Bartolome <alvaro@argilla.io >
Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com >
2024-03-01 17:29:42 +01:00
Ikko Eltociear Ashimine
d00e6f043e
Update README.md ( #113 )
...
evalutions -> evaluations
2024-02-02 09:20:20 +01:00
Kosti
b4bd3a4984
Blog post url: "constitutional-ai" -> "constitutional_ai" ( #112 )
2024-02-01 09:21:53 -08:00
lewtun
995d50912b
Update README.md ( #111 )
...
* Update README.md
* Update README.md
2024-02-01 17:02:43 +01:00
Edward Beeching
cbcb3f60fb
DPO/IPO/KTO ablations ( #104 )
...
* adds configs and readme
* cleaning config files
* fix typos and removes things from config
* updates text to use comparisons rather ablations
* fix readme and adds launch script
* fix launch script, adds blogpost link
* bump release version, added missing dep, fixes configs
* updates main readme file
2024-01-18 14:55:00 +01:00
lewtun
f0ffa0d7a6
Update Zephyr configs to account for UltraFeedback & TRL fixes ( #88 )
...
* Add files
* Add checkpointing
* Add checkpointing to SFT
* Add loss type
* Fix setup|
* Clean SFT
* Add lora config
* Rename config
* Remove max eval samples
* Add kwargs tp push to hub
* Add DPO configs
* Fix dpo configs
* Extend chat template test to multi-turn
* Add warmup
* Refactor
* Fix LoRA -> QLoRA
* Fix configs
* Specify chat template
* Add sample logging
* Fix push to hub hanging
* Add reentrant
* Fix quality
* Add transformer logging
* Tweak grad acc
* Add null type
* Add doc
2024-01-10 17:42:24 +11:00
Girraj Jangid
7d6fe765ec
Update README.md ( #35 )
...
update installation instruction. Added git cmd
2023-11-20 08:52:16 +01:00
Alvaro Bartolome
0e09b0c6ec
Fix note syntax highlighting in README.md ( #20 )
2023-11-15 08:45:27 +01:00
lewtun
a1afb2bbd4
Fix image alignment ( #19 )
2023-11-12 15:47:10 +01:00
lewtun
4c6226bc42
Add moar explanations ( #18 )
2023-11-12 15:43:39 +01:00
Kashif Rasul
4b0c1fe170
fix typos ( #17 )
2023-11-12 13:44:50 +01:00
lewtun
363e29ff95
Apply suggestions from code review
...
Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com >
2023-11-10 15:49:00 +01:00
Lewis Tunstall
edf67d1d93
Tweaks
2023-11-10 11:15:45 +00:00
Lewis Tunstall
610a1a2de4
Add unit tests for data mixer
2023-11-10 08:37:53 +00:00
Lewis Tunstall
89f58a043c
Add project structure
2023-11-09 14:40:23 +00:00
Lewis Tunstall
33a0ce3afd
Add more doc
2023-11-09 13:39:03 +00:00
Lewis Tunstall
d2900adc83
Make it work!
2023-11-08 16:31:57 +00:00
lewtun
3d8570af1e
Update README.md
2023-10-26 23:21:53 +02:00
Lewis Tunstall
3a8003c782
Fix Makefile
2023-10-26 09:30:24 +00:00
Stefano Fiorucci
73c5c136c3
fix typos
2023-10-10 22:28:22 +02:00
Lewis Tunstall
8248414cf6
Rename repo
2023-10-09 19:12:47 +02:00
Lewis Tunstall
8197fe1b1e
Update readme
2023-10-09 16:46:56 +02:00
Nathan Lambert
c3fb784998
fixes
2023-08-29 16:30:18 -07:00
Lewis Tunstall
8903d4aff8
Add skeleton structure
2023-08-29 09:33:26 +02:00
lewtun
1aa5a69b17
Initial commit
2023-08-25 13:35:35 +02:00