Kirill
|
98fe28fb14
|
Clean deprecated max_samples arguments (#89)
|
2024-01-05 09:06:47 +11:00 |
|
NielsRogge
|
57508b5c2d
|
Make SFT script consistent with DPO script (#86)
* Add argument
* Make scripts consistent
* Fix style
---------
Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>
|
2024-01-04 15:55:58 +11:00 |
|
Nathan Azrak
|
3f368a0748
|
Add check that parameters are not intended to be offloaded (#51)
* Add check that parameters are not intended to be offloaded
* Only push model to device if quantization config is set.
|
2023-12-04 09:10:41 +01:00 |
|
lewtun
|
4c6226bc42
|
Add moar explanations (#18)
|
2023-11-12 15:43:39 +01:00 |
|
Kashif Rasul
|
4b0c1fe170
|
fix typos (#17)
|
2023-11-12 13:44:50 +01:00 |
|
Lewis Tunstall
|
5a630a1989
|
Add QLoRA command
|
2023-11-10 13:57:52 +00:00 |
|
Lewis Tunstall
|
e2e8ab945d
|
Refactor imports
|
2023-11-10 13:38:45 +00:00 |
|
edbeeching
|
7f1a14e0d4
|
adds auto adapter merge to dpo script
|
2023-11-10 14:15:44 +01:00 |
|
Lewis Tunstall
|
54185783e0
|
Remove QLoRa for now
|
2023-11-10 11:20:39 +00:00 |
|
Lewis Tunstall
|
edf67d1d93
|
Tweaks
|
2023-11-10 11:15:45 +00:00 |
|
Lewis Tunstall
|
a0b8d49424
|
Rename recipe
|
2023-11-10 10:49:13 +00:00 |
|
Lewis Tunstall
|
756bb76d22
|
Fix Slurm opts
|
2023-11-09 14:09:52 +00:00 |
|
Lewis Tunstall
|
33a0ce3afd
|
Add more doc
|
2023-11-09 13:39:03 +00:00 |
|
Lewis Tunstall
|
2de17f5ba1
|
Add doc
|
2023-11-09 07:32:24 +00:00 |
|
Lewis Tunstall
|
ee10c4efd9
|
Make DPO work!
|
2023-11-08 22:58:34 +00:00 |
|
Lewis Tunstall
|
e54e095978
|
Make it work for realz
|
2023-11-08 22:20:17 +00:00 |
|
Lewis Tunstall
|
d2900adc83
|
Make it work!
|
2023-11-08 16:31:57 +00:00 |
|
Lewis Tunstall
|
967eab4cfb
|
Add skeleton
|
2023-11-08 13:21:57 +00:00 |
|