theblackcat102
|
ba336fb087
|
[fix] fix freeze top N layers
|
2022-12-31 17:43:27 +00:00 |
|
theblackcat102
|
918b7b7ec0
|
[feature] Add galactica training config
|
2023-01-01 01:25:53 +08:00 |
|
theblackcat102
|
24e06626f4
|
[fix] Fix missing configs
|
2022-12-31 17:04:44 +00:00 |
|
theblackcat102
|
f3c299757d
|
[feature] added configs argument for parameters training and recording
|
2022-12-31 17:02:46 +00:00 |
|
theblackcat102
|
d2572d0323
|
[fix] Add drop_token_type to use galactica
|
2022-12-31 09:42:49 +00:00 |
|
theblackcat102
|
3a10f1024a
|
[fix] Fix truncation in collate fn
|
2022-12-31 09:27:09 +00:00 |
|
theblackcat102
|
b2ef4695a0
|
[fix] Fix missing accuracy and eval loss
|
2022-12-31 03:47:54 +00:00 |
|
theblackcat102
|
bcd5c52b3b
|
[feature] working trainer code
|
2022-12-31 03:02:10 +00:00 |
|
theblackcat102
|
ad98a28241
|
[feature] add rank dataset for webgpt and human feedback summary
|
2022-12-30 17:25:50 +00:00 |
|