mirror of
https://github.com/wassname/alpaca_convert.git
synced 2026-06-27 17:47:04 +08:00
Update README.md
This commit is contained in:
@@ -3,6 +3,8 @@ Made some adjust for the code in peft and gptq for llama, and make it possible f
|
||||
<br>
|
||||
~Still numerically unstable.~ Resolved.
|
||||
<br>
|
||||
Reconstruct fp16 matrix from 4bit data and call torch.matmul drastically increased the inference speed.
|
||||
<br>
|
||||
# Requirements
|
||||
gptq-for-llama: https://github.com/qwopqwop200/GPTQ-for-LLaMa<br>
|
||||
peft: https://github.com/huggingface/peft.git<br>
|
||||
|
||||
Reference in New Issue
Block a user