Update README.md

2026-06-27 17:47:04 +08:00 · 2023-03-21 16:49:08 +08:00
parent 8d198e0171
commit 3be75bb3db
1 changed files with 1 additions and 1 deletions
@@ -3,7 +3,7 @@ Made some adjust for the code in peft and gptq for llama, and make it possible f
 <br>
 ~Still numerically unstable.~ Resolved.
 <br>
-Reconstruct fp16 matrix from 4bit data and call torch.matmul drastically increased the inference speed.
+Reconstruct fp16 matrix from 4bit data and call torch.matmul largely increased the inference speed.
 <br>
 # Requirements
 gptq-for-llama: https://github.com/qwopqwop200/GPTQ-for-LLaMa<br>