wassname/alpaca_convert

Fork 0

mirror of https://github.com/wassname/alpaca_convert.git synced 2026-06-27 16:14:08 +08:00

T

wassname 039af1a0db readme

2023-04-10 16:21:01 +08:00

.vscode

init

2023-04-10 16:15:52 +08:00

scripts

init

2023-04-10 16:15:52 +08:00

text-generation-webui

add v1 model as default in custom monkey patch

2023-04-10 09:33:41 +08:00

.gitignore

init

2023-04-10 16:15:52 +08:00

autograd_4bit.py

init

2023-04-10 16:15:52 +08:00

LICENSE

Create LICENSE

2023-03-25 10:17:44 +08:00

mjc_notes.md

init

2023-04-10 16:15:52 +08:00

README.md

readme

2023-04-10 16:21:01 +08:00

requirements.txt

init

2023-04-10 16:15:52 +08:00

README.md

My personal repo to convert models from Lora to huggingface/ggml/gptq 4bit so I can run them in normal text-webui and llama.cpp

How do we do this?

lora -> hf
- tloen/alpaca-lora/export_hf_checkpoint.py
hf -> 4bit
- using GPTQ-for-LLaMa/llama.py CUDA_VISIBLE_DEVICES=0 python llama.py ./llama-hf/llama-7b c4 --wbits 4 --true-sequential --act-order --groupsize 128 --save llama7b-4bit-128g.pt
4bit -> ggml
- llama.cpp/convert-pth-to-ggml.py

TODO

lora -> hf
- test this
hf -> 4bit
4bit to -> ggml

setup env


conda create -n textgen3 python=3.10.9
conda activate textgen3
mamba install pytorch torchvision torchaudio pytorch-cuda=11.7 cudatoolkit-dev==11.7  cudatoolkit=11.7 -c pytorch -c nvidia  -c conda-forge

download models

# # base models.... FIXME


# download loras
python scripts/download-model.py chansung/alpaca-lora-30b
python scripts/download-model.py chansung/alpaca-lora-13b
python scripts/download-model.py tloen/alpaca-lora-7b

convert models

python scripts/export_hf_checkpoint.py ./models/llama-7b-hf -l loras/tloen_alpaca-lora-7b

README.md

TODO

setup env

download models

convert models

Links