vllm/cacheflow/models at 7a7929abe8e2fd6a4688487c471a1ee1fde0edd2 - vllm - Gitea: Git with a cup of tea

wassname/vllm

mirror of https://github.com/wassname/vllm.git synced 2026-07-01 01:28:13 +08:00

Files

T

History

Woosuk Kwon 88c0268a18 Implement custom kernel for LLaMA rotary embedding (#14 )

2023-03-30 11:04:21 -07:00

..

__init__.py

Support tensor parallel (#2 )

2023-03-21 13:45:42 -07:00

attention.py

Implement custom kernel for LLaMA rotary embedding (#14 )

2023-03-30 11:04:21 -07:00

input_metadata.py

Support tensor parallel (#2 )

2023-03-21 13:45:42 -07:00

llama.py

Implement custom kernel for LLaMA rotary embedding (#14 )

2023-03-30 11:04:21 -07:00

memory_analyzer.py

Implement custom kernel for LLaMA rotary embedding (#14 )

2023-03-30 11:04:21 -07:00

model_utils.py

Implement LLaMA (#9 )

2023-03-30 12:25:32 +08:00

opt.py

Implement custom kernel for LLaMA rotary embedding (#14 )

2023-03-30 11:04:21 -07:00

sample.py

Implement custom kernel for LLaMA rotary embedding (#14 )

2023-03-30 11:04:21 -07:00

utils.py

FastAPI-based working frontend (#10 )

2023-03-29 14:48:56 +08:00