vllm/vllm/model_executor at 08ccee1e830d39ecdb3c6cf382c843dbf5ae830e - vllm - Gitea: Git with a cup of tea

wassname/vllm

mirror of https://github.com/wassname/vllm.git synced 2026-07-02 14:33:29 +08:00

Files

T

History

Roger Wang c1dc547129 [Kernel] Fused MoE Config for Mixtral 8x22 (#4002 )

2024-04-11 07:50:00 -07:00

..

[Kernel] Fused MoE Config for Mixtral 8x22 (#4002 )

2024-04-11 07:50:00 -07:00

[Core][Model] torch.compile for layernorm in commandr (#3985 )

2024-04-11 01:48:26 +00:00

__init__.py

[Core] Refactor Attention Take 2 (#3462 )

2024-03-25 04:39:33 +00:00

guided_decoding.py

[Bugfix] Remove key sorting for guided_json parameter in OpenAi compatible Server (#3945 )

2024-04-10 10:15:51 -07:00

guided_logits_processors.py

[CI] Try introducing isort. (#3495 )

2024-03-25 07:59:47 -07:00

model_loader.py

Usage Stats Collection (#2852 )

2024-03-28 22:16:12 -07:00

neuron_model_loader.py

[CI] Try introducing isort. (#3495 )

2024-03-25 07:59:47 -07:00

sampling_metadata.py

[CI] Try introducing isort. (#3495 )

2024-03-25 07:59:47 -07:00

utils.py

[Hardware][Neuron] Refactor neuron support (#3471 )

2024-03-22 01:22:17 +00:00

weight_utils.py

[Core] Enable hf_transfer by default if available (#3817 )

2024-04-04 04:02:43 +00:00