vllm/tests at fd58b73a40d937ea6d2c55e5a8147cc0a605efe2 - vllm - Gitea: Git with a cup of tea

wassname/vllm

mirror of https://github.com/wassname/vllm.git synced 2026-07-02 10:47:39 +08:00

Files

T

History

Yanming W 8efe23f150 Fix input_metadata.selected_token_indices in worker prepare_inputs (#1546 )

2023-11-08 14:19:12 -08:00

..

Implement prompt logprobs & Batched topk for computing logprobs (#1328 )

2023-10-16 10:56:50 -07:00

TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181 )

2023-10-02 15:36:09 -07:00

TP/quantization/weight loading refactor part 1 - Simplify parallel linear logic (#1181 )

2023-10-02 15:36:09 -07:00

Fix integer overflows in attention & cache ops (#1514 )

2023-10-31 15:19:30 -07:00

Add Mistral 7B to test_models (#1366 )

2023-10-16 17:49:54 -07:00

Added logits processor API to sampling params (#1469 )

2023-11-03 14:12:15 -07:00

Fix input_metadata.selected_token_indices in worker prepare_inputs (#1546 )

2023-11-08 14:19:12 -08:00

__init__.py

[Small] Formatter only checks lints in changed files (#1528 )

2023-10-31 15:39:38 -07:00

conftest.py

Implement prompt logprobs & Batched topk for computing logprobs (#1328 )

2023-10-16 10:56:50 -07:00