vllm/models at bcf43ab1f380208ea33769c49d116ea83f915080 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-17 20:27:35 +08:00

History

Qiu 46cbbca05c

[CI][DCP][Perf] reduce DCP CI execution time (#29858 )

Signed-off-by: QiuChunshuo <qiuchunshuo@huawei.com>

2025-12-04 17:28:21 +00:00

..

[Mistral-Small 3.1] Update docs and tests (#14977 )

2025-03-18 03:29:42 -07:00

[Model][6/N] Improve all pooling task | Support chunked prefill with ALL pooling (#27145 )

2025-12-04 13:44:15 +00:00

[Chore] Deprecate merge_by_field_config arg (#30035 )

2025-12-04 17:21:24 +00:00

[Bugfix][Quantization] Support BF16 tensors on GGUF (#29948 )

2025-12-03 10:33:46 +00:00

__init__.py

…

registry.py

[CI][DCP][Perf] reduce DCP CI execution time (#29858 )

2025-12-04 17:28:21 +00:00

test_gguf_download.py

[Chore]: Reorganize gguf utils funtions under transformers_utils (#29891 )

2025-12-02 17:33:23 +00:00

test_initialization.py

[Attention] Refactor CUDA attention backend selection logic (#24794 )

2025-11-11 07:40:44 -05:00

test_oot_registration.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_registry.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_terratorch.py

[Frontend] Require flag for loading text and image embeds (#27204 )

2025-10-22 15:52:02 +00:00

test_transformers.py

[Hardware][AMD] Remove ROCm skip conditions for transformers backend tests (#29782 )

2025-12-02 02:03:13 +08:00

test_utils.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_vision.py

[Chore] Separate out system utilities from vllm.utils (#27201 )

2025-10-22 20:25:25 +00:00

utils.py

[Chore] Move tokenizer initialization methods (#29793 )

2025-12-02 13:33:37 +08:00