vllm/models at 444f0e3f339caba85f84c6628e1df50605b241a0 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-05-10 06:20:10 +08:00

History

Isotr0py b952f4d3c3

[v1] Add PrefixLM support to FlexAttention backend (#27938 )

Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

2025-12-07 15:51:36 +00:00

..

[Mistral-Small 3.1] Update docs and tests (#14977 )

2025-03-18 03:29:42 -07:00

Revert "[Renderer] Separate out RendererConfig from ModelConfig (#30145 )" (#30199 )

2025-12-07 00:00:22 -08:00

[v1] Add PrefixLM support to FlexAttention backend (#27938 )

2025-12-07 15:51:36 +00:00

[Bugfix][Quantization] Support BF16 tensors on GGUF (#29948 )

2025-12-03 10:33:46 +00:00

__init__.py

…

registry.py

Revert "[Renderer] Separate out RendererConfig from ModelConfig (#30145 )" (#30199 )

2025-12-07 00:00:22 -08:00

test_gguf_download.py

[Chore]: Reorganize gguf utils funtions under transformers_utils (#29891 )

2025-12-02 17:33:23 +00:00

test_initialization.py

[Attention] Refactor CUDA attention backend selection logic (#24794 )

2025-11-11 07:40:44 -05:00

test_oot_registration.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_registry.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_terratorch.py

[Frontend] Require flag for loading text and image embeds (#27204 )

2025-10-22 15:52:02 +00:00

test_transformers.py

[Hardware][AMD] Remove ROCm skip conditions for transformers backend tests (#29782 )

2025-12-02 02:03:13 +08:00

test_utils.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_vision.py

[Chore] Separate out system utilities from vllm.utils (#27201 )

2025-10-22 20:25:25 +00:00

utils.py

Revert "[Renderer] Separate out RendererConfig from ModelConfig (#30145 )" (#30199 )

2025-12-07 00:00:22 -08:00