vllm/models at b886068056a05857f796909d2f8573b36fc668a5 - vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-05-10 06:20:10 +08:00

History

Matthew Bonanni b30dfa03c5

[Attention] Refactor CUDA attention backend selection logic (#24794 )

Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>
Signed-off-by: Matthew Bonanni <mbonanni001@gmail.com>
Co-authored-by: Luka Govedič <ProExpertProg@users.noreply.github.com>

2025-11-11 07:40:44 -05:00

fixtures

[Mistral-Small 3.1] Update docs and tests (#14977 )

2025-03-18 03:29:42 -07:00

language

[V1] [Hybrid] Mamba1 Automatic Prefix Caching (#26377 )

2025-11-02 04:16:23 -08:00

multimodal

Fix failing test for CRadio (#27738 )

2025-11-06 15:32:25 -08:00

quantization

[CI/Build][Bugfix]Fix Quantized Models Test on AMD (#27712 )

2025-10-29 06:27:30 +00:00

__init__.py

[CI/Build] Move test_utils.py to tests/utils.py (#4425 )

2024-05-13 23:50:09 +09:00

registry.py

Restore PlaMo2 unit test as pfnet/plamo-2-1b now supports transformers >=4.56 (#28019 )

2025-11-10 06:50:02 +00:00

test_initialization.py

[Attention] Refactor CUDA attention backend selection logic (#24794 )

2025-11-11 07:40:44 -05:00

test_oot_registration.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_registry.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_terratorch.py

[Frontend] Require flag for loading text and image embeds (#27204 )

2025-10-22 15:52:02 +00:00

test_transformers.py

[Bugfix] Fix encoder-only model support for transformers backend (#28021 )

2025-11-04 22:24:41 -08:00

test_utils.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_vision.py

[Chore] Separate out system utilities from vllm.utils (#27201 )

2025-10-22 20:25:25 +00:00

utils.py

[CI/Build] Remove unnecessary flags from test registry (#27353 )

2025-10-23 14:42:40 +00:00