vllm/models at 6e25b1cddfd78eab307acdb5e3ec14475e465d90 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-21 23:47:19 +08:00

History

Roger Wang d3387750f1

[Misc] Turn off encoder torch compile by default (#28634 )

Signed-off-by: Roger Wang <hey@rogerw.io>

2025-11-13 08:38:08 -08:00

..

[Mistral-Small 3.1] Update docs and tests (#14977 )

2025-03-18 03:29:42 -07:00

VLLM_USE_TRITON_FLASH_ATTN V0 variable deprecation (#27611 )

2025-11-11 18:34:36 -08:00

[Misc] Turn off encoder torch compile by default (#28634 )

2025-11-13 08:38:08 -08:00

[Quantization] fix attention quantization of gpt_oss model (#27334 )

2025-11-11 12:06:00 -05:00

__init__.py

[CI/Build] Move test_utils.py to tests/utils.py (#4425 )

2024-05-13 23:50:09 +09:00

registry.py

[CPU] Refactor CPU attention backend (#27954 )

2025-11-12 09:43:06 +08:00

test_initialization.py

[Attention] Refactor CUDA attention backend selection logic (#24794 )

2025-11-11 07:40:44 -05:00

test_oot_registration.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_registry.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_terratorch.py

[Frontend] Require flag for loading text and image embeds (#27204 )

2025-10-22 15:52:02 +00:00

test_transformers.py

[Bugfix] Fix encoder-only model support for transformers backend (#28021 )

2025-11-04 22:24:41 -08:00

test_utils.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_vision.py

[Chore] Separate out system utilities from vllm.utils (#27201 )

2025-10-22 20:25:25 +00:00

utils.py

[CI/Build] Remove unnecessary flags from test registry (#27353 )

2025-10-23 14:42:40 +00:00