vllm/models at d8874c61a55e40db4ada047f1736c38c86439fff - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-23 08:57:15 +08:00

History

wang.yuqi a55b64635c

[Model] Allow users to control skip reading cache per request. (#28194 )

Signed-off-by: wang.yuqi <yuqi.wang@daocloud.io>
Signed-off-by: wang.yuqi <noooop@126.com>

2025-11-16 00:04:50 -08:00

..

[Mistral-Small 3.1] Update docs and tests (#14977 )

2025-03-18 03:29:42 -07:00

[Model] Allow users to control skip reading cache per request. (#28194 )

2025-11-16 00:04:50 -08:00

Avoid bytecode hook and simplify TorchCompileWrapperWithCustomDipatch (#25110 )

2025-11-14 14:11:10 -08:00

[Quantization] fix attention quantization of gpt_oss model (#27334 )

2025-11-11 12:06:00 -05:00

__init__.py

[CI/Build] Move test_utils.py to tests/utils.py (#4425 )

2024-05-13 23:50:09 +09:00

registry.py

[CPU] Refactor CPU attention backend (#27954 )

2025-11-12 09:43:06 +08:00

test_initialization.py

[Attention] Refactor CUDA attention backend selection logic (#24794 )

2025-11-11 07:40:44 -05:00

test_oot_registration.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_registry.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_terratorch.py

[Frontend] Require flag for loading text and image embeds (#27204 )

2025-10-22 15:52:02 +00:00

test_transformers.py

[Docs] Update the name of Transformers backend -> Transformers modeling backend (#28725 )

2025-11-14 16:34:14 +00:00

test_utils.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_vision.py

[Chore] Separate out system utilities from vllm.utils (#27201 )

2025-10-22 20:25:25 +00:00

utils.py

[CI/Build] Remove unnecessary flags from test registry (#27353 )

2025-10-23 14:42:40 +00:00