vllm/models at c7a29d2c8d07ce6188d0c4bb19df6fd1d0e9bc74 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-08-02 12:22:13 +08:00

History

Shinichi Hemmi c9e093116c

[MODEL] Implement plamo3 (#28834 )

Signed-off-by: Shinichi Hemmi <50256998+Alnusjaponica@users.noreply.github.com>

2025-11-20 03:00:19 -08:00

..

[Mistral-Small 3.1] Update docs and tests (#14977 )

2025-03-18 03:29:42 -07:00

Update rope_scaling to rope_parameters in preparation for Transformers v5 (#28542 )

2025-11-19 09:06:36 -08:00

[Model][QwenVL] Replace torch.repeat_interleave with faster np.repeat (#28964 )

2025-11-19 22:04:23 -08:00

Enable bitsandbytes quantization on AMD GPUs that use warp size 32 (#27307 )

2025-11-19 03:12:31 +00:00

__init__.py

[CI/Build] Move test_utils.py to tests/utils.py (#4425 )

2024-05-13 23:50:09 +09:00

registry.py

[MODEL] Implement plamo3 (#28834 )

2025-11-20 03:00:19 -08:00

test_initialization.py

[Attention] Refactor CUDA attention backend selection logic (#24794 )

2025-11-11 07:40:44 -05:00

test_oot_registration.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_registry.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_terratorch.py

[Frontend] Require flag for loading text and image embeds (#27204 )

2025-10-22 15:52:02 +00:00

test_transformers.py

[Docs] Update the name of Transformers backend -> Transformers modeling backend (#28725 )

2025-11-14 16:34:14 +00:00

test_utils.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_vision.py

[Chore] Separate out system utilities from vllm.utils (#27201 )

2025-10-22 20:25:25 +00:00

utils.py

[CI/Build] Remove unnecessary flags from test registry (#27353 )

2025-10-23 14:42:40 +00:00