vllm/models at c0c2dd1e0b75c70706f4d8dbcd1d75f1c1750e14 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-24 11:27:15 +08:00

History

Lukas Geiger a9705a290a

[Model][QwenVL] Replace torch.repeat_interleave with faster np.repeat (#28964 )

Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>

2025-11-19 22:04:23 -08:00

..

[Mistral-Small 3.1] Update docs and tests (#14977 )

2025-03-18 03:29:42 -07:00

Update rope_scaling to rope_parameters in preparation for Transformers v5 (#28542 )

2025-11-19 09:06:36 -08:00

[Model][QwenVL] Replace torch.repeat_interleave with faster np.repeat (#28964 )

2025-11-19 22:04:23 -08:00

Enable bitsandbytes quantization on AMD GPUs that use warp size 32 (#27307 )

2025-11-19 03:12:31 +00:00

__init__.py

[CI/Build] Move test_utils.py to tests/utils.py (#4425 )

2024-05-13 23:50:09 +09:00

registry.py

[Model] Add Afmoe architecture implementation (#28332 )

2025-11-17 15:11:20 -08:00

test_initialization.py

[Attention] Refactor CUDA attention backend selection logic (#24794 )

2025-11-11 07:40:44 -05:00

test_oot_registration.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_registry.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_terratorch.py

[Frontend] Require flag for loading text and image embeds (#27204 )

2025-10-22 15:52:02 +00:00

test_transformers.py

[Docs] Update the name of Transformers backend -> Transformers modeling backend (#28725 )

2025-11-14 16:34:14 +00:00

test_utils.py

Convert formatting to use ruff instead of yapf + isort (#26247 )

2025-10-05 07:06:22 -07:00

test_vision.py

[Chore] Separate out system utilities from vllm.utils (#27201 )

2025-10-22 20:25:25 +00:00

utils.py

[CI/Build] Remove unnecessary flags from test registry (#27353 )

2025-10-23 14:42:40 +00:00