vllm/models at 14006840eacf74f83e0d486eca6a24e75cafa6d3 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-22 05:27:11 +08:00

History

Woosuk Kwon 14006840ea

[V0 Deprecation] Remove V0 FlashInfer attention backend (#22776 )

Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>

2025-08-18 19:54:16 -07:00

..

[Mistral-Small 3.1] Update docs and tests (#14977 )

2025-03-18 03:29:42 -07:00

[New Model]mBART model (#22883 )

2025-08-16 12:16:58 +00:00

chore: remove unnecessary patch_padding_side for the chatglm model (#23090 )

2025-08-18 12:32:13 +00:00

[V0 Deprecation] Remove V0 FlashInfer attention backend (#22776 )

2025-08-18 19:54:16 -07:00

__init__.py

[CI/Build] Move test_utils.py to tests/utils.py (#4425 )

2024-05-13 23:50:09 +09:00

registry.py

[New Model]mBART model (#22883 )

2025-08-16 12:16:58 +00:00

test_initialization.py

[Bugfix] Fix failing GPT-OSS initialization test (#22557 )

2025-08-09 00:03:26 -07:00

test_oot_registration.py

[CI/Build] Fix plugin tests (#21758 )

2025-07-28 15:08:05 +00:00

test_registry.py

[Deprecation][2/N] Replace --task with --runner and --convert (#21470 )

2025-07-27 19:42:40 -07:00

test_transformers.py

Enable headless models for pooling in the Transformers backend (#21767 )

2025-08-01 10:31:29 -07:00

test_utils.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_vision.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

utils.py

[Model] Pooling models default to using chunked prefill & prefix caching if supported. (#20930 )

2025-08-11 09:41:37 -07:00