vllm/models at 69244e67e6822f1c15816f887659e1ccc18c2632 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-07-05 19:47:12 +08:00

History

Cyrus Leung 69244e67e6

[Core] Use key-only cache for BaseMultiModalProcessor (#23018 )

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

2025-08-27 14:19:13 +08:00

..

[Mistral-Small 3.1] Update docs and tests (#14977 )

2025-03-18 03:29:42 -07:00

Support FlashAttention Backend for Hybrid SSM Models (#23299 )

2025-08-26 12:41:52 +00:00

[Core] Use key-only cache for BaseMultiModalProcessor (#23018 )

2025-08-27 14:19:13 +08:00

[V0 Deprecation] Remove V0 FlashInfer attention backend (#22776 )

2025-08-18 19:54:16 -07:00

__init__.py

[CI/Build] Move test_utils.py to tests/utils.py (#4425 )

2024-05-13 23:50:09 +09:00

registry.py

[Model] Add Ernie4.5 VL Model Support (#22514 )

2025-08-26 21:02:55 -07:00

test_initialization.py

[Model] Add LFM2 architecture (#22845 )

2025-08-21 09:35:07 +02:00

test_oot_registration.py

[CI/Build] Fix plugin tests (#21758 )

2025-07-28 15:08:05 +00:00

test_registry.py

[Deprecation][2/N] Replace --task with --runner and --convert (#21470 )

2025-07-27 19:42:40 -07:00

test_transformers.py

Enable headless models for pooling in the Transformers backend (#21767 )

2025-08-01 10:31:29 -07:00

test_utils.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

test_vision.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

utils.py

[Model] Pooling models default to using chunked prefill & prefix caching if supported. (#20930 )

2025-08-11 09:41:37 -07:00