vllm/models at b41fb9d3b10dcf187ac0501ca80ede96d387617f - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-05-04 14:44:31 +08:00

History

sroy745 b41fb9d3b1

[Encoder Decoder] Update Mllama to run with both FlashAttention and XFormers (#9982 )

Signed-off-by: Sourashis Roy <sroy@roblox.com>

2024-11-12 10:53:57 -08:00

..

[CI/Build] Split up models tests (#10069 )

2024-11-09 11:39:14 -08:00

[Hardware][CPU] Add embedding models support for CPU backend (#10193 )

2024-11-11 08:54:28 +00:00

encoder_decoder

[Encoder Decoder] Update Mllama to run with both FlashAttention and XFormers (#9982 )

2024-11-12 10:53:57 -08:00

[CI/Build] Update pixtral tests to use JSON (#8436 )

2024-09-13 03:47:52 +00:00

__init__.py

[CI/Build] Move test_utils.py to tests/utils.py (#4425 )

2024-05-13 23:50:09 +09:00

test_oot_registration.py

[Model] Explicit interface for vLLM models and support OOT embedding models (#9108 )

2024-10-07 06:10:35 +00:00

test_registry.py

[Model] Explicit interface for vLLM models and support OOT embedding models (#9108 )

2024-10-07 06:10:35 +00:00

utils.py

[CI/Build] Update CPU tests to include all "standard" tests (#5481 )

2024-11-08 23:30:04 +08:00