vllm/models at 9520a989dfdf3a1db36798458ce525e9755f7438 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-05-04 01:51:22 +08:00

History

Harry Mellor 4b0da7b60e

Enable hybrid attention models for Transformers backend (#18494 )

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

2025-05-23 10:12:08 +08:00

..

[Mistral-Small 3.1] Update docs and tests (#14977 )

2025-03-18 03:29:42 -07:00

[CI/Build] Update bamba test model location (#18544 )

2025-05-22 06:01:07 -07:00

Re-submit: Fix: Proper RGBA -> RGB conversion for PIL images. (#18569 )

2025-05-23 01:59:18 +00:00

[Minor] Rename quantization nvfp4 to modelopt_fp4 (#18356 )

2025-05-20 09:08:37 -07:00

__init__.py

[CI/Build] Move test_utils.py to tests/utils.py (#4425 )

2024-05-13 23:50:09 +09:00

registry.py

[CI/Build] Update bamba test model location (#18544 )

2025-05-22 06:01:07 -07:00

test_initialization.py

Use Transformers helper get_text_config() instead of checking for text_config (#17105 )

2025-04-25 08:47:35 -07:00

test_oot_registration.py

Re-submit: Fix: Proper RGBA -> RGB conversion for PIL images. (#18569 )

2025-05-23 01:59:18 +00:00

test_registry.py

[Bugfix][ROCm] running new process using spawn method for rocm in tests. (#14810 )

2025-03-17 11:33:35 +00:00

test_transformers.py

Enable hybrid attention models for Transformers backend (#18494 )

2025-05-23 10:12:08 +08:00

test_utils.py

[Misc] Allow AutoWeightsLoader to skip loading weights with specific substr in name (#18358 )

2025-05-19 20:20:12 -07:00

test_vision.py

[Bugfix] Fix Positive Feature Layers in Llava Models (#13514 )

2025-02-19 08:50:07 +00:00

utils.py

[New Model]: nomic-embed-text-v2-moe (#17785 )

2025-05-11 00:59:43 -07:00