vllm/language at dbfa8d31d5e7627a84671c6068ecc8fa58acd1d1 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-08-03 12:47:13 +08:00

History

Mor Zusman fb60ae9b91

[Kernel][Model] Improve continuous batching for Jamba and Mamba (#9189 )

2024-10-16 12:12:43 -04:00

..

__init__.py

[CI/Build] Reorganize models tests (#7820 )

2024-09-13 10:20:06 -07:00

test_aqlm.py

[CI/Build] Reorganize models tests (#7820 )

2024-09-13 10:20:06 -07:00

test_big_models.py

[Model] support minicpm3 (#8297 )

2024-09-14 14:50:26 +00:00

test_danube3_4b.py

[CI/Build] Reorganize models tests (#7820 )

2024-09-13 10:20:06 -07:00

test_fp8.py

[CI/Build] Reorganize models tests (#7820 )

2024-09-13 10:20:06 -07:00

test_gguf.py

[Core] Refactor GGUF parameters packing and forwarding (#8859 )

2024-10-07 10:01:46 +00:00

test_gptq_marlin_24.py

[CI/Build] Reorganize models tests (#7820 )

2024-09-13 10:20:06 -07:00

test_gptq_marlin.py

[CI/Build] Reorganize models tests (#7820 )

2024-09-13 10:20:06 -07:00

test_granite.py

[BugFix] Fix test breakages from transformers 4.45 upgrade (#8829 )

2024-09-26 16:46:43 -07:00

test_granitemoe.py

[Model] Adding Granite MoE. (#8206 )

2024-10-03 09:33:57 +08:00

test_jamba.py

[Kernel][Model] Improve continuous batching for Jamba and Mamba (#9189 )

2024-10-16 12:12:43 -04:00

test_mamba.py

[Model] Support Mamba (#6484 )

2024-10-11 15:40:06 +00:00

test_marlin.py

[CI/Build] Reorganize models tests (#7820 )

2024-09-13 10:20:06 -07:00

test_mistral.py

[Bugfix][Core] Fix tekken edge case for mistral tokenizer (#8640 )

2024-09-20 14:33:03 -07:00

test_modelopt.py

[CI/Build] Reorganize models tests (#7820 )

2024-09-13 10:20:06 -07:00

test_models.py

[CI/Build] Reorganize models tests (#7820 )

2024-09-13 10:20:06 -07:00

test_phimoe.py

[CI/Build] Add test decorator for minimum GPU memory (#8925 )

2024-09-29 02:50:51 +00:00