This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-03-18 12:07:12 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
tests
/
models
/
decoder_only
/
language
History
Mor Zusman
fb60ae9b91
[Kernel][Model] Improve continuous batching for Jamba and Mamba (
#9189
)
2024-10-16 12:12:43 -04:00
..
__init__.py
…
test_aqlm.py
…
test_big_models.py
[Model] support minicpm3 (
#8297
)
2024-09-14 14:50:26 +00:00
test_danube3_4b.py
…
test_fp8.py
…
test_gguf.py
[Core] Refactor GGUF parameters packing and forwarding (
#8859
)
2024-10-07 10:01:46 +00:00
test_gptq_marlin_24.py
…
test_gptq_marlin.py
…
test_granite.py
[BugFix] Fix test breakages from transformers 4.45 upgrade (
#8829
)
2024-09-26 16:46:43 -07:00
test_granitemoe.py
[Model] Adding Granite MoE. (
#8206
)
2024-10-03 09:33:57 +08:00
test_jamba.py
[Kernel][Model] Improve continuous batching for Jamba and Mamba (
#9189
)
2024-10-16 12:12:43 -04:00
test_mamba.py
[Model] Support Mamba (
#6484
)
2024-10-11 15:40:06 +00:00
test_marlin.py
…
test_mistral.py
[Bugfix][Core] Fix tekken edge case for mistral tokenizer (
#8640
)
2024-09-20 14:33:03 -07:00
test_modelopt.py
…
test_models.py
…
test_phimoe.py
[CI/Build] Add test decorator for minimum GPU memory (
#8925
)
2024-09-29 02:50:51 +00:00