This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-03-20 11:39:12 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
tests
/
models
/
decoder_only
/
language
History
Mor Zusman
f13a07b1f8
[Kernel][Model] Varlen prefill + Prefill chunking support for mamba kernels and Jamba model (
#8533
)
2024-09-29 17:35:58 -04:00
..
__init__.py
…
test_aqlm.py
…
test_big_models.py
[Model] support minicpm3 (
#8297
)
2024-09-14 14:50:26 +00:00
test_danube3_4b.py
…
test_fp8.py
…
test_gguf.py
…
test_gptq_marlin_24.py
…
test_gptq_marlin.py
…
test_granite.py
[BugFix] Fix test breakages from transformers 4.45 upgrade (
#8829
)
2024-09-26 16:46:43 -07:00
test_jamba.py
[Kernel][Model] Varlen prefill + Prefill chunking support for mamba kernels and Jamba model (
#8533
)
2024-09-29 17:35:58 -04:00
test_marlin.py
…
test_mistral.py
[Bugfix][Core] Fix tekken edge case for mistral tokenizer (
#8640
)
2024-09-20 14:33:03 -07:00
test_modelopt.py
…
test_models.py
…
test_phimoe.py
[CI/Build] Add test decorator for minimum GPU memory (
#8925
)
2024-09-29 02:50:51 +00:00