This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2025-12-10 03:35:17 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
tests
/
v1
/
e2e
History
Yong Hoon Shin
cb293f6a79
[V1] Enable prefill optimization for Gemma3n (
#22628
)
...
Signed-off-by: Yong Hoon Shin <yhshin@meta.com>
2025-08-28 14:54:30 -07:00
..
__init__.py
[V1] Implement Cascade Attention (
#11635
)
2025-01-01 21:56:46 +09:00
test_cascade_attention.py
[XPU] Use spawn with XPU multiprocessing (
#20649
)
2025-07-09 00:34:28 -07:00
test_correctness_sliding_window.py
[KVCache] Make KVCacheSpec hashable (
#21791
)
2025-07-29 19:58:29 +08:00
test_kv_sharing_fast_prefill.py
[V1] Enable prefill optimization for Gemma3n (
#22628
)
2025-08-28 14:54:30 -07:00
test_min_tokens.py
[CI] Add end-to-end V1 min_tokens test coverage (
#22495
)
2025-08-21 22:04:07 -06:00
test_spec_decode.py
[Model] Support deepseek with eagle (
#21086
)
2025-08-20 19:01:31 +08:00