This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-05-16 09:22:18 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
tests
/
v1
/
e2e
History
Nick Hill
3de2e4bff0
update e2e test
...
Signed-off-by: Nick Hill <nhill@redhat.com>
2025-12-12 10:28:57 -08:00
..
__init__.py
[V1] Implement Cascade Attention (
#11635
)
2025-01-01 21:56:46 +09:00
test_async_scheduling.py
update e2e test
2025-12-12 10:28:57 -08:00
test_async_spec_decode.py
[Attention] Make seq_lens_cpu optional in CommonAttentionMetadata to enable true async spec-decode (
#29624
)
2025-12-09 17:18:10 -08:00
test_cascade_attention.py
[V0 Deprecation] Remove
VLLM_USE_V1
from tests (
#26341
)
2025-10-07 15:42:31 +00:00
test_context_length.py
[Bugfix] Fix validate model input for decoder models (
#27099
)
2025-11-13 10:18:47 -08:00
test_correctness_sliding_window.py
[CI][ROCm] Fix test_correctness_sliding_window (
#29243
)
2025-12-02 04:53:27 +00:00
test_kv_sharing_fast_prefill.py
[CI][ROCm][tests/v1/e2e] Fix multiprocessing launch for the test (
#29123
)
2025-12-02 20:46:10 +00:00
test_lora_with_spec_decode.py
[Misc] remove useless v1 env (
#29164
)
2025-11-21 01:41:20 -08:00
test_min_tokens.py
Update
Optional[x]
->
x | None
and
Union[x, y]
to
x | y
(
#26633
)
2025-10-12 09:51:31 -07:00
test_pooling_chunked_prefill.py
Add tests for chunked prefill and prefix cache with causal pooling models (
#26526
)
2025-10-14 07:45:04 +08:00
test_spec_decode.py
Add Eagle and Eagle3 support to Transformers modeling backend (
#30340
)
2025-12-11 17:02:10 +00:00