This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2025-12-26 08:09:37 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
tests
/
v1
/
e2e
History
Nick Hill
938a81692e
[AsyncScheduling] Don't schedule past request max_tokens (
#27922
)
...
Signed-off-by: Nick Hill <nhill@redhat.com>
2025-11-04 17:06:28 +00:00
..
__init__.py
…
test_async_scheduling.py
[Core] Async scheduling + structured outputs compatibility (
#26866
)
2025-11-01 00:35:04 +00:00
test_cascade_attention.py
…
test_correctness_sliding_window.py
…
test_kv_sharing_fast_prefill.py
[Frontend][torch.compile] CompilationConfig Overhaul (
#20283
): name change compilation level to compilation mode, deprecation compilation level (
#26355
)
2025-10-15 02:51:16 +00:00
test_min_tokens.py
Update
Optional[x]
->
x | None
and
Union[x, y]
to
x | y
(
#26633
)
2025-10-12 09:51:31 -07:00
test_pooling_chunked_prefill.py
Add tests for chunked prefill and prefix cache with causal pooling models (
#26526
)
2025-10-14 07:45:04 +08:00
test_spec_decode.py
[AsyncScheduling] Don't schedule past request max_tokens (
#27922
)
2025-11-04 17:06:28 +00:00