This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2025-12-10 03:54:56 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
tests
/
compile
/
piecewise
History
Yong Hoon Shin
4ac7713e32
Add test case for compiling multiple graphs (
#21044
)
...
Signed-off-by: Yong Hoon Shin <yhshin@meta.com>
2025-07-23 11:00:47 -07:00
..
__init__.py
[torch.compile] rework compile control with piecewise cudagraph (
#9715
)
2024-10-29 23:03:49 -07:00
test_full_cudagraph.py
[Feature][ROCm] Add full graph capture support for TritonAttentionBackend (
#19158
)
2025-06-17 17:03:06 -04:00
test_multiple_graphs.py
Add test case for compiling multiple graphs (
#21044
)
2025-07-23 11:00:47 -07:00
test_simple.py
[CUDA] Enable full cudagraph for FlashMLA (
#18581
)
2025-06-13 18:12:26 +00:00
test_toy_llama.py
[CUDA] Enable full cudagraph for FlashMLA (
#18581
)
2025-06-13 18:12:26 +00:00