This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2025-12-10 07:34:57 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
tests
/
compile
History
bnellnm
f192aeba74
[Bugfix] Enable some fp8 and quantized fullgraph tests (
#10171
)
...
Signed-off-by: Bill Nell <bill@neuralmagic.com>
2024-11-09 08:01:27 +00:00
..
piecewise
[CI/Build] drop support for Python 3.8 EOL (
#8464
)
2024-11-06 07:11:55 +00:00
__init__.py
[torch.compile] register allreduce operations as custom ops (
#8526
)
2024-09-16 22:57:57 -07:00
backend.py
[torch.compile] Fuse RMSNorm with quant (
#9138
)
2024-11-08 21:20:08 +00:00
test_basic_correctness.py
[torch.compile] rework test plans (
#9866
)
2024-10-31 22:20:17 -07:00
test_full_graph.py
[torch.compile] rework compile control with piecewise cudagraph (
#9715
)
2024-10-29 23:03:49 -07:00
test_fusion.py
[torch.compile] Fuse RMSNorm with quant (
#9138
)
2024-11-08 21:20:08 +00:00
test_wrapper.py
[tpu][misc] fix typo (
#8260
)
2024-09-06 22:40:46 -07:00
utils.py
[Bugfix] Enable some fp8 and quantized fullgraph tests (
#10171
)
2024-11-09 08:01:27 +00:00