This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-03-27 05:31:19 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
benchmarks
/
kernels
History
Lucas Wilkinson
5288c06aa0
[Kernel] (1/N) Machete - Hopper Optimized Mixed Precision Linear Kernel (
#7174
)
2024-08-20 07:09:33 -06:00
..
benchmark_aqlm.py
…
benchmark_machete.py
[Kernel] (1/N) Machete - Hopper Optimized Mixed Precision Linear Kernel (
#7174
)
2024-08-20 07:09:33 -06:00
benchmark_marlin.py
[Misc] Disambiguate quantized types via a new ScalarType (
#6396
)
2024-08-02 13:51:58 -07:00
benchmark_moe.py
[Kernel] W8A16 Int8 inside FusedMoE (
#7415
)
2024-08-16 10:06:51 -07:00
benchmark_paged_attention.py
[Model] H2O Danube3-4b (
#6451
)
2024-07-26 20:47:50 -07:00
benchmark_rope.py
[Model] H2O Danube3-4b (
#6451
)
2024-07-26 20:47:50 -07:00
benchmark_shapes.py
…
graph_machete_bench.py
[Kernel] (1/N) Machete - Hopper Optimized Mixed Precision Linear Kernel (
#7174
)
2024-08-20 07:09:33 -06:00
weight_shapes.py
[Kernel] (1/N) Machete - Hopper Optimized Mixed Precision Linear Kernel (
#7174
)
2024-08-20 07:09:33 -06:00