This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-03-26 01:57:01 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
benchmarks
/
kernels
History
Jee Jee Li
a73122de96
[Bugfix] fix benchmark moe (
#14653
)
...
Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>
2025-03-13 16:12:42 +08:00
..
deepgemm
Add benchmark for DeepGEMM and vLLM Block FP8 Dense GEMM (
#13917
)
2025-03-05 17:08:51 -08:00
benchmark_aqlm.py
…
benchmark_layernorm.py
[Bugfix] Correctly call
cudaProfilerStop
in benchmarks script (
#14183
)
2025-03-07 00:42:49 +00:00
benchmark_lora.py
[V1] LoRA - Add triton kernels for V1 (
#13096
)
2025-03-10 17:27:53 -04:00
benchmark_machete.py
[Bugfix] Correctly call
cudaProfilerStop
in benchmarks script (
#14183
)
2025-03-07 00:42:49 +00:00
benchmark_marlin.py
…
benchmark_moe.py
[Bugfix] fix benchmark moe (
#14653
)
2025-03-13 16:12:42 +08:00
benchmark_paged_attention.py
[Bugfix] Correctly call
cudaProfilerStop
in benchmarks script (
#14183
)
2025-03-07 00:42:49 +00:00
benchmark_quant.py
[Bugfix] Correctly call
cudaProfilerStop
in benchmarks script (
#14183
)
2025-03-07 00:42:49 +00:00
benchmark_rmsnorm.py
Correct capitalisation:
VLLM
->
vLLM
(
#14562
)
2025-03-10 16:36:21 +00:00
benchmark_rope.py
…
benchmark_shapes.py
…
graph_machete_bench.py
…
requirements.txt
…
utils.py
…
weight_shapes.py
…