vllm/benchmark at 4858f3bb45ec62fab1fc32dc26eb1e2a8e1df14b - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-23 07:57:11 +08:00

History

Zhuohan Li 4858f3bb45

Add an option to launch cacheflow without ray (#51 )

2023-04-30 15:42:17 +08:00

..

benchmark_attention.py

Add query stride to multi_query_cached_kv_attention & Add kernel benchmark script (#27 )

2023-04-08 13:36:09 -07:00

benchmark_cache.py

Memcpy kernel for flash attention (#29 )

2023-04-10 18:22:49 -07:00

benchmark_latency.py

Add an option to launch cacheflow without ray (#51 )

2023-04-30 15:42:17 +08:00

benchmark_text_completion.py

Add an option to launch cacheflow without ray (#51 )

2023-04-30 15:42:17 +08:00

trace.py

Collect system stats in scheduler & Add scripts for experiments (#30 )

2023-04-12 15:03:49 -07:00