This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-04-23 23:17:07 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
.buildkite
History
Simon Mo
f09edd8a25
Add JSON output support for benchmark_latency and benchmark_throughput (
#4848
)
2024-05-16 10:02:56 -07:00
..
check-wheel-size.py
[Kernel] Refactor FP8 kv-cache with NVIDIA float8_e4m3 support (
#4535
)
2024-05-09 18:04:17 -06:00
download-images.sh
[Feature] Add vision language model support. (
#3042
)
2024-03-25 14:16:30 -07:00
run-amd-test.sh
[Build/CI] Fixing 'docker run' to re-enable AMD CI tests. (
#4642
)
2024-05-07 09:23:17 -07:00
run-benchmarks.sh
Add JSON output support for benchmark_latency and benchmark_throughput (
#4848
)
2024-05-16 10:02:56 -07:00
run-cpu-test.sh
[HotFix] [CI/Build] Minor fix for CPU backend CI (
#3787
)
2024-04-01 22:50:53 -07:00
run-neuron-test.sh
[CI] clean docker cache for neuron (
#4441
)
2024-04-28 23:32:07 +00:00
test-pipeline.yaml
[Speculative decoding][Re-take] Enable TP>1 speculative decoding (
#4840
)
2024-05-16 00:53:51 -07:00
test-template.j2
[Build/CI] Fixing 'docker run' to re-enable AMD CI tests. (
#4642
)
2024-05-07 09:23:17 -07:00