vllm/worker at 2db9044ab6c04a38dbcc9df6756b70dccacc157a - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-28 03:37:12 +08:00

History

Isotr0py 5f1ac1e1d1

Revert "[v1] Add fp32 support to v1 engine through flex attn" (#19404 )

2025-06-10 01:30:20 -07:00

..

__init__.py

[V1] Adding min tokens/repetition/presence/frequence penalties to V1 sampler (#10681 )

2024-12-26 19:02:58 +09:00

test_gpu_input_batch.py

[Core] Use tuple for kv cache group block ids (#19175 )

2025-06-10 07:01:17 +02:00

test_gpu_model_runner.py

Revert "[v1] Add fp32 support to v1 engine through flex attn" (#19404 )

2025-06-10 01:30:20 -07:00