vllm/config at 9874169d07d003fbf0460e6574b0aa83fda23a5d - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-07-07 03:47:09 +08:00

History

Michael Goin 9874169d07 Simplify max model length auto selection

2025-08-29 05:03:25 -04:00

..

__init__.py

Simplify max model length auto selection

2025-08-29 05:03:25 -04:00

cache.py

[V1] Enable prefill optimization for Gemma3n (#22628 )

2025-08-28 14:54:30 -07:00

compilation.py

[V1] [Hybrid] Enable compile and piecewise CUDA graph for MiniMax-Text models (#22589 )

2025-08-27 10:05:16 -07:00

parallel.py

[Misc] update dict parse to EPLBConfig from json dumps to dict unpacking (#23305 )

2025-08-24 08:06:34 +00:00

scheduler.py

[V0 Deprecation] Remove args for multi-step scheduling (#22779 )

2025-08-12 20:38:18 -07:00

utils.py

Extract CompilationConfig from config.py (#22524 )

2025-08-08 16:34:25 -07:00