vllm/engine at 9f3bc0f58c431404f02372e22b4050460e2be448 - vllm - 丝路新云-代码仓

xinyun/vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-07-18 22:07:30 +08:00

History

Cody Yu 9f3bc0f58c

[MISC][V1] Register process killing handler only in the main thread (#14380 )

Signed-off-by: Cody Yu <hao.yu.cody@gmail.com>

2025-03-07 22:40:06 -08:00

..

__init__.py

Update deprecated Python 3.8 typing (#13971 )

2025-03-02 17:34:51 -08:00

async_llm.py

[V1] Eagerly remove finished requests from the batch (#14388 )

2025-03-07 10:56:00 -08:00

core_client.py

[MISC][V1] Register process killing handler only in the main thread (#14380 )

2025-03-07 22:40:06 -08:00

core.py

[V1] Eagerly remove finished requests from the batch (#14388 )

2025-03-07 10:56:00 -08:00

detokenizer.py

[V1] Do not detokenize if sampling param detokenize is False (#14224 )

2025-03-06 10:40:24 -08:00

llm_engine.py

[V1][Core] Support for Structured Outputs (#12388 )

2025-03-07 07:19:11 -08:00

logprobs.py

[V1] Do not detokenize if sampling param detokenize is False (#14224 )

2025-03-06 10:40:24 -08:00

mm_input_cache.py

Update deprecated Python 3.8 typing (#13971 )

2025-03-02 17:34:51 -08:00

output_processor.py

[V1] Do not detokenize if sampling param detokenize is False (#14224 )

2025-03-06 10:40:24 -08:00

parallel_sampling.py

[WIP][[V1][Metrics] Implement max_num_generation_tokens, request_params_n, and request_params_max_tokens metrics (#14055 )

2025-03-03 19:04:45 +00:00

processor.py

[V1] Prompt logprobs + APC compatibility; prompt logprobs reqs cannot fill APC (#13949 )

2025-03-08 01:48:12 +00:00