vllm/entrypoints at c5f685b3ae5fef9dec499f401427b33075673da8 - vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-01-26 21:54:36 +08:00

History

Benjamin Chislett 975676d174

[Feat] Drop-in Torch CUDA Profiler (#27841 )

Signed-off-by: Benjamin Chislett <bchislett@nvidia.com>

2025-11-08 14:07:37 -08:00

anthropic

[Chore] eliminate duplicated and unconditional object serialization in anthropic messages api (#27792 )

2025-11-06 19:09:19 +00:00

cli

[CLI] add --max-tokens to vllm complete (#28109 )

2025-11-07 12:21:40 +00:00

openai

[Feat] Drop-in Torch CUDA Profiler (#27841 )

2025-11-08 14:07:37 -08:00

__init__.py

…

api_server.py

[Misc] Clean up more utils (#27567 )

2025-10-27 15:30:38 +00:00

chat_utils.py

[Multimodal] Make MediaConnector extensible. (#27759 )

2025-11-04 18:28:01 +00:00

constants.py

…

context.py

[Frontend] [gpt-oss] Tool json call parsing error retry (#27675 )

2025-10-29 09:42:44 +00:00

harmony_utils.py

reasoning_content -> reasoning (#27752 )

2025-11-08 12:15:08 +00:00

launcher.py

…

llm.py

Fix(llm): Abort orphaned requests when llm.chat() batch fails Fixes #26081 (#27420 )

2025-11-02 16:24:01 +00:00

logger.py

…

renderer.py

…

responses_utils.py

[Frontend] OpenAI Responses API supports Tool/Function calling - non-harmony (#26874 )

2025-11-06 10:40:03 +00:00

score_utils.py

[Model] Add num_cached_tokens for PoolingRequestOutput (#27378 )

2025-10-23 14:03:42 +08:00

ssl.py

…

tool_server.py

…

tool.py

…

utils.py

[Chore]:Extract math and argparse utilities to separate modules (#27188 )

2025-10-26 04:03:32 -07:00