vllm/offline_inference at 57201a6a4c53bbd6adb9a4b702c95d5f480161d5 - vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-05-09 17:24:33 +08:00

History

[Misc] fix typo and add detailed log (#28178 )

Signed-off-by: Andy Xie <andy.xning@gmail.com>

2025-11-09 05:33:46 +00:00

basic

[Chore]:Extract math and argparse utilities to separate modules (#27188 )

2025-10-26 04:03:32 -07:00

disaggregated-prefill-v1

[Docs] Switch to better markdown linting pre-commit hook (#21851 )

2025-07-29 19:45:08 -07:00

kv_load_failure_recovery

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

logits_processor

[Bugfix] Validate custom logits processor xargs for online serving (#27560 )

2025-11-05 16:53:33 +00:00

openai_batch

[Doc] ruff format remaining Python examples (#26795 )

2025-10-15 01:25:49 -07:00

pooling

[Frontend][Doc][5/N] Improve all pooling task | Polish encode (pooling) api & Document. (#25524 )

2025-10-30 12:13:05 +00:00

profiling_tpu

[Chore]:Extract math and argparse utilities to separate modules (#27188 )

2025-10-26 04:03:32 -07:00

qwen2_5_omni

[Chore]:Extract math and argparse utilities to separate modules (#27188 )

2025-10-26 04:03:32 -07:00

async_llm_streaming.py

[Example] Add async_llm_streaming.py example for AsyncLLM streaming in python (#21763 )

2025-07-30 18:39:46 -06:00

audio_language.py

[Chore]:Extract math and argparse utilities to separate modules (#27188 )

2025-10-26 04:03:32 -07:00

automatic_prefix_caching.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

batch_llm_inference.py

[Docs] Improve docstring for ray data llm example (#20597 )

2025-07-07 20:06:26 -07:00

chat_with_tools.py

[Doc]: fix typos in Python comments (#24417 )

2025-09-08 00:22:16 -07:00

context_extension.py

[Misc] refactor context extension (#19246 )

2025-06-07 05:13:21 +00:00

data_parallel.py

[Chore] Separate out vllm.utils.network_utils (#27164 )

2025-10-19 03:06:32 -07:00

disaggregated_prefill.py

Remove deprecated PyNcclConnector (#24151 )

2025-09-03 22:49:16 +00:00

encoder_decoder_multimodal.py

[Chore]:Extract math and argparse utilities to separate modules (#27188 )

2025-10-26 04:03:32 -07:00

llm_engine_example.py

[Chore]:Extract math and argparse utilities to separate modules (#27188 )

2025-10-26 04:03:32 -07:00

load_sharded_state.py

[Misc] fix typo and add detailed log (#28178 )

2025-11-09 05:33:46 +00:00

lora_with_quantization_inference.py

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

metrics.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

mistral-small.py

[Frontend] Use engine argument to control MM cache size (#22441 )

2025-08-07 09:47:10 -07:00

mlpspeculator.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

multilora_inference.py

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

prefix_caching.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

prompt_embed_inference.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

qwen_1m.py

Remove V0 attention backends (#25351 )

2025-09-21 16:03:28 -07:00

reproducibility.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

rlhf_colocate.py

[RL] fast weight update with zmq + ipc handles (#24295 )

2025-09-09 16:57:46 +08:00

rlhf_utils.py

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

rlhf.py

[Chore] Separate out vllm.utils.network_utils (#27164 )

2025-10-19 03:06:32 -07:00

save_sharded_state.py

[Chore]:Extract math and argparse utilities to separate modules (#27188 )

2025-10-26 04:03:32 -07:00

simple_profiling.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

skip_loading_weights_in_engine_init.py

[Doc] Add inplace weights loading example (#19640 )

2025-07-17 21:12:23 -07:00

spec_decode.py

[chore] Move some wikimedia images to S3 (#28351 )

2025-11-09 01:58:26 +00:00

structured_outputs.py

[Chore] Cleanup guided namespace, move to structured outputs config (#22772 )

2025-09-18 09:20:27 +00:00

torchrun_dp_example.py

[CI/Build] Test torchrun with 8 cards (#27548 )

2025-10-29 10:26:06 -07:00

torchrun_example.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

tpu.py

[Doc]: fix typos in various files (#24798 )

2025-09-13 00:43:33 -07:00

vision_language_multi_image.py

[chore] Move some wikimedia images to S3 (#28351 )

2025-11-09 01:58:26 +00:00

vision_language_pooling.py

[Chore]:Extract math and argparse utilities to separate modules (#27188 )

2025-10-26 04:03:32 -07:00

vision_language.py

[Model] Add PaddleOCR-VL Model Support (#27758 )

2025-11-03 19:04:22 +08:00