vllm/offline_inference at 08405609cc14d81f20e9faf61b2cd87e7909b797 - vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-08-01 12:47:54 +08:00

History

[Docs] Reduce custom syntax used in docs (#27009 )

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

2025-10-16 20:05:34 -07:00

basic

fix: print outputt offline_inference/base/chat.py example (#25744 )

2025-09-26 01:18:24 -07:00

disaggregated-prefill-v1

[Docs] Switch to better markdown linting pre-commit hook (#21851 )

2025-07-29 19:45:08 -07:00

kv_load_failure_recovery

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

logits_processor

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

openai_batch

[Doc] ruff format remaining Python examples (#26795 )

2025-10-15 01:25:49 -07:00

pooling

[Docs] Reduce custom syntax used in docs (#27009 )

2025-10-16 20:05:34 -07:00

profiling_tpu

[Misc] small update (#20462 )

2025-07-03 20:33:44 -07:00

qwen2_5_omni

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

async_llm_streaming.py

[Example] Add async_llm_streaming.py example for AsyncLLM streaming in python (#21763 )

2025-07-30 18:39:46 -06:00

audio_language.py

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

automatic_prefix_caching.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

batch_llm_inference.py

[Docs] Improve docstring for ray data llm example (#20597 )

2025-07-07 20:06:26 -07:00

chat_with_tools.py

[Doc]: fix typos in Python comments (#24417 )

2025-09-08 00:22:16 -07:00

context_extension.py

[Misc] refactor context extension (#19246 )

2025-06-07 05:13:21 +00:00

data_parallel.py

[Frontend][torch.compile] CompilationConfig Overhaul (#20283 ): name change compilation level to compilation mode, deprecation compilation level (#26355 )

2025-10-15 02:51:16 +00:00

disaggregated_prefill.py

Remove deprecated PyNcclConnector (#24151 )

2025-09-03 22:49:16 +00:00

encoder_decoder_multimodal.py

Remove V0 Encoder-Decoder Support (#24907 )

2025-09-15 21:17:14 -07:00

llm_engine_example.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

load_sharded_state.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

lora_with_quantization_inference.py

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

metrics.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

mistral-small.py

[Frontend] Use engine argument to control MM cache size (#22441 )

2025-08-07 09:47:10 -07:00

mlpspeculator.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

multilora_inference.py

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

prefix_caching.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

prithvi_geospatial_mae_io_processor.py

[Model][2/N] Improve all pooling task | Support multi-vector retrieval (#25370 )

2025-10-15 11:14:41 +00:00

prithvi_geospatial_mae.py

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

prompt_embed_inference.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

qwen_1m.py

Remove V0 attention backends (#25351 )

2025-09-21 16:03:28 -07:00

reproducibility.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

rlhf_colocate.py

[RL] fast weight update with zmq + ipc handles (#24295 )

2025-09-09 16:57:46 +08:00

rlhf_utils.py

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

rlhf.py

[RLHF] Fix torch.dtype not serializable in example (#22158 )

2025-08-04 02:43:33 +00:00

save_sharded_state.py

[Bugfix] fix max-file-size type from str to int (#21675 )

2025-07-28 00:06:52 -07:00

simple_profiling.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

skip_loading_weights_in_engine_init.py

[Doc] Add inplace weights loading example (#19640 )

2025-07-17 21:12:23 -07:00

spec_decode.py

[spec decode] Consolidate speculative decode method name for MTP (#25232 )

2025-09-26 22:27:05 +00:00

structured_outputs.py

[Chore] Cleanup guided namespace, move to structured outputs config (#22772 )

2025-09-18 09:20:27 +00:00

torchrun_dp_example.py

[Doc] Polish example for torchrun dp (#25899 )

2025-09-29 21:31:34 +00:00

torchrun_example.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

tpu.py

[Doc]: fix typos in various files (#24798 )

2025-09-13 00:43:33 -07:00

vision_language_multi_image.py

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

vision_language_pooling.py

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00

vision_language.py

Update Optional[x] -> x | None and Union[x, y] to x | y (#26633 )

2025-10-12 09:51:31 -07:00