vllm/offline_inference at 02a3ce2230c74f9615b979464550d0f300c0f8ca - vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-06-24 04:17:11 +08:00

History

Fanli Lin 2ab27b70f5 [XPU] Fix MOE DP accuracy issue on XPU (#25465 )

Signed-off-by: yewentao256 <zhyanwentao@126.com>

2025-10-03 13:35:54 -07:00

basic

[Kernel/Quant] Remove AQLM (#22943 )

2025-08-16 19:38:21 +00:00

disaggregated-prefill-v1

[Docs] Switch to better markdown linting pre-commit hook (#21851 )

2025-07-29 19:45:08 -07:00

logits_processor

[V1] Logits processor docs (#22919 )

2025-09-17 11:53:12 -07:00

openai_batch

[Docs] Switch to better markdown linting pre-commit hook (#21851 )

2025-07-29 19:45:08 -07:00

pooling

[New Model] Support BertForTokenClassification / Named Entity Recognition (NER) task (#24872 )

2025-09-18 23:22:01 +08:00

profiling_tpu

[Misc] small update (#20462 )

2025-07-03 20:33:44 -07:00

qwen2_5_omni

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

async_llm_streaming.py

[Example] Add async_llm_streaming.py example for AsyncLLM streaming in python (#21763 )

2025-07-30 18:39:46 -06:00

audio_language.py

[Doc]: fix typos in Python comments (#24173 )

2025-09-04 08:52:17 -07:00

automatic_prefix_caching.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

batch_llm_inference.py

[Docs] Improve docstring for ray data llm example (#20597 )

2025-07-07 20:06:26 -07:00

chat_with_tools.py

[Doc]: fix typos in Python comments (#24417 )

2025-09-08 00:22:16 -07:00

context_extension.py

[Misc] refactor context extension (#19246 )

2025-06-07 05:13:21 +00:00

data_parallel.py

[XPU] Fix MOE DP accuracy issue on XPU (#25465 )

2025-10-03 13:35:54 -07:00

disaggregated_prefill.py

Remove deprecated PyNcclConnector (#24151 )

2025-09-03 22:49:16 +00:00

encoder_decoder_multimodal.py

Remove V0 Encoder-Decoder Support (#24907 )

2025-09-15 21:17:14 -07:00

llm_engine_example.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

load_sharded_state.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

lora_with_quantization_inference.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

metrics.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

mistral-small.py

[Frontend] Use engine argument to control MM cache size (#22441 )

2025-08-07 09:47:10 -07:00

mlpspeculator.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

multilora_inference.py

[Doc]: fix typos in Python comments (#24026 )

2025-09-01 09:38:20 +00:00

prefix_caching.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

prithvi_geospatial_mae_io_processor.py

[Misc] Terratorch related fixes (#24337 )

2025-09-08 06:40:26 -07:00

prithvi_geospatial_mae.py

[Core][Model] Terratorch backend integration (#23513 )

2025-09-04 00:22:41 -07:00

prompt_embed_inference.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

qwen_1m.py

Remove V0 attention backends (#25351 )

2025-10-03 13:35:53 -07:00

reproducibility.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

rlhf_colocate.py

[RL] fast weight update with zmq + ipc handles (#24295 )

2025-09-09 16:57:46 +08:00

rlhf_utils.py

[RL] fast weight update with zmq + ipc handles (#24295 )

2025-09-09 16:57:46 +08:00

rlhf.py

[RLHF] Fix torch.dtype not serializable in example (#22158 )

2025-08-04 02:43:33 +00:00

save_sharded_state.py

[Bugfix] fix max-file-size type from str to int (#21675 )

2025-07-28 00:06:52 -07:00

simple_profiling.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

skip_loading_weights_in_engine_init.py

[Doc] Add inplace weights loading example (#19640 )

2025-07-17 21:12:23 -07:00

spec_decode.py

[spec decode] Fix MTP inference path for MiMo-7B model (#25136 )

2025-09-18 09:12:19 -07:00

structured_outputs.py

[Chore] Cleanup guided namespace, move to structured outputs config (#22772 )

2025-09-18 09:20:27 +00:00

torchrun_dp_example.py

[DP] support torchrun external launcher with Data Parallelism (#24899 )

2025-10-03 13:35:53 -07:00

torchrun_example.py

[Misc] Add SPDX-FileCopyrightText (#19100 )

2025-06-03 11:20:17 -07:00

tpu.py

[Doc]: fix typos in various files (#24798 )

2025-09-13 00:43:33 -07:00

vision_language_multi_image.py

Remove V0 Encoder-Decoder Support (#24907 )

2025-09-15 21:17:14 -07:00

vision_language_pooling.py

[Deprecation][2/N] Replace --task with --runner and --convert (#21470 )

2025-07-27 19:42:40 -07:00

vision_language.py

[Model] Support Dots OCR (#24645 )

2025-10-03 13:35:53 -07:00