vllm/design at 6f1e7f7226447f606a0731376a2d0bd080aa2767 - vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-03-16 15:27:13 +08:00

History

Benjamin Chislett 304419576a

[Perf] Refactor cudagraph_support to enable full CUDA graphs for spec decoding with FlashInfer (#28479 )

Signed-off-by: Benjamin Chislett <bchislett@nvidia.com>

2025-11-13 01:56:40 +09:00

arch_overview.md

[Docs] Replace all explicit anchors with real links (#27087 )

2025-10-17 02:22:06 -07:00

cuda_graphs.md

[Perf] Refactor cudagraph_support to enable full CUDA graphs for spec decoding with FlashInfer (#28479 )

2025-11-13 01:56:40 +09:00

dbo.md

[UX] Replace VLLM_ALL2ALL_BACKEND with --all2all-backend (#26732 )

2025-10-13 18:12:52 -07:00

debug_vllm_compile.md

[Docs] Add guide to debugging vLLM-torch.compile integration (#28094 )

2025-11-05 21:31:46 +00:00

fused_moe_modular_kernel.md

[Docs] Reduce custom syntax used in docs (#27009 )

2025-10-16 20:05:34 -07:00

huggingface_integration.md

…

hybrid_kv_cache_manager.md

…

io_processor_plugins.md

[Frontend][Doc][5/N] Improve all pooling task | Polish encode (pooling) api & Document. (#25524 )

2025-10-30 12:13:05 +00:00

logits_processors.md

[Bugfix] Validate custom logits processor xargs for online serving (#27560 )

2025-11-05 16:53:33 +00:00

metrics.md

[Doc] Fix minor issues in docs/design/metrics.md (#27436 )

2025-10-24 05:40:54 -07:00

mm_processing.md

[Docs] Replace all explicit anchors with real links (#27087 )

2025-10-17 02:22:06 -07:00

moe_kernel_features.md

[RFC][ROCm][AITER] Keep all AITER kernels in _aiter_ops class like _custom_ops and _ipex_ops (#24490 )

2025-11-10 08:20:53 -08:00

multiprocessing.md

[Docs] Replace all explicit anchors with real links (#27087 )

2025-10-17 02:22:06 -07:00

p2p_nccl_connector.md

…

paged_attention.md

…

plugin_system.md

[V1][Metrics][Plugin] Add plugin support for custom StatLoggerBase implementations (#22456 )

2025-10-18 15:12:46 -07:00

prefix_caching.md

[Doc] Fix numbering sequence in prefix caching (#27357 )

2025-10-22 17:35:47 +00:00

torch_compile.md

[BUG] Make 'binary' default option for saving torch compile artifacts when using standalone_compile (#27616 )

2025-11-03 11:13:51 -05:00