vllm/design at 3a557ea3da2c248983a8e7bf2e3abc7218797807 - vllm

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-07-08 20:07:17 +08:00

History

Steve Westerhouse 9d701e90d8

[Doc] Clarify FP8 KV cache computation workflow (#31071 )

Signed-off-by: westers <steve.westerhouse@origami-analytics.com>

2025-12-22 08:41:37 +08:00

arch_overview.md

[Docs] Replace all explicit anchors with real links (#27087 )

2025-10-17 02:22:06 -07:00

cuda_graphs.md

[Doc]: fixing typos in various files (#30540 )

2025-12-14 02:14:37 -08:00

dbo.md

[UX] Replace VLLM_ALL2ALL_BACKEND with --all2all-backend (#26732 )

2025-10-13 18:12:52 -07:00

debug_vllm_compile.md

[Frontend] Remove deprecated -O.xx flag (#29991 )

2025-12-05 00:47:22 -08:00

fused_moe_modular_kernel.md

[Docs] Enable some more markdown lint rules for the docs (#28731 )

2025-11-14 18:39:19 +00:00

huggingface_integration.md

[Misc] Update TokenizerLike interface and move get_cached_tokenizer (#29730 )

2025-11-30 14:59:47 +08:00

hybrid_kv_cache_manager.md

[doc] Hybrid KV Cache Manager design doc (#22688 )

2025-08-26 20:19:05 +00:00

io_processor_plugins.md

[examples] Resettle pooling examples. (#29365 )

2025-12-02 15:54:28 +00:00

logits_processors.md

[Doc]: fix typos in various files (#28863 )

2025-11-17 20:32:14 -08:00

lora_resolver_plugins.md

docs(lora_resolvers): clarify multi-resolver order and storage path requirement (#28153 )

2025-11-14 18:08:30 +00:00

metrics.md

[Docs] Generate full list of metrics in user docs (#30388 )

2025-12-10 16:09:34 +00:00

mm_processing.md

[Docs] Replace all explicit anchors with real links (#27087 )

2025-10-17 02:22:06 -07:00

moe_kernel_features.md

Remove all2all backend envvar (#30363 )

2025-12-18 19:46:28 +00:00

multiprocessing.md

[Docs] Replace all explicit anchors with real links (#27087 )

2025-10-17 02:22:06 -07:00

optimization_levels.md

[Doc]: fixing typos in various files (#30540 )

2025-12-14 02:14:37 -08:00

p2p_nccl_connector.md

[V0 Deprecation] Remove VLLM_USE_V1 from docs and scripts (#26336 )

2025-10-07 16:46:44 +08:00

paged_attention.md

[Doc] Clarify FP8 KV cache computation workflow (#31071 )

2025-12-22 08:41:37 +08:00

plugin_system.md

[Docs] fix function name (#30748 )

2025-12-17 12:14:45 +00:00

prefix_caching.md

prefix caching design doc sha256 now default (#29261 )

2025-12-06 07:39:56 +00:00

torch_compile.md

[Frontend] Remap -O to -cc commandline flag (#29557 )

2025-11-28 21:51:12 +00:00