This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-03-29 02:24:47 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
docs
/
design
History
redwrasse
6476382384
prefix caching design doc sha256 now default (
#29261
)
...
Signed-off-by: redwrasse <mail@redwrasse.io>
2025-12-06 07:39:56 +00:00
..
arch_overview.md
…
cuda_graphs.md
…
dbo.md
…
debug_vllm_compile.md
[Frontend] Remove deprecated -O.xx flag (
#29991
)
2025-12-05 00:47:22 -08:00
fused_moe_modular_kernel.md
…
huggingface_integration.md
[Misc] Update
TokenizerLike
interface and move
get_cached_tokenizer
(
#29730
)
2025-11-30 14:59:47 +08:00
hybrid_kv_cache_manager.md
…
io_processor_plugins.md
[examples] Resettle pooling examples. (
#29365
)
2025-12-02 15:54:28 +00:00
logits_processors.md
…
lora_resolver_plugins.md
…
metrics.md
docs: update metrics design doc to use new vllm:kv_cache_usage_perc (
#30041
)
2025-12-04 23:37:14 +00:00
mm_processing.md
…
moe_kernel_features.md
[Kernels] Remove BatchedTritonOrDeepGemmExperts and default fallback to Triton (
#29929
)
2025-12-03 20:49:00 +00:00
multiprocessing.md
…
optimization_levels.md
…
p2p_nccl_connector.md
…
paged_attention.md
…
plugin_system.md
…
prefix_caching.md
prefix caching design doc sha256 now default (
#29261
)
2025-12-06 07:39:56 +00:00
torch_compile.md
…