From 9e67c4ce985b0b8852603cfe3fcaf8f37de137ed Mon Sep 17 00:00:00 2001 From: "rongfu.leng" Date: Wed, 17 Dec 2025 20:14:45 +0800 Subject: [PATCH] [Docs] fix function name (#30748) Signed-off-by: rongfu.leng --- docs/design/plugin_system.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/design/plugin_system.md b/docs/design/plugin_system.md index b0ca2dad23d5b..0fd448c2153c3 100644 --- a/docs/design/plugin_system.md +++ b/docs/design/plugin_system.md @@ -109,7 +109,7 @@ Every plugin has three parts: - `init_device`: This function is called to set up the device for the worker. - `initialize_cache`: This function is called to set cache config for the worker. - `load_model`: This function is called to load the model weights to device. - - `get_kv_cache_spaces`: This function is called to generate the kv cache spaces for the model. + - `get_kv_cache_spec`: This function is called to generate the kv cache spec for the model. - `determine_available_memory`: This function is called to profiles the peak memory usage of the model to determine how much memory can be used for KV cache without OOMs. - `initialize_from_config`: This function is called to allocate device KV cache with the specified kv_cache_config - `execute_model`: This function is called every step to inference the model.