[Misc] DeepEPHighThroughtput - Enable Inductor pass (#21311)

Signed-off-by: Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by: Varun Sundar Rabindranath <vsundarr@redhat.com>
2026-01-08 21:56:29 +08:00 · 2025-07-22 12:05:45 +05:30 · 2025-07-22 12:05:45 +05:30 · 8425f785ad
commit 8425f785ad
parent c17231e827
1 changed files with 0 additions and 3 deletions
--- a/vllm/platforms/cuda.py
+++ b/vllm/platforms/cuda.py
@ -182,9 +182,6 @@ class CudaPlatformBase(Platform):
            compilation_config.use_cudagraph = False
            if model_config is not None:
                model_config.enforce_eager = True
-            # TODO (varun): Turning this ON gives incorrect results for the
-            # Deepseek-V2-lite model.
-            vllm_config.compilation_config.use_inductor = False

    @classmethod
    def get_current_memory_usage(cls,