diff --git a/docs/source/features/quantization/fp8.md b/docs/source/features/quantization/fp8.md index f87b2a02cd44..95e105357bd3 100644 --- a/docs/source/features/quantization/fp8.md +++ b/docs/source/features/quantization/fp8.md @@ -106,7 +106,7 @@ Load and run the model in `vllm`: ```python from vllm import LLM model = LLM("./Meta-Llama-3-8B-Instruct-FP8-Dynamic") -model.generate("Hello my name is") +result = model.generate("Hello my name is") print(result[0].outputs[0].text) ```