mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced 2025-12-10 16:05:42 +08:00
parent
b9bcdc7158
commit
c0ce15dfb2
@ -55,7 +55,7 @@ Start the serving the LLaMA-13B model on an A100 GPU:
|
|||||||
|
|
||||||
$ sky launch serving.yaml
|
$ sky launch serving.yaml
|
||||||
|
|
||||||
Check the output of the command. There will be a sharable gradio link (like the last line of the following). Open it in your browser to use the LLaMA model to do the text completion.
|
Check the output of the command. There will be a shareable gradio link (like the last line of the following). Open it in your browser to use the LLaMA model to do the text completion.
|
||||||
|
|
||||||
.. code-block:: console
|
.. code-block:: console
|
||||||
|
|
||||||
|
|||||||
Loading…
x
Reference in New Issue
Block a user