This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2025-12-26 14:49:41 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
examples
/
online_serving
History
Rui Qiao
217937221b
Elastic Expert Parallel Initial Support (
#20775
)
...
Signed-off-by: Rui Qiao <ruisearch42@gmail.com>
2025-07-18 17:46:09 -07:00
..
chart-helm
[V0 deprecation] Remove V0 CPU/XPU/TPU backends (
#20412
)
2025-07-06 08:48:13 -07:00
disaggregated_serving
…
disaggregated_serving_p2p_nccl_xpyd
[V1][P/D]Enhance Performance and code readability for P2pNcclConnector (
#20906
)
2025-07-16 22:13:00 -07:00
elastic_ep
Elastic Expert Parallel Initial Support (
#20775
)
2025-07-18 17:46:09 -07:00
opentelemetry
…
prometheus_grafana
…
structured_outputs
[Misc] small update (
#20462
)
2025-07-03 20:33:44 -07:00
api_client.py
…
cohere_rerank_client.py
…
disaggregated_prefill.sh
…
gradio_openai_chatbot_webserver.py
…
gradio_webserver.py
…
jinaai_rerank_client.py
…
kv_events_subscriber.py
…
multi_instance_data_parallel.py
[Misc] Add SPDX-FileCopyrightText (
#20428
)
2025-07-04 07:40:42 +00:00
multi-node-serving.sh
[Docs] Improve documentation for multi-node service helper script (
#20600
)
2025-07-08 19:44:26 -07:00
openai_chat_completion_client_for_multimodal.py
…
openai_chat_completion_client_with_tools_required.py
…
openai_chat_completion_client_with_tools_xlam_streaming.py
[Misc] Add SPDX-FileCopyrightText (
#20428
)
2025-07-04 07:40:42 +00:00
openai_chat_completion_client_with_tools_xlam.py
[Misc] Add SPDX-FileCopyrightText (
#20428
)
2025-07-04 07:40:42 +00:00
openai_chat_completion_client_with_tools.py
…
openai_chat_completion_client.py
…
openai_chat_completion_tool_calls_with_reasoning.py
…
openai_chat_completion_with_reasoning_streaming.py
…
openai_chat_completion_with_reasoning.py
…
openai_chat_embedding_client_for_multimodal.py
…
openai_classification_client.py
…
openai_completion_client.py
…
openai_cross_encoder_score_for_multimodal.py
[Model][VLM] Support JinaVL Reranker (
#20260
)
2025-07-10 10:43:43 -07:00
openai_cross_encoder_score.py
…
openai_embedding_client.py
…
openai_embedding_matryoshka_fy.py
…
openai_pooling_client.py
…
openai_transcription_client.py
…
openai_translation_client.py
…
prompt_embed_inference_with_openai_client.py
…
ray_serve_deepseek.py
[Docs] Improve documentation for Deepseek R1 on Ray Serve LLM (
#20601
)
2025-07-08 02:09:06 -07:00
retrieval_augmented_generation_with_langchain.py
…
retrieval_augmented_generation_with_llamaindex.py
…
run_cluster.sh
[Docs] Improve documentation for ray cluster launcher helper script (
#20602
)
2025-07-15 03:55:45 -07:00
sagemaker-entrypoint.sh
…
streamlit_openai_chatbot_webserver.py
…
utils.py
…