This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2025-12-26 00:09:40 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
examples
History
Cyrus Leung
9edca6bf8f
[Frontend] Online Pooling API (
#11457
)
...
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
2024-12-24 17:54:30 +08:00
..
chart-helm
…
fp8
…
production_monitoring
…
api_client.py
…
aqlm_example.py
…
cpu_offload.py
…
disaggregated_prefill.sh
…
florence2_inference.py
…
gguf_inference.py
…
gradio_openai_chatbot_webserver.py
…
gradio_webserver.py
…
llm_engine_example.py
…
logging_configuration.md
…
lora_with_quantization_inference.py
…
multilora_inference.py
…
offline_chat_with_tools.py
…
offline_inference_arctic.py
…
offline_inference_audio_language.py
…
offline_inference_chat.py
…
offline_inference_classification.py
…
offline_inference_cli.py
…
offline_inference_distributed.py
…
offline_inference_embedding.py
…
offline_inference_encoder_decoder.py
…
offline_inference_mlpspeculator.py
…
offline_inference_neuron_int8_quantization.py
…
offline_inference_neuron.py
…
offline_inference_openai.md
…
offline_inference_pixtral.py
…
offline_inference_scoring.py
…
offline_inference_structured_outputs.py
…
offline_inference_tpu.py
…
offline_inference_vision_language_embedding.py
…
offline_inference_vision_language_multi_image.py
…
offline_inference_vision_language.py
…
offline_inference_with_default_generation_config.py
…
offline_inference_with_prefix.py
…
offline_inference_with_profiler.py
…
offline_inference.py
…
offline_profile.py
…
openai_chat_completion_client_for_multimodal.py
…
openai_chat_completion_client_with_tools.py
…
openai_chat_completion_client.py
…
openai_chat_completion_structured_outputs.py
…
openai_chat_embedding_client_for_multimodal.py
…
openai_completion_client.py
…
openai_cross_encoder_score.py
[Frontend] Online Pooling API (
#11457
)
2024-12-24 17:54:30 +08:00
openai_embedding_client.py
…
openai_example_batch.jsonl
…
openai_pooling_client.py
[Frontend] Online Pooling API (
#11457
)
2024-12-24 17:54:30 +08:00
run_cluster.sh
…
save_sharded_state.py
…
template_alpaca.jinja
…
template_baichuan.jinja
…
template_blip2.jinja
…
template_chatglm2.jinja
…
template_chatglm.jinja
…
template_chatml.jinja
…
template_dse_qwen2_vl.jinja
…
template_falcon_180b.jinja
…
template_falcon.jinja
…
template_inkbot.jinja
…
template_llava.jinja
…
template_vlm2vec.jinja
…
tensorize_vllm_model.py
…
tool_chat_template_granite_20b_fc.jinja
…
tool_chat_template_granite.jinja
…
tool_chat_template_hermes.jinja
…
tool_chat_template_internlm2_tool.jinja
…
tool_chat_template_llama3.1_json.jinja
…
tool_chat_template_llama3.2_json.jinja
…
tool_chat_template_llama3.2_pythonic.jinja
…
tool_chat_template_mistral_parallel.jinja
…
tool_chat_template_mistral.jinja
…
tool_chat_template_toolace.jinja
…