This website requires JavaScript.
Explore
Help
Sign In
xinyun
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
mirror of
https://git.datalinker.icu/vllm-project/vllm.git
synced
2026-03-28 11:08:10 +08:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
vllm
/
examples
History
Alex Wu
dbc0754ddf
[docs] Fix typo in examples filename openi -> openai (
#4864
)
2024-05-17 00:42:17 +09:00
..
fp8
…
production_monitoring
[Bugfix] Update grafana.json (
#4711
)
2024-05-09 10:10:13 -07:00
api_client.py
…
aqlm_example.py
…
gradio_openai_chatbot_webserver.py
…
gradio_webserver.py
…
llava_example.py
…
llm_engine_example.py
…
logging_configuration.md
…
multilora_inference.py
…
offline_inference_arctic.py
[Model] Snowflake arctic model implementation (
#4652
)
2024-05-09 22:37:14 +00:00
offline_inference_distributed.py
…
offline_inference_embedding.py
[Model][Misc] Add e5-mistral-7b-instruct and Embedding API (
#3734
)
2024-05-11 11:30:37 -07:00
offline_inference_neuron.py
…
offline_inference_openai.md
[Frontend] Support OpenAI batch file format (
#4794
)
2024-05-15 19:13:36 -04:00
offline_inference_with_prefix.py
…
offline_inference.py
…
openai_chat_completion_client.py
…
openai_completion_client.py
…
openai_embedding_client.py
[Model][Misc] Add e5-mistral-7b-instruct and Embedding API (
#3734
)
2024-05-11 11:30:37 -07:00
openai_example_batch.jsonl
[docs] Fix typo in examples filename openi -> openai (
#4864
)
2024-05-17 00:42:17 +09:00
save_sharded_state.py
[Core] Implement sharded state loader (
#4690
)
2024-05-15 22:11:54 -07:00
template_alpaca.jinja
…
template_baichuan.jinja
…
template_chatglm2.jinja
…
template_chatglm.jinja
…
template_chatml.jinja
…
template_falcon_180b.jinja
…
template_falcon.jinja
…
template_inkbot.jinja
…
tensorize_vllm_model.py
[Frontend] [Core] perf: Automatically detect vLLM-tensorized model, update
tensorizer
to version 2.9.0 (
#4208
)
2024-05-13 14:57:07 -07:00