Logo
Explore Help
Sign In
xinyun/vllm
1
0
Fork 0
You've already forked vllm
mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-05-11 12:41:27 +08:00
Code Issues Packages Projects Releases Wiki Activity
vllm/vllm/v1
History
Robert Shaw 9597a095f2
[V1][Core][1/n] Logging and Metrics (#11962)
Signed-off-by: rshaw@neuralmagic.com <rshaw@neuralmagic.com>
2025-01-12 21:02:02 +00:00
..
attention
[Kernel] Move attn_type to Attention.__init__() (#11690)
2025-01-07 00:11:28 +08:00
core
[V1][Core][1/n] Logging and Metrics (#11962)
2025-01-12 21:02:02 +00:00
engine
[V1][Core][1/n] Logging and Metrics (#11962)
2025-01-12 21:02:02 +00:00
executor
[V1] Refactor get_executor_cls (#11754)
2025-01-06 07:59:16 +00:00
metrics
[V1][Core][1/n] Logging and Metrics (#11962)
2025-01-12 21:02:02 +00:00
sample
[Doc] Fix typo (#11666)
2025-01-01 08:10:10 +00:00
worker
[torch.compile] Hide KV cache behind torch.compile boundary (#11677)
2025-01-10 13:14:42 +08:00
__init__.py
[V1] AsyncLLM Implementation (#9826)
2024-11-11 23:05:38 +00:00
outputs.py
[V1] Multiprocessing Tensor Parallel Support for v1 (#9856)
2024-12-10 06:28:14 +00:00
request.py
[V1] Extend beyond image modality and support mixed-modality inference with Llava-OneVision (#11685)
2025-01-06 19:58:16 +00:00
serial_utils.py
[V1] Use pickle for serializing EngineCoreRequest & Add multimodal inputs to EngineCoreRequest (#10245)
2024-11-12 08:57:14 -08:00
utils.py
[V1] Simplify Shutdown (#11659)
2025-01-03 17:25:38 +00:00
Powered by Gitea Version: 1.23.1 Page: 538ms Template: 4ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API