xinyun/vllm - vllm - 丝路新云-代码仓

mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-04-15 02:27:03 +08:00

Author	SHA1	Message	Date
Russell Bryant	c320ca8edd	[Core] Don't do platform detection at import time (#12933 ) Signed-off-by: Russell Bryant <rbryant@redhat.com>	2025-02-11 07:25:25 +00:00
youkaichao	bc1bdecebf	[core][distributed] exact ray placement control (#12732 ) Signed-off-by: youkaichao <youkaichao@gmail.com>	2025-02-06 02:03:19 +08:00
youkaichao	ad4a9dc817	[cuda] manually import the correct pynvml module (#12679 ) fixes problems like https://github.com/vllm-project/vllm/pull/12635 and https://github.com/vllm-project/vllm/pull/12636 and https://github.com/vllm-project/vllm/pull/12565 --------- Signed-off-by: youkaichao <youkaichao@gmail.com>	2025-02-03 15:58:21 +08:00
Russell Bryant	e489ad7a21	[Misc] Add SPDX-License-Identifier headers to python source files (#12628 ) - Add SPDX license headers to python source files - Check for SPDX headers using pre-commit commit 9d7ef44c3cfb72ca4c32e1c677d99259d10d4745 Author: Russell Bryant <rbryant@redhat.com> Date: Fri Jan 31 14:18:24 2025 -0500 Add SPDX license headers to python source files This commit adds SPDX license headers to python source files as recommended to the project by the Linux Foundation. These headers provide a concise way that is both human and machine readable for communicating license information for each source file. It helps avoid any ambiguity about the license of the code and can also be easily used by tools to help manage license compliance. The Linux Foundation runs license scans against the codebase to help ensure we are in compliance with the licenses of the code we use, including dependencies. Having these headers in place helps that tool do its job. More information can be found on the SPDX site: - https://spdx.dev/learn/handling-license-info/ Signed-off-by: Russell Bryant <rbryant@redhat.com> commit 5a1cf1cb3b80759131c73f6a9dddebccac039dea Author: Russell Bryant <rbryant@redhat.com> Date: Fri Jan 31 14:36:32 2025 -0500 Check for SPDX headers using pre-commit Signed-off-by: Russell Bryant <rbryant@redhat.com> --------- Signed-off-by: Russell Bryant <rbryant@redhat.com>	2025-02-02 11:58:18 -08:00
Lucas Wilkinson	cabaf4eff3	[Attention] MLA decode optimizations (#12528 ) Signed-off-by: Lucas Wilkinson <lwilkinson@neuralmagic.com> Signed-off-by: simon-mo <xmo@berkeley.edu> Co-authored-by: Woosuk Kwon <woosuk.kwon@berkeley.edu> Co-authored-by: simon-mo <simon.mo@hey.com> Co-authored-by: Michael Goin <mgoin64@gmail.com> Co-authored-by: Zhuohan Li <zhuohan123@gmail.com> Co-authored-by: Tyler Michael Smith <tysmith@redhat.com> Co-authored-by: Alexander Matveev <59768536+alexm-neuralmagic@users.noreply.github.com> Co-authored-by: simon-mo <xmo@berkeley.edu>	2025-01-30 23:49:37 -08:00
Robert Shaw	5f671cb4c3	[V1] Improve Error Message for Unsupported Config (#12535 ) Co-authored-by: Michael Goin <michael@neuralmagic.com>	2025-01-29 04:56:56 +00:00
youkaichao	2b83503227	[misc] fix cross-node TP (#12166 ) Signed-off-by: youkaichao <youkaichao@gmail.com>	2025-01-18 10:53:27 +08:00
youkaichao	ad34c0df0f	[core] platform agnostic executor via collective_rpc (#11256 ) Signed-off-by: youkaichao <youkaichao@gmail.com>	2025-01-15 13:45:21 +08:00
Shanshan Shen	9ddac56311	[Platform] move current_memory_usage() into platform (#11369 ) Signed-off-by: Shanshan Shen <467638484@qq.com>	2025-01-15 03:38:25 +00:00
Shanshan Shen	a7d59688fb	[Platform] Move get_punica_wrapper() function to Platform (#11516 ) Signed-off-by: Shanshan Shen <467638484@qq.com>	2025-01-13 13:12:10 +00:00
youkaichao	458e63a2c6	[platform] add device_control env var (#12009 ) Signed-off-by: youkaichao <youkaichao@gmail.com>	2025-01-13 20:59:09 +08:00
youkaichao	89ce62a316	[platform] add ray_device_key (#11948 ) Signed-off-by: youkaichao <youkaichao@gmail.com>	2025-01-13 16:20:52 +08:00
wangxiyuan	405eb8e396	[platform] Allow platform specify attention backend (#11609 ) Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com> Signed-off-by: Mengqing Cao <cmq0113@163.com> Co-authored-by: Mengqing Cao <cmq0113@163.com>	2025-01-09 21:46:50 +08:00
wangxiyuan	e88db68cf5	[Platform] platform agnostic for EngineArgs initialization (#11225 ) Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>	2024-12-16 22:11:06 -08:00
Gene Der Su	82c73fd510	[Bugfix] cuda error running llama 3.2 (#11047 )	2024-12-10 07:41:11 +00:00
Tyler Michael Smith	28b3a1c7e5	[V1] Multiprocessing Tensor Parallel Support for v1 (#9856 ) Signed-off-by: Tyler Michael Smith <tyler@neuralmagic.com>	2024-12-10 06:28:14 +00:00
wangxiyuan	aea2fc38c3	[Platform] Move `async output` check to platform (#10768 ) Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>	2024-12-09 17:24:46 +00:00
wangxiyuan	661175bc82	[platform] Add verify_quantization in platform. (#10757 ) Signed-off-by: wangxiyuan <wangxiyuan1007@gmail.com>	2024-11-29 15:22:21 +00:00
Chendi.Xue	0a71900bc9	Remove hard-dependencies of Speculative decode to CUDA workers (#10587 ) Signed-off-by: Chendi Xue <chendi.xue@intel.com>	2024-11-26 17:57:11 -08:00
Conroy Cheers	f5792c7c4a	[Hardware][NVIDIA] Add non-NVML CUDA mode for Jetson (#9735 ) Signed-off-by: Conroy Cheers <conroy@corncheese.org>	2024-11-26 10:26:28 -08:00
youkaichao	eebad39f26	[torch.compile] support all attention backends (#10558 ) Signed-off-by: youkaichao <youkaichao@gmail.com>	2024-11-22 14:04:42 -08:00
youkaichao	a111d0151f	[platforms] absorb worker cls difference into platforms folder (#10555 ) Signed-off-by: youkaichao <youkaichao@gmail.com> Co-authored-by: Nick Hill <nhill@redhat.com>	2024-11-21 21:00:32 -08:00
youkaichao	cf656f5a02	[misc] improve error message (#10553 ) Signed-off-by: youkaichao <youkaichao@gmail.com>	2024-11-21 13:13:17 -08:00
Mengqing Cao	9d827170a3	[Platforms] Add `device_type` in `Platform` (#10508 ) Signed-off-by: MengqingCao <cmq0113@163.com>	2024-11-21 04:44:20 +00:00
youkaichao	388ee3de66	[torch.compile] limit inductor threads and lazy import quant (#10482 ) Signed-off-by: youkaichao <youkaichao@gmail.com>	2024-11-20 18:36:33 -08:00
bnellnm	3cb07a36a2	[Misc] Upgrade to pytorch 2.5 (#9588 ) Signed-off-by: Bill Nell <bill@neuralmagic.com> Signed-off-by: youkaichao <youkaichao@gmail.com> Co-authored-by: youkaichao <youkaichao@gmail.com>	2024-10-27 09:44:24 +00:00
Cyrus Leung	390be74649	[Misc] Print stack trace using `logger.exception` (#9461 )	2024-10-17 13:55:48 +00:00
Cyrus Leung	26a68d5d7e	[CI/Build] Add test decorator for minimum GPU memory (#8925 )	2024-09-29 02:50:51 +00:00
Cyrus Leung	6ffa3f314c	[CI/Build] Avoid CUDA initialization (#8534 )	2024-09-18 10:38:11 +00:00
youkaichao	ed6f002d33	[cuda][misc] error on empty CUDA_VISIBLE_DEVICES (#7924 )	2024-08-27 12:06:11 -07:00
youkaichao	70c094ade6	[misc][cuda] improve pynvml warning (#7852 )	2024-08-25 14:30:09 -07:00
youkaichao	ad28a74beb	[misc][cuda] add warning for pynvml user (#7675 )	2024-08-20 00:35:09 -07:00
youkaichao	eed020f673	[misc] use nvml to get consistent device name (#7582 )	2024-08-16 21:15:13 -07:00
Cyrus Leung	9ba85bc152	[mypy] Misc. typing improvements (#7417 )	2024-08-13 09:20:20 +08:00
youkaichao	639159b2a6	[distributed][misc] add specialized method for cuda platform (#7249 )	2024-08-07 08:54:52 -07:00
Benjamin Muskalla	b422d4961a	[CI/Build] Enable mypy typing for remaining folders (#6268 )	2024-07-10 22:15:55 +08:00
youkaichao	a3c9435d93	[hardware][cuda] use device id under CUDA_VISIBLE_DEVICES for get_device_capability (#6216 )	2024-07-08 20:02:15 -07:00
youkaichao	482045ee77	[hardware][misc] introduce platform abstraction (#6080 )	2024-07-02 20:12:22 -07:00

38 Commits