Logo
Explore Help
Sign In
xinyun/vllm
1
0
Fork 0
You've already forked vllm
mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2025-12-24 22:36:01 +08:00
Code Issues Packages Projects Releases Wiki Activity
vllm/tests/distributed
History
Lily Liu 7041de4384
[Kernel] Flashinfer for prefill & decode, with Cudagraph support for decode (#4628)
Co-authored-by: LiuXiaoxuanPKU <llilyliupku@gmail.com>, bong-furiosa <bongwon.jang@furiosa.ai>
2024-06-28 15:28:49 -07:00
..
__init__.py
[CI/Build] Move test_utils.py to tests/utils.py (#4425)
2024-05-13 23:50:09 +09:00
test_basic_distributed_correctness.py
[Kernel] Flashinfer for prefill & decode, with Cudagraph support for decode (#4628)
2024-06-28 15:28:49 -07:00
test_chunked_prefill_distributed.py
[CI/Test] improve robustness of test (vllm_runner) (#5357)
2024-06-08 08:59:20 +00:00
test_comm_ops.py
[Distributed] Add send and recv helpers (#5719)
2024-06-23 14:42:28 -07:00
test_custom_all_reduce.py
[Distributed] Add send and recv helpers (#5719)
2024-06-23 14:42:28 -07:00
test_parallel_state.py
[Distributed] Make it clear that % should not be in tensor dict keys. (#5927)
2024-06-28 15:20:22 +00:00
test_pynccl.py
[Distributed] Add send and recv helpers (#5719)
2024-06-23 14:42:28 -07:00
test_same_node.py
[Core][Distributed] add same-node detection (#5369)
2024-06-11 10:53:59 -07:00
test_shm_broadcast.py
[bugfix][distributed] fix shm broadcast when the queue size is full (#5801)
2024-06-25 21:56:02 -07:00
test_utils.py
[Hardware][AMD][CI/Build][Doc] Upgrade to ROCm 6.1, Dockerfile improvements, test fixes (#5422)
2024-06-25 15:56:15 -07:00
Powered by Gitea Version: 1.23.1 Page: 576ms Template: 4ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API