Logo
Explore Help
Sign In
xinyun/vllm
1
0
Fork 0
You've already forked vllm
mirror of https://git.datalinker.icu/vllm-project/vllm.git synced 2026-01-20 06:24:27 +08:00
Code Issues Packages Projects Releases Wiki Activity
vllm/vllm/platforms
History
Daifeng Li fa78de9dc3
Quantization: support FP4 quantized models on AMD CDNA2/CDNA3 GPUs (#22527)
Signed-off-by: feng <fengli1702@gmail.com>
Signed-off-by: Michael Goin <mgoin64@gmail.com>
Co-authored-by: Michael Goin <mgoin64@gmail.com>
2025-08-22 20:53:21 -06:00
..
__init__.py
[TPU] Support Pathways in vLLM (#21417)
2025-07-30 10:02:12 -07:00
cpu.py
[Hardware][IBM Z]Enable v1 for s390x and s390x dockerfile fixes (#22725)
2025-08-19 04:40:37 +00:00
cuda.py
[Kernel] Add FP8 support with FlashMLA backend (#22668)
2025-08-22 02:26:32 +00:00
interface.py
[Kernel] Add FP8 support with FlashMLA backend (#22668)
2025-08-22 02:26:32 +00:00
neuron.py
[Refactor]Abstract Platform Interface for Distributed Backend and Add xccl Support for Intel XPU (#19410)
2025-07-07 04:32:32 +00:00
rocm.py
Quantization: support FP4 quantized models on AMD CDNA2/CDNA3 GPUs (#22527)
2025-08-22 20:53:21 -06:00
tpu.py
[Kernel] Add FP8 support with FlashMLA backend (#22668)
2025-08-22 02:26:32 +00:00
xpu.py
[XPU]avoid circular import during XPU init (#23017)
2025-08-16 05:16:34 +00:00
Powered by Gitea Version: 1.23.1 Page: 466ms Template: 10ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API