[Doc]: Improve feature tables (#13224)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
This commit is contained in:
Harry Mellor 2025-02-18 10:52:39 +00:00 committed by GitHub
parent 8cf97f8661
commit 2358ca527b
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194
5 changed files with 143 additions and 129 deletions

View File

@ -0,0 +1,8 @@
.vertical-table-header th.head:not(.stub) {
writing-mode: sideways-lr;
white-space: nowrap;
max-width: 0;
p {
margin: 0;
}
}

View File

@ -78,8 +78,12 @@ html_theme_options = {
'use_repository_button': True, 'use_repository_button': True,
'use_edit_page_button': True, 'use_edit_page_button': True,
} }
# Add any paths that contain custom static files (such as style sheets) here,
# relative to this directory. They are copied after the builtin static files,
# so a file named "default.css" will overwrite the builtin "default.css".
html_static_path = ["_static"] html_static_path = ["_static"]
html_js_files = ["custom.js"] html_js_files = ["custom.js"]
html_css_files = ["custom.css"]
myst_url_schemes = { myst_url_schemes = {
'http': None, 'http': None,
@ -121,11 +125,6 @@ if READTHEDOCS_VERSION_TYPE == "tag":
if os.path.exists(header_file): if os.path.exists(header_file):
os.remove(header_file) os.remove(header_file)
# Add any paths that contain custom static files (such as style sheets) here,
# relative to this directory. They are copied after the builtin static files,
# so a file named "default.css" will overwrite the builtin "default.css".
# html_static_path = ['_static']
# Generate additional rst documentation here. # Generate additional rst documentation here.
def setup(app): def setup(app):

View File

@ -4,8 +4,14 @@
The tables below show mutually exclusive features and the support on some hardware. The tables below show mutually exclusive features and the support on some hardware.
The symbols used have the following meanings:
- ✅ = Full compatibility
- 🟠 = Partial compatibility
- ❌ = No compatibility
:::{note} :::{note}
Check the '✗' with links to see tracking issue for unsupported feature/hardware combination. Check the ❌ or 🟠 with links to see tracking issue for unsupported feature/hardware combination.
::: :::
## Feature x Feature ## Feature x Feature
@ -29,6 +35,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
:header-rows: 1 :header-rows: 1
:stub-columns: 1 :stub-columns: 1
:widths: auto :widths: auto
:class: vertical-table-header
- * Feature - * Feature
* [CP](#chunked-prefill) * [CP](#chunked-prefill)
@ -48,7 +55,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* beam-search * beam-search
* <abbr title="Guided Decoding">guided dec</abbr> * <abbr title="Guided Decoding">guided dec</abbr>
- * [CP](#chunked-prefill) - * [CP](#chunked-prefill)
* *
* *
* *
* *
@ -66,7 +73,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* *
- * [APC](#automatic-prefix-caching) - * [APC](#automatic-prefix-caching)
* ✅ * ✅
* *
* *
* *
* *
@ -82,9 +89,9 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* *
* *
- * [LoRA](#lora-adapter) - * [LoRA](#lora-adapter)
* [](gh-pr:9057)
* ✅ * ✅
* * ✅
* ✅
* *
* *
* *
@ -102,7 +109,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* ✅ * ✅
* ✅ * ✅
* ✅ * ✅
* *
* *
* *
* *
@ -118,9 +125,9 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
- * [SD](#spec_decode) - * [SD](#spec_decode)
* ✅ * ✅
* ✅ * ✅
* ✗ * ❌
* ✅
* ✅ * ✅
*
* *
* *
* *
@ -138,7 +145,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* ✅ * ✅
* ✅ * ✅
* ✅ * ✅
* *
* *
* *
* *
@ -150,13 +157,13 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* *
* *
- * <abbr title="Pooling Models">pooling</abbr> - * <abbr title="Pooling Models">pooling</abbr>
* *
* *
* *
* *
* *
* *
* *
* *
* *
* *
@ -167,14 +174,14 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* *
* *
- * <abbr title="Encoder-Decoder Models">enc-dec</abbr> - * <abbr title="Encoder-Decoder Models">enc-dec</abbr>
* ✗ * ❌
* [](gh-issue:7366) * [](gh-issue:7366)
* ✗ * ❌
* ✗ * ❌
* [](gh-issue:7366) * [](gh-issue:7366)
* ✅
* ✅ * ✅
* ✅ * ✅
*
* *
* *
* *
@ -190,9 +197,9 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* ✅ * ✅
* ✅ * ✅
* ✅ * ✅
* ✗ * ❌
* ✅
* ✅ * ✅
*
* *
* *
* *
@ -205,12 +212,12 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* ✅ * ✅
* ✅ * ✅
* ✅ * ✅
* [](gh-pr:8199)
* ✅
* ✗
* ✅ * ✅
* ✅ * ✅
* * ❌
* ✅
* ✅
* ✅
* *
* *
* *
@ -222,49 +229,49 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* ✅ * ✅
* ✅ * ✅
* ✅ * ✅
* ✗ * ❌
* ✅
* ❌
* ❌
* ✅ * ✅
* ✗
* ✗
* ✅ * ✅
* ✅ * ✅
*
* *
* *
* *
* *
* *
- * multi-step - * multi-step
* *
* ✅ * ✅
* ✗ * ❌
* ✅
* ❌
* ✅
* ❌
* ❌
* ✅ * ✅
* ✗
* ✅ * ✅
* ✗
* ✗
* ✅ * ✅
* [](gh-issue:8198)
* ✅ * ✅
*
* *
* *
* *
* *
- * <abbr title="Multimodal Inputs">mm</abbr> - * <abbr title="Multimodal Inputs">mm</abbr>
* ✅ * ✅
* [](gh-pr:8348) * [🟠](gh-pr:8348)
* [✗](gh-pr:7199) * [🟠](gh-pr:4194)
* ? *
* ? *
* ✅ * ✅
* ✅ * ✅
* ✅ * ✅
* ✅ * ✅
* ✅ * ✅
* ✅ * ✅
* ? *
* *
* *
* *
* *
@ -273,16 +280,16 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* ✅ * ✅
* ✅ * ✅
* ✅ * ✅
* [](gh-issue:6137) * [](gh-issue:6137)
* ✅ * ✅
* *
* ✅ * ✅
* ✅ * ✅
* ✅ * ✅
* ? * ❔
* [](gh-issue:7968) * [](gh-issue:7968)
* ✅
* ✅ * ✅
*
* *
* *
- * beam-search - * beam-search
@ -290,35 +297,35 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* ✅ * ✅
* ✅ * ✅
* ✅ * ✅
* [](gh-issue:6137) * [](gh-issue:6137)
* ✅ * ✅
* *
* ✅ * ✅
* ✅ * ✅
* ✅ * ✅
* ? * ❔
* [](gh-issue:7968) * [](gh-issue:7968)
* ? * ❔
* ✅
* ✅ * ✅
*
* *
- * <abbr title="Guided Decoding">guided dec</abbr> - * <abbr title="Guided Decoding">guided dec</abbr>
* ✅ * ✅
* ✅ * ✅
* ? *
* ? *
* [](gh-issue:11484) * [](gh-issue:11484)
* ✅ * ✅
* *
* ? *
* ✅ * ✅
* ✅ * ✅
* ✅ * ✅
* [](gh-issue:9893) * [](gh-issue:9893)
* ? * ❔
* ✅
* ✅ * ✅
* ✅ * ✅
*
::: :::
(feature-x-hardware)= (feature-x-hardware)=
@ -339,7 +346,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* CPU * CPU
* AMD * AMD
- * [CP](#chunked-prefill) - * [CP](#chunked-prefill)
* [](gh-issue:2729) * [](gh-issue:2729)
* ✅ * ✅
* ✅ * ✅
* ✅ * ✅
@ -347,7 +354,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* ✅ * ✅
* ✅ * ✅
- * [APC](#automatic-prefix-caching) - * [APC](#automatic-prefix-caching)
* [](gh-issue:3687) * [](gh-issue:3687)
* ✅ * ✅
* ✅ * ✅
* ✅ * ✅
@ -368,7 +375,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* ✅ * ✅
* ✅ * ✅
* ✅ * ✅
* [](gh-issue:8475) * [](gh-issue:8475)
* ✅ * ✅
- * [SD](#spec_decode) - * [SD](#spec_decode)
* ✅ * ✅
@ -384,7 +391,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* ✅ * ✅
* ✅ * ✅
* ✅ * ✅
* *
* ✅ * ✅
- * <abbr title="Pooling Models">pooling</abbr> - * <abbr title="Pooling Models">pooling</abbr>
* ✅ * ✅
@ -393,7 +400,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* ✅ * ✅
* ✅ * ✅
* ✅ * ✅
* ? *
- * <abbr title="Encoder-Decoder Models">enc-dec</abbr> - * <abbr title="Encoder-Decoder Models">enc-dec</abbr>
* ✅ * ✅
* ✅ * ✅
@ -401,7 +408,7 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* ✅ * ✅
* ✅ * ✅
* ✅ * ✅
* *
- * <abbr title="Multimodal Inputs">mm</abbr> - * <abbr title="Multimodal Inputs">mm</abbr>
* ✅ * ✅
* ✅ * ✅
@ -432,15 +439,15 @@ Check the '✗' with links to see tracking issue for unsupported feature/hardwar
* ✅ * ✅
* ✅ * ✅
* ✅ * ✅
* *
* *
- * multi-step - * multi-step
* ✅ * ✅
* ✅ * ✅
* ✅ * ✅
* ✅ * ✅
* ✅ * ✅
* [](gh-issue:8477) * [](gh-issue:8477)
* ✅ * ✅
- * best-of - * best-of
* ✅ * ✅

View File

@ -20,93 +20,93 @@ The table below shows the compatibility of various quantization implementations
* AWS Inferentia * AWS Inferentia
* Google TPU * Google TPU
- * AWQ - * AWQ
* *
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* *
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* *
* *
- * GPTQ - * GPTQ
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* *
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* *
* *
- * Marlin (GPTQ/AWQ/FP8) - * Marlin (GPTQ/AWQ/FP8)
* *
* *
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* *
* *
* *
* *
* *
- * INT8 (W8A8) - * INT8 (W8A8)
* *
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* *
* *
* ✅︎ * ✅︎
* *
* *
- * FP8 (W8A8) - * FP8 (W8A8)
* *
* *
* *
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* *
* *
* *
* *
- * AQLM - * AQLM
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* *
* *
* *
* *
* *
- * bitsandbytes - * bitsandbytes
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* *
* *
* *
* *
* *
- * DeepSpeedFP - * DeepSpeedFP
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* *
* *
* *
* *
* *
- * GGUF - * GGUF
* ✅︎ * ✅︎
* ✅︎ * ✅︎
@ -114,16 +114,16 @@ The table below shows the compatibility of various quantization implementations
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* ✅︎ * ✅︎
* *
* *
* *
* *
::: :::
- Volta refers to SM 7.0, Turing to SM 7.5, Ampere to SM 8.0/8.6, Ada to SM 8.9, and Hopper to SM 9.0. - Volta refers to SM 7.0, Turing to SM 7.5, Ampere to SM 8.0/8.6, Ada to SM 8.9, and Hopper to SM 9.0.
- "✅︎" indicates that the quantization method is supported on the specified hardware. - ✅︎ indicates that the quantization method is supported on the specified hardware.
- "✗" indicates that the quantization method is not supported on the specified hardware. - indicates that the quantization method is not supported on the specified hardware.
:::{note} :::{note}
This compatibility chart is subject to change as vLLM continues to evolve and expand its support for different hardware platforms and quantization methods. This compatibility chart is subject to change as vLLM continues to evolve and expand its support for different hardware platforms and quantization methods.

View File

@ -28,10 +28,10 @@ The selected option sets the default pooler used to extract the final hidden sta
- * Embedding (`embed`) - * Embedding (`embed`)
* `LAST` * `LAST`
* ✅︎ * ✅︎
* *
- * Classification (`classify`) - * Classification (`classify`)
* `LAST` * `LAST`
* *
* ✅︎ * ✅︎
- * Sentence Pair Scoring (`score`) - * Sentence Pair Scoring (`score`)
* \* * \*
@ -39,8 +39,8 @@ The selected option sets the default pooler used to extract the final hidden sta
* \* * \*
- * Reward Modeling (`reward`) - * Reward Modeling (`reward`)
* `ALL` * `ALL`
* *
* *
::: :::
\*The default pooler is always defined by the model. \*The default pooler is always defined by the model.