ComfyUI/comfy at e42682b24ef033a93001ba27cc5c5aa461a61d8d - ComfyUI - 丝路新云-代码仓

xinyun/ComfyUI

mirror of https://git.datalinker.icu/comfyanonymous/ComfyUI synced 2026-03-16 17:17:27 +08:00

History

rattus128 e42682b24e

Reduce Peak WAN inference VRAM usage (#9898 )

* flux: Do the xq and xk ropes one at a time

This was doing independendent interleaved tensor math on the q and k
tensors, leading to the holding of more than the minimum intermediates
in VRAM. On a bad day, it would VRAM OOM on xk intermediates.

Do everything q and then everything k, so torch can garbage collect
all of qs intermediates before k allocates its intermediates.

This reduces peak VRAM usage for some WAN2.2 inferences (at least).

* wan: Optimize qkv intermediates on attention

As commented. The former logic computed independent pieces of QKV in
parallel which help more inference intermediates in VRAM spiking
VRAM usage. Fully roping Q and garbage collecting the intermediates
before touching K reduces the peak inference VRAM usage.

2025-09-16 19:21:14 -04:00

..

Add encoder part of whisper large v3 as an audio encoder model. (#9894 )

2025-09-16 01:19:50 -04:00

Replace print with logging (#6138 )

2024-12-20 16:24:55 -05:00

LoRA Trainer: LoRA training node in weight adapter scheme (#8446 )

2025-06-13 19:25:59 -04:00

Uni pc sampler now works with audio and video models.

2025-01-18 05:27:58 -05:00

Add Hunyuan 3D 2.1 Support (#8714 )

2025-09-04 20:36:20 -04:00

Fix depending on asserts to raise an exception in BatchedBrownianTree and Flash attn module (#9884 )

2025-09-15 20:05:03 -04:00

Reduce Peak WAN inference VRAM usage (#9898 )

2025-09-16 19:21:14 -04:00

Silence clip tokenizer warning. (#8934 )

2025-07-16 14:42:07 -04:00

Controlnet refactor.

2024-06-27 18:43:11 -04:00

Improvements to the TAESD3 implementation.

2024-06-16 02:04:24 -04:00

Remove single quote pattern to avoid wrong matches (#9842 )

2025-09-13 16:59:19 -04:00

Fix #9537 (#9576 )

2025-08-27 12:45:02 -04:00

checkpoint_pickle.py

Remove pytorch_lightning dependency.

2023-06-13 10:11:33 -04:00

cli_args.py

Print all fast options in --help (#9737 )

2025-09-06 01:05:05 -04:00

clip_config_bigg.json

Fix potential issue with non clip text embeddings.

2024-07-30 14:41:13 -04:00

clip_model.py

USO style reference. (#9677 )

2025-09-02 15:36:22 -04:00

clip_vision_config_g.json

Add support for clip g vision model to CLIPVisionLoader.

2023-08-18 11:13:29 -04:00

clip_vision_config_h.json

Add support for unCLIP SD2.x models.

2023-04-01 23:19:15 -04:00

clip_vision_config_vitl_336_llava.json

Support llava clip vision model.

2025-03-06 00:24:43 -05:00

clip_vision_config_vitl_336.json

support clip-vit-large-patch14-336 (#4042 )

2024-07-17 13:12:50 -04:00

clip_vision_config_vitl.json

Add support for unCLIP SD2.x models.

2023-04-01 23:19:15 -04:00

clip_vision_siglip_384.json

Support new flux model variants.

2024-11-21 08:38:23 -05:00

clip_vision_siglip_512.json

Support 512 siglip model.

2025-04-05 07:01:01 -04:00

clip_vision.py

Some changes to the previous hunyuan PR. (#9725 )

2025-09-04 20:39:02 -04:00

conds.py

Add some warnings and prevent crash when cond devices don't match. (#9169 )

2025-08-04 04:20:12 -04:00

context_windows.py

Make step index detection much more robust (#9392 )

2025-08-17 18:54:07 -04:00

controlnet.py

Support qwen inpaint controlnet. (#9772 )

2025-09-08 17:30:26 -04:00

diffusers_convert.py

Remove useless code.

2025-01-24 06:15:54 -05:00

diffusers_load.py

load_unet -> load_diffusion_model with a model_options argument.

2024-08-12 23:20:57 -04:00

float.py

Clamp output when rounding weight to prevent Nan.

2024-10-19 19:07:10 -04:00

gligen.py

Remove some useless code. (#8812 )

2025-07-06 07:07:39 -04:00

hooks.py

Hooks Part 2 - TransformerOptionsHook and AdditionalModelsHook (#6377 )

2025-01-11 12:20:23 -05:00

latent_formats.py

Add support for Chroma Radiance (#9682 )

2025-09-13 17:58:43 -04:00

lora_convert.py

Implement the USO subject identity lora. (#9674 )

2025-09-01 18:54:02 -04:00

lora.py

Support the omnigen2 umo lora. (#9886 )

2025-09-15 18:10:55 -04:00

model_base.py

Add support for Chroma Radiance (#9682 )

2025-09-13 17:58:43 -04:00

model_detection.py

Add support for Chroma Radiance (#9682 )

2025-09-13 17:58:43 -04:00

model_management.py

Fix amd_min_version crash when cpu device. (#9754 )

2025-09-07 21:16:29 -04:00

model_patcher.py

USO style reference. (#9677 )

2025-09-02 15:36:22 -04:00

model_sampling.py

Basic initial support for cosmos predict2 text to image 2B and 14B models. (#8517 )

2025-06-13 07:05:23 -04:00

ops.py

Enable Convolution AutoTuning (#9301 )

2025-09-01 20:33:50 -04:00

options.py

Only parse command line args when main.py is called.

2023-09-13 11:38:20 -04:00

patcher_extension.py

Implement EasyCache and Invent LazyCache (#9496 )

2025-08-22 22:41:08 -04:00

pixel_space_convert.py

Changes to the previous radiance commit. (#9851 )

2025-09-13 18:03:34 -04:00

rmsnorm.py

Add warning when using old pytorch. (#9347 )

2025-08-15 00:22:26 -04:00

sample.py

Auto reshape 2d to 3d latent for single image generation on video model.

2024-12-29 02:26:49 -05:00

sampler_helpers.py

Added context window support to core sampling code (#9238 )

2025-08-13 21:33:05 -04:00

samplers.py

Add DPM++ 2M SDE Heun (RES) sampler (#9542 )

2025-08-27 19:07:31 -04:00

sd1_clip_config.json

Fix potential issue with non clip text embeddings.

2024-07-30 14:41:13 -04:00

sd1_clip.py

Disable prompt weights for qwen. (#9438 )

2025-08-20 01:08:11 -04:00

sd.py

Changes to the previous radiance commit. (#9851 )

2025-09-13 18:03:34 -04:00

sdxl_clip.py

Add a T5TokenizerOptions node to set options for the T5 tokenizer. (#7803 )

2025-04-25 19:36:00 -04:00

supported_models_base.py

Mixed precision diffusion models with scaled fp8.

2024-10-21 18:12:51 -04:00

supported_models.py

Changes to the previous radiance commit. (#9851 )

2025-09-13 18:03:34 -04:00

utils.py

Add WAN ATI support (#8874 )

2025-07-24 20:59:19 -04:00