ComfyUI/comfy at f17251bec65b5760cfedec29eace7d77f4b35130 - ComfyUI - 丝路新云-代码仓

xinyun/ComfyUI

mirror of https://git.datalinker.icu/comfyanonymous/ComfyUI synced 2026-06-21 15:47:00 +08:00

History

rattus f17251bec6

Account for the VRAM cost of weight offloading (#10733 )

* mm: default to 0 for NUM_STREAMS

Dont count the compute stream as an offload stream. This makes async
offload accounting easier.

* mm: remove 128MB minimum

This is from a previous offloading system requirement. Remove it to
make behaviour of the loader and partial unloader consistent.

* mp: order the module list by offload expense

Calculate an approximate offloading temporary VRAM cost to offload a
weight and primary order the module load list by that. In the simple
case this is just the same as the module weight, but with Loras, a
weight with a lora consumes considerably more VRAM to do the Lora
application on-the-fly.

This will slightly prioritize lora weights, but is really for
proper VRAM offload accounting.

* mp: Account for the VRAM cost of weight offloading

when checking the VRAM headroom, assume that the weight needs to be
offloaded, and only load if it has space for both the load and offload
 * the number of streams.

As the weights are ordered from largest to smallest by offload cost
this is guaranteed to fit in VRAM (tm), as all weights that follow
will be smaller.

Make the partial unload aware of this system as well by saving the
budget for offload VRAM to the model state and accounting accordingly.
Its possible that partial unload increases the size of the largest
offloaded weights, and thus needs to unload a little bit more than
asked to accomodate the bigger temp buffers.

Honor the existing codes floor on model weight loading of 128MB by
having the patcher honor this separately withough regard to offloading.
Otherwise when MM specifies its 128MB minimum, MP will see the biggest
weights, and budget that 128MB to only offload buffer and load nothing
which isnt the intent of these minimums. The same clamp applies in
case of partial offload of the currently loading model.

2025-11-27 01:03:03 -05:00

..

Support the HuMo model. (#9903 )

2025-09-17 00:12:48 -04:00

Add better error message for common error. (#10846 )

2025-11-23 04:55:22 -05:00

LoRA Trainer: LoRA training node in weight adapter scheme (#8446 )

2025-06-13 19:25:59 -04:00

Uni pc sampler now works with audio and video models.

2025-01-18 05:27:58 -05:00

Add Hunyuan 3D 2.1 Support (#8714 )

2025-09-04 20:36:20 -04:00

Fix depending on asserts to raise an exception in BatchedBrownianTree and Flash attn module (#9884 )

2025-09-15 20:05:03 -04:00

block info (#10841 )

2025-11-26 20:28:44 -08:00

Silence clip tokenizer warning. (#8934 )

2025-07-16 14:42:07 -04:00

Controlnet refactor.

2024-06-27 18:43:11 -04:00

Improvements to the TAESD3 implementation.

2024-06-16 02:04:24 -04:00

Z Image model. (#10892 )

2025-11-25 18:41:45 -05:00

Fix loras not working on mixed fp8. (#10899 )

2025-11-26 00:07:58 -05:00

checkpoint_pickle.py

Remove pytorch_lightning dependency.

2023-06-13 10:11:33 -04:00

cli_args.py

--disable-api-nodes now sets CSP header to force frontend offline. (#10829 )

2025-11-21 17:51:55 -05:00

clip_config_bigg.json

Fix potential issue with non clip text embeddings.

2024-07-30 14:41:13 -04:00

clip_model.py

USO style reference. (#9677 )

2025-09-02 15:36:22 -04:00

clip_vision_config_g.json

Add support for clip g vision model to CLIPVisionLoader.

2023-08-18 11:13:29 -04:00

clip_vision_config_h.json

Add support for unCLIP SD2.x models.

2023-04-01 23:19:15 -04:00

clip_vision_config_vitl_336_llava.json

Support llava clip vision model.

2025-03-06 00:24:43 -05:00

clip_vision_config_vitl_336.json

support clip-vit-large-patch14-336 (#4042 )

2024-07-17 13:12:50 -04:00

clip_vision_config_vitl.json

Add support for unCLIP SD2.x models.

2023-04-01 23:19:15 -04:00

clip_vision_siglip_384.json

Support new flux model variants.

2024-11-21 08:38:23 -05:00

clip_vision_siglip_512.json

Support 512 siglip model.

2025-04-05 07:01:01 -04:00

clip_vision.py

Some changes to the previous hunyuan PR. (#9725 )

2025-09-04 20:39:02 -04:00

conds.py

Add some warnings and prevent crash when cond devices don't match. (#9169 )

2025-08-04 04:20:12 -04:00

context_windows.py

Make step index detection much more robust (#9392 )

2025-08-17 18:54:07 -04:00

controlnet.py

Fix Race condition in --async-offload that can cause corruption (#10501 )

2025-10-29 17:17:46 -04:00

diffusers_convert.py

Remove useless code.

2025-01-24 06:15:54 -05:00

diffusers_load.py

load_unet -> load_diffusion_model with a model_options argument.

2024-08-12 23:20:57 -04:00

float.py

Clamp output when rounding weight to prevent Nan.

2024-10-19 19:07:10 -04:00

gligen.py

Remove some useless code. (#8812 )

2025-07-06 07:07:39 -04:00

hooks.py

Hooks Part 2 - TransformerOptionsHook and AdditionalModelsHook (#6377 )

2025-01-11 12:20:23 -05:00

latent_formats.py

Add cheap latent preview for flux 2. (#10907 )

2025-11-26 04:00:43 -05:00

lora_convert.py

Implement the USO subject identity lora. (#9674 )

2025-09-01 18:54:02 -04:00

lora.py

Support the omnigen2 umo lora. (#9886 )

2025-09-15 18:10:55 -04:00

model_base.py

Fix Flux2 reference image mem estimation. (#10905 )

2025-11-26 02:36:19 -05:00

model_detection.py

Z Image model. (#10892 )

2025-11-25 18:41:45 -05:00

model_management.py

Account for the VRAM cost of weight offloading (#10733 )

2025-11-27 01:03:03 -05:00

model_patcher.py

Account for the VRAM cost of weight offloading (#10733 )

2025-11-27 01:03:03 -05:00

model_sampling.py

Refactor model sampling sigmas code. (#10250 )

2025-10-08 17:49:02 -04:00

nested_tensor.py

WIP way to support multi multi dimensional latents. (#10456 )

2025-10-23 21:21:14 -04:00

ops.py

Fix loras not working on mixed fp8. (#10899 )

2025-11-26 00:07:58 -05:00

options.py

Only parse command line args when main.py is called.

2023-09-13 11:38:20 -04:00

patcher_extension.py

Fix order of inputs nested merge_nested_dicts (#10362 )

2025-10-15 16:47:26 -07:00

pixel_space_convert.py

Changes to the previous radiance commit. (#9851 )

2025-09-13 18:03:34 -04:00

quant_ops.py

Fix loras not working on mixed fp8. (#10899 )

2025-11-26 00:07:58 -05:00

rmsnorm.py

Add warning when using old pytorch. (#9347 )

2025-08-15 00:22:26 -04:00

sample.py

Fix mistake. (#10484 )

2025-10-25 23:07:29 -04:00

sampler_helpers.py

Added context window support to core sampling code (#9238 )

2025-08-13 21:33:05 -04:00

samplers.py

WIP way to support multi multi dimensional latents. (#10456 )

2025-10-23 21:21:14 -04:00

sd1_clip_config.json

Fix potential issue with non clip text embeddings.

2024-07-30 14:41:13 -04:00

sd1_clip.py

Lower vram usage for flux 2 text encoder. (#10887 )

2025-11-25 14:58:39 -05:00

sd.py

Z Image model. (#10892 )

2025-11-25 18:41:45 -05:00

sdxl_clip.py

Add a T5TokenizerOptions node to set options for the T5 tokenizer. (#7803 )

2025-04-25 19:36:00 -04:00

supported_models_base.py

Mixed Precision Quantization System (#10498 )

2025-10-28 16:20:53 -04:00

supported_models.py

Adjustments to Z Image. (#10893 )

2025-11-25 19:02:51 -05:00

utils.py

WIP way to support multi multi dimensional latents. (#10456 )

2025-10-23 21:21:14 -04:00