ComfyUI

mirror of https://git.datalinker.icu/comfyanonymous/ComfyUI synced 2025-12-16 09:35:12 +08:00

Author	SHA1	Message	Date
Rattus	e65e642fbe	mm: fix debug message	2025-12-03 00:59:27 +10:00
Rattus	99bed5e19f	mm: make model offloading deffered with weakrefs RAMPressure caching may ned to purge the same model that you are currently trying to offload for VRAM freeing. In this case, RAMPressure cache takes priority and needs to be able to pull the trigger on dumping the whole model and freeing the ModelPatcher in question. To do this, defer the actual tranfer of model weights from GPU to RAM to model_management state and not as part of ModelPatcher. This is dones as a list of weakrefs. If RAM cache decides to free to model you are currently unloading, then the ModelPatcher and refs simply dissappear in the middle of the unloading process, and both RAM and VRAM will be freed. The unpatcher now queues the individual leaf modules to be offloaded one-by-one so that RAM levels can be monitored. Note that the UnloadPartially that is potentially done as part of a load will not be freeable this way, however it shouldn't be anyway as that is the currently active model and RAM cache cannot save you if you cant even fit the one model you are currently trying to use.	2025-12-03 00:59:06 +10:00
Rattus	07d7cd9618	mm: dont use list of indexes for unload list work list This is currently put together as a list of indexes assuming the current_loaded_models doesn't change. However we might need to pruge a model as part of the offload process which means this list can change in the middle of the freeing process. handle by taking independent refs to the LoadedModel objects and dong safe by-value deletion of current_loaded_models.	2025-12-03 00:59:06 +10:00
Rattus	7af5bf49e4	mm: make garbage collector null safe on real_model currently this hard assumes that the caller of model_unload will keep current_loaded_models in sync. With RAMPressureCache its possible to have the garbage collector occur in the middle of the model free process which can split these two steps.	2025-12-03 00:59:06 +10:00
Rattus	4a83a9bc0e	sd: Free RAM on main model load	2025-12-03 00:59:06 +10:00
Rattus	62a2622591	mm: Add free_ram() Add the free_ram() API and a means to install implementations of the freer (I.E. the RAM cache).	2025-12-03 00:59:06 +10:00
Rattus	68053b1180	caching: build headroom into the RAM cache move the headroom logic into the RAM cache to make this a little easier to call to "free me some RAM". Rename the API to free_ram(). Split off the clean_list creation to a completely separate function to avoid any stray strong reference to the content-to-be-freed on the stack.	2025-12-03 00:59:06 +10:00
Yoland Yan	a17cf1c387	Add @guill as a code owner (#11031 )	2025-12-01 22:40:44 -05:00
Dr.Lt.Data	b4a20acc54	feat: Support ComfyUI-Manager for pip version (#7555 )	2025-12-01 22:32:52 -05:00
Christian Byrne	c55dc857d5	bump comfyui-frontend-package to 1.33.10 (#11028 )	2025-12-01 20:56:38 -05:00
comfyanonymous	878db3a727	Implement the Ovis image model. (#11030 )	2025-12-01 20:56:17 -05:00
comfyanonymous	30c259cac8	ComfyUI version v0.3.76 v0.3.76	2025-12-01 20:25:35 -05:00
Alexander Piskun	1cb7e22a95	[API Nodes] add Kling O1 model support (#11025 ) * feat(api-nodes): add Kling O1 model support * fix: increase max allowed duration to 10.05 seconds * fix(VideoInput): respect "format" argument	2025-12-01 16:11:52 -08:00
comfyanonymous	2640acb31c	Update qwen tokenizer to add qwen 3 tokens. (#11029 ) Doesn't actually change anything for current workflows because none of the current models have a template with the think tokens.	2025-12-01 17:13:48 -05:00
Christian Byrne	7dbd5dfe91	bump comfyui-frontend-package to 1.32.10 (#11018 )	2025-12-01 13:27:17 -05:00
comfyanonymous	f8b981ae9a	Next AMD portable will have pytorch with ROCm 7.1.1 (#11002 )	2025-11-30 04:21:31 -05:00
ComfyUI Wiki	4967f81778	update template to 0.7.25 (#10996 ) * update template to 0.7.24 * Update template to 0.7.25	2025-11-29 18:07:26 -08:00
comfyanonymous	0a6746898d	Make the ScaleRope node work on Z Image and Lumina. (#10994 )	2025-11-29 18:00:55 -05:00
comfyanonymous	5151cff293	Add some missing z image lora layers. (#10980 )	2025-11-28 23:55:00 -05:00
Dr.Lt.Data	af96d9812d	feat(security): add System User protection with `__` prefix (#10966 ) * feat(security): add System User protection with `__` prefix Add protected namespace for custom nodes to store sensitive data (API keys, licenses) that cannot be accessed via HTTP endpoints. Key changes: - New API: get_system_user_directory() for internal access - New API: get_public_user_directory() with structural blocking - 3-layer defense: header validation, path blocking, creation prevention - 54 tests covering security, edge cases, and backward compatibility System Users use `__` prefix (e.g., __system, __cache) following Python's private member convention. They exist in user_directory/ but are completely blocked from /userdata HTTP endpoints. * style: remove unused imports	2025-11-28 21:28:42 -05:00
comfyanonymous	52a32e2b32	Support some z image lora formats. (#10978 )	2025-11-28 21:12:42 -05:00
Jukka Seppänen	b907085709	Support video tiny VAEs (#10884 ) * Support video tiny VAEs * lighttaew scaling fix * Also support video taes in previews Only first frame for now as live preview playback is currently only available through VHS custom nodes. * Support Wan 2.1 lightVAE * Relocate elif block and set Wan VAE dim directly without using pruning rate for lightvae	2025-11-28 19:40:19 -05:00
comfyanonymous	065a2fbbec	Update driver link in AMD portable README (#10974 )	2025-11-28 19:37:39 -05:00
rattus	0ff0457892	mm: wrap the raw stream in context manager (#10958 ) The documentation of torch.foo.Stream being usable with with: suggests it starts at version 2.7. Use the old API for backwards compatibility.	2025-11-28 16:38:12 -05:00
Urle Sistiana	6484ac89dc	fix QuantizedTensor.is_contiguous (#10956 ) (#10959 )	2025-11-28 16:33:07 -05:00
comfyanonymous	f55c98a89f	Disable offload stream when torch compile. (#10961 )	2025-11-28 16:16:46 -05:00
Dr.Lt.Data	ca7808f240	fix(user_manager): fix typo in move_userdata dest validation (#10967 ) Check `dest` instead of `source` when validating destination path in move_userdata endpoint.	2025-11-28 12:43:17 -08:00
Alexander Piskun	52e778fff3	feat(Kling-API-Nodes): add v2-5-turbo model to FirstLastFrame node (#10938 )	2025-11-28 02:52:59 -08:00
comfyanonymous	9d8a817985	Enable async offloading by default on Nvidia. (#10953 ) Add --disable-async-offload to disable it. If this causes OOMs that go away when you --disable-async-offload please report it.	2025-11-27 17:46:12 -05:00
ComfyUI Wiki	b59750a86a	Update template to 0.7.23 (#10949 )	2025-11-27 17:12:56 -05:00
rattus	3f382a4f98	quant ops: Dequantize weight in-place (#10935 ) In flux2 these weights are huge (200MB). As plain_tensor is a throw-away deep copy, do this multiplication in-place to save VRAM.	2025-11-27 08:06:30 -08:00
rattus	f17251bec6	Account for the VRAM cost of weight offloading (#10733 ) * mm: default to 0 for NUM_STREAMS Dont count the compute stream as an offload stream. This makes async offload accounting easier. * mm: remove 128MB minimum This is from a previous offloading system requirement. Remove it to make behaviour of the loader and partial unloader consistent. * mp: order the module list by offload expense Calculate an approximate offloading temporary VRAM cost to offload a weight and primary order the module load list by that. In the simple case this is just the same as the module weight, but with Loras, a weight with a lora consumes considerably more VRAM to do the Lora application on-the-fly. This will slightly prioritize lora weights, but is really for proper VRAM offload accounting. * mp: Account for the VRAM cost of weight offloading when checking the VRAM headroom, assume that the weight needs to be offloaded, and only load if it has space for both the load and offload * the number of streams. As the weights are ordered from largest to smallest by offload cost this is guaranteed to fit in VRAM (tm), as all weights that follow will be smaller. Make the partial unload aware of this system as well by saving the budget for offload VRAM to the model state and accounting accordingly. Its possible that partial unload increases the size of the largest offloaded weights, and thus needs to unload a little bit more than asked to accomodate the bigger temp buffers. Honor the existing codes floor on model weight loading of 128MB by having the patcher honor this separately withough regard to offloading. Otherwise when MM specifies its 128MB minimum, MP will see the biggest weights, and budget that 128MB to only offload buffer and load nothing which isnt the intent of these minimums. The same clamp applies in case of partial offload of the currently loading model.	2025-11-27 01:03:03 -05:00
Haoming	c38e7d6599	block info (#10841 )	2025-11-26 20:28:44 -08:00
comfyanonymous	eaf68c9b5b	Make lora training work on Z Image and remove some redundant nodes. (#10927 )	2025-11-26 19:25:32 -05:00
Kohaku-Blueleaf	cc6a8dcd1a	Dataset Processing Nodes and Improved LoRA Trainer Nodes with multi resolution supports. (#10708 ) * Create nodes_dataset.py * Add encoded dataset caching mechanism * make training node to work with our dataset system * allow trainer node to get different resolution dataset * move all dataset related implementation to nodes_dataset * Rewrite dataset system with new io schema * Rewrite training system with new io schema * add ui pbar * Add outputs' id/name * Fix bad id/naming * use single process instead of input list when no need * fix wrong output_list flag * use torch.load/save and fix bad behaviors	2025-11-26 19:18:08 -05:00
Alexander Piskun	a2d60aad0f	convert nodes_customer_sampler.py to V3 schema (#10206 )	2025-11-26 14:55:31 -08:00
Alexander Piskun	d8433c63fd	chore(api-nodes): remove chat widgets from OpenAI/Gemini nodes (#10861 )	2025-11-26 14:42:01 -08:00
comfyanonymous	dd41b74549	Add Z Image to readme. (#10924 )	2025-11-26 15:36:38 -05:00
comfyanonymous	55f654db3d	Fix the CSP offline feature. (#10923 )	2025-11-26 15:16:40 -05:00
Terry Jia	58c6ed541d	Merge 3d animation node (#10025 )	2025-11-26 14:58:27 -05:00
Christian Byrne	234c3dc85f	Bump frontend to 1.32.9 (#10867 )	2025-11-26 14:58:08 -05:00
Alexander Piskun	8908ee2628	fix(gemini): use first 10 images as fileData (URLs) and remaining images as inline base64 (#10918 )	2025-11-26 10:38:30 -08:00
Alexander Piskun	1105e0d139	improve UX for batch uploads in upload_images_to_comfyapi (#10913 )	2025-11-26 09:23:14 -08:00
Alexander Piskun	8938aa3f30	add Veo3 First-Last-Frame node (#10878 )	2025-11-26 09:14:02 -08:00
comfyanonymous	f16219e3aa	Add cheap latent preview for flux 2. (#10907 ) Thank you to the person who calculated them. You saved me a percent of my time.	2025-11-26 04:00:43 -05:00
comfyanonymous	8402c8700a	ComfyUI version v0.3.75 v0.3.75	2025-11-26 02:41:13 -05:00
comfyanonymous	58b8574661	Fix Flux2 reference image mem estimation. (#10905 )	2025-11-26 02:36:19 -05:00
comfyanonymous	90b3995ec8	ComfyUI v0.3.74 v0.3.74	2025-11-26 00:34:15 -05:00
comfyanonymous	bdb10a583f	Fix loras not working on mixed fp8. (#10899 )	2025-11-26 00:07:58 -05:00
comfyanonymous	0e24dbb19f	Adjustments to Z Image. (#10893 )	2025-11-25 19:02:51 -05:00

1 2 3 4 5 ...

4308 Commits