ComfyUI

mirror of https://git.datalinker.icu/comfyanonymous/ComfyUI synced 2026-01-27 05:47:27 +08:00

Author	SHA1	Message	Date
Alexander Piskun	163b629c70	use new API client in Pixverse and Ideogram nodes (#10543 )	2025-10-29 23:49:03 -07:00
Jedrzej Kosinski	998bf60beb	Add units/info for the numbers displayed on 'load completely' and 'load partially' log messages (#10538 )	2025-10-29 19:37:06 -04:00
comfyanonymous	906c089957	Fix small performance regression with fp8 fast and scaled fp8. (#10537 )	2025-10-29 19:29:01 -04:00
comfyanonymous	25de7b1bfa	Try to fix slow load issue on low ram hardware with pinned mem. (#10536 )	2025-10-29 17:20:27 -04:00
rattus	ab7ab5be23	Fix Race condition in --async-offload that can cause corruption (#10501 ) * mm: factor out the current stream getter Make this a reusable function. * ops: sync the offload stream with the consumption of w&b This sync is nessacary as pytorch will queue cuda async frees on the same stream as created to tensor. In the case of async offload, this will be on the offload stream. Weights and biases can go out of scope in python which then triggers the pytorch garbage collector to queue the free operation on the offload stream possible before the compute stream has used the weight. This causes a use after free on weight data leading to total corruption of some workflows. So sync the offload stream with the compute stream after the weight has been used so the free has to wait for the weight to be used. The cast_bias_weight is extended in a backwards compatible way with the new behaviour opt-in on a defaulted parameter. This handles custom node packs calling cast_bias_weight and defeatures async-offload for them (as they do not handle the race). The pattern is now: cast_bias_weight(... , offloadable=True) #This might be offloaded thing(weight, bias, ...) uncast_bias_weight(...) * controlnet: adopt new cast_bias_weight synchronization scheme This is nessacary for safe async weight offloading. * mm: sync the last stream in the queue, not the next Currently this peeks ahead to sync the next stream in the queue of streams with the compute stream. This doesnt allow a lot of parallelization, as then end result is you can only get one weight load ahead regardless of how many streams you have. Rotate the loop logic here to synchronize the end of the queue before returning the next stream. This allows weights to be loaded ahead of the compute streams position.	2025-10-29 17:17:46 -04:00
comfyanonymous	ec4fc2a09a	Fix case of weights not being unpinned. (#10533 )	2025-10-29 15:48:06 -04:00
comfyanonymous	1a58087ac2	Reduce memory usage for fp8 scaled op. (#10531 )	2025-10-29 15:43:51 -04:00
Alexander Piskun	6c14f3afac	use new API client in Luma and Minimax nodes (#10528 )	2025-10-29 11:14:56 -07:00
comfyanonymous	e525673f72	Fix issue. (#10527 )	2025-10-29 00:37:00 -04:00
comfyanonymous	3fa7a5c04a	Speed up offloading using pinned memory. (#10526 ) To enable this feature use: --fast pinned_memory	2025-10-29 00:21:01 -04:00
Alexander Piskun	210f7a1ba5	convert nodes_recraft.py to V3 schema (#10507 )	2025-10-28 14:38:05 -07:00
rattus	d202c2ba74	execution: Allow a subgraph nodes to execute multiple times (#10499 ) In the case of --cache-none lazy and subgraph execution can cause anything to be run multiple times per workflow. If that rerun nodes is in itself a subgraph generator, this will crash for two reasons. pending_subgraph_results[] does not cleanup entries after their use. So when a pending_subgraph_result is consumed, remove it from the list so that if the corresponding node is fully re-executed this misses lookup and it fall through to execute the node as it should. Secondly, theres is an explicit enforcement against dups in the addition of subgraphs nodes as ephemerals to the dymprompt. Remove this enforcement as the use case is now valid.	2025-10-28 16:22:08 -04:00
contentis	8817f8fc14	Mixed Precision Quantization System (#10498 ) * Implement mixed precision operations with a registry design and metadate for quant spec in checkpoint. * Updated design using Tensor Subclasses * Fix FP8 MM * An actually functional POC * Remove CK reference and ensure correct compute dtype * Update unit tests * ruff lint * Implement mixed precision operations with a registry design and metadate for quant spec in checkpoint. * Updated design using Tensor Subclasses * Fix FP8 MM * An actually functional POC * Remove CK reference and ensure correct compute dtype * Update unit tests * ruff lint * Fix missing keys * Rename quant dtype parameter * Rename quant dtype parameter * Fix unittests for CPU build	2025-10-28 16:20:53 -04:00
comfyanonymous	22e40d2ace	Tell users to update their nvidia drivers if portable doesn't start. (#10518 )	2025-10-28 15:08:08 -04:00
comfyanonymous	3bea4efc6b	Tell users to update nvidia drivers if problem with portable. (#10510 )	2025-10-28 04:45:45 -04:00
comfyanonymous	8cf2ba4ba6	Remove comfy api key from queue api. (#10502 )	2025-10-28 03:23:52 -04:00
comfyanonymous	b61a40cbc9	Bump stable portable to cu130 python 3.13.9 (#10508 )	2025-10-28 03:21:45 -04:00
comfyanonymous	f2bb3230b7	ComfyUI version v0.3.67 v0.3.67	2025-10-28 03:03:59 -04:00
Jedrzej Kosinski	614b8d3345	frontend bump to 1.28.8 (#10506 )	2025-10-28 03:01:13 -04:00
ComfyUI Wiki	6abc30aae9	Update template to 0.2.4 (#10505 )	2025-10-28 01:56:30 -04:00
Alexander Piskun	55bad30375	feat(api-nodes): add LTXV API nodes (#10496 )	2025-10-27 22:25:29 -07:00
ComfyUI Wiki	c305deed56	Update template to 0.2.3 (#10503 )	2025-10-27 22:24:16 -07:00
comfyanonymous	601ee1775a	Add a bat to run comfyui portable without api nodes. (#10504 )	2025-10-27 23:54:00 -04:00
comfyanonymous	c170fd2db5	Bump portable deps workflow to torch cu130 python 3.13.9 (#10493 )	2025-10-26 20:23:01 -04:00
Alexander Piskun	9d529e5308	fix(api-nodes): random issues on Windows by capturing general OSError for retries (#10486 )	2025-10-25 23:51:06 -07:00
comfyanonymous	f6bbc1ac84	Fix mistake. (#10484 )	2025-10-25 23:07:29 -04:00
comfyanonymous	098a352f13	Add warning for torch-directml usage (#10482 ) Added a warning message about the state of torch-directml.	2025-10-25 20:05:22 -04:00
Alexander Piskun	e86b79ab9e	convert Gemini API nodes to V3 schema (#10476 )	2025-10-25 14:35:30 -07:00
comfyanonymous	426cde37f1	Remove useless function (#10472 )	2025-10-24 19:56:51 -04:00
Alexander Piskun	dd5af0c587	convert Tripo API nodes to V3 schema (#10469 )	2025-10-24 15:48:34 -07:00
Alexander Piskun	388b306a2b	feat(api-nodes): network client v2: async ops, cancellation, downloads, refactor (#10390 ) * feat(api-nodes): implement new API client for V3 nodes * feat(api-nodes): implement new API client for V3 nodes * feat(api-nodes): implement new API client for V3 nodes * converted WAN nodes to use new client; polishing * fix(auth): do not leak authentification for the absolute urls * convert BFL API nodes to use new API client; remove deprecated BFL nodes * converted Google Veo nodes * fix(Veo3.1 model): take into account "generate_audio" parameter	2025-10-23 22:37:16 -07:00
ComfyUI Wiki	24188b3141	Update template to 0.2.2 (#10461 ) Fix template typo issue	2025-10-24 01:36:30 -04:00
comfyanonymous	1bcda6df98	WIP way to support multi multi dimensional latents. (#10456 )	2025-10-23 21:21:14 -04:00
comfyanonymous	a1864c01f2	Small readme improvement. (#10442 )	2025-10-22 17:26:22 -04:00
rattus	4739d7717f	execution: fold in dependency aware caching / Fix --cache-none with loops/lazy etc (Resubmit) (#10440 ) * execution: fold in dependency aware caching This makes --cache-none compatiable with lazy and expanded subgraphs. Currently the --cache-none option is powered by the DependencyAwareCache. The cache attempts to maintain a parallel copy of the execution list data structure, however it is only setup once at the start of execution and does not get meaninigful updates to the execution list. This causes multiple problems when --cache-none is used with lazy and expanded subgraphs as the DAC does not accurately update its copy of the execution data structure. DAC has an attempt to handle subgraphs ensure_subcache however this does not accurately connect to nodes outside the subgraph. The current semantics of DAC are to free a node ASAP after the dependent nodes are executed. This means that if a subgraph refs such a node it will be requed and re-executed by the execution_list but DAC wont see it in its to-free lists anymore and leak memory. Rather than try and cover all the cases where the execution list changes from inside the cache, move the while problem to the executor which maintains an always up-to-date copy of the wanted data-structure. The executor now has a fast-moving run-local cache of its own. Each _to node has its own mini cache, and the cache is unconditionally primed at the time of add_strong_link. add_strong_link is called for all of static workflows, lazy links and expanded subgraphs so its the singular source of truth for output dependendencies. In the case of a cache-hit, the executor cache will hold the non-none value (it will respect updates if they happen somehow as well). In the case of a cache-miss, the executor caches a None and will wait for a notification to update the value when the node completes. When a node completes execution, it simply releases its mini-cache and in turn its strong refs on its direct anscestor outputs, allowing for ASAP freeing (same as the DependencyAwareCache but a little more automatic). This now allows for re-implementation of --cache-none with no cache at all. The dependency aware cache was also observing the dependency sematics for the objects and UI cache which is not accurate (this entire logic was always outputs specific). This also prepares for more complex caching strategies (such as RAM pressure based caching), where a cache can implement any freeing strategy completely independently of the DepedancyAwareness requirement. * main: re-implement --cache-none as no cache at all The execution list now tracks the dependency aware caching more correctly that the DependancyAwareCache. Change it to a cache that does nothing. * test_execution: add --cache-none to the test suite --cache-none is now expected to work universally. Run it through the full unit test suite. Propagate the server parameterization for whether or not the server is capabale of caching, so that the minority of tests that specifically check for cache hits can if else. Hard assert NOT caching in the else to give some coverage of --cache-none expected behaviour to not acutally cache.	2025-10-22 15:49:05 -04:00
Jedrzej Kosinski	f13cff0be6	Add custom node published subgraphs endpoint (#10438 ) * Add get_subgraphs_dir to ComfyExtension and PUBLISHED_SUBGRAPH_DIRS to nodes.py * Created initial endpoints, although the returned paths are a bit off currently * Fix path and actually return real data * Sanitize returned /api/global_subgraphs entries * Remove leftover function from early prototyping * Remove added whitespace * Add None check for sanitize_entry	2025-10-21 23:16:16 -04:00
comfyanonymous	9cdc64998f	Only disable cudnn on newer AMD GPUs. (#10437 )	2025-10-21 19:15:23 -04:00
comfyanonymous	560b1bdfca	ComfyUI version v0.3.66 v0.3.66	2025-10-21 01:12:32 -04:00
comfyanonymous	b7992f871a	Revert "execution: fold in dependency aware caching / Fix --cache-none with l…" (#10422 ) This reverts commit b1467da4803017a418c32c159525767f45871ca3.	2025-10-20 19:03:06 -04:00
comfyanonymous	2c2aa409b0	Log message for cudnn disable on AMD. (#10418 )	2025-10-20 15:43:24 -04:00
ComfyUI Wiki	a4787ac83b	Update template to 0.2.1 (#10413 ) * Update template to 0.1.97 * Update template to 0.2.1	2025-10-20 15:28:36 -04:00
Christian Byrne	b5c59b763c	Deprecation warning on unused files (#10387 ) * only warn for unused files * include internal extensions	2025-10-19 13:05:46 -07:00
comfyanonymous	b4f30bd408	Pytorch is stupid. (#10398 )	2025-10-19 01:25:35 -04:00
comfyanonymous	dad076aee6	Speed up chroma radiance. (#10395 )	2025-10-18 23:19:52 -04:00
comfyanonymous	0cf33953a7	Fix batch size above 1 giving bad output in chroma radiance. (#10394 )	2025-10-18 23:15:34 -04:00
comfyanonymous	5b80addafd	Turn off cuda malloc by default when --fast autotune is turned on. (#10393 )	2025-10-18 22:35:46 -04:00
comfyanonymous	9da397ea2f	Disable torch compiler for cast_bias_weight function (#10384 ) * Disable torch compiler for cast_bias_weight function * Fix torch compile.	2025-10-17 20:03:28 -04:00
comfyanonymous	92d97380bd	Update Python 3.14 installation instructions (#10385 ) Removed mention of installing pytorch nightly for Python 3.14.	2025-10-17 18:22:59 -04:00
Alexander Piskun	99ce2a1f66	convert nodes_controlnet.py to V3 schema (#10202 )	2025-10-17 14:13:05 -07:00
rattus128	b1467da480	execution: fold in dependency aware caching / Fix --cache-none with loops/lazy etc (#10368 ) * execution: fold in dependency aware caching This makes --cache-none compatiable with lazy and expanded subgraphs. Currently the --cache-none option is powered by the DependencyAwareCache. The cache attempts to maintain a parallel copy of the execution list data structure, however it is only setup once at the start of execution and does not get meaninigful updates to the execution list. This causes multiple problems when --cache-none is used with lazy and expanded subgraphs as the DAC does not accurately update its copy of the execution data structure. DAC has an attempt to handle subgraphs ensure_subcache however this does not accurately connect to nodes outside the subgraph. The current semantics of DAC are to free a node ASAP after the dependent nodes are executed. This means that if a subgraph refs such a node it will be requed and re-executed by the execution_list but DAC wont see it in its to-free lists anymore and leak memory. Rather than try and cover all the cases where the execution list changes from inside the cache, move the while problem to the executor which maintains an always up-to-date copy of the wanted data-structure. The executor now has a fast-moving run-local cache of its own. Each _to node has its own mini cache, and the cache is unconditionally primed at the time of add_strong_link. add_strong_link is called for all of static workflows, lazy links and expanded subgraphs so its the singular source of truth for output dependendencies. In the case of a cache-hit, the executor cache will hold the non-none value (it will respect updates if they happen somehow as well). In the case of a cache-miss, the executor caches a None and will wait for a notification to update the value when the node completes. When a node completes execution, it simply releases its mini-cache and in turn its strong refs on its direct anscestor outputs, allowing for ASAP freeing (same as the DependencyAwareCache but a little more automatic). This now allows for re-implementation of --cache-none with no cache at all. The dependency aware cache was also observing the dependency sematics for the objects and UI cache which is not accurate (this entire logic was always outputs specific). This also prepares for more complex caching strategies (such as RAM pressure based caching), where a cache can implement any freeing strategy completely independently of the DepedancyAwareness requirement. * main: re-implement --cache-none as no cache at all The execution list now tracks the dependency aware caching more correctly that the DependancyAwareCache. Change it to a cache that does nothing. * test_execution: add --cache-none to the test suite --cache-none is now expected to work universally. Run it through the full unit test suite. Propagate the server parameterization for whether or not the server is capabale of caching, so that the minority of tests that specifically check for cache hits can if else. Hard assert NOT caching in the else to give some coverage of --cache-none expected behaviour to not acutally cache.	2025-10-17 13:55:15 -07:00

1 2 3 4 5 ...

4150 Commits