rattus 3f382a4f98
quant ops: Dequantize weight in-place (#10935)
In flux2 these weights are huge (200MB). As plain_tensor is a throw-away
deep copy, do this multiplication in-place to save VRAM.
2025-11-27 08:06:30 -08:00
..
2025-11-26 20:28:44 -08:00
2024-06-27 18:43:11 -04:00
2025-11-25 18:41:45 -05:00
2025-09-02 15:36:22 -04:00
2025-01-24 06:15:54 -05:00
2025-07-06 07:07:39 -04:00
2025-09-15 18:10:55 -04:00
2025-11-25 18:41:45 -05:00
2025-10-25 23:07:29 -04:00
2025-11-25 18:41:45 -05:00