Update readme.md

This commit is contained in:
Jukka Seppänen 2024-08-07 02:15:27 +03:00 committed by GitHub
parent 2ae70dd82e
commit 49767f1cda
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194

View File

@ -4,6 +4,13 @@ Currently requires diffusers with PR: https://github.com/huggingface/diffusers/p
This is specified in requirements.txt
Uses same T5 model than SD3 and Flux, fp8 works fine too. Memory requirements depend mostly on the video length.
VAE decoding seems to be the only big that takes a lot of VRAM when everything is offloaded, peaks at around 13-14GB momentarily at that stage.
Sampling itself takes only maybe 5-6GB.
Hacked in img2img to attempt vid2vid workflow, works interestingly with some inputs, highly experimental.
https://github.com/user-attachments/assets/e6951ef4-ea7a-4752-94f6-cf24f2503d83
https://github.com/user-attachments/assets/9e41f37b-2bb3-411c-81fa-e91b80da2559