✨ Added guidance start parameter. #393

ashen-sensored · 2023-02-26T16:31:10Z

No description provided.

ashen-sensored · 2023-02-26T16:33:56Z

By shifting the guidance start time, allowing the vanilla Unet to lay out the foundation at high noise before applying correction by ControlNet, it is possible to retain most of the information from the original generation.

ashen-sensored · 2023-02-26T18:08:46Z

Demo:
Guidance Start:0 (default behavior)

Guidance Start:0.19

Mikubill · 2023-02-26T18:42:32Z

Looks good. Also need to make some changes in API handler

…rid_support.py

ashen-sensored · 2023-02-26T19:07:29Z

Looks good. Also need to make some changes in API handler

The new parameter related change has been applied to api.py and xyz_grid_support.py.
I did a directory search, I think I covered all related locations.

aiton-sd · 2023-02-27T06:18:12Z

The following change is probably required:

params = [None] * 14 > params = PARAM_COUNT

…T global variable

Merge branch 'main' into feature/guidance-start # Conflicts: # scripts/api.py

enn-nafnlaus · 2023-02-27T10:59:43Z

I think when guidance was added (I updated the ControlNet extension for the first time last night), it broke the ability to use it in batches. I need to do some more testing, but when doing prompt-travel with ControlNet enabled, which used to work just fine, I was only seeing the impacts of ControlNet for the first 1 or 2 images of the prompt travel. And that would make sense based on how Guidance works, if there's a counter that gets reset only when Generate is clicked.

I also think that the very way these Guidance scroll bars are laid out is... confusing and misleading. I was so confused when messing around with them the first time, as to why I was getting radically different results between 0.16 and 0.17. It's not at all obvious from the name that it's actually a "percentage of your steps", which gets multiplied and then converted to an integer, which isn't at all an intuitive way to do it and requires that the person do math to figure out what number to set vs. how many steps they want it to run for. And also works differently from the bracket notation in AUTOMATIC1111 itself, where you specify the number of steps.

That said, it's undeniably a cool addition!

ED: Come to think about it, I only had the one guidance bar, for the guidance end. Looks like I need to update again and see if the first bug got fixed in the process of the second bar getting added...

Magicalore · 2023-02-27T13:14:08Z

How do you even make it work? No matter what I do I get nothing good...

enn-nafnlaus · 2023-02-27T13:42:12Z

How does ControlNet? Works fine for me, right out of the box. Try following a tutorial step-by-step and tell us which one and at which step it goes wrong for you.

Magicalore · 2023-02-27T13:53:57Z

How does ControlNet? Works fine for me, right out of the box. Try following a tutorial step-by-step and tell us which one and at which step it goes wrong for you.

ControlNet works fine, I'm trying to get better hands to appear in the image by merging a depth map of some hands and the openpose model together and I tried with Guidance Start at 0 or at 0.19 and I get way worse results, even without the openpose on it cannot recognize from the depth map that these are hands

catboxanon · 2023-02-27T14:24:12Z

I think the example here wasn't made very clear because it was broken up into several comments with little explanation. This is what it's doing.

The original generated image is below, no ControlNet is used:

The hand is obviously a mess. So, they took this image into Blender and created a hand depth pass as a guide. This is the depth pass they used.

They then used this depth pass with ControlNet. However, the default settings make ControlNet affect the output during the entire generation time. This means that all the empty information in the depth pass (the black area) is accounted for, and causes the output to become bad and not match the original prompt. This is that output:

So, why don't we delay ControlNet from kicking in so we can let the original noise do it's thing, and then we can control it to fix the hand? That's why this PR was created. Now, when ControlNet is delayed (in this case, only a few steps in, as the value used is 0.19, which relative to 20 steps is not that much), the original composition can play out, but the hand can be fixed. This is that image:

tl;dr: This seems to be a way to allow for implementing passes that only control certain elements, without destroying the original image. I don't think this will be entirely useful for if you're generating something completely from scratch, i.e. trying to use the hand depth pass on a completely different seed. It requires knowledge of what is generated normally. A depth pass from Blender is also a bit overkill imo -- think of if you were to use a different module like scribble instead.

Magicalore · 2023-02-27T17:23:43Z

Oh thank you! Yes by bad I misunderstood how this worked!

aleksusklim · 2023-02-27T18:37:07Z

A quick and simple question to whoever has deep understanding of ControlNet structure:

– Why we cannot have a spatial weight on it, to get "mask" for applying to the ControlNet itself?
So we will be able to mask-out everything except the hand on depth map, and then it theoretically would not mess with the other parts of the image.

Is this is not physically possible? Is the weight is not applied to every pixel/latent (even controlling up to 8*8 squares would be great!) independently?

AbyszOne · 2023-02-27T18:43:06Z

I think the example here wasn't made very clear because it was broken up into several comments with little explanation. This is what it's doing.

The original generated image is below, no ControlNet is used:

The hand is obviously a mess. So, they took this image into Blender and created a hand depth pass as a guide. This is the depth pass they used.

They then used this depth pass with ControlNet. However, the default settings make ControlNet affect the output during the entire generation time. This means that all the empty information in the depth pass (the black area) is accounted for, and causes the output to become bad and not match the original prompt. This is that output:

So, why don't we delay ControlNet from kicking in so we can let the original noise do it's thing, and then we can control it to fix the hand? That's why this PR was created. Now, when ControlNet is delayed (in this case, only a few steps in, as the value used is 0.19, which relative to 20 steps is not that much), the original composition can play out, but the hand can be fixed. This is that image:

tl;dr: This seems to be a way to allow for implementing passes that only control certain elements, without destroying the original image. I don't think this will be entirely useful for if you're generating something completely from scratch, i.e. trying to use the hand depth pass on a completely different seed. It requires knowledge of what is generated normally. A depth pass from Blender is also a bit overkill imo -- think of if you were to use a different module like scribble instead.

Although that does help, wouldn't it be enough to edit the depth map of the image itself? With the same settings you would generate the same image only edited. It is also useful to leave the same image in img2img as a guide + pose + depth edit (or just scrible).
In any case, I've found much more unique uses for this feature. Thanks for the addition.👍

✨ Added guidance start parameter.

ce5dc9f

🧱 Apply "Guidance Start" parameter related change to api.py and xyz_g…

440927d

…rid_support.py

👌 🐛 fixing parse_remote_call according to newly introduced PARAM_COUN…

b552ce2

…T global variable

Mikubill mentioned this pull request Feb 27, 2023

Multiple control units in API #384

Merged

gitadmin0608 and others added 2 commits February 26, 2023 23:33

🔀 resolve conflict w.r.t. latest changes to api.py

a3f1ab8

Merge branch 'main' into feature/guidance-start # Conflicts: # scripts/api.py

fix: backward compatibility

511e254

Mikubill merged commit 3752046 into Mikubill:main Feb 27, 2023

ashen-sensored deleted the feature/guidance-start branch February 27, 2023 08:51

ashen-sensored restored the feature/guidance-start branch February 27, 2023 20:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

✨ Added guidance start parameter. #393

✨ Added guidance start parameter. #393

Uh oh!

ashen-sensored commented Feb 26, 2023

Uh oh!

ashen-sensored commented Feb 26, 2023

Uh oh!

ashen-sensored commented Feb 26, 2023 •

edited

Loading

Uh oh!

Mikubill commented Feb 26, 2023

Uh oh!

ashen-sensored commented Feb 26, 2023

Uh oh!

aiton-sd commented Feb 27, 2023 •

edited

Loading

Uh oh!

enn-nafnlaus commented Feb 27, 2023 •

edited

Loading

Uh oh!

Magicalore commented Feb 27, 2023

Uh oh!

enn-nafnlaus commented Feb 27, 2023

Uh oh!

Magicalore commented Feb 27, 2023

Uh oh!

catboxanon commented Feb 27, 2023 •

edited

Loading

Uh oh!

Magicalore commented Feb 27, 2023 •

edited

Loading

Uh oh!

aleksusklim commented Feb 27, 2023

Uh oh!

AbyszOne commented Feb 27, 2023

Uh oh!

Uh oh!

✨ Added guidance start parameter. #393

✨ Added guidance start parameter. #393

Uh oh!

Conversation

ashen-sensored commented Feb 26, 2023

Uh oh!

ashen-sensored commented Feb 26, 2023

Uh oh!

ashen-sensored commented Feb 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Mikubill commented Feb 26, 2023

Uh oh!

ashen-sensored commented Feb 26, 2023

Uh oh!

aiton-sd commented Feb 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

enn-nafnlaus commented Feb 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Magicalore commented Feb 27, 2023

Uh oh!

enn-nafnlaus commented Feb 27, 2023

Uh oh!

Magicalore commented Feb 27, 2023

Uh oh!

catboxanon commented Feb 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Magicalore commented Feb 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aleksusklim commented Feb 27, 2023

Uh oh!

AbyszOne commented Feb 27, 2023

Uh oh!

Uh oh!

ashen-sensored commented Feb 26, 2023 •

edited

Loading

aiton-sd commented Feb 27, 2023 •

edited

Loading

enn-nafnlaus commented Feb 27, 2023 •

edited

Loading

catboxanon commented Feb 27, 2023 •

edited

Loading

Magicalore commented Feb 27, 2023 •

edited

Loading