SD3 Controlnet
from https://huggingface.co/InstantX/SD3-Controlnet-Pose
control image | weight=0.0 | weight=0.3 | weight=0.5 | weight=0.7 | weight=0.9 |
---|---|---|---|---|---|
Please ensure that the version of diffusers >= 0.30.0.dev0.
# Demo
python import torch from diffusers import StableDiffusion3ControlNetPipeline from diffusers.models import SD3ControlNetModel, SD3MultiControlNetModel from diffusers.utils import load_image
load pipeline
controlnet = SD3ControlNetModel.frompretrained("InstantX/SD3-Controlnet-Pose") pipe = StableDiffusion3ControlNetPipeline.frompretrained( "stabilityai/stable-diffusion-3-medium-diffusers", controlnet=controlnet ) pipe.to("cuda", torch.float16)
config
controlimage = loadimage("https://huggingface.co/InstantX/SD3-Controlnet-Pose/resolve/main/pose.jpg") prompt = 'Anime style illustration of a girl wearing a suit. A moon in sky. In the background we see a big rain approaching. text "InstantX" on image' nprompt = 'NSFW, nude, naked, porn, ugly' image = pipe( prompt, negativeprompt=nprompt, controlimage=controlimage, controlnetconditioning_scale=0.5, ).images[0] image.save('image.jpg') ```
Limitation
Due to the fact that only 1024*1024 pixel resolution was used during the training phase, the inference performs best at this size, with other sizes yielding suboptimal results. We will initiate multi-resolution training in the future, and at that time, we will open-source the new weights.
评论