Super-resolution in DeepFloyd is bugged for any non-64x64 to 256x256 upscales #3289

AmericanPresidentJimmyCarter · 2023-04-29T16:40:34Z

Describe the bug

Superresolution code is currently bugged because it does not accept height and width, and rather just assigns a height and width of 256x256. This results in squished images when you try to super resolution anything from the first stage model that is not 64x64.

diffusers/src/diffusers/pipelines/deepfloyd_if/pipeline_if_superresolution.py

Lines 809 to 810 in eade430

    
           height = self.unet.config.sample_size 
        
           width = self.unet.config.sample_size

Solution: Just allow height and width to be passed in. I forked the class and did this manually and it works fine.

Reproduction

from diffusers import DiffusionPipeline
from diffusers.utils import pt_to_pil
import torch

# stage 1
stage_1 = DiffusionPipeline.from_pretrained("DeepFloyd/IF-I-XL-v1.0", variant="fp16", torch_dtype=torch.float16)
stage_1.enable_model_cpu_offload()

# stage 2
stage_2 = DiffusionPipeline.from_pretrained(
    "DeepFloyd/IF-II-L-v1.0", text_encoder=None, variant="fp16", torch_dtype=torch.float16
)
stage_2.enable_model_cpu_offload()


prompt = 'a photo of a kangaroo wearing an orange hoodie and blue sunglasses standing in front of the eiffel tower holding a sign that says "very deep learning"'
generator = torch.manual_seed(1)

# text embeds
prompt_embeds, negative_embeds = stage_1.encode_prompt(prompt)

# stage 1
image = stage_1(
    prompt_embeds=prompt_embeds,
    negative_prompt_embeds=negative_embeds,
    generator=generator,
    output_type="pt",
    height=96,
    width=64,
).images
pt_to_pil(image)[0].save("./if_stage_I.png")

# stage 2
image = stage_2(
    image=image,
    prompt_embeds=prompt_embeds,
    negative_prompt_embeds=negative_embeds,
    generator=generator,
    output_type="pt",
).images
pt_to_pil(image)[0].save("./if_stage_II.png")

Logs

No response

System Info

py 3.10.6, diffusers on latest main

patrickvonplaten · 2023-05-03T12:25:55Z

Yes I think we should indeed allow this! Continuing discussion on #3298

AmericanPresidentJimmyCarter added the bug Something isn't working label Apr 29, 2023

AmericanPresidentJimmyCarter mentioned this issue Apr 30, 2023

Deepfloyd aspect ratios #3291

Closed

devxpy mentioned this issue May 1, 2023

Allow arbitrary aspect ratio in IFSuperResolutionPipeline #3298

Merged

williamberman closed this as completed in #3298 May 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Super-resolution in DeepFloyd is bugged for any non-64x64 to 256x256 upscales #3289

Super-resolution in DeepFloyd is bugged for any non-64x64 to 256x256 upscales #3289

AmericanPresidentJimmyCarter commented Apr 29, 2023

patrickvonplaten commented May 3, 2023

Super-resolution in DeepFloyd is bugged for any non-64x64 to 256x256 upscales #3289

Super-resolution in DeepFloyd is bugged for any non-64x64 to 256x256 upscales #3289

Comments

AmericanPresidentJimmyCarter commented Apr 29, 2023

Describe the bug

Reproduction

Logs

System Info

patrickvonplaten commented May 3, 2023