I’m still not clear if it’s going to deliver the unique layers to you? If you se...

jamilton · 2025-12-19T23:43:36 1766187816

The linked GitHub readme says it outputs a powerpoint file of the layers.

Llamamoe · 2025-12-19T23:51:07 1766188267

...of all the possible formats, it outputs.. a powerpoint presentation..? What.

dragonwriter · 2025-12-20T04:06:49 1766203609

The github repo includes (among other things) a script (relying on python-pptx) to output decomposed layer images into a pptx file “where you can edit and move these layers flexibly.” (I've never user Powerpoint for this, but maybe it is good enough for this and ubiquitous enough that this is sensible?)

djfobbz · 2025-12-20T00:15:15 1766189715

Lol, right?!?! I would've expected sequential PNGs followed by SVGs once the model improved.

CamperBob2 · 2025-12-20T00:32:35 1766190755

That's what the example code at https://old.reddit.com/r/StableDiffusion/comments/1pqnghp/qw... generates. You get 0.png, 1.png ... n.png, where n= the requested number of layers-1.

It'll drop a 600W RTX 6000 to its knees for about a minute, but it does work.

dvrp · 2025-12-20T01:37:06 1766194626

I saw some people at a company called Pruna AI got it down to 8 seconds with Cloudflare/Replicate, but I don't know if it was on consumer hardware or an A100/H100/H200, and I don't know if the inference optimization is open-source yet.

oefrha · 2025-12-20T02:02:18 1766196138

I don't see the word powerpoint anywhere in https://github.com/QwenLM/Qwen-Image-Layered, I only see a code snippet saving a bunch of PNGs:

  with torch.inference_mode():
      output = pipeline(**inputs)
      output_image = output.images[0]
  
  for i, image in enumerate(output_image):
      image.save(f"{i}.png")

Unless it's a joke that went over my head or you're talking about some other GitHub readme (there's only one GitHub link in TFA), posting an outright lie like this is not cool.

dragonwriter · 2025-12-20T04:02:00 1766203320

> I don't see the word powerpoint anywhere in https://github.com/QwenLM/Qwen-Image-Layered,

The word "powerpoint" is not there, however this text is:

“The following scripts will start a Gradio-based web interface where you can decompose an image and export the layers into a pptx file, where you can edit and move these layers flexibly.”

oefrha · 2025-12-20T04:22:45 1766204565

Oh okay I missed it, sorry. But that’s just using a separate python-pptx package to export the generated list of images to a .pptx file, not something inherent to the model.