I added voxel diffusion to Minecraft

StableDiffusion

／u／Timothy_Barnes

6 Abril 2025 at 00:20

submitted by /u/Timothy_Barnes
[link] [comments]

Huge update to the ComfyUI Inpaint Crop and Stitch nodes to inpaint only on masked area. (incl. workflow)

StableDiffusion

／u／elezet4

6 Abril 2025 at 09:23

Huge update to the ComfyUI Inpaint Crop and Stitch nodes to inpaint only on masked area. (incl. workflow)

Hi folks,

I've just published a huge update to the Inpaint Crop and Stitch nodes.

"✂️ Inpaint Crop" crops the image around the masked area, taking care of pre-resizing the image if desired, extending it for outpainting, filling mask holes, growing or blurring the mask, cutting around a larger context area, and resizing the cropped area to a target resolution.

The cropped image can be used in any standard workflow for sampling.

Then, the "✂️ Inpaint Stitch" node stitches the inpainted image back into the original image without altering unmasked areas.

The main advantages of inpainting only in a masked area with these nodes are:

It is much faster than sampling the whole image.
It enables setting the right amount of context from the image for the prompt to be more accurately represented in the generated picture.Using this approach, you can navigate the tradeoffs between detail and speed, context and speed, and accuracy on representation of the prompt and context.
It enables upscaling before sampling in order to generate more detail, then stitching back in the original picture.
It enables downscaling before sampling if the area is too large, in order to avoid artifacts such as double heads or double bodies.
It enables forcing a specific resolution (e.g. 1024x1024 for SDXL models).
It does not modify the unmasked part of the image, not even passing it through VAE encode and decode.
It takes care of blending automatically.

What's New?

This update does not break old workflows - but introduces new improved version of the nodes that you'd have to switch to: '✂️ Inpaint Crop (Improved)' and '✂️ Inpaint Stitch (Improved)'.

The improvements are:

Stitching is now way more precise. In the previous version, stitching an image back into place could shift it by one pixel. That will not happen anymore.
Images are now cropped before being resized. In the past, they were resized before being cropped. This triggered crashes when the input image was large and the masked area was small.
Images are now not extended more than necessary. In the past, they were extended x3, which was memory inefficient.
The cropped area will stay inside of the image if possible. In the past, the cropped area was centered around the mask and would go out of the image even if not needed.
Fill mask holes will now keep the mask as float values. In the past, it turned the mask into binary (yes/no only).
Added a hipass filter for mask that ignores values below a threshold. In the past, sometimes mask with a 0.01 value (basically black / no mask) would be considered mask, which was very confusing to users.
In the (now rare) case that extending out of the image is needed, instead of mirroring the original image, the edges are extended. Mirroring caused confusion among users in the past.
Integrated preresize and extend for outpainting in the crop node. In the past, they were external and could interact weirdly with features, e.g. expanding for outpainting on the four directions and having "fill_mask_holes" would cause the mask to be fully set across the whole image.
Now works when passing one mask for several images or one image for several masks.
Streamlined many options, e.g. merged the blur and blend features in a single parameter, removed the ranged size option, removed context_expand_pixels as factor is more intuitive, etc.

The Inpaint Crop and Stitch nodes can be downloaded using ComfyUI-Manager, just look for "Inpaint-CropAndStitch" and install the latest version. The GitHub repository is here.

Video Tutorial

There's a full video tutorial in YouTube: https://www.youtube.com/watch?v=mI0UWm7BNtQ . It is for the previous version of the nodes but still useful to see how to plug the node and use the context mask.

Examples

'Crop' outputs the cropped image and mask. You can do whatever you want with them (except resizing). Then, 'Stitch' merges the resulting image back in place.

(drag and droppable png workflow)

Another example, this one with Flux, this time using a context mask to specify the area of relevant context.

(drag and droppable png workflow)

Want to say thanks? Just share these nodes, use them in your workflow, and please star the github repository.

Enjoy!

submitted by /u/elezet4
[link] [comments]

Any time you pay money to someone in this community, you are doing everyone a disservice. Aggressively pirate "paid" diffusion models for the good of the community and because it's the morally correct thing to do.

StableDiffusion

／u／Parogarr

6 Abril 2025 at 10:21

I have never charged a dime for any LORA I have ever made, nor would I ever, because every AI model is trained on copyrighted images. This is supposed to be an open source/sharing community. I 100% fully encourage people to leak and pirate any diffusion model they want and to never pay a dime. When things are set to "generation only" on CivitAI like Illustrious 2.0, and you have people like the makers of illustrious holding back releases or offering "paid" downloads, they are trying to destroy what is so valuable about enthusiast/hobbyist AI. That it is all part of the open source community.

"But it costs money to train"

Yeah, no shit. I've rented H100 and H200s. I know it's very expensive. But the point is you do it for the love of the game, or you probably shouldn't do it at all. If you're after money, go join Open AI or Meta. You don't deserve a dime for operating on top of a community that was literally designed to be open.

The point: AI is built upon pirated work. Whether you want to admit it or not, we're all pirates. Pirates who charge pirates should have their boat sunk via cannon fire. It's obscene and outrageous how people try to grift open-source-adjacent communities.

You created a model that was built on another person's model that was built on another person's model that was built using copyrighted material. You're never getting a dime from me. Release your model or STFU and wait for someone else to replace you. NEVER GIVE MONEY TO GRIFTERS.

As soon as someone makes a very popular model, they try to "cash out" and use hype/anticipation to delay releasing a model to start milking and squeezing people to buy "generations" on their website or to buy the "paid" or "pro" version of their model.

IF PEOPLE WANTED TO ENTRUST THEIR PRIVACY TO ONLINE GENERATORS THEY WOULDN'T BE INVESTING IN HARDWARE IN THE FIRST PLACE. NEVER FORGET WHAT AI DUNGEON DID. THE HEART OF THIS COMMUNITY HAS ALWAYS BEEN IN LOCAL GENERATION. GRIFTERS WHO TRY TO WOO YOU INTO SACRIFICING YOUR PRIVACY DESERVE NONE OF YOUR MONEY.

submitted by /u/Parogarr
[link] [comments]

This Studio Ghibli Wan LoRA by @seruva19 produces very beautiful output and they shared a detailed guide on how they trained it w/ a 3090

StableDiffusion

／u／PetersOdyssey

5 Abril 2025 at 23:45

This Studio Ghibli Wan LoRA by @seruva19 produces very beautiful output and they shared a detailed guide on how they trained it w/ a 3090

You can find the guide here.

submitted by /u/PetersOdyssey
[link] [comments]

Looks like Hi3DGen is better than the other 3D generators out there.

StableDiffusion

／u／Plenty_Big4560

6 Abril 2025 at 08:14

submitted by /u/Plenty_Big4560
[link] [comments]

I used Wan2.1, Flux, and locall tts to make a Spongebob bank robbery video:

StableDiffusion

／u／CreepyMan121

6 Abril 2025 at 02:47

I used Wan2.1, Flux, and locall tts to make a Spongebob bank robbery video:

submitted by /u/CreepyMan121
[link] [comments]

My Krita workflow (NoobAI + Illustrious)

StableDiffusion

／u／Kernubis

6 Abril 2025 at 12:52

My Krita workflow (NoobAI + Illustrious)

I want to share my creative workflow about Krita.

I don't use regions, i prefer to guide my generations with brushes and colors, then i prompt about it to help the checkpoint understand what is seeing on the canvas.

I often create a layer filter with some noise, this adds tons of details, playing with opacity and graininess.

The first pass is done with NoobAI, just because it has way more creative angle views and it's more dynamic than many other checkpoints, even tho it's way less sharp.

After this i do a second pass with a denoise of about 25% with another checkpoint and tons of loras, as you can see, i have used T-Illunai this time, with many wonderful loras.

I hope it was helpful and i hope you can unlock some creative idea with my workflow :)

submitted by /u/Kernubis
[link] [comments]

Updated my Nunchaku workflow V2 to support ControlNets and batch upscaling, now with First Block Cache. 3.6 second Flux images!

StableDiffusion

／u／jib_reddit

6 Abril 2025 at 13:33

It can make a 10 Step 1024X1024 Flux image in 3.6 seconds (on a RTX 3090) with a First Bock Cache of 0.150.

Then upscale to 2024X2024 in 13.5 seconds.

My Custom SVDQuant finetune is here:https://civitai.com/models/686814/jib-mix-flux

submitted by /u/jib_reddit
[link] [comments]

Bladeborne Rider

StableDiffusion

／u／HailoKnight

6 Abril 2025 at 11:57

Bladeborne Rider - By HailoKnight

"Forged in battle, bound by steel — she rides where legends are born."

Ride into battle with my latest Illustrious LoRA!

These models never cease to amaze me how far we can push creativity!

And the best part of it is to see what you guys can make with it! :O

Example prompt used:
"Flatline, Flat vector illustration,,masterpiece, best quality, good quality, very aesthetic, absurdres, newest, 8K, depth of field, focused subject, dynamic close up angle, close up, Beautiful Evil ghost woman, long white hair, see through, glowing blue eyes, wearing a dress,, dynamic close up pose, blue electricity sparks, riding a blue glowing skeleton horse in to battle, sitting on the back of a see through skeleton horse, wielding a glowing sword, holofoil glitter, faint, glowing, otherworldly glow, graveyard in background"

Hope you can enjoy!

You can find the lora here:
https://www.shakker.ai/modelinfo/dbc7e311c4644d8abcbded2e74543233?from=personal_page&versionUuid=a227c9c83ddb40a890c76fb0abaf4c17

submitted by /u/HailoKnight
[link] [comments]

Do you edit your AI images after generation? Here's a before and after comparison

StableDiffusion

／u／Ztox_

6 Abril 2025 at 01:19

Do you edit your AI images after generation? Here's a before and after comparison

Hey everyone! This is my second post here — I’ve been experimenting a lot lately and just started editing my AI-generated images.

In the image I’m sharing, the right side is the raw output from Stable Diffusion. While it looks impressive at first, I feel like it has too much detail — to the point that it starts looking unnatural or even a bit absurd. That’s something I often notice with AI images: the extreme level of detail can feel artificial or inhuman.

On the left side, I edited the image using Forge and a bit of Krita. I mainly focused on removing weird artifacts, softening some overly sharp areas, and dialing back that “hyper-detailed” look to make it feel more natural and human.

I’d love to know:
– Do you also edit your AI images after generation?
– Or do you usually keep the raw outputs as they are?
– Any tips or tools you recommend?

Thanks for checking it out! I’m still learning, so any feedback is more than welcome 😊

My CivitAI: espadaz Creator Profile | Civitai

submitted by /u/Ztox_
[link] [comments]

Wake up 3060 12gb! We have OpenAI closed models to burn.

StableDiffusion

／u／-Ellary-

5 Abril 2025 at 18:23

Wake up 3060 12gb! We have OpenAI closed models to burn.

submitted by /u/-Ellary-
[link] [comments]

Wan2.1 I2V is good at undersetting what is is seeing

StableDiffusion

／u／Leading_Hovercraft82

6 Abril 2025 at 12:59

Wan2.1 I2V is good at undersetting what is is seeing

submitted by /u/Leading_Hovercraft82
[link] [comments]

Wan 2.1 I2V (So this is the 2nd version with Davinci 2x Upscaling)

StableDiffusion

／u／cyboghostginx

5 Abril 2025 at 20:06

Wan 2.1 I2V (So this is the 2nd version with Davinci 2x Upscaling)

Check it out

submitted by /u/cyboghostginx
[link] [comments]

A1111 suddenly stopped working for me after 1 yr?

StableDiffusion

／u／AiSuperHarem

6 Abril 2025 at 12:59

https://preview.redd.it/7577qecrp7te1.png?width=2560&format=png&auto=webp&s=092d3421d904385f482d67b04442cd1fe7dda9f6

Hi, I've been using a1111 SD 1.5 for over a year, but recently I get this error. Can i get some help? I also get prompted to log-in to github now which didn't happen until recently...

submitted by /u/AiSuperHarem
[link] [comments]

Wan 2.1-Fun 1.3b Really doing some heavy lifting

StableDiffusion

／u／Comed_Ai_n

6 Abril 2025 at 11:27

Wan 2.1-Fun 1.3b Really doing some heavy lifting

Images created with Flux Dev. Animated with Wan 2.1-Fun 1.3b with keyframes at the beginning, middle and end.

Prompt: The cosmic entity slowly emerges from the darkness. Its form, a nightmarish blend of organic and arcane, shifts subtly. Tentacles writhe behind its head, their crimson tips glowing faintly. Its eyes blinks slowly, the pink iris reflecting the starlight. Golden, jagged horns gleam as they catch the cosmic star light in outer space.

submitted by /u/Comed_Ai_n
[link] [comments]

I read that 1% Percent of TV Static Comes from radiation of the Big Bang. Any way to use TV static as latent noise to generate images with Stable Diffusion ?

StableDiffusion

／u／More_Bid_2197

5 Abril 2025 at 15:49

I read that 1% Percent of TV Static Comes from radiation of the Big Bang. Any way to use TV static as latent noise to generate images with Stable Diffusion ?

See Static? You’re Seeing The Last Remnants of The Big Bang

One percent of your old TV's static comes from CMBR (Cosmic Microwave Background Radiation). CMBR is the electromagnetic radiation left over from the Big Bang. We humans, 13.8 billion years later, are still seeing the leftover energy from that event

submitted by /u/More_Bid_2197
[link] [comments]

looking for a extension but forgot the name

StableDiffusion

／u／Thick-Prune7053

6 Abril 2025 at 12:33

i stop using stable diffusion for over a year and did a clean install but now ifg a useful extension i had. it lets u delete checkpoints/lora easy and gives u prompts for the lora ur using

submitted by /u/Thick-Prune7053
[link] [comments]

Changed Drive Letter, now getting "Fatal error in launcher: Unable to create process using"

StableDiffusion

／u／HyeVltg3

6 Abril 2025 at 14:31

Can anyone make sense of whats going on, my next step is to scrap and start from scratch but if theres a simple fix that would great too!

--------------------------

F:\SD-JAN2025\venv\Scripts>activate.bat

(venv) F:\SD-JAN2025\venv\Scripts>pip3 uninstall torch
Fatal error in launcher: Unable to create process using '"G:\SD-JAN2025\venv\Scripts\python.exe" "F:\SD-JAN2025\venv\Scripts\pip3.exe" uninstall torch': The system cannot find the file specified.

(venv) F:\SD-JAN2025\venv\Scripts>pip uninstall torch

Fatal error in launcher: Unable to create process using '"G:\SD-JAN2025\venv\Scripts\python.exe" "F:\SD-JAN2025\venv\Scripts\pip.exe" uninstall torch': The system cannot find the file specified.

(venv) F:\SD-JAN2025\venv\Scripts>py pip uninstall torch

C:\Users\*user*\AppData\Local\Programs\Python\Python312\python.exe: can't open file 'F:\\SD-JAN2025\\venv\\Scripts\\pip': [Errno 2] No such file or directory

(venv) F:\SD-JAN2025\venv\Scripts>pip uninstall torch

Fatal error in launcher: Unable to create process using '"G:\SD-JAN2025\venv\Scripts\python.exe" "F:\SD-JAN2025\venv\Scripts\pip.exe" uninstall torch': The system cannot find the file specified.

(venv) F:\SD-JAN2025\venv\Scripts>where python

F:\SD-JAN2025\venv\Scripts\python.exe

C:\Users\*user*\AppData\Local\Programs\Python\Python310\python.exe

C:\Users\*user*\AppData\Local\Programs\Python\Python312\python.exe

C:\Users\*user*\AppData\Local\Microsoft\WindowsApps\python.exe

(venv) F:\SD-JAN2025\venv\Scripts>deactivate.bat

F:\SD-JAN2025\venv\Scripts>where python

F:\SD-JAN2025\venv\Scripts\python.exe

C:\Users\*user*\AppData\Local\Programs\Python\Python310\python.exe

C:\Users\*user*\AppData\Local\Programs\Python\Python312\python.exe

C:\Users\*user*\AppData\Local\Microsoft\WindowsApps\python.exe

-------------------------------------

Fatal error in launcher: Unable to create process using '"G:

still points to my old drive letter, G.

submitted by /u/HyeVltg3
[link] [comments]

Is there any way to improve the Trellis model?

StableDiffusion

／u／dinhchicong

6 Abril 2025 at 09:43

Hi everyone,
It’s been about 4 months since TRELLIS was released, and it has been super useful for my work—especially for generating 3D models in Gaussian Splatting format from .ply files.

Recently, I’ve been digging deeper into how Trellis works to see if there are ways to improve the output quality. Specifically, I’m exploring ways to evaluate and enhance rendered images from 360-degree angles, aiming for sharper and more consistent results. (Previously, I mainly focused on improving image quality by using better image generation models like Flux-Pro 1.1 or optimizing evaluation metrics.)

I also came across Hunyan3D V2, which looks promising—but unfortunately, it doesn’t support exporting to Gaussian Splatting format.

Has anyone here tried improving Trellis, or has any idea how to enhance the 3D generation pipeline? Maybe we can brainstorm together for the benefit of the community.

Example trellis + flux pro 1.1:

Prompt: 3D butterfly with colourful wings

Image from Flux pro 1.1

Output trellis

submitted by /u/dinhchicong
[link] [comments]

Vista de Lectura

What's New?

Video Tutorial

Examples

Bladeborne Rider - By HailoKnight

"Forged in battle, bound by steel — she rides where legends are born."

See Static? You’re Seeing The Last Remnants of The Big Bang