While we wait for the Flux.1 Dev controlnet models for ComfyUI, how about some composition control using image to image? Turns out that works just great, only often I find myself need longer prompts as they seem to generate better images. Or maybe if I could just ask for what I want, even if I’m not really sure? If only there was an AI to help with that?
Links, as shown in the video:
ComfyUI Flux workflows - https://comfyanonymous.github.io/ComfyUI_examples/flux/
LLM Party - https://github.com/heshengtao/comfyui_LLM_party
Ollama - https://github.com/ollama/ollama
Want to support the channel?
https://www.patreon.com/NerdyRodent
https://www.patreon.com/posts/ai-enhanced-flux-109665789
== Beginners Guides for ComfyUI ==
1. Installing Anaconda for MS Windows Beginners - https://youtu.be/OjOn0Q_U8cY
2. Installing ComfyUI for Beginners - https://youtu.be/2r3uM_b3zA8
3. ComfyUI Workflows for Beginners - https://youtu.be/VM9snsuoqBc
An AI generated song I made using Udio, along with some AI generated images using Flux.1
Learn more about Flux.1 in ComfyUI at https://youtu.be/DLUx-mK4g0c
Want to support the channel?
https://www.patreon.com/NerdyRodent
#shorts #flux1 #stablediffusion
Flux.1, AuraFlow 0.2 and AuraSR are some of the latest AI image models you can run at home for free on your own PC. Installing Flux.1 Schnell or Dev for ComfyUI is super easy, as ComfyUI has day 0 native support - simply download the models and run!
AuraFlow & Flux work best with at least 24GB VRAM, but fp8 options are available for Flux which use less than 14GB! Flux has both schnell and dev versions available, with Apache and non-commercial licences respectively. Run for free on your own PC, or use their pro service!
Which is the best? Take a look and find out what each of them can do!
Links -
https://huggingface.co/fal/AuraFlow-v0.2
https://blog.fal.ai/aurasr-v2/
https://huggingface.co/black-forest-labs/FLUX.1-schnell
https://comfyanonymous.github.io/ComfyUI_examples/flux/
https://blog.fal.ai/flux-the-largest-open-sourced-text2img-model-now-available-on-fal/
Smaller file alternatives - https://huggingface.co/Kijai/flux-fp8/tree/main
Want to support the channel?
https://www.patreon.com/NerdyRodent
Patreons get loads of extra Flux.1 goodies too - upscale, inpaint, high res fix and more!
https://www.patreon.com/posts/flux-is-awesome-109279424
== Start Learning ComfyUI in 3 Easy Steps! ==
1. Installing Anaconda for MS Windows Beginners - https://youtu.be/OjOn0Q_U8cY
2. Installing ComfyUI for Beginners - https://youtu.be/2r3uM_b3zA8
3. ComfyUI Workflows for Beginners - https://youtu.be/VM9snsuoqBc
Wave goodbye to InsightFace and it's non-commercial use license and welcome in the new OPEN SOURCE options for LivePortrait in ComfyUI. Works great with static images, video-to-video and also via your webcam. Yes, you can use your face to change the expression on another image :)
No need to install InsightFace. Just Open Source and run!
Want to support the channel?
https://www.patreon.com/NerdyRodent
https://www.patreon.com/posts/liveportrait-108928162
Links:
https://github.com/KwaiVGI/LivePortrait
https://liveportrait.github.io/
https://github.com/kijai/ComfyUI-LivePortraitKJ
https://github.com/yoyo-nb/Thin-Plate-Spline-Motion-Model
https://www.pexels.com/video/woman-wearing-red-dress-7626887/
== Beginners Start Here! ==
1. Installing Anaconda for MS Windows Beginners - https://youtu.be/OjOn0Q_U8cY
2. Installing ComfyUI for Beginners - https://youtu.be/2r3uM_b3zA8
3. ComfyUI Workflows for Beginners - https://youtu.be/VM9snsuoqBc
== More things! ==
* 1-step SDXL - https://youtu.be/LAQYZWbmkwA
* PixArt in ComfyUI - https://youtu.be/TQduyxvzX4Q
Chapters:
0:00 LivePortrait Intro
0:53 LivePortrait Install
2:06 LivePortrait Image Animation
6:42 LivePortrait Video to video (v2v)
8:56 LivePortrait Webcam
Lots of new free stuff for ComfyUI this week including the new ControlNet Union ProMax Model, Pixart Sigma going to 900M, a better background remover and more!
Want to support the channel?
https://www.patreon.com/NerdyRodent
https://www.patreon.com/posts/controlnet-union-108501230
Repos:
https://huggingface.co/dataautogpt3/PixArt-Sigma-900M/tree/main
https://github.com/john-mnz/ComfyUI-Inspyrenet-Rembg
https://huggingface.co/xinsir/controlnet-union-sdxl-1.0
https://github.com/city96/ComfyUI_ExtraModels
https://github.com/xinsir6/ControlNetPlus
https://huggingface.co/xinsir/controlnet-union-sdxl-1.0/tree/main
Learn more!
New Samplers & Schedulers in ComfyUI: https://youtu.be/-GXJDz8i-Wo
PixArt Sigma: https://youtu.be/TQduyxvzX4Q
== Start your ComfyUI Journey! ==
1. Installing Anaconda for MS Windows Beginners - https://youtu.be/OjOn0Q_U8cY
2. Installing ComfyUI for Beginners - https://youtu.be/2r3uM_b3zA8
3. ComfyUI Workflows for Beginners - https://youtu.be/VM9snsuoqBc
A Stable Diffusion image model with a truly open source license? Excellent! Count me in :) AuraFlow claims to be exactly that, and it's supported directly in ComfyUI... but what are the image generations like in this 0.1, beta test release?
Want to support the channel?
https://www.patreon.com/NerdyRodent
Patreon post for this video:
https://www.patreon.com/posts/auraflow-truly-108017973
AuraFlow - https://huggingface.co/fal/AuraFlow
== More Stable Diffusion Stuff! ==
* Installing Anaconda for MS Windows Beginners - https://youtu.be/OjOn0Q_U8cY
* Installing ComfyUI for Beginners - https://youtu.be/2r3uM_b3zA8
* ComfyUI Workflows for Beginners - https://youtu.be/VM9snsuoqBc
* Faster Stable Diffusions with the LCM LoRA - https://youtu.be/zrxd95Mxz24
* Make an Animated, Talking Avatar - https://youtu.be/Z7TLukqckR0
* Make A Consistent Character in ANY pose - https://youtu.be/SacK9tMVNUA
Stable Diffusion 3 has a new license, so it looks like we're now good to go with commercial use... up to a limit. Do check out the information for yourself, available at - https://stability.ai/news/license-update
Here I do some basic prompt tests, compare outputs vs Pixart Sigma, try using SD3 as a refiner and also via high-res fix.
Want to support the channel?
https://www.patreon.com/NerdyRodent
https://www.patreon.com/posts/sd3-licence-fix-107686733
* ComfyUI Extra Models - https://github.com/city96/ComfyUI_ExtraModels
== Learn more things! ==
1. Installing Anaconda for MS Windows Beginners - https://youtu.be/OjOn0Q_U8cY
2. Installing ComfyUI for Beginners - https://youtu.be/2r3uM_b3zA8
3. ComfyUI Workflows for Beginners - https://youtu.be/VM9snsuoqBc
4. Pixart Sigma - https://youtu.be/TQduyxvzX4Q
A whole bunch of updates went into ComfyUI recently, and with them we get a selection of new samplers such as EulerCFG++ and DEIS, as well as the new GITS scheduler. See them all in action, then try it yourself at home!
Want to support the channel?
https://www.patreon.com/NerdyRodent
* DEIS, GITS, iPNDM - https://github.com/zju-pi/diff-sampler
* CFG++ - https://arxiv.org/abs/2406.08070
== Learn More Stuff! ==
* Installing Anaconda for MS Windows Beginners - https://youtu.be/OjOn0Q_U8cY
* Installing ComfyUI for Beginners - https://youtu.be/2r3uM_b3zA8
* ComfyUI Workflows for Beginners - https://youtu.be/VM9snsuoqBc
* Faster Stable Diffusions with the LCM LoRA - https://youtu.be/zrxd95Mxz24
* Make an Animated, Talking Avatar - https://youtu.be/Z7TLukqckR0
* Make A Consistent Character in ANY pose - https://youtu.be/SacK9tMVNUA
If you've been hunting for an AI art model with a reasonable licence, then perhaps Lumina, Pixart or Hunyuan may be of interest? Each one has it's own little quirks and today they're pitted against each other in another battle of the prompts!
Want to support the channel?
https://www.patreon.com/NerdyRodent
Lumina Next - https://github.com/Alpha-VLLM/Lumina-T2X
Hunyuan - https://github.com/Tencent/HunyuanDiT
Hunyuan Video - https://youtu.be/oDK0-KesWQo
Pixart Sigma - https://github.com/PixArt-alpha/PixArt-sigma
Pixart Sigma Video - https://youtu.be/TQduyxvzX4Q
Music created using udio.
== More Stable Diffusion Stuff! ==
* Installing Anaconda for MS Windows Beginners - https://youtu.be/OjOn0Q_U8cY
* Installing ComfyUI for Beginners - https://youtu.be/2r3uM_b3zA8
* ComfyUI Workflows for Beginners - https://youtu.be/VM9snsuoqBc
* Faster Stable Diffusions with the LCM LoRA - https://youtu.be/zrxd95Mxz24
* Make an Animated, Talking Avatar - https://youtu.be/Z7TLukqckR0
* Make A Consistent Character in ANY pose - https://youtu.be/SacK9tMVNUA
Time for a prompt showdown! Try these at home using your favourite model (such as Stable Diffusion 3) and see how they compare to both Pixart Sigma and HunYuan DiT. A wide range of styles are covered, and the prompts are listed below for your copy-and-paste ease :)
Want to support the channel?
https://www.patreon.com/NerdyRodent
https://www.patreon.com/posts/prompt-showdown-106258102
Update: HunYuan v1.1 is out now too - and it's even better!
https://huggingface.co/Tencent-Hunyuan/HunyuanDiT-v1.1
= Prompt Showdown! =
Negative Prompt:
many hands, really wobbly, distorted and blurry fingers and hands.
Positive Prompts:
1:
2: A woman sleeping on the grass.
3: An avocado chair
4: Anime art style blue rabbit flying through the air wearing a red cape and goggles
5: Vector-art style business logo, SVG, simple, plain, psychic rodent emoji.
6: A fantasy-art style rodent mage, level 12
7: Chibi rodent scientist discovering where he left his pencil
8: A giraffe engineer, epic Shōnen manga style, high detail Seinen
9: Kemono style illustration of a rabbit doctor wearing a blue surgeon's gown walking down the path of an old cemetery. Green eyes, brown fur, fine ears, solemn feel. misty path. dignified, austere. fog, gloom. haunted vibe.
10: Cthulhu stands over a kitten, oil painting style. A classical artwork, vintage, old. The scene is set as the beast towers over the tiny, but dreadful, all-black kitten. The bright summer sky is in high contrast to the dark, evil shadows that lurk beneath. The style is reminiscent of Jacques Stella and Pompeo Batoni. In the background is an ancient fiery temple of doom, hewn from the very rock face itself by the kitten. The image quality is astounding.
11: A cubist art style kangaroo druid in the forest, cubism, quality artwork, soft pastel shades
12: In the Chinese ink painting style, a little deer stands leisurely in the lush bamboo forest. The morning light shines through the bamboo leaves, casting mottled light and shadows. The little deer quietly drinks the clear stream water, and the bamboo leaves are swaying in the wind. The whole picture exudes a sense of tranquillity and harmony.
13: A stone statue of a deer relaxing on top of a colourful bed in an old, Victorian house. Professional photo, bokeh.
14: A childish doodle of a bad kitten
15: Paper-cut art style tiger emerging from a bunch of flowers, bold colours, depth, shading, 3d effect, pop-up
16: A ghostly face is peering through the window into my house from outside! The ghost looks very scary but may be wearing a mask, and a sense of evil can be felt. charcoal art, shading, sketch.
17: 在一个充满创意的画廊里,一幅引人注目的画作展示了一只时髦的啮齿动物。画作采用了印象派风格,色彩斑斓,笔触细腻,充满生动的光影效果。画中的小鼠戴着复古圆框眼镜,穿着时尚的格子衬衫和牛仔裤,手持一杯咖啡,悠闲地站在一片繁茂的花园中。背景是柔和的色彩斑点和模糊的树影,营造出宁静与优雅的氛围,突显了小鼠的独特魅力。
== More AI Things! ==
* Anaconda for MS Windows Beginners - https://youtu.be/OjOn0Q_U8cY
* Installing ComfyUI for Beginners - https://youtu.be/2r3uM_b3zA8
* ComfyUI Workflows for Beginners - https://youtu.be/VM9snsuoqBc
* Easy Consistent Character in ANY pose - https://youtu.be/SacK9tMVNUA
* Make an Animated, Talking Avatar - https://youtu.be/Z7TLukqckR0
Just a quick video to show some of the language understanding capabilities of two image generators. This time it's HunYuan DiT vs the base SDXL 1.0 model. Some people wondered how the Chinese vs English language negative prompts would impact the SDXL generation, and so here is the answer! Also included are a few other languages, so if you're got your score card from the original video, you can now do some updates ;)
Just to handle two things at once, I've also included the "extended version" for one of the pieces of outro music I've made on Udio!
Full video: https://youtu.be/oDK0-KesWQo
Udio song link: https://www.udio.com/songs/hmeqhhQiEniVK6rUapCrFs
Want to support the channel?
https://www.patreon.com/NerdyRodent
#shorts #sdxl #ai
Overall, in HunYuan-DiT's tests, it scores 59.0% vs 56.7% for Stable Diffusion 3, making it objectively better. More than 50 professional evaluators performed the evaluations, so it must be true... or is it? For you to be able to judge for yourself, I pit HunYuan-DiT against the weaker (but freely available) SDXL in a battle for AI supremacy!
Want to support the channel?
https://www.patreon.com/NerdyRodent
https://www.patreon.com/posts/hunyuan-dit-than-105823648
GitHub Repo:
https://github.com/Tencent/HunyuanDiT
== More Nerdy Stuff! ==
* Installing Anaconda for MS Windows Beginners - https://youtu.be/OjOn0Q_U8cY
* Installing ComfyUI for Beginners - https://youtu.be/2r3uM_b3zA8
* ComfyUI Workflows for Beginners - https://youtu.be/VM9snsuoqBc
* Faster Stable Diffusions with Hyper SDXL - https://youtu.be/LAQYZWbmkwA
* Make A Consistent Character in ANY pose - https://youtu.be/SacK9tMVNUA
* Make an Animated, Talking Avatar - https://youtu.be/Z7TLukqckR0
Combine an LLM with Stable Diffusion and you can Omost generate the image you ask for!
Want to support the channel?
https://www.patreon.com/NerdyRodent
Links:
https://github.com/lllyasviel/Omost
https://www.patreon.com/posts/your-image-is-105386033
== More Stable Diffusion Stuff! ==
* Installing Anaconda for MS Windows Beginners - https://youtu.be/OjOn0Q_U8cY
* Installing ComfyUI for Beginners - https://youtu.be/2r3uM_b3zA8
* ComfyUI Workflows for Beginners - https://youtu.be/VM9snsuoqBc
* Faster Stable Diffusions with the LCM LoRA - https://youtu.be/zrxd95Mxz24
* Make an Animated, Talking Avatar - https://youtu.be/Z7TLukqckR0
* Make A Consistent Character in ANY pose - https://youtu.be/SacK9tMVNUA
Khoj is your AI "second brain". Get answers to your questions, whether they be online or in your own notes. Use local or hosted LLMs. Self-host locally or use our cloud instance. Access from Obsidian, Emacs, Desktop app, Web or Whatsapp. Make agents, do automations and more! RAG made easy :)
Want to support the channel?
https://www.patreon.com/NerdyRodent
Links:
https://github.com/khoj-ai/khoj
https://khoj.dev/
== More Links! ==
* Installing Anaconda for MS Windows Beginners - https://youtu.be/OjOn0Q_U8cY
* Installing ComfyUI for Beginners - https://youtu.be/2r3uM_b3zA8
* ComfyUI Workflows for Beginners - https://youtu.be/VM9snsuoqBc
* Faster Stable Diffusions with the LCM LoRA - https://youtu.be/zrxd95Mxz24
* Make an Animated, Talking Avatar - https://youtu.be/Z7TLukqckR0
* Make A Consistent Character in ANY pose - https://youtu.be/SacK9tMVNUA
Chapters
0:00 Khoj introduction
10:13 Khoj installation
Faces. Most of us have at least 1, and they're great fun to play with. This AI stuff seems to be zooming ahead really quickly, so I figured why not put a bunch of face things together, both old and new, and see how quickly it can generate? Thus, Refacer was born. Powered by SDXL, you can pop in any face and change it to whatever style takes your fancy! Realistic to anime, painting to realistic, cartoon to pixel art - it's all down to your choice of prompt and model :)
Want to support the channel?
https://www.patreon.com/NerdyRodent
GitHub:
https://github.com/nerdyrodent/AVeryComfyNerd
== More Stable Diffusion Stuff! ==
* Installing Anaconda for MS Windows Beginners - https://youtu.be/OjOn0Q_U8cY
* Installing ComfyUI for Beginners - https://youtu.be/2r3uM_b3zA8
* ComfyUI Workflows for Beginners - https://youtu.be/VM9snsuoqBc
* Photomaker - https://youtu.be/ZTck128jfFY
* Hyper SDXL - https://youtu.be/LAQYZWbmkwA
* Make an Animated, Talking Avatar - https://youtu.be/Z7TLukqckR0
* Reposer = A Consistent Character In ANY Pose - https://youtu.be/SacK9tMVNUA
Yes. There's nothing new about speaking to an AI and having it speak back, however, when the responses are this fast then things start to become a bit more fun - especially with an AI as unhinged as this one...
dnhkng GlaDOS is free, open-source goodness, meaning you can download and run on your own computer in the comfort of your own home. Everything runs locally, so no API keys to mess around with either. Yay!
NB. This is not a valve product, and yes - they probably need a better name for the project ;)
Want to support the channel?
https://www.patreon.com/NerdyRodent
Links:
https://github.com/dnhkng/GlaDOS
== More Stable Diffusion Stuff! ==
* Installing Anaconda for MS Windows Beginners - https://youtu.be/OjOn0Q_U8cY
* Installing ComfyUI for Beginners - https://youtu.be/2r3uM_b3zA8
* ComfyUI Workflows for Beginners - https://youtu.be/VM9snsuoqBc
* Faster Stable Diffusions with the LCM LoRA - https://youtu.be/zrxd95Mxz24
* Make an Animated, Talking Avatar - https://youtu.be/Z7TLukqckR0
* Make A Consistent Character in ANY pose - https://youtu.be/SacK9tMVNUA
== Contents ==
0:00 - dnhkng GlaDOS Intro
4:43 - dnhkng GlaDOS Install
The new Hyper-SD models are FREE and there are THREE ComfyUI workflows to play with! Use the amazing 1-step unet, or speed up existing models by using the LoRAs.
Better than LCM? Take a look for yourself and see!
Want to support the channel?
https://www.patreon.com/NerdyRodent
Links:
https://huggingface.co/ByteDance/Hyper-SD
== More Stable Diffusion Stuff! ==
* Installing Anaconda for MS Windows Beginners - https://youtu.be/OjOn0Q_U8cY
* Installing ComfyUI for Beginners - https://youtu.be/2r3uM_b3zA8
* ComfyUI Workflows for Beginners - https://youtu.be/VM9snsuoqBc
* Faster Stable Diffusions with the LCM LoRA - https://youtu.be/zrxd95Mxz24
* Make an Animated, Talking Avatar - https://youtu.be/Z7TLukqckR0
* Make A Consistent Character in ANY pose - https://youtu.be/SacK9tMVNUA
Pixart Sigma was released recently, and while the main repo takes a little tweaking to run nicely on 24GB VRAM, ComfyUI comes to the rescue making it easy to run in just 6 GB!
Like with ELLA, T5 encodings replace CLIP leading to increased prompt adherence. Inthis video I show you how to get going for FREE in ComfyUI, and compare SDXL vs Pixart Sigma.
All this without one of those SD3 “no commercial use” licenses!
Want to support the channel? Get workflows and more!
https://www.patreon.com/NerdyRodent
Links
https://github.com/PixArt-alpha/PixArt-sigma
https://github.com/city96/ComfyUI_ExtraModels
https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma
https://github.com/PixArt-alpha/PixArt-alpha/blob/master/asset/docs/pixart_comfyui.md
https://huggingface.co/PixArt-alpha/PixArt-Sigma/blob/main/PixArt-Sigma-XL-2-1024-MS.pth
== More Stable Diffusion Stuff! ==
* Installing Anaconda for MS Windows Beginners - https://youtu.be/OjOn0Q_U8cY
* Installing ComfyUI - https://youtu.be/2r3uM_b3zA8
* ComfyUI Workflow Creation Essentials For Beginners - https://youtu.be/VM9snsuoqBc
* Faster Stable Diffusions with the LCM LoRA - https://youtu.be/zrxd95Mxz24
* Make an Animated, Talking Avatar - https://youtu.be/Z7TLukqckR0
* One Image Gets You a Consistent Character in ANY pose - https://youtu.be/SacK9tMVNUA