Vista de Lectura

Hay nuevos artículos disponibles. Pincha para refrescar la página.

Hunyuan-DiT Released V1.2, Caption model, 6GB GPU VRAM Inference scripts

submitted by /u/Cheap_Fan_7827
[link] [comments]

Gen-3 Alpha Text to Video is Now Available to Everyone

Gen-3 Alpha Text to Video is Now Available to Everyone

Runway has launched Gen-3 Alpha, a powerful text-to-video AI model now generally available. Previously, it was only accessible to partners and testers. This tool allows users to generate high-fidelity videos from text prompts with remarkable detail and control. Gen-3 Alpha offers improved quality and realism compared to recent competitors Luma and Kling. It's designed for artists and creators, enabling them to explore novel concepts and scenarios.

  • Text to Video (released), Image to Video and Video to Video (coming soon)
  • Offers fine-grained temporal control for complex scene changes and transitions
  • Trained on a new infrastructure for large-scale multimodal learning
  • Major improvement in fidelity, consistency, and motion
  • Paid plans are currently prioritized. Free limited access should be available later.
  • RunwayML historically co-created Stable Diffusion and released SD 1.5.

Source: X - RunwayML

PS: If you enjoyed this post, you'll love the free newsletter. Short daily summaries of the best AI news and insights from 300+ media, to gain time and stay ahead.

https://reddit.com/link/1dt561j/video/6u4d2xhiaz9d1/player

submitted by /u/Altruistic_Gibbon907
[link] [comments]

What tool to keep images organized?

Was wondering what tools you guys were using (if any) to keep your AI images library organised.

I saw https://breadboard.me/ and https://github.com/RupertAvery/DiffusionToolkit but the first one lacks a tree-like folder/albums organisation structure and the 2nd one has albums but doesn't seem like you can have multi-layer folder structure.

What I want to have is one folder per project then subfolders for each stages of the project. What I call a project is "a set of related images with a consistent character"

Basically something like this :

Project Name -- main folder of the project
-ideas -- folder to mess with ideas
-to fix -- composition needs fixes
-low res -- composition is ok but in low res
-upscaled -- final high quality images

At the moment I'm using https://github.com/zanllp/sd-webui-infinite-image-browsing but it's not the best since folders are basically just references to the file system folders, so you have to manually move the files and create the folders in the file system which is a bit slow.

Anyone got tool recommendation for this kind of workflow or even just remarks and advice to improve this workflow? I'm still very new to SD so all advices are much welcome!

submitted by /u/raphh
[link] [comments]

sdxl\pony models focused on extremely believable selfie shots\phone camera shots, NON PROFESSIONAL

It seems that all the models I've tried (realisticvision, juggernaut, etc) can make realistic images, but they're all "too fake" and professional, if it even makes sense. Are some realistic models out there finetuned on selfie shots\webcam\low quality phone shots etc? Something an old iphone 6 would shot, or even older, I don't know...

submitted by /u/Relative_Bit_7250
[link] [comments]

Forge vs Invoke - Which is Better for Remote Phone use?

So my usual preference for SD is ComfyUI, but it’s kind of tough to use on my phone. So I tried A1111, but I find that the gradio link crashes within 1 - 4 hours. I’ve played with Invoke a little bit locally, but found I had trouble getting it to share models nicely.

I’d like to find a UI that works really well on a smaller screen, has a solid web-use framework, and - if possible - allows for multiple denoising stages.

Thanks for any advice.

submitted by /u/CrypticTryptic
[link] [comments]
❌