Vista Normal

Hay nuevos artículos disponibles. Pincha para refrescar la página.
AnteayerIA

This week in Stable Diffusion - all the major developments in a nutshell

1 Octubre 2024 at 09:22
  • Interesting find of the week: Kat, an engineer who built a tool to visualize time-based media with gestures.
  • Flux updates:
    • Outpainting: ControlNet Outpainting using FLUX.1 Dev in ComfyUI demonstrated, with workflows provided for implementation.
    • Fine-tuning: Flux fine-tuning can now be performed with 10GB of VRAM, making it more accessible to users with mid-range GPUs.
    • Quantized model: Flux-Dev-Q5_1.gguf quantized model significantly improves performance on GPUs with 12GB VRAM, such as the NVIDIA RTX 3060.
    • New Controlnet models: New depth, upscaler, and surface normals models released for image enhancement in Flux.
    • CLIP and Long-CLIP models: Fine-tuned versions of CLIP-L and Long-CLIP models now fully integrated with the HuggingFace Diffusers pipeline.
  • James Cameron joins Stability.AI: Renowned filmmaker James Cameron has joined Stability AI's Board of Directors, bringing his expertise in merging cutting-edge technology with storytelling to the AI company.
  • Put This On Your Radar:
    • MIMO: Controllable character video synthesis model for creating realistic character videos with controllable attributes.
    • Google's Zero-Shot Voice Cloning: New technique that can clone voices using just a few seconds of audio sample.
    • Leonardo AI's Image Upscaling Tool: New high-definition image enlargement feature rivaling existing tools like Magnific.
    • PortraitGen: AI portrait video editing tool enabling multi-modal portrait editing, including text-based and image-based effects.
    • FaceFusion 3.0.0: Advanced face swapping and editing tool with new features like "Pixel Boost" and face editor.
    • CogVideoX-I2V Workflow Update: Improved image-to-video generation in ComfyUI with better output quality and efficiency.
    • Ctrl-X: New tool for image generation with structure and appearance control, without requiring additional training or guidance.
    • Invoke AI 5.0: Major update to open-source image generation tool with new features like Control Canvas and Flux model support.
    • JoyCaption: Free and open uncensored vision-language model (Alpha One Release) for training diffusion models.
    • ComfyUI-Roboflow: Custom node for image analysis in ComfyUI, integrating Roboflow's capabilities.
    • Tiled Diffusion with ControlNet Upscaling: Workflow for generating high-resolution images with fine control over details in ComfyUI.
    • 2VEdit: Video editing tool that transforms entire videos by editing just the first frame.
    • Flux LoRA showcase: New FLUX LoRA models including Simple Vector Flux, How2Draw, Coloring Book, Amateur Photography v5, Retro Comic Book, and RealFlux 1.0b.

📰 Full newsletter with relevant links, context, and visuals available in the original document.

🔔 If you're having a hard time keeping up in this domain - consider subscribing. We send out our newsletter every Sunday.

submitted by /u/OkSpot3819
[link] [comments]

This week in SD - all the major developments in a nutshell

24 Septiembre 2024 at 09:15
  • Interesting find of the week: Sougwen Chung, a Chinese-Canadian artist pioneering human-machine collaboration in art. Her "Assembly Lines" project features robotic art assistants that sync with her brainwaves to create paintings together.
  • Flux updates:
    • GPU compatibility: Successful generation on AMD GPU (RX 6600 XT), overcoming compatibility issues using Zluda.
    • CFG improvements: Support for negative prompting and values >1 without image degradation, based on PuLID team's work.
    • Consistent character frames: Technique using Flux and ControlNet for generating multiple consistent frames.
    • LoRA and DoRA training: Insights on training models using OneTrainer with Flux.1 architecture, including detailed configuration settings.
    • ComfyUI Flux pipeline: Clean and organized workflow for Stable Diffusion image generation using Flux.
    • Seamless outpainting: New workflow for precise background and human feature outpainting using Flux models in ComfyUI.
  • Lions Gate x Runway: Lionsgate partners with AI firm Runway to develop exclusive AI models based on its film and TV library, focusing on integrating AI into pre- and post-production workflows.
  • EA x AI: Electronic Arts positions AI as core to its business strategy, with over 100 AI projects in development across efficiency, expansion, and transformation areas.
  • Put This On Your Radar:
    • Tripo 3D (Version 2.0): Text-to-3D model generation tool releases version 2.0 with significantly improved mesh quality.
    • CogStudio: Advanced web interface for AI video generation based on the CogVideo model.
    • OmniGen: New unified multimodal AI model combining text and image generation capabilities.
    • Differential diffusion technique for AnimateDiff: Technique for creating more stable backgrounds in AI-generated videos.
    • Pony and non-pony AI model merging technique: New method for merging specialized AI models to expand capabilities.
    • Image and sound generation workflow: Workflow for generating both images and corresponding sound effects from a single prompt using Stable Diffusion and Stable Audio.
    • CogVideoX-5B: Open-source image-to-video model weights released for generating short video clips from input images.
    • CogVideoX-Fun: Open-source text/image/video-to-video model by Alibaba PAI with enhanced video generation capabilities.
    • ComfyUI workflow for replacing video backgrounds with Flux model: Workflow demonstrating how to replace backgrounds in videos using the Flux model.
    • Multi-face swap workflow for ComfyUI: Workflow for swapping multiple faces in a single image with customizable options.
    • Audio reactive particle simulator in ComfyUI: Workflow demonstrating an audio-reactive particle simulation system for creating visually dynamic content.
    • KLING 1.5: Update to KLING with motion control and general improvements.
  • Flux LoRA showcase: New FLUX LoRA models including Miniature People, Omegle Webcam, Gesture Drawing, Jigsaw, and SameFace Fix.

📰 Full newsletter with relevant links, context, and visuals.

🔔 If you're having a hard time keeping up in this domain - considering subscribing. We send out our newsletter every Sunday.

submitted by /u/OkSpot3819
[link] [comments]
❌
❌