tgoop.com/RIMLLab/139
Last Update:
π Compositional Learning Journal Club
Join us this week for an in-depth discussion on Compositional Learning in the context of cutting-edge text-to-image generative models. We will explore recent breakthroughs and challenges, focusing on how these models handle compositional tasks and where improvements can be made.
β
This Week's Presentation:
πΉ Title: InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization
πΈ Presenter: Amir Kasaei
π Abstract:
Recent advancements in diffusion models, like Stable Diffusion, have shown impressive image generation capabilities, but ensuring precise alignment with text prompts remains a challenge. This presentation introduces Initial Noise Optimization (InitNO), a method that refines initial noise to improve semantic accuracy in generated images. By evaluating and guiding the noise using cross-attention and self-attention scores, the approach effectively enhances image-prompt alignment, as demonstrated through rigorous experimentation.
π Paper: InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization
Session Details:
- π
Date: Sunday
- π Time: 5:00 - 6:00 PM
- π Location: Online at vc.sharif.edu/ch/rohban
We look forward to your participation! βοΈ
BY RIML Lab
Share with your friend now:
tgoop.com/RIMLLab/139