tgoop.com/RIMLLab/137
Last Update:
๐ Compositional Learning Journal Club
Join us this week for an in-depth discussion on Compositional Learning in the context of cutting-edge text-to-image generative models. We will explore recent breakthroughs and challenges, focusing on how these models handle compositional tasks and where improvements can be made.
โ
This Week's Presentation:
๐น Title: A semiotic methodology for assessing the compositional effectiveness of generative text-to-image models
๐ธ Presenter: Amir Kasaei
๐ Abstract:
A new methodology for evaluating text-to-image generation models is being proposed, addressing limitations in current evaluation techniques. Existing methods, which use metrics such as fidelity and CLIPScore, often combine criteria like position, action, and photorealism in their assessments. This new approach adapts model analysis from visual semiotics, establishing distinct visual composition criteria. It highlights three key dimensions: plastic categories, multimodal translation, and enunciation, each with specific sub-criteria. The methodology is tested on Midjourney and DALLยทE, providing a structured framework that can be used for future quantitative analyses of generated images.
๐ Paper: A semiotic methodology for assessing the compositional effectiveness of generative text-to-image models
Session Details:
- ๐
Date: Sunday
- ๐ Time: 5:00 - 6:00 PM
- ๐ Location: Online at vc.sharif.edu/ch/rohban
We look forward to your participation! โ๏ธ
BY RIML Lab
Share with your friend now:
tgoop.com/RIMLLab/137