Presentation
Separate-and-Enhance: Compositional Finetuning for Text-to-image Diffusion Models
DescriptionThis work targets on improving the compositional capability of text-to-image models. Different from previous approaches that requires heavy test-time adaptation per prompt, we propose a compositional finetuning framework with two novel objectives. Through comprehensive evaluations, our model demonstrates superior performance in image realism, text-image alignment, and adaptability to novel concepts.

Event Type
Technical Paper
TimeThursday, 1 August 20249:30am - 9:40am MDT
LocationMile High 1
ACM Digital Library
Journal Papers' PDFs
Conference Papers' PDFs
Conference Papers' PDFs
Session Time & Location
Sunday, 28 July 20246:00pm - 8:45pm MDTBluebird Ballroom
Thursday, 1 August 20249:00am - 10:30am MDTMile High 1
Research & Education
Livestreamed
Recorded
Full Conference
Full Conference Supporter
Virtual Access
Exhibitor Full Conference
Thursday