Presentation
LGTM: Local-to-Global Text-driven Human Motion Diffusion Model
DescriptionWe introduce LGTM, a novel Local-to-Global pipeline for Text-to-Motion generation based on diffusion model. It decomposes motion description to body-part level with LLMs and encodes them with corresponding body part motion individually, then optimizes whole body motion by attention encoder. As a result, it can generate local semantics well-matched motion.

Event Type
Technical Paper
TimeWednesday, 31 July 202410:45am - 10:55am MDT
LocationMile High 4
ACM Digital Library
Journal Papers' PDFs
Conference Papers' PDFs
Conference Papers' PDFs
Session Time & Location
Sunday, 28 July 20246:00pm - 8:45pm MDTBluebird Ballroom
Wednesday, 31 July 202410:45am - 12:15pm MDTMile High 4
Research & Education
Livestreamed
Recorded
Animation
Full Conference
Full Conference Supporter
Virtual Access
Exhibitor Full Conference
Wednesday