Presentation
Streetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion
SessionConsistent Text-to-Image
DescriptionOur method generates Streetscapes — long sequences of views through a synthesized city-scale scene. We build on video diffusion models, but in an autoregressive framework that easily scales to long camera trajectories. We train our system on the unique Google Street View data, allowing controlling generations by scene layouts and camera poses.
![](/wp-content/linklings_snippets/representative_images/g9RLQfQ6jwNiw6GE.jpg)
Event Type
Technical Paper
TimeMonday, 29 July 20242:30pm - 2:40pm MDT
LocationMile High 1
ACM Digital Library
Journal Papers' PDFs
Conference Papers' PDFs
Conference Papers' PDFs
Session Time & Location
Research & Education
Livestreamed
Recorded
AI
Machine Learning
Rendering
Full Conference
Full Conference Supporter
Virtual Access
Exhibitor Full Conference
Monday