← İçerik
Yapay Zeka · 4 dk okuma · 17 Nisan 2026

Creo: Staged Image Generation Restores User Control

Multi-stage text-to-image system scaffolds creation from sketch to final output, letting users lock decisions and avoid premature commitment.

Kaynak: arxiv/cs.AI · Zoe De Simone, Angie Boggust, Fredo Durand, Ashia Wilson, Arvind Satyanarayan · orijinali aç ↗ ↗
Paylaş: X LinkedIn

Creo breaks text-to-image generation into progressive stages with intermediate abstractions, enabling users to edit incrementally and maintain creative flexibility.

  • One-shot T2I systems embed fine-grained details that lock users into early decisions prematurely.
  • Creo progresses from rough sketches through intermediate stages to high-resolution final images.
  • Users can edit at each stage and lock regions to prevent unintended changes in later edits.
  • Diffs replace full regeneration, reducing visual drift as resolution increases across stages.
  • Study participants reported stronger ownership and traced their decisions through the build-up process.
  • Embedding analysis shows Creo outputs exhibit greater diversity than one-shot baseline results.
  • Locking mechanism preserves prior decisions, allowing targeted edits to specific regions or attributes.

Sık sorulanlar

  • Creo uses a locking mechanism that preserves decisions from earlier stages. When you lock a region or attribute, edits in subsequent stages affect only unlocked areas. Instead of regenerating the entire image, the system applies targeted diffs, reducing visual drift and keeping prior decisions intact.

İlgili