Yapay Zeka · 4 dk okuma · 17 Nisan 2026
Creo: Staged Image Generation Restores User Control
Multi-stage text-to-image system scaffolds creation from sketch to final output, letting users lock decisions and avoid premature commitment.
Kaynak: arxiv/cs.AI · Zoe De Simone, Angie Boggust, Fredo Durand, Ashia Wilson, Arvind Satyanarayan · orijinali aç ↗ ↗
Creo breaks text-to-image generation into progressive stages with intermediate abstractions, enabling users to edit incrementally and maintain creative flexibility.
- — One-shot T2I systems embed fine-grained details that lock users into early decisions prematurely.
- — Creo progresses from rough sketches through intermediate stages to high-resolution final images.
- — Users can edit at each stage and lock regions to prevent unintended changes in later edits.
- — Diffs replace full regeneration, reducing visual drift as resolution increases across stages.
- — Study participants reported stronger ownership and traced their decisions through the build-up process.
- — Embedding analysis shows Creo outputs exhibit greater diversity than one-shot baseline results.
- — Locking mechanism preserves prior decisions, allowing targeted edits to specific regions or attributes.
Sık sorulanlar
- Creo uses a locking mechanism that preserves decisions from earlier stages. When you lock a region or attribute, edits in subsequent stages affect only unlocked areas. Instead of regenerating the entire image, the system applies targeted diffs, reducing visual drift and keeping prior decisions intact.