AlignVAR improves image super-resolution with visual autoregression, 10x faster than diffusion models
Researchers propose AlignVAR, a visual autoregressive framework for image super-resolution that addresses critical consistency problems in existing VAR models. The approach combines spatial consistency autoregression and hierarchical consistency constraints to achieve 10x faster inference with 50% fewer parameters than leading diffusion-based methods.