Analyzing and Improving Fast Sampling of Text-to-Image Di...

Analyzing and Improving Fast Sampling of Text-to-Image Diffusion Models

arXiv:2603.00763v1 Announce Type: new Abstract: Text-to-image diffusion models have achieved unprecedented success but still struggle to produce high-quality results under limited sampling budgets. Existing training-free sampling acceleration methods are typically developed independently, leaving the overall performance and compatibility among these methods unexplored. In this paper, we bridge this gap by systematically elucidating the design space, and our comprehensive experiments identify the sampling time schedule as the most pivotal factor. Inspired by the geometric properties of diffusion models revealed through the Frenet-Serret formulas, we propose constant total rotation schedule (TORS), a scheduling strategy that ensures uniform geometric variation along the sampling trajectory. TORS outperforms previous training-free acceleration methods and produces high-quality images with 10 sampling steps on Flux.1-Dev and Stable Diffusion 3.5. Extensive experiments underscore the adaptability of our method to unseen models, hyperparameters, and downstream applications.

相关推荐

The Information-Theoretic Imperative: Compression and the Epistemic Foundations of Intelligence

DAG-Math: Graph-of-Thought Guided Mathematical Reasoning in LLMs

HardcoreLogic: Challenging Large Reasoning Models with Long-tail Logic Puzzle Games

ScholarEval: Research Idea Evaluation Grounded in Literature

OpenAutoNLU: Open Source AutoML Library for NLU

Doctor-R1: Mastering Clinical Inquiry with Experiential Agentic Reinforcement Learning