Intrinsic Task Symmetry Drives Generalization in Algorith...

Intrinsic Task Symmetry Drives Generalization in Algorithmic Tasks

arXiv:2603.01968v1 Announce Type: cross Abstract: Grokking, the sudden transition from memorization to generalization, is characterized by the emergence of low-dimensional representations, yet the mechanism underlying this organization remains elusive. We propose that intrinsic task symmetries primarily drive grokking and shape the geometry of the model's representation space. We identify a consistent three-stage training dynamic underlying grokking: (i) memorization, (ii) symmetry acquisition, and (iii) geometric organization. We show that generalization emerges during the symmetry acquisition phase, after which representations reorganize into a structured, task-aligned geometry. We validate this symmetry-driven account across diverse algorithmic domains, including algebraic, structural, and relational reasoning tasks. Building on these findings, we introduce a symmetry-based diagnostic that anticipates the onset of generalization and propose strategies to accelerate it. Together, our results establish intrinsic symmetry as the key factor enabling neural networks to move beyond memorization and achieve robust algorithmic reasoning.

相关推荐

MatRIS: Toward Reliable and Efficient Pretrained Machine Learning Interaction Potentials

Self-Correction Inside the Model: Leveraging Layer Attention to Mitigate Hallucinations in Large Vision Language Models

SesaHand: Enhancing 3D Hand Reconstruction via Controllable Generation with Semantic and Structural Alignment

成都六类人才可享租房补贴，最高补贴比例达100%

ST京蓝：公司股票核查结束，将于3月4日复牌

CharacterFlywheel: Scaling Iterative Improvement of Engaging and Steerable LLMs in Production