BiJEPA: Bi-directional Joint Embedding Predictive Archite...

BiJEPA: Bi-directional Joint Embedding Predictive Architecture for Symmetric Representation Learning

arXiv:2603.00049v1 Announce Type: new Abstract: Self-Supervised Learning (SSL) has shifted from pixel-level reconstruction to latent space prediction, spearheaded by the Joint Embedding Predictive Architecture (JEPA). While effective, standard JEPA models typically rely on a uni-directional prediction mechanism (e.g. Context $\to$ Target), potentially neglecting the informative signal inherent in the inverse relationship, degrading its performance. In this work, we propose \textbf{BiJEPA}, a \textit{Bi-Directional Joint Embedding Predictive Architecture} that enforces cycle-consistent predictability between data segments. We address the inherent instability of symmetric prediction (representation explosion) by introducing a critical norm regularization mechanism on the representation vectors. We evaluate BiJEPA on three distinct modalities: synthetic periodic signals, chaotic Lorenz attractor trajectories, and high-dimensional image data (MNIST). Our results demonstrate that BiJEPA achieves stable convergence without collapse, captures the semantic structure of chaotic systems, and learns robust temporal and spatial representations capable of generation and generalisation, offering a more holistic approach to representation learning.

相关推荐

Towards Safe Reasoning in Large Reasoning Models via Corrective Intervention

Towards Safe Reasoning in Large Reasoning Models via Corrective Intervention

Is It Thinking or Cheating? Detecting Implicit Reward Hacking by Measuring Reasoning Effort

Towards Khmer Scene Document Layout Detection

Reasoning or Retrieval? A Study of Answer Attribution on Large Reasoning Models

Towards Safe Reasoning in Large Reasoning Models via Corrective Intervention