Wasserstein Distances Made Explainable: Insights Into Dat...

Wasserstein Distances Made Explainable: Insights Into Dataset Shifts and Transport Phenomena

arXiv:2505.06123v2 Announce Type: replace-cross Abstract: Wasserstein distances provide a powerful framework for comparing data distributions. They can be used to analyze processes over time or to detect inhomogeneities within data. However, simply calculating the Wasserstein distance or analyzing the corresponding transport plan (or coupling) may not be sufficient for understanding what factors contribute to a high or low Wasserstein distance. In this work, we propose a novel solution based on Explainable AI that allows us to efficiently and accurately attribute Wasserstein distances to various data components, including data subgroups, input features, or interpretable subspaces. Our method achieves high accuracy across diverse datasets and Wasserstein distance specifications, and its practical utility is demonstrated in three use cases.

相关推荐

When Bias Meets Trainability: Connecting Theories of Initialization

Rewriting Pre-Training Data Boosts LLM Performance in Math and Code

Self-Destructive Language Model

Decoding Open-Ended Information Seeking Goals from Eye Movements in Reading

SounDiT: Geo-Contextual Soundscape-to-Landscape Generation

Scalable Multi-Task Learning through Spiking Neural Networks with Adaptive Task-Switching Policy for Intelligent Autonomous Agents