Do LLMs Adhere to Label Definitions? Examining Their Rece...

Do LLMs Adhere to Label Definitions? Examining Their Receptivity to External Label Definitions

arXiv:2509.02452v3 Announce Type: replace-cross Abstract: Do LLMs genuinely incorporate external definitions, or do they primarily rely on their parametric knowledge? To address these questions, we conduct controlled experiments across multiple explanation benchmark datasets (general and domain-specific) and label definition conditions, including expert-curated, LLM-generated, perturbed, and swapped definitions. Our results reveal that while explicit label definitions can enhance accuracy and explainability, their integration into an LLM's task-solving processes is neither guaranteed nor consistent, suggesting reliance on internalized representations in many cases. Models often default to their internal representations, particularly in general tasks, whereas domain-specific tasks benefit more from explicit definitions. These findings underscore the need for a deeper understanding of how LLMs process external knowledge alongside their pre-existing capabilities.

相关推荐

Chasing the Tail: Effective Rubric-based Reward Modeling for Large Language Model Post-Training

CASCADE: LLM-Powered JavaScript Deobfuscator at Google

Characteristic Root Analysis and Regularization for Linear Time Series Forecasting

Mechanistic Indicators of Understanding in Large Language Models

Incentive-Aligned Multi-Source LLM Summaries

A Comprehensive Benchmark for Electrocardiogram Time-Series