Re4: Scientific Computing Agent with Rewriting, Resolutio...

Re4: Scientific Computing Agent with Rewriting, Resolution, Review and Revision

arXiv:2508.20729v2 Announce Type: replace Abstract: Large language models (LLMs) serve as an active and promising field of generative artificial intelligence and have demonstrated abilities to perform complex tasks in multiple domains, including mathematical and scientific reasoning. In this work, we construct a novel agent framework for solving representative problems in scientific computing. The proposed agent, incorporating a "rewriting-resolution-review-revision" logical chain via three reasoning LLMs (functioning as the Consultant, Reviewer, and Programmer, respectively), is integrated in a collaborative and interactive manner. The Consultant module endows the agent with knowledge transfer capabilities to link problems to professional domain insights, thereby rewriting problem descriptions through text augmentation. The Programmer module is responsible for generating and executing well-structured code to deliver the problem resolution. The Reviewer module equips the agent with the capacity for self-debugging and self-refinement through interactive feedback with code runtime outputs. By leveraging the end-to-end review mechanism, the executable code provided by the Programmer attains the iterative revision. A comprehensive evaluation is conducted on the performance of the proposed agent framework in solving partial differential equations (PDEs), ill-conditioned linear systems, and data-driven physical analysis problems. Compared to single-model, this collaborative framework significantly improves the bug-free code generation rate and reduces the occurrence of non-physical solutions, thereby establishing a highly reliable framework for autonomous code generation based on natural language descriptions. The review mechanism improved the average execution success rate of the modern reasoning models. Our code is available at https://github.com/ChengAo21/Re4_Sci_Agent

相关推荐

A Message Passing Realization of Expected Free Energy Minimization

Re4: Scientific Computing Agent with Rewriting, Resolution, Review and Revision

Direct low-field MRI super-resolution using undersampled k-space

Act Like a Pathologist: Tissue-Aware Whole Slide Image Reasoning

Learning to Draft: Adaptive Speculative Decoding with Reinforcement Learning

Collab-REC: An LLM-based Agentic Framework for Balancing Recommendations in Tourism