Code Aesthetics with Agentic Reward Feedback

Xiao, Bang; Jiang, Lingjie; Huang, Shaohan; Lv, Tengchao; Huang, Yupan; Wu, Xun; Cui, Lei; Wei, Furu

Computer Science > Computation and Language

arXiv:2510.23272 (cs)

[Submitted on 27 Oct 2025]

Title:Code Aesthetics with Agentic Reward Feedback

Authors:Bang Xiao, Lingjie Jiang, Shaohan Huang, Tengchao Lv, Yupan Huang, Xun Wu, Lei Cui, Furu Wei

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs) have become valuable assistants for developers in code-related tasks. While LLMs excel at traditional programming tasks such as code generation and bug fixing, they struggle with visually-oriented coding tasks, often producing suboptimal aesthetics. In this paper, we introduce a new pipeline to enhance the aesthetic quality of LLM-generated code. We first construct AesCode-358K, a large-scale instruction-tuning dataset focused on code aesthetics. Next, we propose agentic reward feedback, a multi-agent system that evaluates executability, static aesthetics, and interactive aesthetics. Building on this, we develop GRPO-AR, which integrates these signals into the GRPO algorithm for joint optimization of functionality and code aesthetics. Finally, we develop OpenDesign, a benchmark for assessing code aesthetics. Experimental results show that combining supervised fine-tuning on AesCode-358K with reinforcement learning using agentic reward feedback significantly improves performance on OpenDesign and also enhances results on existing benchmarks such as PandasPlotBench. Notably, our AesCoder-4B surpasses GPT-4o and GPT-4.1, and achieves performance comparable to large open-source models with 480B-685B parameters, underscoring the effectiveness of our approach.

Comments:	30 pages, 7 figures
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2510.23272 [cs.CL]
	(or arXiv:2510.23272v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2510.23272

Submission history

From: Bang Xiao [view email]
[v1] Mon, 27 Oct 2025 12:32:33 UTC (9,358 KB)

Computer Science > Computation and Language

Title:Code Aesthetics with Agentic Reward Feedback

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Code Aesthetics with Agentic Reward Feedback

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators