


default search action
Shaohui Peng
- > Home > Persons > Shaohui Peng
Publications
- 2026
[c24]Jiaming Guo, Rui Zhang, Zerun Li, Yunkai Gao, Shaohui Peng, Siming Lan, Xing Hu, Zidong Du, Xishan Zhang, Ling Li:
Efficient Diffusion Planning with Temporal Diffusion. AAAI 2026: 21450-21458
[c23]Xinguo Zhu, Shaohui Peng, Jiaming Guo, Yunji Chen, Qi Guo, Yuanbo Wen, Hang Qin, Ruizhi Chen, Qirui Zhou, Ke Gao, Yanjun Wu, Chen Zhao, Ling Li:
QiMeng-Kernel: Macro-Thinking Micro-Coding Paradigm for LLM-Based High-Performance GPU Kernel Generation. AAAI 2026: 29168-29176- 2025
[j4]Yansong Pan, Rui Zhang, Jiaming Guo
, Shaohui Peng, Fan Wu, Kaizhao Yuan, Yunkai Gao, Siming Lan, Ruizhi Chen
, Ling Li, Xing Hu
, Zidong Du, Zihao Zhang
, Xin Zhang, Wei Li
, Qi Guo, Yunji Chen:
Morphology generalizable reinforcement learning via multi-level graph features. Neurocomputing 631: 129644 (2025)
[c22]Qirui Zhou, Shaohui Peng, Weiqiang Xiong, Haixin Chen, Yuanbo Wen, Haochen Li, Ling Li, Qi Guo, Yongwei Zhao, Ke Gao, Ruizhi Chen, Yanjun Wu, Zhao Chen, Yunji Chen:
QiMeng-Attention: SOTA Attention Operator is generated by SOTA Attention Algorithm. ACL (Findings) 2025: 8491-8505
[c21]Haochen Li, Rui Zhang, Hantao Yao, Xin Zhang, Yifan Hao, Xinkai Song, Shaohui Peng, Yongwei Zhao, Chen Zhao, Yanjun Wu, Ling Li:
SEEN-DA: SEmantic ENtropy guided Domain-aware Attention for Domain Adaptive Object Detection. CVPR 2025: 25465-25475
[c20]Xuzhi Zhang, Shaohui Peng, Qirui Zhou, Yuanbo Wen, Qi Guo, Ruizhi Chen, Xinguo Zhu, Weiqiang Xiong, Haixin Chen, Congying Ma, Ke Gao, Chen Zhao, Yanjun Wu, Yunji Chen, Ling Li:
QiMeng-TensorOp: One-Line Prompt is Enough for High-Performance Tensor Operator Generation with Hardware Primitives. IJCAI 2025: 7038-7046
[i24]Xuzhi Zhang, Shaohui Peng, Qirui Zhou, Yuanbo Wen, Qi Guo, Ruizhi Chen, Xinguo Zhu, Weiqiang Xiong, Haixin Chen, Congying Ma, Ke Gao, Chen Zhao, Yanjun Wu, Yunji Chen, Ling Li:
QiMeng-TensorOp: Automatically Generating High-Performance Tensor Operators with Hardware Primitives. CoRR abs/2505.06302 (2025)
[i23]Rui Zhang, Yuanbo Wen, Shuyao Cheng, Di Huang, Shaohui Peng, Jiaming Guo, Pengwei Jin, Jiacheng Zhao, Tianrui Ma, Yaoyu Zhu, Yifan Hao, Yongwei Zhao, Shengwen Liang, Ying Wang
, Xing Hu, Zidong Du, Huimin Cui, Ling Li, Qi Guo, Yunji Chen:
QiMeng: Fully Automated Hardware and Software Design for Processor Chip. CoRR abs/2506.05007 (2025)
[i22]Qirui Zhou, Shaohui Peng, Weiqiang Xiong, Haixin Chen, Yuanbo Wen, Haochen Li, Ling Li, Qi Guo, Yongwei Zhao, Ke Gao, Ruizhi Chen, Yanjun Wu, Chen Zhao, Yunji Chen:
QiMeng-Attention: SOTA Attention Operator is generated by SOTA Attention Algorithm. CoRR abs/2506.12355 (2025)
[i21]Zikang Tian, Shaohui Peng, Du Huang, Jiaming Guo, Ruizhi Chen, Rui Zhang, Xishan Zhang, Yuxuan Guo, Zidong Du, Qi Guo, Ling Li, Yewen Pu, Xing Hu, Yunji Chen:
Code Driven Planning with Domain-Adaptive Critic. CoRR abs/2509.19077 (2025)
[i19]Xinguo Zhu, Shaohui Peng, Jiaming Guo, Yunji Chen, Qi Guo, Yuanbo Wen, Hang Qin, Ruizhi Chen, Qirui Zhou, Ke Gao, Yanjun Wu, Chen Zhao, Ling Li:
QiMeng-Kernel: Macro-Thinking Micro-Coding Paradigm for LLM-Based High-Performance GPU Kernel Generation. CoRR abs/2511.20100 (2025)
[i18]Jiaming Guo, Rui Zhang, Zerun Li, Yunkai Gao, Shaohui Peng, Siming Lan, Xing Hu, Zidong Du, Xishan Zhang, Ling Li:
Efficient Diffusion Planning with Temporal Diffusion. CoRR abs/2511.21054 (2025)- 2024
[c18]Shaohui Peng, Xing Hu
, Qi Yi, Rui Zhang, Jiaming Guo, Di Huang, Zikang Tian, Ruizhi Chen, Zidong Du, Qi Guo, Yunji Chen, Ling Li:
Hypothesis, Verification, and Induction: Grounding Large Language Models with Self-Driven Skill Learning. AAAI 2024: 14599-14607
[c17]Fan Wu, Rui Zhang, Qi Yi, Yunkai Gao, Jiaming Guo, Shaohui Peng, Siming Lan, Husheng Han, Yansong Pan, Kaizhao Yuan, Pengwei Jin, Ruizhi Chen, Yunji Chen, Ling Li:
OCEAN-MBRL: Offline Conservative Exploration for Model-Based Offline Reinforcement Learning. AAAI 2024: 15897-15905
[c14]Haihan Gao, Rui Zhang, Qi Yi, Hantao Yao, Haochen Li, Jiaming Guo, Shaohui Peng, Yunkai Gao, QiCheng Wang, Xing Hu, Yuanbo Wen, Zihao Zhang, Zidong Du, Ling Li, Qi Guo, Yunji Chen:
Prompt-based Visual Alignment for Zero-shot Policy Transfer. ICML 2024: 14954-14968
[i17]Yunpu Zhao, Rui Zhang, Wenyi Li, Di Huang, Jiaming Guo, Shaohui Peng, Yifan Hao, Yuanbo Wen, Xing Hu
, Zidong Du, Qi Guo, Ling Li, Yunji Chen:
Assessing and Understanding Creativity in Large Language Models. CoRR abs/2401.12491 (2024)
[i16]Yuxuan Guo, Shaohui Peng, Jiaming Guo, Di Huang, Xishan Zhang, Rui Zhang, Yifan Hao, Ling Li, Zikang Tian, Mingju Gao, Yutai Li, Yiming Gan, Shuai Liang, Zihao Zhang, Zidong Du, Qi Guo, Xing Hu
, Yunji Chen:
Luban: Building Open-Ended Creative Agents via Autonomous Embodied Verification. CoRR abs/2405.15414 (2024)
[i15]Haihan Gao, Rui Zhang, Qi Yi, Hantao Yao, Haochen Li, Jiaming Guo, Shaohui Peng, Yunkai Gao, QiCheng Wang, Xing Hu
, Yuanbo Wen, Zihao Zhang, Zidong Du, Ling Li, Qi Guo, Yunji Chen:
Prompt-based Visual Alignment for Zero-shot Policy Transfer. CoRR abs/2406.03250 (2024)- 2023
[j2]Qi Yi, Rui Zhang, Shaohui Peng, Jiaming Guo, Xing Hu
, Zidong Du, Qi Guo, Ruizhi Chen, Ling Li, Yunji Chen:
Learning controllable elements oriented representations for reinforcement learning. Neurocomputing 549: 126455 (2023)
[c13]Shaohui Peng, Xing Hu
, Rui Zhang, Jiaming Guo, Qi Yi, Ruizhi Chen, Zidong Du, Ling Li, Qi Guo, Yunji Chen:
Conceptual Reinforcement Learning for Language-Conditioned Tasks. AAAI 2023: 9426-9434
[c11]Yunkai Gao, Rui Zhang, Jiaming Guo, Fan Wu, Qi Yi, Shaohui Peng, Siming Lan, Ruizhi Chen, Zidong Du, Xing Hu, Qi Guo, Ling Li, Yunji Chen:
Context Shift Reduction for Offline Meta-Reinforcement Learning. NeurIPS 2023
[c10]Jiaming Guo, Rui Zhang, Shaohui Peng, Qi Yi, Xing Hu, Ruizhi Chen, Zidong Du, Xishan Zhang, Ling Li, Qi Guo, Yunji Chen:
Efficient Symbolic Policy Learning with Differentiable Symbolic Expression. NeurIPS 2023
[c7]Siming Lan, Rui Zhang, Qi Yi, Jiaming Guo, Shaohui Peng, Yunkai Gao, Fan Wu, Ruizhi Chen, Zidong Du, Xing Hu, Xishan Zhang, Ling Li, Yunji Chen:
Contrastive Modules with Temporal Attention for Multi-Task Reinforcement Learning. NeurIPS 2023
[c6]Zikang Tian, Ruizhi Chen, Xing Hu, Ling Li, Rui Zhang, Fan Wu, Shaohui Peng, Jiaming Guo, Zidong Du, Qi Guo, Yunji Chen:
Decompose a Task into Generalizable Subtasks in Multi-Agent Reinforcement Learning. NeurIPS 2023
[i12]Shaohui Peng, Xing Hu
, Rui Zhang, Jiaming Guo, Qi Yi, Ruizhi Chen, Zidong Du, Ling Li, Qi Guo, Yunji Chen:
Conceptual Reinforcement Learning for Language-Conditioned Tasks. CoRR abs/2303.05069 (2023)
[i9]Shaohui Peng, Xing Hu
, Qi Yi, Rui Zhang, Jiaming Guo, Di Huang, Zikang Tian, Ruizhi Chen, Zidong Du, Qi Guo, Yunji Chen, Ling Li:
Self-driven Grounding: Large Language Model Agents with Automatical Language-aligned Skill Learning. CoRR abs/2309.01352 (2023)
[i8]Siming Lan, Rui Zhang, Qi Yi, Jiaming Guo, Shaohui Peng, Yunkai Gao, Fan Wu, Ruizhi Chen, Zidong Du, Xing Hu
, Xishan Zhang, Ling Li, Yunji Chen:
Contrastive Modules with Temporal Attention for Multi-Task Reinforcement Learning. CoRR abs/2311.01075 (2023)
[i7]Jiaming Guo, Rui Zhang, Shaohui Peng, Qi Yi, Xing Hu
, Ruizhi Chen, Zidong Du, Xishan Zhang, Ling Li, Qi Guo, Yunji Chen:
Efficient Symbolic Policy Learning with Differentiable Symbolic Expression. CoRR abs/2311.02104 (2023)
[i6]Yunkai Gao, Rui Zhang, Jiaming Guo, Fan Wu, Qi Yi, Shaohui Peng, Siming Lan, Ruizhi Chen, Zidong Du, Xing Hu
, Qi Guo, Ling Li, Yunji Chen:
Context Shift Reduction for Offline Meta-Reinforcement Learning. CoRR abs/2311.03695 (2023)- 2022
[c5]Shaohui Peng, Xing Hu, Rui Zhang, Ke Tang, Jiaming Guo, Qi Yi, Ruizhi Chen, Xishan Zhang, Zidong Du, Ling Li, Qi Guo, Yunji Chen:
Causality-driven Hierarchical Structure Discovery for Reinforcement Learning. NeurIPS 2022
[i4]Shaohui Peng, Xing Hu
, Rui Zhang, Ke Tang, Jiaming Guo, Qi Yi, Ruizhi Chen, Xishan Zhang, Zidong Du, Ling Li, Qi Guo, Yunji Chen:
Causality-driven Hierarchical Structure Discovery for Reinforcement Learning. CoRR abs/2210.06964 (2022)- 2021
[i1]Ruizhi Chen, Xiaoyu Wu, Yansong Pan, Kaizhao Yuan, Ling Li, TianYun Ma, JiYuan Liang, Rui Zhang, Kai Wang, Chen Zhang, Shaohui Peng, Xishan Zhang, Zidong Du, Qi Guo, Yunji Chen:
Eden: A Unified Environment Framework for Booming Reinforcement Learning Algorithms. CoRR abs/2109.01768 (2021)

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-04-02 01:05 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID






