


default search action
Ruoyi Du
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2026
[j12]Junhan Chen, Dongliang Chang, Yujun Tong, Ruoyi Du, Yingqing Wang, Zhanyu Ma, Yi-Zhe Song:
FCNet: Extracting undistorted images for fine-grained image classification. Neurocomputing 668: 132376 (2026)- 2025
[j11]Yurong Guo
, Ruoyi Du
, Aneeshan Sain
, Kongming Liang
, Yuan Dong
, Yi-Zhe Song
, Zhanyu Ma
:
Understanding Episode Hardness in Few-Shot Learning. IEEE Trans. Pattern Anal. Mach. Intell. 47(1): 616-633 (2025)
[j10]Tian Zhang
, Kongming Liang
, Ruoyi Du
, Wei Chen, Zhanyu Ma
:
Disentangling Before Composing: Learning Invariant Disentangled Features for Compositional Zero-Shot Learning. IEEE Trans. Pattern Anal. Mach. Intell. 47(2): 1132-1147 (2025)
[j9]Yuqi Yang
, Dongliang Chang
, Ruoyi Du
, Yi-Zhe Song
, Zhanyu Ma
:
Adaptive Multi-Resolution Feature Fusion for Fine-Grained Visual Classification. IEEE Trans. Circuits Syst. Video Technol. 35(8): 8252-8264 (2025)
[j8]Kaixin Chen
, Huiying Chang
, Mengqiu Xu
, Ruoyi Du
, Ming Wu
, Zhanyu Ma
, Chuang Zhang
:
Class-Customized Domain Adaptation: Unlock Each Customer-Specific Class With Single Annotation. IEEE Trans. Image Process. 34: 5527-5542 (2025)
[c17]Peng Gao, Le Zhuo, Dongyang Liu, Ruoyi Du, Xu Luo, Longtian Qiu, Yuhang Zhang, Rongjie Huang, Shijie Geng, Renrui Zhang, Junlin Xie, Wenqi Shao, Zhengkai Jiang, Tianshuo Yang, Weicai Ye, Tong He, Jingwen He, Junjun He, Yu Qiao, Hongsheng Li:
Lumina-T2X: Scalable Flow-based Large Diffusion Transformer for Flexible Resolution Generation. ICLR 2025
[i20]Jiayi Lei, Renrui Zhang, Xiangfei Hu, Weifeng Lin, Zhen Li, Wenjian Sun, Ruoyi Du, Le Zhuo, Zhongyu Li, Xinyue Li, Shitian Zhao, Ziyu Guo, Yiting Lu, Peng Gao, Hongsheng Li:
IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models. CoRR abs/2501.13920 (2025)
[i19]Qi Qin, Le Zhuo, Yi Xin, Ruoyi Du, Zhen Li, Bin Fu, Yiting Lu, Jiakang Yuan, Xinyue Li, Dongyang Liu, Xiangyang Zhu, Manyuan Zhang, Will Beddow, Erwann Millon, Victor Perez, Wenhai Wang, Conghui He, Bo Zhang, Xiaohong Liu, Hongsheng Li, Yu Qiao, Chang Xu, Peng Gao:
Lumina-Image 2.0: A Unified and Efficient Image Generative Framework. CoRR abs/2503.21758 (2025)
[i18]Zhong-Yu Li, Ruoyi Du, Juncheng Yan, Le Zhuo, Zhen Li, Peng Gao, Zhanyu Ma, Ming-Ming Cheng:
VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning. CoRR abs/2504.07960 (2025)
[i17]Yupeng Zhou, Zhen Li, Ziheng Ouyang, Yuming Chen, Ruoyi Du, Daquan Zhou, Bin Fu, Yihao Liu, Peng Gao, Ming-Ming Cheng, Qibin Hou:
OneVAE: Joint Discrete and Continuous Optimization Helps Discrete Video VAE Train Better. CoRR abs/2508.09857 (2025)
[i16]Dongyang Liu, Peng Gao, David Liu, Ruoyi Du, Zhen Li, Qilong Wu, Xin Jin, Sihan Cao, Shifeng Zhang, Hongsheng Li, Steven Hoi:
Decoupled DMD: CFG Augmentation as the Spear, Distribution Matching as the Shield. CoRR abs/2511.22677 (2025)
[i15]Z.-Image Team, Huanqia Cai, Sihan Cao, Ruoyi Du, Peng Gao, Steven C. H. Hoi, Zhaohui Hou, Shijie Huang, Dengyang Jiang, Xin Jin, Liangchen Li, Zhen Li, Zhong-Yu Li, David Liu, Dongyang Liu, Junhan Shi, Qilong Wu, Feng Yu, Chi Zhang, Shifeng Zhang, Shilin Zhou:
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer. CoRR abs/2511.22699 (2025)- 2024
[j7]Ruoyi Du
, Dongliang Chang
, Zhanyu Ma
, Kongming Liang
, Yi-Zhe Song
, Jun Guo
:
Semi-Supervised Learning for FGVC With Out-of-Category Data. IEEE Trans. Pattern Anal. Mach. Intell. 46(5): 2658-2671 (2024)
[j6]Yiming Zhang
, Ruoyi Du
, Zheng-Hua Tan
, Wenwu Wang
, Zhanyu Ma
:
Generating Accurate and Diverse Audio Captions Through Variational Autoencoder Framework. IEEE Signal Process. Lett. 31: 2520-2524 (2024)
[c16]Ruoyi Du, Dongliang Chang, Timothy M. Hospedales, Yi-Zhe Song, Zhanyu Ma:
DemoFusion: Democratising High-Resolution Image Generation With No $$$. CVPR 2024: 6159-6168
[c15]Le Zhuo, Ruoyi Du, Han Xiao, Yangguang Li, Dongyang Liu, Rongjie Huang, Wenze Liu, Xiangyang Zhu, Fu-Yun Wang, Zhanyu Ma, Xu Luo, Zehan Wang, Kaipeng Zhang, Lirui Zhao, Si Liu, Xiangyu Yue, Wanli Ouyang, Yu Qiao, Hongsheng Li, Peng Gao:
Lumina-Next : Making Lumina-T2X Stronger and Faster with Next-DiT. NeurIPS 2024
[i14]Peng Gao, Le Zhuo, Dongyang Liu, Ruoyi Du, Xu Luo, Longtian Qiu, Yuhang Zhang, Chen Lin, Rongjie Huang, Shijie Geng, Renrui Zhang, Junlin Xi, Wenqi Shao, Zhengkai Jiang, Tianshuo Yang, Weicai Ye, He Tong, Jingwen He, Yu Qiao, Hongsheng Li:
Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers. CoRR abs/2405.05945 (2024)
[i13]Yiming Zhang, Xuenan Xu, Ruoyi Du, Haohe Liu, Yuan Dong, Zheng-Hua Tan, Wenwu Wang, Zhanyu Ma:
Zero-Shot Audio Captioning Using Soft and Hard Prompts. CoRR abs/2406.06295 (2024)
[i12]Le Zhuo, Ruoyi Du, Han Xiao, Yangguang Li, Dongyang Liu, Rongjie Huang, Wenze Liu, Lirui Zhao, Fu-Yun Wang, Zhanyu Ma, Xu Luo, Zehan Wang, Kaipeng Zhang, Xiangyang Zhu, Si Liu, Xiangyu Yue, Dingning Liu, Wanli Ouyang
, Ziwei Liu, Yu Qiao, Hongsheng Li, Peng Gao:
Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT. CoRR abs/2406.18583 (2024)
[i11]Ruoyi Du, Dongyang Liu, Le Zhuo, Qin Qi, Hongsheng Li, Zhanyu Ma, Peng Gao:
I-Max: Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers with Projected Flow. CoRR abs/2410.07536 (2024)- 2023
[j5]Dongliang Chang
, Kaiyue Pang
, Ruoyi Du
, Yujun Tong
, Yi-Zhe Song
, Zhanyu Ma
, Jun Guo
:
Making a Bird AI Expert Work for You and Me. IEEE Trans. Pattern Anal. Mach. Intell. 45(10): 12068-12084 (2023)
[j4]Yiming Zhang
, Hong Yu
, Ruoyi Du, Zheng-Hua Tan
, Wenwu Wang
, Zhanyu Ma
, Yuan Dong
:
ACTUAL: Audio Captioning With Caption Feature Space Regularization. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2643-2657 (2023)
[c14]Dongliang Chang, Yujun Tong, Ruoyi Du, Timothy M. Hospedales, Yi-Zhe Song
, Zhanyu Ma:
An Erudite Fine-Grained Visual Classification Model. CVPR 2023: 7268-7277
[c13]Ruoyi Du, Dongliang Chang, Kongming Liang, Timothy M. Hospedales, Yi-Zhe Song
, Zhanyu Ma:
On-the-Fly Category Discovery. CVPR 2023: 11691-11700
[c12]Ruoyi Du, Wenqing Yu, Heqing Wang, Ting-En Lin, Dongliang Chang, Zhanyu Ma:
Multi-View Active Fine-Grained Visual Recognition. ICCV 2023: 1568-1578
[c11]Yurong Guo, Ruoyi Du, Yuan Dong, Timothy M. Hospedales, Yi-Zhe Song, Zhanyu Ma:
Task-aware Adaptive Learning for Cross-domain Few-shot Learning. ICCV 2023: 1590-1599
[c10]Yitao Chen, Ruoyi Du, Kongming Liang, Zhanyu Ma:
Self-Enhanced Training Framework for Referring Expression Grounding. ICIP 2023: 3060-3064
[c9]Jie Wang, Yixiao Zheng, Ruoyi Du, Yiming Zhang, Kongming Liang, Zhanyu Ma:
Plugging Stylized Controls in Open-Stylized Image Captioning. PRCV (1) 2023: 309-320
[c8]Zihan Yan, Ruoyi Du, Kongming Liang, Tao Wei, Wei Chen, Zhanyu Ma:
Image Generation Based Intra-class Variance Smoothing for Fine-Grained Visual Classification. PRCV (6) 2023: 447-459
[i10]Ruoyi Du, Dongliang Chang, Timothy M. Hospedales, Yi-Zhe Song, Zhanyu Ma:
DemoFusion: Democratising High-Resolution Image Generation With No $$$. CoRR abs/2311.16973 (2023)- 2022
[j3]Ruoyi Du
, Jiyang Xie
, Zhanyu Ma
, Dongliang Chang
, Yi-Zhe Song
, Jun Guo
:
Progressive Learning of Category-Consistent Multi-Granularity Features for Fine-Grained Visual Classification. IEEE Trans. Pattern Anal. Mach. Intell. 44(12): 9521-9535 (2022)
[j2]Yurong Guo, Ruoyi Du
, Xiaoxu Li
, Jiyang Xie
, Zhanyu Ma
, Yuan Dong:
Learning Calibrated Class Centers for Few-Shot Classification by Pair-Wise Similarity. IEEE Trans. Image Process. 31: 4543-4555 (2022)
[c7]Tian Zhang
, Kongming Liang
, Ruoyi Du
, Xian Sun, Zhanyu Ma
, Jun Guo:
Learning Invariant Visual Representations for Compositional Zero-Shot Learning. ECCV (24) 2022: 339-355
[c6]Jingye Wang, Ruoyi Du, Dongliang Chang, Kongming Liang, Zhanyu Ma:
Domain Generalization via Frequency-domain-based Feature Disentanglement and Interaction. ACM Multimedia 2022: 4821-4829
[c5]Dongliang Chang, Junhan Chen, Xinran Wang, Ruoyi Du, Wenqing Yu, Yufan Liu, Yujun Tong, Kongming Liang, Yi-Zhe Song
, Zhanyu Ma:
Complex Scenario-Oriented Fine-Grained Visual Classification Platform. MMSP 2022: 1
[c4]Yitao Chen, Shibo Nie, Mandan Guan, Jie Wang, Ruoyi Du, Dongliang Chang, Kongming Liang, Zhanyu Ma:
Multi-modal Human-machine Conversation System for Real Physical World. MMSP 2022: 1
[c3]Junhan Chen, Dongliang Chang, Jiyang Xie, Ruoyi Du, Zhanyu Ma:
Cross-Layer Feature based Multi-Granularity Visual Classification. VCIP 2022: 1-5
[i9]Jingye Wang, Ruoyi Du, Dongliang Chang, Zhanyu Ma:
Domain Generalization via Frequency-based Feature Disentanglement and Interaction. CoRR abs/2201.08029 (2022)
[i8]Yiming Zhang, Hong Yu, Ruoyi Du, Zhanyu Ma, Yuan Dong:
Caption Feature Space Regularization for Audio Captioning. CoRR abs/2204.08409 (2022)
[i7]Tian Zhang, Kongming Liang, Ruoyi Du, Xian Sun, Zhanyu Ma, Jun Guo:
Learning Invariant Visual Representations for Compositional Zero-Shot Learning. CoRR abs/2206.00415 (2022)
[i6]Ruoyi Du, Wenqing Yu, Heqing Wang, Dongliang Chang, Ting-En Lin, Yongbin Li, Zhanyu Ma:
Multi-View Active Fine-Grained Recognition. CoRR abs/2206.01153 (2022)- 2021
[j1]Jiyang Xie
, Yixiao Zheng, Ruoyi Du
, Weiyu Xiong, Yufei Cao, Zhanyu Ma
, Dongpu Cao
, Jun Guo
:
Deep Learning-Based Computer Vision for Surveillance in ITS: Evaluation of State-of-the-Art Methods. IEEE Trans. Veh. Technol. 70(4): 3027-3042 (2021)
[c2]Siqing Zhang, Ruoyi Du, Dongliang Chang, Zhanyu Ma, Jun Guo:
Knowledge Transfer Based Fine-Grained Visual Classification. ICME 2021: 1-6
[i5]Dongliang Chang, Yixiao Zheng, Zhanyu Ma, Ruoyi Du, Kongming Liang:
Fine-Grained Visual Classification via Simultaneously Learning of Multi-regional Multi-grained Features. CoRR abs/2102.00367 (2021)
[i4]Dongliang Chang, Kaiyue Pang, Ruoyi Du, Zhanyu Ma, Yi-Zhe Song, Jun Guo:
Making a Bird AI Expert Work for You and Me. CoRR abs/2112.02747 (2021)
[i3]Ruoyi Du, Dongliang Chang, Zhanyu Ma, Yi-Zhe Song, Jun Guo:
Clue Me In: Semi-Supervised FGVC with Out-of-Distribution Data. CoRR abs/2112.02825 (2021)- 2020
[c1]Ruoyi Du
, Dongliang Chang
, Ayan Kumar Bhunia
, Jiyang Xie
, Zhanyu Ma
, Yi-Zhe Song
, Jun Guo:
Fine-Grained Visual Classification via Progressive Multi-granularity Training of Jigsaw Patches. ECCV (20) 2020: 153-168
[i2]Ruoyi Du, Dongliang Chang, Ayan Kumar Bhunia, Jiyang Xie, Yi-Zhe Song, Zhanyu Ma, Jun Guo:
Fine-Grained Visual Classification via Progressive Multi-Granularity Training of Jigsaw Patches. CoRR abs/2003.03836 (2020)
[i1]Siqing Zhang, Ruoyi Du, Dongliang Chang, Zhanyu Ma, Jun Guo:
Knowledge Transfer Based Fine-grained Visual Classification. CoRR abs/2012.11389 (2020)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-01-16 23:55 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







