Ruoyi's HomePage

About Me

(I am currently seeking job opportunities in China.)

I am a PhD candidate (since 2021) at the Pattern Recognition and Intelligent System Laboratory, School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing (100876), China. I am supervised by Prof. Zhanyu Ma and co-supervised by Prof. Yi-Zhe Song

My research interests are in:

Visual Content Generation 🎨
Fine-Grained Visual Recognition 🦅

Highlights

Peng Gao*, Le Zhuo*, Dongyang Liu*, Ruoyi Du*, Xu Luo*, Longtian Qiu*, Yuhang Zhang, Chen Lin, Rongjie Huang, Shijie Geng, Renrui Zhang, Junlin Xie, Wenqi Shao, Zhengkai Jiang, Tianshuo Yang, Weicai Ye, He Tong, Jingwen He, Yu Qiao, and Hongsheng Li, Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers, Technical Report.
arXiv Code
Ruoyi Du, Dongyang Liu, Le Zhuo, Qi Qin, Hongsheng Li, Zhanyu Ma, and Peng Gao, I-Max: Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers with Projected Flow, Technical Report.
arXiv Code
Le Zhuo*, Ruoyi Du*, Han Xiao*, Yangguang Li*, Dongyang Liu*, Rongjie Huang*, Wenze Liu*, Xiangyang Zhu, Fu-Yun Wang, Zhanyu Ma, Xu Luo, Zehan Wang, Kaipeng Zhang, Lirui Zhao, Si Liu, Xiangyu Yue, Wanli Ouyang, Yu Qiao, Hongsheng Li, and Peng Gao, Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT, in Proceedings of Conference on Neural Information Processing Systems (NeurIPS), 2024.
arXiv Code
Ruoyi Du, Dongliang Chang, Timothy Hospedales, Yi-Zhe Song, and Zhanyu Ma, DemoFusion: Democratising High-Resolution Image Generation With No $$$, in Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognitions (CVPR), 2024.
Page arXiv Code
Ruoyi Du, Dongliang Chang, Zhanyu Ma, Kongming Liang, Yi-Zhe Song, and Jun Guo, Semi-Supervised FGVC with Out-of-Category Data, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), Accepted.
DOI Code
Ruoyi Du, Wenqing Yu, Heqing Wang, Ting-En Lin, Dongliang Chang, and Zhanyu Ma, Multi-view Active Fine-grained Visual Recognition, in Proceedings of IEEE/CVF Conference on Computer Vision (ICCV), 2023.
Ruoyi Du, Dongliang Chang, Kongming Liang, Timothy Hospedales, Yi-Zhe Song, and Zhanyu Ma, On-the-fly Category Discovery, in Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognitions (CVPR), 2023.
Paper Code
Ruoyi Du, Jiyang Xie, Zhanyu Ma, Dongliang Chang, Yi-Zhe Song, and Jun Guo, Progressive Learning of Category-Consistent Multi-Granularity Features for Fine-Grained Visual Classification, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 44, no. 12, pp. 9521-9535, 2022.
DOI Code
Ruoyi Du, Dongliang Chang, Ayan Kumar Bhunia, Jiyang Xie, Zhanyu Ma, Yi-Zhe Song, and Jun Guo, Fine-Grained Visual Classification via Progressive Multi-Granularity Training of Jigsaw Patches, in Proceedings of European Conference on Computer Vision (ECCV), 2020.
Arxiv Code

Publications

Le Zhuo*, Ruoyi Du*, Han Xiao*, Yangguang Li*, Dongyang Liu*, Rongjie Huang*, Wenze Liu*, Xiangyang Zhu, Fu-Yun Wang, Zhanyu Ma, Xu Luo, Zehan Wang, Kaipeng Zhang, Lirui Zhao, Si Liu, Xiangyu Yue, Wanli Ouyang, Yu Qiao, Hongsheng Li, and PengGao, Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT, in Proceedings of Conference on Neural Information Processing Systems (NeurIPS), 2024. arXiv Code
Tian Zhang, Kongming Liang, Ruoyi Du, Wei Chen, and Zhanyu Ma, Disentangling before Composing: Learning Invariant Disentangled Features for Compositional Zero-Shot Learning, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), accepted. Code
Yurong Guo, Ruoyi Du, Aneeshan Sain, Kongming Liang, Yuan Dong, Yi-Zhe Song, and Zhanyu Ma, Understanding Episode Hardness in Few-shot Learning, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), accepted. arXiv Code
Ruoyi Du, Dongliang Chang, Timothy Hospedales, Yi-Zhe Song and Zhanyu Ma, DemoFusion: Democratising High-Resolution Image Generation With No $$$, in Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognitions (CVPR), 2024. Page arXiv Code
Ruoyi Du, Dongliang Chang, Zhanyu Ma, Kongming Liang, Yi-Zhe Song, and Jun Guo, Semi-Supervised FGVC with Out-of-Category Data, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), Accepted. DOI Code
Ruoyi Du, Wenqing Yu, Heqing Wang, Ting-En Lin, Dongliang Chang, and Zhanyu Ma, Multi-view Active Fine-grained Visual Recognition, in Proceedings of IEEE/CVF Conference on Computer Vision (ICCV), 2023.
Yurong Guo, Ruoyi Du, Yuan Dong, Timothy Hospedales, Yi-Zhe Song and Zhanyu Ma, Task-aware Adaptive Learning for Cross-domain Few-shot Learning, in Proceedings of IEEE/CVF Conference on Computer Vision (ICCV), 2023.
Dongliang Chang, Kaiyue Pang, Ruoyi Du, Yujun Tong, Yi-Zhe Song, Zhanyu Ma, and Jun Guo, Making a Bird AI Expert Work for You and Me, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), accepted. arXiv Code
Ruoyi Du, Dongliang Chang, Kongming Liang, Timothy Hospedales, Yi-Zhe Song and Zhanyu Ma, On-the-fly Category Discovery, in Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognitions (CVPR), 2023. Paper Code
Dongliang Chang, Yujun Tong, Ruoyi Du, Timothy Hospedales, Yi-Zhe Song and Zhanyu Ma, An Erudite Fine-Grained Visual Classification Model, in Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognitions (CVPR), 2023. Paper Code
Tian Zhang, Kongming Liang, Ruoyi Du, Xian Sun, Zhanyu Ma, and Jun Guo, Learning Invariant Visual Representations for Compositional Zero-Shot Learning, in Proceedings of European Conference on Computer Vision (ECCV), 2022.
Jingye Wang, Ruoyi Du, Dongliang Chang, Kongming Liang, and Zhanyu Ma, Domain Generalization via Frequency-domain-based Feature Disentanglement and Interaction, in Proceedings of ACM International Conference on Multimedia (ACM MM), 2022.
Yurong Guo, Ruoyi Du, Xiaoxu Li, Jiyang Xie, Zhanyu Ma, and Yuan Dong, Learning Calibrated Class Centers for Few-shot Classification by Pair-wise Similarity, IEEE Transactions on Image Processing (TIP), vol. 31, pp. 4543-4555, 2022. DOI Code
Ruoyi Du, Jiyang Xie, Zhanyu Ma, Dongliang Chang, Yi-Zhe Song, and Jun Guo, Progressive Learning of Category-Consistent Multi-Granularity Features for Fine-Grained Visual Classification, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 44, no. 12, pp. 9521-9535, 2022. DOI Code
Ruoyi Du, Dongliang Chang, Ayan Kumar Bhunia, Jiyang Xie, Zhanyu Ma, Yi-Zhe Song, and Jun Guo, Fine-Grained Visual Classification via Progressive Multi-Granularity Training of Jigsaw Patches, in Proceedings of European Conference on Computer Vision (ECCV), 2020. Arxiv Code

Service

Review for T-PAMI, IJCV, T-IP, T-CSVT, T-NNLS, CVPR, ICCV, ECCV, ICASSP, etc.

Ruoyi Du (杜若一)

About Me

Highlights

Publications

Service