avatar

Ruoyi Du (杜若一)

Take your time to rise.

About Me

(I am currently seeking job opportunities in China.)

I am a PhD candidate (since 2021) at the Pattern Recognition and Intelligent System Laboratory, School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing (100876), China. I am supervised by Prof. Zhanyu Ma and co-supervised by Prof. Yi-Zhe Song

My research interests are in:

  • Visual Content Generation 🎨
  • Fine-Grained Visual Recognition 🦅

Highlights

  • lumina-t2x Illustration
    Peng Gao*, Le Zhuo*, Dongyang Liu*, Ruoyi Du*, Xu Luo*, Longtian Qiu*, Yuhang Zhang, Chen Lin, Rongjie Huang, Shijie Geng, Renrui Zhang, Junlin Xie, Wenqi Shao, Zhengkai Jiang, Tianshuo Yang, Weicai Ye, He Tong, Jingwen He, Yu Qiao, and Hongsheng Li, Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers, Technical Report.
  • imax Illustration
    Ruoyi Du, Dongyang Liu, Le Zhuo, Qi Qin, Hongsheng Li, Zhanyu Ma, and Peng Gao, I-Max: Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers with Projected Flow, Technical Report.
  • lumina-next Illustration
    Le Zhuo*, Ruoyi Du*, Han Xiao*, Yangguang Li*, Dongyang Liu*, Rongjie Huang*, Wenze Liu*, Xiangyang Zhu, Fu-Yun Wang, Zhanyu Ma, Xu Luo, Zehan Wang, Kaipeng Zhang, Lirui Zhao, Si Liu, Xiangyu Yue, Wanli Ouyang, Yu Qiao, Hongsheng Li, and Peng Gao, Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT, in Proceedings of Conference on Neural Information Processing Systems (NeurIPS), 2024.
  • DemoFusion Illustration
    Ruoyi Du, Dongliang Chang, Timothy Hospedales, Yi-Zhe Song, and Zhanyu Ma, DemoFusion: Democratising High-Resolution Image Generation With No $$$, in Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognitions (CVPR), 2024.
  • FGVC Illustration
    Ruoyi Du, Dongliang Chang, Zhanyu Ma, Kongming Liang, Yi-Zhe Song, and Jun Guo, Semi-Supervised FGVC with Out-of-Category Data, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), Accepted.
  • Multi-view Active FGVC
    Ruoyi Du, Wenqing Yu, Heqing Wang, Ting-En Lin, Dongliang Chang, and Zhanyu Ma, Multi-view Active Fine-grained Visual Recognition, in Proceedings of IEEE/CVF Conference on Computer Vision (ICCV), 2023.
  • On-the-fly Category Discovery
    Ruoyi Du, Dongliang Chang, Kongming Liang, Timothy Hospedales, Yi-Zhe Song, and Zhanyu Ma, On-the-fly Category Discovery, in Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognitions (CVPR), 2023.
  • Progressive Learning for FGVC
    Ruoyi Du, Jiyang Xie, Zhanyu Ma, Dongliang Chang, Yi-Zhe Song, and Jun Guo, Progressive Learning of Category-Consistent Multi-Granularity Features for Fine-Grained Visual Classification, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 44, no. 12, pp. 9521-9535, 2022.
  • PMG Training of Jigsaw Patches
    Ruoyi Du, Dongliang Chang, Ayan Kumar Bhunia, Jiyang Xie, Zhanyu Ma, Yi-Zhe Song, and Jun Guo, Fine-Grained Visual Classification via Progressive Multi-Granularity Training of Jigsaw Patches, in Proceedings of European Conference on Computer Vision (ECCV), 2020.

Publications

  • Le Zhuo*, Ruoyi Du*, Han Xiao*, Yangguang Li*, Dongyang Liu*, Rongjie Huang*, Wenze Liu*, Xiangyang Zhu, Fu-Yun Wang, Zhanyu Ma, Xu Luo, Zehan Wang, Kaipeng Zhang, Lirui Zhao, Si Liu, Xiangyu Yue, Wanli Ouyang, Yu Qiao, Hongsheng Li, and PengGao, Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT, in Proceedings of Conference on Neural Information Processing Systems (NeurIPS), 2024. arXiv Code
  • Tian Zhang, Kongming Liang, Ruoyi Du, Wei Chen, and Zhanyu Ma, Disentangling before Composing: Learning Invariant Disentangled Features for Compositional Zero-Shot Learning, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), accepted. Code
  • Yurong Guo, Ruoyi Du, Aneeshan Sain, Kongming Liang, Yuan Dong, Yi-Zhe Song, and Zhanyu Ma, Understanding Episode Hardness in Few-shot Learning, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), accepted. arXiv Code
  • Ruoyi Du, Dongliang Chang, Timothy Hospedales, Yi-Zhe Song and Zhanyu Ma, DemoFusion: Democratising High-Resolution Image Generation With No $$$, in Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognitions (CVPR), 2024. Page arXiv Code
  • Ruoyi Du, Dongliang Chang, Zhanyu Ma, Kongming Liang, Yi-Zhe Song, and Jun Guo, Semi-Supervised FGVC with Out-of-Category Data, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), Accepted. DOI Code
  • Ruoyi Du, Wenqing Yu, Heqing Wang, Ting-En Lin, Dongliang Chang, and Zhanyu Ma, Multi-view Active Fine-grained Visual Recognition, in Proceedings of IEEE/CVF Conference on Computer Vision (ICCV), 2023.
  • Yurong Guo, Ruoyi Du, Yuan Dong, Timothy Hospedales, Yi-Zhe Song and Zhanyu Ma, Task-aware Adaptive Learning for Cross-domain Few-shot Learning, in Proceedings of IEEE/CVF Conference on Computer Vision (ICCV), 2023.
  • Dongliang Chang, Kaiyue Pang, Ruoyi Du, Yujun Tong, Yi-Zhe Song, Zhanyu Ma, and Jun Guo, Making a Bird AI Expert Work for You and Me, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), accepted. arXiv Code
  • Ruoyi Du, Dongliang Chang, Kongming Liang, Timothy Hospedales, Yi-Zhe Song and Zhanyu Ma, On-the-fly Category Discovery, in Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognitions (CVPR), 2023. Paper Code
  • Dongliang Chang, Yujun Tong, Ruoyi Du, Timothy Hospedales, Yi-Zhe Song and Zhanyu Ma, An Erudite Fine-Grained Visual Classification Model, in Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognitions (CVPR), 2023. Paper Code
  • Tian Zhang, Kongming Liang, Ruoyi Du, Xian Sun, Zhanyu Ma, and Jun Guo, Learning Invariant Visual Representations for Compositional Zero-Shot Learning, in Proceedings of European Conference on Computer Vision (ECCV), 2022.
  • Jingye Wang, Ruoyi Du, Dongliang Chang, Kongming Liang, and Zhanyu Ma, Domain Generalization via Frequency-domain-based Feature Disentanglement and Interaction, in Proceedings of ACM International Conference on Multimedia (ACM MM), 2022.
  • Yurong Guo, Ruoyi Du, Xiaoxu Li, Jiyang Xie, Zhanyu Ma, and Yuan Dong, Learning Calibrated Class Centers for Few-shot Classification by Pair-wise Similarity, IEEE Transactions on Image Processing (TIP), vol. 31, pp. 4543-4555, 2022. DOI Code
  • Ruoyi Du, Jiyang Xie, Zhanyu Ma, Dongliang Chang, Yi-Zhe Song, and Jun Guo, Progressive Learning of Category-Consistent Multi-Granularity Features for Fine-Grained Visual Classification, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 44, no. 12, pp. 9521-9535, 2022. DOI Code
  • Ruoyi Du, Dongliang Chang, Ayan Kumar Bhunia, Jiyang Xie, Zhanyu Ma, Yi-Zhe Song, and Jun Guo, Fine-Grained Visual Classification via Progressive Multi-Granularity Training of Jigsaw Patches, in Proceedings of European Conference on Computer Vision (ECCV), 2020. Arxiv Code

Service

Review for T-PAMI, IJCV, T-IP, T-CSVT, T-NNLS, CVPR, ICCV, ECCV, ICASSP, etc.