I am currently pursuing a master’s degree at the School of Computer Science and Technology, East China Normal University, under the supervision of Associate Professor Lei Chen. I completed my undergraduate studies at the School of Software and the School of Economics and Management (minor) at Tianjin Polytechnic University. Additionally, I interned for one year with the GMAI team at Shanghai Artificial Intelligence Laboratory. My leader is Junjun He, and my mentor is Jin Ye. Now I’m an intern at INTSIG, working as an intern for image security algorithms.
My research areas include:
- Medical imaging
- Multimodal large models in healthcare
- Generative models
🎓 Educations
- 2022.09 - 2025.06, East China Normal University, Shanghai, China.
- 2018.09 - 2022.06, Tianjin Polytechnic University, Tianjin, China.
📝 Publications
MICCAI 2024

Wang G*
, Ye J*, Cheng J, et al. SAM-Med3D-MoE: Towards a Non-Forgetting Segment Anything Model via Mixture of Experts for 3D Medical Image Segmentation[J]. arXiv preprint arXiv:2407.04938, 2024. Accepted by MICCAI2024. [PDF]
NeurIPS 2024

- Chen P*, Ye J*,
Wang G*
, et al. GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI[J]. arXiv preprint arXiv:2408.03361, 2024. Accepted by NeurIPS2024. [PDF] [Homepage]
CVPR 2025

- Chen Y*,
Wang G*
, Ji Y*, et al. SlideChat: A Large Vision-Language Assistant for Whole-Slide Pathology Image Understanding[J]. arXiv preprint arXiv:2410.11761, 2024. Accepted by CVPR2025.[PDF][Homepage]
CVPR 2025

- Cheng, J., Fu, B., Ye, J.,
Wang, G.
, Li, T., Wang, H., … & He, J. (2024). Interactive Medical Image Segmentation: A Benchmark Dataset and Baseline. arXiv preprint arXiv:2411.12814. Accepted by CVPR2025.[PDF][Homepage]
XXXX 2025

- Li, W., Hu, M.,
Wang, G.
, Liu, L., Zhou, K., … & He, J. (2024). Ophora: A Large-Scale Data-Driven Text-Guided Ophthalmic Surgical Video Generation Model. Submitted to XXXX2025.
XXXX 2025

- Tianbin Li, Yanzhou Su, Wei Li, Bin Fu, Zhe Chen, Ziyan Huang,
Guoan Wang
, et al. GMAI-VL & GMAI-VL-15M: Towards General Medical AI with a Large Vision-Language Model and a Comprehensive Multimodal Dataset. 2024. Submitted to XXXX2025.[PDF]
PRCV 2023

- Chen, J., Cheng, J., Jiang, L., Yin, P.,
Wang, G.
, & Zhu, M. (2023, October). PRFNet: Progressive Region Focusing Network for Polyp Segmentation. In Chinese Conference on Pattern Recognition and Computer Vision (PRCV) (pp. 394-406). Singapore: Springer Nature Singapore. Accepted by PRCV2023. [PDF]]
ICME 2023

- Huai, T., Yang, S., Zhang, J.,
Wang, G.
, Yu, X., Ma, T., & He, L. (2023, July). SQT: Debiased Visual Question Answering via Shuffling Question Types. In 2023 IEEE International Conference on Multimedia and Expo (ICME) (pp. 600-605). IEEE. Accepted by ICME2023. [PDF]]
🏅 Honors and Awards
- National scholarship
- Tianjin Municipal People’s Government Scholarship
- President’s first scholarship
- H Award in the American Collegiate Mathematical Contest in Modeling
- Blue Bridge Cup C/C++ Group National Competition third prize
- Second prize of Asia-Pacific University Students Mathematical Contest in Modeling
💻 Internships
- 2025.02 - Present, INTSIG.
- 2024.09 - 2024.11, Shanghai Artificial Intelligence Laboratory.
- 2024.07 - 2024.08, Nomura Information Technology Shanghai Co., Ltd (NTSH)
- 2023.12 - 2024.06, Shanghai Artificial Intelligence Laboratory.
- 2023.09 - 2023.11, Ubiquant Investment (Beijing) Corp.
- 2023.02 - 2023.06, iLambda Corp.