I am currently a Eng.D. candidate at the College of Biomedical Engineering, Fudan University (复旦大学生物医学工程与技术创新学院). My research focuses on AI-driven medical image analysis, including mainstream tasks such as segmentation, classification, anomaly detection, and image translation. Recently, my research interests have shifted toward multimodal ultrasound/photoacoustic imaging.
I graduated from College of Computer Science and Artificial Intelligence, Southwest Minzu University (西南民族大学计算机科学与人工智能学院) with a bachelor’s degree and from the College of Computer Science and Mathematics, Fujian University of Technology (福建理工大学计算机科学与数学学院) with a master’s degree, advised by Professor Zuoyong Li (李佐勇).
Currently, I am conducting my studies at the Intelligent Medical Ultrasound Laboratory (IMU Lab) of the Academy for Engineering & Technology (工程与应用技术研究院), Fudan University. Under the supervision of Professor Xin Liu (刘欣), my research focuses on deep learning-based ultrasound imaging. I have published 5+ papers at the top international journoal.
🔥 News
- 2025.02: 🎉 A paper is accepted by PR
- 2024.10: 🎉 A paper is accepted by IEEE TCSVT
- 2024.09: 🎉 A paper is accepted by ESWA
📝 Publications
🎙 Image Classification

SIAVC: Semi-Supervised Framework for Industrial Accident Video Classification
Zuoyong Li, Qinghua Lin, Haoyi Fan, Tiesong Zhao, David Zhang
- We propose a Video Cross-set Augmentation Module (VCAM) by incorporating high-confidence unlabeled samples into the augmentation queue and generating various pseudo-label samples through interpolation, to mitigate the sampling experience mismatch while expanding the training data.
- We propose a Super Augmentation Block (SAB) that adds random mask and Gaussian noise to frames based on historical losses to re-augment high-confidence samples. SAB allows for the re-utilization of these strongly augmented samples for better consistency regularization..
- We propose a multi-class accident video dataset called Express Center Accidents 9 (ECA9). ECA9 comprises nine typical accidents in hub-level express processing centers, and provides video-level labels and frame-level anomaly labels. To the best of our knowledge, ECA9 is the first surveillance video dataset for industrial accident scenes.

FireMatch: A semi-supervised video fire detection network based on consistency and distribution alignment
Qinghua Lin, Zuoyong Li, Kun Zeng, Haoyi Fan, Wei Li, Xiaoguang Zhou
- To fully leverage unlabeled data, we extend consistency regularization with a self-adaptive pseudo-label to the video classification field. Generating enough high-quality pseudo-label data for training helps the model achieve accurate fire video classification.
- For addressing the problem of imbalanced labeled and unlabeled data leading to mismatched sampling experiences, we propose video cross-set sample augmentation combined with adversarial distribution alignment to generate additional labeled samples and alleviate this bias.
- We conduct extensive experiments and ablation studies on public datasets and compare our method with state-of-the-art semi supervised methods. The experimental results demonstrate the effectiveness of the proposed method.

Leukocyte classification using relative-relationship-guided contrastive learning
Zuoyong Li, Qinghua Lin, Jiawei Wu, Taotao Lai, Rongteng Wu, David Zhang
- We introduce the heuristic-guided strategy of contrastive learning, which provides positive and negative samples based on relative distance knowledge. This strategy addresses the bottleneck of contrast views in contrast learning resulting from different classes being sensitive to different data expansions.
- We propose a Relative-Relationship-Guided Contrastive Learning Representation (ReCLR) framework for the leukocyte classification, which introduces the prior distance knowledge to mine positive pairs with the adversarial relative relationship and negative pairs with entropy constraint.
- We conduct extensive experiments and compare ReCLR with several state-of-the-art methods on real leukocyte datasets. The results show that our method achieves better classification performance in different evaluation protocols, including linear evaluation, domain transfer, and finetuning, which shows the effectiveness of proposed method.
🧑🎨 Generative Model

WtNGAN: Unpaired image translation from white light images to narrow-band images
Qinghua Lin, Zuoyong Li, Kun Zeng, Jie Wen, Yuting Zhang, Jian Chen
- We utilize contrastive learning and structural consistency constraints to facilitate detailed translation from white light images to narrow-band light images while preserving the structure of the generated images.
- For the first time, we introduce Vision Mamba to the field of unpaired image translation. The generator based on Vision Mamba enhances the detailed representation of gastroscopic white light images to narrow-band light images by establishing long-range dependencies.
- We conduct extensive experiments on a self-built gastroscopic dataset and the BraTS2021 dataset, and the experimental results demonstrate that the proposed method outperforms the current state-of-the-art unpaired image translation methods.
🎙 Image Segmentation

DeepCrackAT: An effective crack segmentation framework based on learning multi-scale crack features
Qinghua Lin, Wei Li, Xiangpan Zheng, Haoyi Fan, Zuoyong Li
- We propose a deep crack convolutional network, named DeepCrackAT, which utilizes an Attention mechanism to fuse multi-scale features from convolution and Tokenized MLP for crack segmentation.
- For irregularly distributed cracks, we use the hybrid dilated convolutions to increase the receptive field of convolutional operations and capture more crack features. Additionally, the proposed method employs a tokenized multilayer perceptron to project high-dimensional crack features into a low dimension space, enhancing the network’s ability of noise resistance.
- We introduce the convolutional block attention module to construct an attentional skip-layer fusion block for multi-scale feature fusion. This helps to enhance the network’s perception of the critical crack region and alleviate the problem of information loss in thick crack segmentation.
Others
TIM 2025
KAC-Unet: A Medical Image Segmentation with the Adaptive Group Strategy and Kolmogorov-Arnold Network, Shiying Lin, Rong Hu, Zuoyong Li, Qinghua Lin, et al.ACMM MM 2025
Gradient-Aware Revitalization of Non-Effective Samples in Medical Image Segmentation, Shiying Lin, Rong Hu, Zuoyong Li, Qinghua Lin, et al.计算机工程与应用 2025
多维层次语义蒸馏引导下的深度学习模型压缩方法, 陈碧霞, 林清华, et al.
🎖 Honors and Awards
- 2025.06 Graduate Academic Excellence First-Class Scholarship of Fujian University of Technology (Top 1%)
- 2025.06 Outstanding Postgraduate Graduates of Fujian University of Technology (Ranked First)
- 2023.12 Graduate Academic Excellence Special Scholarship of Fujian University of Technology (Top 1%)
- 2022.12 First-Class Freshman Scholarship of Fujian University of Technology (Top 3%)
📖 Educations
- 2025.09 - Now, Doctoral Candidate in Engineering, College of Biomedical Engineering, Fudan University, Shanghai.
- 2025.01 - 2025.06, Visiting Students, Fudan University, Shanghai.
- 2022.09 - 2025.06, Master, School of Computer Science and Mathematics, Fujian University of Technology, Fuzhou.
- 2017.09 - 2021.06, Undergraduate, College of Computer Science and Engineering, Southwest Minzu University, Chengdu.