Research

AI-enabled Algorithms in Computational Pathology

Deep learning-based Whole Slide Image (WSI) analysis shows significant potential for automating pathological image diagnosis and intelligent analysis. However, WSIs differ significantly from natural images in size, ranging from 100 million to 10 billion pixels, which prevents the direct application of deep learning models developed for natural images to WSIs. A common approach is to divide WSIs into many non-overlapping small patches for processing, but providing fine-grained annotations for these patches is prohibitively expensive (a WSI can typically produce tens of thousands of patches), rendering patch-based supervised methods impractical. As a result, weakly supervised learning methods based on Multiple Instance Learning (MIL) have become prevalent.

Current challenges in applying MIL to computational pathology include finer-grained classification and localization, multi-scale information fusion, patch and slide-level distribution modeling, elimination of redundancy and ambiguity, and avoiding overfitting, among others.

Our research aims to address these challenges by proposing new, efficient AI algorithms to achieve more accurate, efficient, and intelligent WSI automatic diagnosis and analysis. It is worth mentioning that with the rise of various powerful foundation models, we are actively exploring how to better utilize these models to address various issues in computational pathology.

Weakly/Semi-Supervised Learning Algorithms

Foundation Model-Driven Few/Zero-Shot WSI Classification

Multimodal Medical Information Fusion and Data Mining

The integration of diverse data modalities—such as images, text, and tabular data—into a unified analytical framework has the potential to significantly enhance medical diagnosis and patient care. Multimodal medical diagnosis leverages the strengths of each data type, providing a comprehensive understanding of patient health and disease progression. While Whole Slide Images (WSIs) offer detailed morphological insights into patient status, cancer progression and responses to treatment are influenced by a multitude of factors. Therefore, incorporating additional modalities, such as radiology images, textual medical reports, and omics data, is crucial for improved outcome prediction.

The primary challenges in this field include efficient multimodal feature extraction, alignment and fusion strategies, multimodal decision-making, and handling missing modalities. These challenges necessitate the development of sophisticated AI algorithms to achieve seamless integration and robust analysis of multimodal data.

Our research focuses on addressing these challenges by proposing novel, efficient AI algorithms. We aim to enhance the accuracy and efficiency of multimodal diagnostic systems through advanced feature extraction and fusion techniques. Additionally, we are actively exploring the application of multimodal foundation models to tackle these challenges, leveraging their vast pre-trained knowledge to improve multimodal data integration and analysis.

Data mining plays a crucial role in extracting valuable insights from multimodal medical data. Our research also emphasizes active learning strategies in medical imaging to optimize data utilization and improve model performance.

Multimodal Medical Diagnosis (Image, Text, Tabular Data)

Algorithm-enabled Clinical Research

We are also deeply committed to applying AI algorithms to specific clinical tasks to better address urgent clinical needs. With over two years of experience as research assistant in the pathology department of large comprehensive hospitals, I have progressively applied developed algorithms to areas including gynecology, urology, neuroendocrinology, gastroenterology, head and neck tumors, and rare diseases. We have validated these algorithms with clinical data totaling over 10,000 cases, covering tasks such as tumor diagnosis, prognosis, treatment response prediction, and gene mutation prediction, among others. Concurrently with our research publications, we are also involved in the relevant industrialization process and have applied for multiple related patents.