👋 Welcome!

I am a senior at Shanghai University of Medicine & Health Sciences, finishing my B.S. in Data Science and Big Data Technology under the guidance of Dr. Geng Zhu. I am passionate about leveraging data science to solve real-world healthcare challenges.

To bridge my computational skills with a commitment to human well-being, I proactively pursued micro-majors in “Health and Social Care” and “AI in Precision Medicine.” These programs, covering topics from genetic diagnostics to chronic disease care, built a solid foundation for me to connect code with clinical applications.

I am currently translating theory into practice through a long-term internship at Wanda Information Co., Ltd., which is expected to continue until June 2026. Co-mentored by Professor Menghan Hu of East China Normal University, We are tackling a core, real-world industry challenge: how to effectively deploy state-of-the-art Large Language Models on vast and heterogeneous datasets to streamline complex workflows and enhance operational efficiency. This experience is providing me with deep insights into the complexities and immense potential of applying cutting-edge algorithms to diverse business scenarios.

Looking ahead to my Ph.D. studies, my research vision is clear and focused: to develop human-centered and empathetic AI technologies for medicine.

I aim to fuse multimodal data—such as medical imaging, physiological signals, clinical text, and genomics—with machine learning and deep learning algorithms to investigate underlying disease mechanisms and build more accurate diagnostic tools. I am particularly excited by the potential of combining Natural Language Processing (NLP), Large Language Models (LLMs), and Reinforcement Learning (RL) to create interactive platforms. These platforms would not only aid in diagnosis but also provide patients with dynamic guidance on rehabilitation, medication, and lifestyle interventions, serving as a trusted digital health companion. Beyond this, I am also keenly interested in exploring related frontiers such as Brain-Computer Interface (BCI), advanced prosthetics, AR/VR, Human-Computer Interaction (HCI), Robotics and multimodal generative models etc., and I am excited by the prospect of exploring their intersections in my future research.

🔥 News

  • 2025.09: One paper on Research Progress on Gamma Rhythm Stimulation for Alzheimer’s Disease Treatment was accepted by Modern Instruments & Mediccal Treatment!
  • 2024.08: One patent has been published.
  • 2024.08: One patent has been published.

📝 Research

Decoding Schizophrenia with EEG & ERP
sym

Decoding Schizophrenia: Superior Discriminative Power of Resting-State EEG over Event-related Potentials in a Comparative Machine Learning Framework

[Project Leader] This work establishes that for machine learning-based diagnosis, resting-state EEG provides a more powerful and reliable basis for classifying schizophrenia than brain activity recorded during cognitive tasks (ERPs).

Content
  • EEG/ERP Data Preprocessing: Systematically processed resting-state EEG and N-back task ERP signals using EEGLAB, implementing a full pipeline of filtering, artifact rejection, and normalization to maximize signal integrity and ensure high-quality data for analysis.

  • Frequency-Domain Biomarker Identification: Employed Fourier Transform to extract power and energy spectrum features, converting complex time-series data into a clear frequency-domain representation to uncover potential biomarkers of schizophrenia.

  • Interpretable Machine Learning for Diagnosis: Developed diagnostic models using interpretable algorithms (SVM, GBDT, KNN, Random Forest). Leveraged the Fisher-score algorithm for feature selection to pinpoint the most discriminative neural patterns across different frequency bands and brain regions.

  • Key Findings: Demonstrated that GBDT model achieved superior performance in classifying resting-state EEG, while SVM was optimal for task-based ERP data. This highlights a state-dependent optimal model choice. Identified the occipital lobe during visual processing tasks as a critical region containing highly discriminative features, suggesting its key role in the pathophysiology of schizophrenia.

Decoding Age-Related Schizophrenia
sym

Age-Related Neural Signatures of Schizophrenia: A Cross-Cohort Deep Learning Analysis of EEG Functional Connectivity

[Project Leader] This research investigates age-related neural signatures of schizophrenia by conducting a comparative analysis of adult and adolescent EEG data, leveraging advanced brain network modeling and deep learning.

Content
  • Cross-Cohort & Cross-Age Comparative Framework: Initiated a novel comparative study between a privately collected adult schizophrenia EEG dataset (Shanghai) and a public adolescent dataset (Moscow), establishing a framework to test the generalizability of neural biomarkers across different age groups and populations.

  • Functional Brain Network Construction & Analysis: Constructed functional brain networks using Power Spectral Density (PSD) as node features and Phase Lag Index (PLI) / Phase Locking Value (PLV) as edge weights. This approach enabled a direct comparison of functional connectivity patterns between the two distinct patient cohorts.

  • Graph-Based Deep Learning for Neuropathology: Applied Graph Convolutional Network to model the brain's functional connectome, successfully identifying complex, non-linear neuropathological patterns that distinguish the adult and adolescent schizophrenia cohorts.

  • Developing a Superior Hybrid Diagnostic Model: Designed and benchmarked multiple deep learning architectures for diagnostic classification. The proposed LSTM-Transformer model demonstrated superior performance and robustness, outperforming other hybrid models across both datasets, providing a powerful solution for cross-cohort diagnosis.

AI-Assisted Clinical Diagnosis
sym sym

An Interactive Platform for AI-Assisted Clinical Diagnosis

[Sole Developer] This website demonstrates the translation of foundational AI research into a tangible clinical tool. I independently conceived and developed an interactive web-based platform designed to bridge the gap between computational models and real-world clinical practice.

Content
  • A Holistic, LLM-Enhanced Diagnostic Framework: Drawing on my industry experience with Large Language Models (LLMs), I engineered a novel platform that integrates three critical data sources: objective EEG-based diagnostic predictions, standardized clinical assessment scales, and a dynamic, interactive patient consultation module.

  • Rapid Prototyping and Research-to-Practice Translation: I demonstrated strong self-learning and execution capabilities by mastering the Streamlit framework from the ground up to build and deploy a functional prototype in just two weeks. This platform is specifically designed to create a seamless workflow between computational research and hands-on clinical assessment.

  • Domain-Specific Language Model Customization: To power the platform’s consultation module, I built a specialized, lightweight language model. By leveraging Retrieval-Augmented Generation (RAG), the model was fine-tuned on a curated corpus of medical literature and clinical reports to provide context-aware and evidence-based responses for clinical inquiries.

Word Vector Models
sym

Uncovering Semantic Structures in Stroke Research: A Comparative Analysis of Word Vector Models

[Project Leader] This project leverages natural language processing and unsupervised learning to systematically analyze a large corpus of medical literature, providing a data-driven evaluation of word embedding techniques for a specialized domain.

Content
  • Domain-Specific Text Processing Pipeline: Engineered a specialized text processing pipeline to handle the unique challenges of medical literature. By training a word segmentation model on manually annotated samples, the system achieves high accuracy in identifying professional medical terms within a large-scale corpus of stroke research abstracts.

  • Head-to-Head Comparison of Word Embedding Models: Conducted a direct comparative analysis of two seminal word embedding algorithms, GloVe and Word2Vec. Both models were trained on the processed corpus to create distinct vector-space representations of the stroke research domain, allowing for a rigorous evaluation of their ability to capture semantic relationships.

  • Quantitative and Qualitative Performance Evaluation:Employed unsupervised learning (cluster analysis) as a downstream task to provide an evidence-based evaluation of each vectorization model. By comparing the quality of the resulting thematic clusters, this work offers a clear conclusion on which algorithm is superior for uncovering meaningful, latent structures within specialized scientific literature.

Interventions for DCD
sym

Interventions for Developmental Coordination Disorder (DCD)

[Project Leader] This study provides a comprehensive analysis of the research landscape for interventions targeting DCD in children, combining a wide-ranging bibliometric review with a detailed meta-analysis of intervention effectiveness.

Content
  • Systematic Mapping of the Global Research Landscape: Conducted a large-scale bibliometric analysis to identify key research trends, geographical patterns, and influential institutions and authors in the field of DCD intervention. The study highlights Canada, particularly McMaster University, as a central hub for DCD research.

  • Evidence-based Evaluation of Intervention Efficacy: Performed a meta-analysis to quantitatively assess the effectiveness of various motor interventions using the standardized Movement Assessment Battery for Children-2 (MABC-2).

  • Identification of Optimal Intervention Strategies: The findings demonstrate that different interventions yield distinct benefits. Notably, multi-component exercises are most effective for improving balance, while task-oriented and skill-based activities show significant advantages for enhancing manual dexterity, aiming, and catching.

  • Guidance for Future Clinical Practice and Research: By integrating bibliometric trends with quantitative efficacy data, this research identifies critical gaps in the literature and proposes a clear, data-driven framework for designing and implementing targeted DCD management strategies.

EEG-Based Fatigue Prediction
sym

“PLEISURE” - An On-Chip, Real-Time System for EEG-Based Fatigue Prediction

[Core Member] This work encompasses the end-to-end development of an intelligent IoT system for real-time fatigue prediction, from algorithm design and on-chip hardware implementation to advanced deep learning analysis.

Content
  • FPGA-Based Real-Time Algorithm Implementation: A fatigue detection algorithm was designed and implemented on an FPGA. This on-chip processing enables real-time signal analysis at the edge for a wearable predictive warning system.

  • Non-Linear Feature Engineering: A feature extraction pipeline was developed to quantify fatigue from raw EEG signals. The process involves computing non-linear dynamic metrics, including Sample Entropy and Approximate Entropy, to characterize changes in brain activity associated with physiological fatigue.

  • Dual-Approach Deep Learning for Classification: Two deep learning models were architected and benchmarked for fatigue classification. A CNN-LSTM fusion model was applied to capture spatio-temporal dependencies in EEG signals, while GNN was used to model the brain's functional connectivity.

🥇 Honors and Awards

📖 Education

  • 2022.09 - 2026.06, Bachelor’s Degree in Data Science and Big Data Technology, Shanghai University of Medicine & Health Sciences, Shanghai, China

💻 Internship

Joint Training
sym

Wanda Information Co., Ltd.

Co-mentored by Professor Menghan Hu of East China Normal University, We are tackling a core, real-world industry challenge: how to effectively deploy state-of-the-art Large Language Models on vast and heterogeneous datasets to streamline complex workflows and enhance operational efficiency. This work is expected to continue until June 2026.