About Me

Yiming Shen is a Ph.D. Candidate in Applied & Computational Mathematics at the University of Massachusetts Boston, conducting research under Prof. David Degras-Valabregue.

Specializing in Statistical Modeling and Healthcare Data Science, my work bridges the gap between rigorous mathematical derivation and actionable clinical insights. I focus on developing high-dimensional data analysis methods for Brain-Computer Interfaces (BCI) and Medical Imaging.

I am actively seeking Data Scientist roles in the Pharmaceutical and Biotechnology industries.

Technical Expertise

Statistical Learning & Modeling
High-dimensional Tensor Analysis, Bayesian Inference, Mixed-Effects Models, Survival Analysis.
Machine Learning & BCI
Deep Learning for EEG/MRI, Domain Adaptation, Riemannian Geometry, Feature Engineering (CSP, Log-variance).
Computational Tools
R (Expert, Package Development), Python (scikit-learn, PyTorch), SQL, LaTeX, Git.

Selected Achievements

Kaggle Silver Medal (2024) | HMS - Harmful Brain Activity Classification

  • Developed a robust pipeline to detect seizures in complex EEG data, achieving a Top 5% ranking globally.
  • View Project Detail

R Package Developer | DA4BCI

  • Authored Data Augmentation for BCI, an R package optimizing classification accuracy for motor imagery tasks using geometric data generation.
  • View Repository

Research Framework

My research integrates High-Dimensional Tensor Analysis with Riemannian Manifold Geometry to solve the fundamental challenge of neurophysiological non-stationarity in Brain-Computer Interfaces (BCI). This framework is structured around three theoretical pillars:

High-Dimensional Tensor MCCA
Developed computational methods for Tensor Multiple Canonical Correlation Analysis (TMCCA) capable of handling datasets of arbitrary and mixed orders. This work establishes a statistical framework for integrating multi-view neuroimaging data.
Geometric Decomposition of Neural Drift
Analyzed EEG covariance evolution on the Riemannian manifold of Symmetric Positive Definite (SPD) matrices. By projecting representations onto a common unit-trace SPD manifold, I isolate geometric structural distortions from signal magnitude changes to diagnose performance decay.
Confidence-Gated Adaptive Decoding
Reframed BCI adaptation as a statistically grounded decision problem. Using Confidence-Interval (CI) Gating and Non-Inferiority Testing, this framework selects optimal adaptation pipelines (e.g., Bridge-Domain vs. Minimum-Distance) based on measured neurophysiological divergence.
Performance Landscape of BCI Adaptation Strategies
Figure: A diagnostic map illustrating the trade-off between Adaptation Potential (y-axis) and Absolute Accuracy/Stability (x-axis) across different decoding pipelines (BDP, MAP, MMP). My research identifies stable regimes for robust BCI deployment.