Hong Kong Baptist University (HKBU) Research Cluster on Data Analytics and Artificial Intelligence in X

A Temporal Tensor Factorization Framework for Phenotyping and Dynamic Patient Representation Learning Using Multi-Modal EHR Data
Principal Investigator: Dr. William K. CHEUNG (Department of Computer Science)

Leveraging electronic health records (EHR) data for healthcare predictive analytics has been receiving growing attention in recent years. High-throughput phenotyping is one of the analytics task where machine learning algorithms are used to derive phenotypes (sets of clinical conditions) from the EHR data to characterize patients of different diseases. In this project, we propose a deep tensor factorization framework for inferring highly interpretable phenotypes and dynamic patient representations from multi-modal EHR data. The proposed framework contains a temporal tensor model as its core for capturing (a) the interaction of the structured information (like diagnosis, medication, and lab tests), (b) the underlying phenotypes (as tensor factors), and (c) the temporal evolution of the phenotype portion (dynamic representation), as part of the model learning. As the temporal evolution of the health condition of a patient is complex in nature, deep models like recurrent neural network and neural Hawkes process can be integrated for regularizing the dynamic representations. In addition, the proposed framework can be integrated with a deep network architecture to learn to extract features from physiological time series like vital signs and ECG waveforms so that the associated predictive analytics tasks can be carried out in a patient-specific manner.


  • To develop a temporal tensor factorization framework for joint learning of hidden interaction of clinical events, phenotypes and dynamic patient representation from the structured EHR data.
  • To extract diagnosis codes from progress notes to be integrated into the temporal tensor factorization framework for learning diagnosis associated phenotypes.
  • To develop methodologies for multiple time scale settings of the EHR data and the corresponding deep models for regularizing the dynamic patient representations.
  • To integrate the proposed framework with a deep time series model to enable phenotype-aware physiology times series modeling for adaptive real-time predictive analytics.

Grant Support:

This project is supported by the General Research Fund (GRF), Research Grants Council (RGC), Hong Kong SAR, China (Project 12201219).

For further information on this research topic, please contact Dr. William K. CHEUNG.