Research

Statistics (Method and Theory)

2023+

Surrogate-Assisted Federated Learning of High-Dimensional Electronic Health Record Data.
Liu, Y.#, Liu, M.#, Guo, Z. and Cai, T., 2023+. In submission. [Arxiv]

Improve Efficiency of Doubly Robust Estimator when Propensity Score is Misspecified.
Lv, L.** and Liu, M., 2023+. Under major revision. [Arxiv]

Augmented Transfer Regression Learning with Semi-non-parametric Nuisance Models.
Liu, M., Zhang, Y., Liao, K., Cai, T., 2023. Journal of Machine Learning Research (forthcoming). [Arxiv]

Maxway CRT: Improving the Robustness of Model-X Inference.
Li, S.#, and Liu, M.#, 2023. Journal of the Royal Statistical Society Series B (Statistical Methodology) (forthcoming). [Arxiv] [Code]

Assessing Heterogeneous Risk of Type II Diabetes Associated with Statin Usage: Evidence from Electronic Health Record Data.
Guo, X.#, Wei, W.#, Liu, M., Cai, T., Wu, C. and Wang, J., 2023. Journal of the American Statistical Association (forthcoming). [Arxiv] [Journal]

2022

Semi-supervised Triply Robust Inductive Transfer Learning.
Cai, T.*, Li, M.* and Liu, M.*, 2022+. Under revision. [Arxiv]

Doubly Robust Augmented Model Accuracy Transfer Inference with High Dimensional Features.
Zhou, D.#, Liu, M.#, Li, M. and Cai, T., 2022+. Under revision. [Arxiv]

Efficient Estimation and Evaluation of Prediction Rules in Semi-Supervised Settings under Stratified Sampling.
Gronsbell, J.#, Liu, M.#, Tian, L. and Cai, T., 2022. Journal of the Royal Statistical Society Series B (Statistical Methodology). [Arxiv] [Code] [Journal]

Prior Adaptive Semi-supervised Learning with Application to EHR Phenotyping.
Zhang, Y.#, Liu, M.#, Neykov, M. and Cai, T., 2022. Journal of Machine Learning Research. [Arxiv] [Code] [Journal]

Fast and Powerful Conditional Randomization Testing via Distillation.
Liu, M., Katsevich, E., Janson, L. and Ramdas A., 2022. Biometrika. [Arxiv] [Code] [Journal]

Individual Data Protected Integrative Regression Analysis of High-Dimensional Heterogeneous Data.
Cai, T.*, Liu, M.* and Xia, Y.*, 2022. Journal of the American Statistical Association. [Arxiv] [Code] [Journal]

2021

Double/Debiased Machine Learning for Logistic Partially Linear Model.
Liu, M.*, Zhang, Y.* and Zhou, D.*, 2021. The Econometrics Journal. [Arxiv] [Code] [Journal]

Integrative High Dimensional Multiple Testing with Heterogeneity under Data Sharing Constraints.
Liu, M., Xia, Y., Cho, K. and Cai, T., 2021. Journal of Machine Learning Research. [Arxiv] [Journal]

Before 2021

Joint Models for Time-to-Event Data and Longitudinal Biomarkers of High Dimension.
Liu, M., Sun, J., Herazo-Maya, J.D., Kaminski, N. and Zhao, H., 2019. Statistics in Biosciences. [Journal]

Modeling Individualized Coefficient Alpha to Measure Quality of Test Score Data.
Liu, M., Hu, M. and Zhou, X., 2018. Statistics in medicine. [Journal]

Biomedical Informatics

Differential Associations of Interleukin 6 Receptor Variant Across Genetic Ancestries and Implications for Targeted Therapies.
Wang, X.#, Liu, M.#, et al., 2022+. In submission. [MedRxiv]

Weakly Semi-supervised Phenotyping Using Electronic Health Records.
Nogues, I., Wen, J., Lin, Y., Liu, M., Tedeschi, S., Geva, A., Cai, T., Hong, C., 2022. Journal of Biomedical Informatics. [Journal]

Clinical Knowledge Extraction via Sparse Embedding Regression (KESER) with Multi-Center Large Scale Electronic Health Record Data.
Hong, C., et al., 2021. Nature Portfolio Journal (npj) Digital Medicine. [MedRxiv] [Webpage] [Journal]
(I am listed as the third author).

A High-Throughput Phenotyping Algorithm Is Portable from Adult to Pediatric Populations.
Geva, A., Liu, M., Panickan, V., Avillach, P., Cai, T.# and Mandl, K.#, 2021. Journal of the American Medical Informatics Association. [Journal]

SCRIBE: a new approach to dropout imputation and batch effects correction for single-cell RNA-seq data.
Zhang, Y., Liang, K., Liu, M., Li, Y., Ge, H. and Zhao, H., 2019. Machine Learning in Computational Biology. [BioRxiv]

Collaborative Research

Hospitalizations Associated With Mental Health Conditions Among Adolescents in the US and France During the COVID-19 Pandemic.
Gutiérrez-Sacristán, A., et al., 2022. JAMA Network Open. [Journal]

Changes in laboratory value improvement and mortality rates over the course of the pandemic: an international retrospective cohort study of hospitalised patients infected with SARS-CoV-2.
Hong, C., et al., 2022. BMJ open. [Journal]

International comparisons of laboratory values from the 4CE collaborative to predict COVID-19 mortality.
Weber, G., et al., 2022. Nature Portfolio Journal (npj) Digital Medicine. [MedRxiv] [Journal]

National Trends in Disease Activity for COVID-19 among Children in the US.
Bourgeois, F., Hutch, M., Liu, M., Avillach, P. and Luo, Y. (2021). Frontiers in Pediatrics. [Journal]

International Analysis of Electronic Health Records of Children and Youth Hospitalized With COVID-19 Infection in 6 Countries.
Bourgeois, F., et al., 2021. JAMA Network. [Journal]
(I am listed as the fourth author).

International changes in COVID-19 clinical trajectories across 315 hospitals and 6 countries: retrospective cohort study.
Weber, G., et al., 2021. Journal of Medical Internet Research. [Journal]

  • *: The authors are listed in alphabetic order.
  • #: The authors contributed equally to this work.

  • **: Student authors under my supervision.