Research
Research Interests
My research interests encompass high-dimensional data analysis and machine learning.
- High dimensional data analysis: multiple testing, replicable inference and dimension reduction, with applications in genotype data.
- Machine Learning: diffusion models, transformer and synthetic data.
Publications and Preprints
- Wang, P., Lyu. P, Peddada, S., and Cao, H. (2025). Statistical analysis of correlated expression data from high throughput experiments.
Genetics, iyaf060. - Bell, T.N., Kusi-Appiah, A.E., Tocci, V., Lyu. P, Zhu, L., Zhu, F., Van Winkle, D., Cao, H., Singh, M.S. and Lenhert, S. (2024). Scalable lipid droplet microarray fabrication, validation, and screening.
Plos one, 19(7). - Lyu, P., Li, Y., Wen, X., and Cao, H. (2023). JUMP: replicability analysis of high-throughput experiments with applications to spatial transcriptomic studies.
Bioinformatics, 39(6). - Lyu, P., Zhang, X., and Cao, H. (2025+). Replicability analysis of high dimensional data accounting for dependence. [arXiv].
- Lyu. P, Bell, T., Lenhert S., and Cao, H. (2025+). Sample size calculation in cell culture - how many cells should I count?