Research Grants:
- PI, a Subaward of NIH R01GM140476
- Co-PI, NSF DMS-2210687
- PI, NSF IIS-1546087.
- PI, NSF SES-1533956.
Research interests: methods, theory, and applications in statistics and data science:
- Empirical likelihood
- Longitudinal and dependent data analysis
- High-dimensional data analysis
- Financial statistics and econometrics
- Sampling statistics and analysis of missing data
- Nonparametric and semiparametric statistical methods
Publications:
2024
- Tong, P., Chen, S.X., and Tang, C.Y. (2024). Multivariate calibrations with auxiliary variables. Statistica Sinica. To appear.
- Chang, J., Hu, Q., Liu, C., and Tang, C.Y. (2024). Optimal covariance matrix estimation for high-dimensional noise in high-frequency data. Journal of Econometrics. 239, 105329.
- Jiang, B., Liu, C., and Tang, C.Y. (2024). Dynamic covariance matrix estimation and portfolio analysis with high-frequency data. Journal of Financial Econometrics. 22, 461-491.
- Chen, D., Li, C., Tang, C.Y., and Yan, J. (2024). The leverage effect puzzle under semi-nonparametric stochastic volatility models. Journal of Business & Economic Statistics. 42, 548-562.
- Tang, C.Y. (2024). A model specification test for semiparametric nonignorable missing data modeling. Econometrics and Statistics. 30, 124-132.
- Hu, J., Chen, Y., Leng, C., and Tang, C.Y. (2024). Applied regression analysis of correlations for correlated Data. Annals of Applied Statistics. 18,184-198.
- Duan, R., Liang, C. J., Shaw, P., Tang, C. Y., and Chen, Y. (2024). Testing the missing at random assumption in generalized linear models in the presence of instrumental variables. Scandinavian Journal of Statistics. 51, 334-354.
2023
- Jing, N., Fang, E.X., and Tang, C.Y. (2023). Robust matrix estimations meet Frank-Wolf algorithms. Machine Learning. 112, 2723-2760.
- Zhang, W., Li, Y., Chen, Y., and Tang, C.Y. (2023). Parsimonious Gaussian copula modelling through constrained Cholesky decomposition for data with temporal dependence. Scientia Sinica Mathematica. 53, 777-790.
- Guo, X., Chen, Y., and Tang, C.Y. (2023). Information criteria for latent factor models: a study on factor pervasiveness and adaptivity. Journal of Econometrics. 233, 237-250.
2022
- Sarkar, S.K. and Tang, C.Y. (2022). Adjusting the Benjamini-Hochberg method for controlling the false discovery rate in knockoff assisted variable selection. Biometrika. 109, 1149-1155.
- Sun, N. and Tang, C.Y. (2022). Testing high-dimensional covariance matrices with random projections and corrected likelihood ratio. Statistics and Its Interface. 15, 449-461.
- Tong, P., Chen, S.X., and Tang, C.Y. (2022). Detecting and evaluating dust-events in north China with ground air quality data. Earth and Space Science. 9, e2021EA001849.
- Yin, Z., Tong, J., Chen, Y., Hubbard, R.A., and Tang, C.Y. (2022). A cost-effective chart review sampling design to account for phenotyping Error in EHR data. Journal of the American Medical Informatics Association. 29, 52-61.
2021
- Ye, Z., Li, X., and Tang, C.Y. (2021). Nonparametric inference for superposed renewal processes with applications in parametric inferences. Bernoulli. 27, 2804-2826.
- Guo, X. and Tang, C.Y. (2021). Specification tests for covariance structures in high-dimensional statistical models. Biometrika. 108, 335-351.
- Chang, J., Chen, S. X., Tang, C.Y., and Wu, T.T. (2021). High-dimensional empirical likelihood inference. Biometrika. 108, 127-147.
2020
- Bruce, S.A., Tang, C.Y., Hall, M. H., and Krafty, R.T. (2020). Empirical frequency band analysis of nonstationary time series. Journal of the American Statistical Association, Theory and Methods. 115, 1933-1945.
- Tang, C.Y., Fang, E.X., and Dong, Y. (2020). High-dimensional interactions detection with sparse principal hessian matrix. Journal of Machine Learning Research. 21(19) 125.
- Tang, C.Y., Fan, Y., and Kong, Y. (2020). Precision matrix estimation by inverse principal orthogonal decomposition. Communications in Mathematical Research. 36 68-92.
2019
- Tang, C.Y., Zhang, W., and Leng, C. (2019). Discrete longitudinal data modeling with a mean-correlation regression approach. Statistica Sinica. 29, 853-876.
2018
- Yuan, M., Tang, C.Y., Hong, Y., and Yang, J. (2018). Disentangling and assessing uncertainties in multiperiod corporate default risk predictions. Annals of Applied Statistics. 12, 2587-2617.
- Chang, J., Tang, C.Y., and Wu, T.T. (2018). A new scope of penalized empirical likelihood with high-dimensional estimating equations. Annals of Statistics. 46, 3185-3216.
- Chang, J.,Guo, J., and Tang, C.Y. (2018). Peter Hall’s contribution to empirical likelihood. Statistica Sinica. 28,2375-2387.
- Dong, Y., Xia, Q., Tang, C.Y., and Li, Z. (2018). On Sufficient Dimension Reduction with Missing Responses through Estimating Equations. Computational Statistics and Data Analysis. 126,67-77.
- Chang, J., Delaigle, A., Hall, P., and Tang, C.Y. (2018). A frequency domain analysis of the error distribution from noisy high-frequency data. Biometrika. 105,353-369.
2016
- Chang, J.,Tang, C.Y. and Wu, Y. (2016). Local independence feature screening for nonparametric and semiparametric models by marginal empirical likelihood. Annals of Statistics.44,515-539.
2015
- Wu, T.T., Li,G. and Tang, C.Y. (2015). Empirical likelihood and variable selection for censored linear regression. Scandinavian Journal of Statistics. 42,798-812.
- Zhang, W., Leng, C. and Tang, C.Y. (2015). A joint modeling approach for longitudinal studies. Journal of the Royal Statistical Society, Series B. 77,219-238.
2014
- Liu, C. and Tang, C.Y. (2014). A quasi-maximum likelihood approach for integrated covariance matrix estimation with high frequency data. Journal of Econometrics. 180,217-232.
- Tang, C.Y. and Wu, T.T. (2014). Nested coordinate descent algorithms for empirical likelihood. Journal of Statistical Computation and Simulation. 84,1917-1930.
2013
- Chang, J., Tang, C.Y. and Wu, Y. (2013). Marginal empirical likelihood and sure independence screening. Annals of Statistics. 41, 2132-2148.
- Liu, C. and Tang, C.Y. (2013). A state space model approach to integrated covariance matrix estimation with high frequency data. Statistics and Its Interface.6, 463-475.
- Tang, C.Y. and Fan, Y. (2013). Discussion of “Large covariance estimation by thresholding principal orthogonal complements”. Journal of the Royal Statistical Society, Series B. 75, 671.
- Fan, Y. and Tang, C.Y. (2013). Tuning parameter selection in high dimensional penalized likelihood. Journal of the Royal Statistical Society, Series B. 75, 531-552.
- Chen, S.X., Qin, J. and Tang, C.Y. (2013). Mann-Whitney test with adjustments to pre-treatment variables for missing values and observational study. Journal of the Royal Statistical Society, Series B. 75, 81-102.
2012
- Tang, C.Y. and Qin, Y. (2012). An efficient empirical likelihood approach for estimating equations with missing data. Biometrika. 99, 1001-1007.
- Leng, C. and Tang, C.Y. (2012). Sparse matrix graphical models. Journal of the American Statistical Association. 107, 1187-1200.
- Leng, C. and Tang, C.Y. (2012). Penalized empirical likelihood and growing dimensional general estimating equations. Biometrika. 99, 703-716.
- Tang, C.Y. and Leng, C. (2012). An empirical likelihood approach to quantile regression with auxiliary information. Statistics and Probability Letters. 82, 29-36.
2011
- Tang, C.Y. and Leng, C. (2011). Empirical likelihood and quantile regression in longitudinal data analysis. Biometrika. 98, 1001-1006.
- Chen, S.X. and Tang, C.Y. (2011). Nonparametric regression with discrete covariates and missing values. Statistics and Its Interface. 4, 463-474.
- Chen, S.X. and Tang, C.Y. (2011). Properties of census dual system population size estimators. International Statistical Review. 79, 336-361.
- Leng, C. and Tang, C.Y. (2011). Improving variance function estimation in semiparametric longitudinal data analysis. The Canadian Journal of Statistics. 39, 656-670.
2010
- Tang, C.Y. and Leng, C. (2010). Penalized high dimensional empirical likelihood. Biometrika. 97, 905-920.
- Chen, S.X., Tang, C.Y. and Mule, V. T. (2010). Local post-stratification in dual system accuracy and coverage evaluation for the US Census. Journal of the American Statistical Association, Applications and Case Studies. 105, 105-119.
2009 and earlier
- Tang, C.Y. and Chen, S.X. (2009). Parameter estimation and bias correction for diffusion processes. Journal of Econometrics. 149, 65-81.
- Chen, S.X., Gao, J. and Tang, C.Y. (2008). A test for model specification of diffusion processes. Annals of Statistics. 36, 167-198.
- Chen, S.X. and Tang, C.Y. (2005). Nonparametric inference of value at risk for dependent financial returns. Journal of Financial Econometrics. 3, 227-255.