Research Profile

Dabao Zhang

 

 

I currently focus my research on developing variable selection methods based on a new penalization-framework for high-dimensional data, and applying them to whole-genome association study, whole-genome animal/plant selection, eQTL mapping, and analysis of gene-gene/gene-environment interaction.

Statistical Methodologies

Bayesian Analysis, Empirical Likelihood Approach, Generalized Thresholding, Graphical Models, Multivariate Extreme Values, Multivariate Statistics, Variable Selection for Large p Small n Data

 

1.    D. Zhang, Y. Lin and M. Zhang (2009). Penalized orthogonal-components regression for large p small n data. Electronic Journal of Statistics, 3: 781-796.

2.    M. Zhang, D. Zhang and M.T. Wells (2009). Generalized thresholding estimators for high-dimensional location parameters. Accepted by Statistica Sinica.

3.    D. Zhang, M. T. Wells and L. Peng (2008). Nonparametric estimation of the dependence function for a multivariate extreme value distribution. Journal of Multivariate Analysis, 99: 577-588.

4.    N.-H. Chan, L. Peng and D. Zhang (2007). Empirical-likelihood-based confidence intervals for conditional variance in heteroscedastic regression models. Accepted by Econometric Theory.

5.    D. Zhang, M. T. Wells, B. W. Turnbull, D. Sparrow and P. A. Cassano (2005). Hierarchical Graphical Models: An Application to Pulmonary Function and Cholesterol Levels in the Normative Aging Study. Journal of the American Statistical Association, 100: 719-727.

6.    D. Zhang, S. He and Z. Xie (1993). Outlier Detection and Intervention for ARIMA(p,d,0). Proceedings of First Asian Conference on Statistical Computation.

 

Statistical Genetics and Bioinformatics

Analysis of Gene Expression Data, Analysis of Mass Spectrometry Data, Comparative Proteomics/Metabolomics Study, Quantitative Trait Loci Mapping, Whole-Genome Association Study

 

 

1.    Y. Lin, M. Zhang, L. Wang, V. Pungpapong, J.C. Fleet, and D. Zhang (2009). Simultaneous genome-wide association studies of anti-CCP in rheumatoid arthritis using penalized orthogonal-components regression. Accepted by BMC Proceedings.

2.    M. Zhang, Y. Lin, L. Wang, V. Pungpapong, J.C. Fleet, and D. Zhang (2009). Case-control genome-wide association study of rheumatoid arthritis from GAW16 using POCRE-LDA. Accepted by BMC Proceedings.

3.    N. Liu, D. Zhang, and H. Zhao (2009). Genotyping error detection in samples of unrelated individuals without replicate genotyping. Human Heredity, 67: 154-162 (DOI: 10.1159/000181153).

4.    D. Zhang, X. Huang, F.E. Regnier, and M. Zhang (2008). Two-dimensional correlation optimized warping algorithm for aligning GCXGC-MS data. Analytical Chemistry, 80 (8): 2664-2671.

5.    M. Zhang, D. Zhang, M. T., Wells (2008). Variable selection with large p small n regression models: mapping QTL with epistasis. BMC Bioinformatics, 9:251.

6.    D. Zhang and M. Zhang (2007). Bayesian profiling of molecular signatures to predict event times. Theoretical Biology & Medical Modelling, 4:3, doi:10.1186/1742-4682-4-3.

7.    D. Zhang, M. Zhang, and M. T. Wells (2006). Multiplicative Background Correction for Spotted Microarrays to Improve Reproducibility. Genetical Research, 87: 195-206.

8.    M. Zhang, K. L. Montooth, M. T. Wells, A. G. Clark and D. Zhang (2005). Mapping Multiple Quantitative Trait Loci by Bayesian Classification. Genetics, 169: 2305-2318.

9.    D. Zhang, M. T. Wells, C. D. Smart, and W. E. Fry (2005). Bayesian Normalization and Inference for Differential Gene Expression Data. Journal of Computational Biology, 12: 391-406.

10.  Complex Traits Consortium (2004). The Collaborative Cross: A Community Resource for the Genetic Analysis of Complex Traits. Nature Genetics, 36: 1133-1137.

 

 

Applied Statistics

Analysis of Diverse Biomedical Data

1.    T.R. Mhyre, R. Loy, P.N. Tariot, L.A. Profenno, K.A. Maguire-Zeiss, D. Zhang, P.D. Coleman and H.J. Federoff (2008). Proteomic analysis of peripheral leukocytes in Alzheimer's disease patients treated with divalproex sodium. Neurobiology of Aging, 29: 1631-1643.

2.    S. W. Perry, J. P. Norman, A. Litzburg, D. Zhang, S. Dewhurst and H. A. Gelbard (2005). HIV-1 Transactivator of Transcription Protein Induces Mitochondrial Hyperpolarization and Synaptic Stress Leading to Apoptosis. Journal of Immunology, 174: 4333-4344.

3.    M. Zhang, X. Wang, D. Zhang, G. Xu, H. Dong, Y. Yu and J. Han (2004). Orphanin FQ Antagonizes the Inhibition of Ca2+ Currents Induced by Mu-opioid Receptors. Journal of Molecular Neuroscience, 25: 21-27.

 

Statistical Packages

All are developed in MATLAB. All copyrights are retained by Dabao Zhang unless stated otherwise. They are free to use for academic purpose with proper citation. Please contact me for any bugs and application issues.

·         POCRE: Implement the penalized orthogonal-component regression (POCRE) algorithm proposed in Zhang, Lin and Zhang (2009).

·         2DCOW: Implement the two-dimensional correlation optimized warping algorithm proposed in Zhang, Huang, Regnier and  Zhang (2008).

·         MicroBayes: Implement the approach proposed in Zhang, Wells, Smart, and Fry (2005).

·         GEBCauchy: Implement the generalized empirical Bayes thresholding with Cauchy priors proposed in Zhang, Zhang and Wells (2009).

·         GEBLaplace: Implement the generalized empirical Bayes thresholding with Laplace priors which is developed in a paper in preparation (see Zhang, Zhang and Wells, 2009 for GEBT).

·         QTLBayes: Implement the Bayesian approach for QTL mapping proposed in Zhang, Montooth, Wells, Clark and Zhang (2005), which is extended in Zhang, Zhang, and Wells (2008) and another paper in preparation.

·         SemMix: Implement the EM algorithm for mixed graphical models as described in Zhang, Wells, Turnbull, Sparrow and Cassano (2005).