Research Profile

Dabao Zhang

 

 

I currently focus my research on developing variable selection methods based on a new penalization-framework for high-dimensional data, and applying them to whole-genome association study, whole-genome animal/plant selection, eQTL mapping, and analysis of gene-gene/gene-environment interaction.

Statistical Methodologies

Bayesian Analysis, Empirical Likelihood Approach, Graphical Models, Multivariate Extreme Values, Multivariate Statistics, Shrinkage Estimators, Variable Selection for Large p Small n Data

bullet

D. Zhang, Y. Lin and M. Zhang (2009). Penalized orthogonal-components regression for large p small n data. Accepted by Electronic Journal of Statistics (see an old version: arXiv:0811.4167v3[stat.ME]).

 

bullet

M. Zhang, D. Zhang and M.T. Wells (2009). Generalized thresholding estimators for high-dimensional location parameters. Accepted by Statistica Sinica.

bullet

D. Zhang, M. T. Wells and L. Peng (2008). Nonparametric estimation of the dependence function for a multivariate extreme value distribution. Journal of Multivariate Analysis, 99: 577-588.

bullet

N.-H. Chan, L. Peng and D. Zhang (2007). Empirical-likelihood-based confidence intervals for conditional variance in heteroscedastic regression models. Accepted by Econometric Theory.

bullet

D. Zhang, M. T. Wells, B. W. Turnbull, D. Sparrow and P. A. Cassano (2005). Hierarchical Graphical Models: An Application to Pulmonary Function and Cholesterol Levels in the Normative Aging Study. Journal of American Statistics Association, 100: 719-727.

bullet

D. Zhang, S. He and Z. Xie (1993). Outlier Detection and Intervention for ARIMA(p,d,0). Proceedings of First Asian Conference on Statistical Computation.

Statistics in Bioinformatics

Analysis of Gene Expression Data, Analysis of Mass Spectrometry Data, Comparative Proteomics/Metabolomics Study, Quantitative Trait Loci Mapping, Whole-Genome Association Study

 

bullet

Y. Lin, M. Zhang, L. Wang, V. Pungpapong, J.C. Fleet, and D. Zhang (2009). Simultaneous genome-wide association studies of anti-CCP in rheumatoid arthritis using penalized orthogonal-components regression. Accepted by BMC Proceedings.

 

bullet

M. Zhang, Y. Lin, L. Wang, V. Pungpapong, J.C. Fleet, and D. Zhang (2009). Case-control genome-wide association study of rheumatoid arthritis from GAW16 using POCRE-LDA. Accepted by BMC Proceedings.

 

bullet

N. Liu, D. Zhang, and H. Zhao (2009). Genotyping error detection in samples of unrelated individuals without replicate genotyping. Human Heredity, 67: 154-162 (DOI: 10.1159/000181153).

bullet

D. Zhang, X. Huang, F.E. Regnier, and M. Zhang (2008). Two-dimensional correlation optimized warping algorithm for aligning GCXGC-MS data. Analytical Chemistry, 80 (8): 2664-2671.

bullet

M. Zhang, D. Zhang, M. T., Wells (2008). Variable selection with large p small n regression models: mapping QTL with epistasis. BMC Bioinformatics, 9:251.

bullet

D. Zhang and M. Zhang (2007). Bayesian profiling of molecular signatures to predict event times. Theoretical Biology & Medical Modelling, 4:3, doi:10.1186/1742-4682-4-3.

bullet

D. Zhang, M. Zhang, and M. T. Wells (2006). Multiplicative Background Correction for Spotted Microarrays to Improve Reproducibility. Genetical Research, 87: 195-206.

bullet

M. Zhang, K. L. Montooth, M. T. Wells, A. G. Clark and D. Zhang (2005). Mapping Multiple Quantitative Trait Loci by Bayesian Classification. Genetics, 169: 2305-2318.

bullet

D. Zhang, M. T. Wells, C. D. Smart, and W. E. Fry (2005). Bayesian Normalization and Inference for Differential Gene Expression Data. Journal of Computational Biology, 12: 391-406.

bullet

Complex Traits Consortium (2004). The Collaborative Cross: A Community Resource for the Genetic Analysis of Complex Traits. Nature Genetics, 36: 1133-1137.

Applied Statistics

Analysis of Diverse Biomedical Data

 

bullet

T.R. Mhyre, R. Loy, P.N. Tariot, L.A. Profenno, K.A. Maguire-Zeiss, D. Zhang, P.D. Coleman and H.J. Federoff (2008). Proteomic analysis of peripheral leukocytes in Alzheimer's disease patients treated with divalproex sodium. Neurobiology of Aging, 29: 1631-1643.

bullet

S. W. Perry, J. P. Norman, A. Litzburg, D. Zhang, S. Dewhurst and H. A. Gelbard (2005). HIV-1 Transactivator of Transcription Protein Induces Mitochondrial Hyperpolarization and Synaptic Stress Leading to Apoptosis. Journal of Immunology, 174: 4333-4344.

bullet

M. Zhang, X. Wang, D. Zhang, G. Xu, H. Dong, Y. Yu and J. Han (2004). Orphanin FQ Antagonizes the Inhibition of Ca2+ Currents Induced by Mu-opioid Receptors. Journal of Molecular Neuroscience, 25: 21-27.

Statistical Packages

All are developed in MATLAB. All copyrights are retained by Dabao Zhang and/or Min Zhang unless stated otherwise. They are free to use for academic purpose with proper citation. Please contact me for any bugs and application issues.

 

bullet

POCRE: Implement the penalized orthogonal-component regression (POCRE) algorithm proposed in Zhang, Lin and Zhang (2009).

bullet

2DCOW: Implement the two-dimensional correlation optimized warping algorithm proposed in Zhang, Huang, Regnier and  Zhang (2008).

bullet

MapMSDiff: Developed for comparative proteomic/metabolomic studies using one-dimensional mass spectrometry data which is described in a paper in preparation. We may extend it for multi-dimensional mass spectrometry data in future.

bullet

MicroBayes: Implement the approach proposed in Zhang, Wells, Smart, and Fry (2005).

bullet

GEBCauchy: Implement the generalized empirical Bayes thresholding with Cauchy priors proposed in Zhang, Zhang and Wells (2009).

bullet

GEBLaplace: Implement the generalized empirical Bayes thresholding with Laplace priors which is developed in a paper in preparation (see Zhang, Zhang and Wells, 2009 for GEBT).

bullet

QTLBayes: Implement the Bayesian approach for QTL mapping proposed in Zhang, Montooth, Wells, Clark and Zhang (2005), which is extended in Zhang, Zhang, and Wells (2008) and another paper in preparation.

bullet

SemMix: Implement the EM algorithm for mixed graphical models as described in Zhang, Wells, Turnbull, Sparrow and Cassano (2005).

Talks

 

 

bullet

VIGRE (Oct. 24, 2007): Nonparametric Estimation of the Dependence Function for a Multivariate Extreme Value Distribution

bulletICSA (June 24, 2009): Two-Dimensional Correlation Optimized Warping Algorithm for Aligning GCxGC-MS Data

 

Return to Dabao Zhang's Homepage