Results 1 - 10
of
42
Adaptive Multi-Task Lasso: with application to eQTL detection
"... To understand the relationship between genomic variations among population and complex diseases, it is essential to detect eQTLs which are associated with phenotypic effects. However, detecting eQTLs remains a challenge due to complex underlying mechanisms and the very large number of genetic loci i ..."
Abstract
-
Cited by 5 (1 self)
- Add to MetaCart
To understand the relationship between genomic variations among population and complex diseases, it is essential to detect eQTLs which are associated with phenotypic effects. However, detecting eQTLs remains a challenge due to complex underlying mechanisms and the very large number of genetic loci involved compared to the number of samples. Thus, to address the problem, it is desirable to take advantage of the structure of the data and prior information about genomic locations such as conservation scores and transcription factor binding sites. In this paper, we propose a novel regularized regression approach for detecting eQTLs which takes into account related traits simultaneously while incorporating many regulatory features. We first present a Bayesian network for a multi-task learning problem that includes priors on SNPs, making it possible to estimate the significance of each covariate adaptively. Then we find the maximum a posteriori (MAP) estimation of regression coefficients and estimate weights of covariates jointly. This optimization procedure is efficient since it can be achieved by using a projected gradient descent and a coordinate descent procedure iteratively. Experimental results on simulated and real yeast datasets confirm that our model outperforms previous methods for finding eQTLs. 1
ARTICLE Genome-wide Association Study of Smoking Initiation and Current Smoking
"... For the identification of genes associated with smoking initiation and current smoking, genome-wide association analyses were carried out in 3497 subjects. Significant genes that replicated in three independent samples (n 405, 5810, and 1648) were visualized into a biologically meaningful network s ..."
Abstract
-
Cited by 3 (3 self)
- Add to MetaCart
For the identification of genes associated with smoking initiation and current smoking, genome-wide association analyses were carried out in 3497 subjects. Significant genes that replicated in three independent samples (n 405, 5810, and 1648) were visualized into a biologically meaningful network showing cellular location and direct interaction of their proteins. Several interesting groups of proteins stood out, including glutamate receptors (e.g., GRIN2B, GRIN2A, GRIK2, GRM8), proteins involved in tyrosine kinase receptor signaling (e.g., NTRK2, GRB14), transporters (e.g., SLC1A2, SLC9A9) and cell-adhesion molecules (e.g., CDH23). We conclude that a networkbased genome-wide association approach can identify genes influencing smoking behavior.
REPORT Sequence Variants in Three Loci Influence Monocyte Counts and Erythrocyte
"... Blood cells participate in vital physiological processes, and their numbers are tightly regulated so that homeostasis is maintained. Disruption of key regulatory mechanisms underlies many blood-related Mendelian diseases but also contributes to more common disorders, including atherosclerosis. We se ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
Blood cells participate in vital physiological processes, and their numbers are tightly regulated so that homeostasis is maintained. Disruption of key regulatory mechanisms underlies many blood-related Mendelian diseases but also contributes to more common disorders, including atherosclerosis. We searched for quantitative trait loci (QTL) for hematology traits through a whole-genome association study, because these could provide new insights into both hemopoeitic and disease mechanisms. We tested 1.8 million variants for association with 13 hematology traits measured in 6015 individuals from the Australian and Dutch populations. These traits included hemoglobin composition, platelet counts, and red blood cell and white blood cell indices. We identified three regions of strong association that, to our knowledge, have not been previously reported in the literature. The first was located in an intergenic region of chromosome 9q31 near LPAR1, explaining 1.5 % of the variation in monocyte counts (best SNP rs7023923, p 8.9 3 10 14). The second locus was located on chromosome 6p21 and associated with mean cell erythrocyte volume (rs12661667, p 1.2 3 10 9, 0.7 % variance explained) in a region that spanned five genes, including CCND3, a member of the D-cyclin gene family that is involved in hematopoietic stem cell expansion. The third region was also associated with erythrocyte volume and was located in an intergenic region on chromosome 6q24 (rs592423, p 5.3 3 10 9, 0.6 % variance explained). All three loci replicated in an independent panel of 1543 individuals (p values 0.001, 9.9 3 10 5, and 7 3 10 5, respectively). The identification of these QTL provides new opportunities for furthering our understanding of the mechanisms regulating hemopoietic cell fate. Blood cells play an integral role in vital physiological
Phylogenetic dependency networks: Inferring patterns of adaptation in HIV
, 2009
"... This is to certify that I have examined this copy of a doctoral dissertation by ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
This is to certify that I have examined this copy of a doctoral dissertation by
Binomial Mixture Model-based Association Tests under Genetic Heterogeneity
, 2009
"... Most of the existing association tests for population-based case-control studies are based on comparing the mean genotype scores between the case and control groups, which may not be efficient under genetic heterogeneity. Given that most common diseases are genetically heterogeneous, caused by mutat ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
Most of the existing association tests for population-based case-control studies are based on comparing the mean genotype scores between the case and control groups, which may not be efficient under genetic heterogeneity. Given that most common diseases are genetically heterogeneous, caused by mutations in multiple loci, it may be beneficial to fully account for genetic heterogeneity in an association test. Here we first propose a binomial mixture model for such a purpose and develop a corresponding mixture likelihood ratio test (MLRT) for a single locus. We also consider two methods to combine single-locus-based MLRTs across multiple loci in linkage disequilibrium to boost power when causal SNPs are not genotyped. We show with a wide spectrum of numerical examples that under genetic heterogeneity the proposed tests are more powerful than some commonly used association tests.
Use of Genome-Wide Expression Data to Mine the ‘‘Gray Zone’ ’ of GWA Studies Leads to Novel Candidate Obesity Genes
"... To get beyond the ‘‘low-hanging fruits’ ’ so far identified by genome-wide association (GWA) studies, new methods must be developed in order to discover the numerous remaining genes that estimates of heritability indicate should be contributing to complex human phenotypes, such as obesity. Here we d ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
To get beyond the ‘‘low-hanging fruits’ ’ so far identified by genome-wide association (GWA) studies, new methods must be developed in order to discover the numerous remaining genes that estimates of heritability indicate should be contributing to complex human phenotypes, such as obesity. Here we describe a novel integrative method for complex disease gene identification utilizing both genome-wide transcript profiling of adipose tissue samples and consequent analysis of genome-wide association data generated in large SNP scans. We infer causality of genes with obesity by employing a unique set of monozygotic twin pairs
A Genomewide Association Study of Nicotine and Alcohol Dependence in Australian and Dutch Populations
"... * These authors contributed equally to the work reported. These authors contributed equally to the work reported. Persistent tobacco use and excessive alcohol consumption are major public health concerns worldwide. Both alcohol and nicotine dependence (AD, ND) are genetically influenced complex diso ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
* These authors contributed equally to the work reported. These authors contributed equally to the work reported. Persistent tobacco use and excessive alcohol consumption are major public health concerns worldwide. Both alcohol and nicotine dependence (AD, ND) are genetically influenced complex disorders that exhibit a high degree of comorbidity. To identify gene variants contributing to one or both of these addictions, we first conducted a pooling-based genomewide association study (GWAS) in an Australian population, using Illumina Infinium 1M arrays. Allele frequency differences were compared between pooled DNA from case
BOOST: A Fast Approach to Detecting Gene-Gene Interactions in Genome-wide Case-Control Studies
, 2010
"... Gene-gene interactions have long been recognized to be fundamentally important for understanding genetic causes of complex disease traits. At present, identifying gene-gene interactions from genome-wide case-control studies is computationally and methodologically challenging. In this paper, we intro ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
Gene-gene interactions have long been recognized to be fundamentally important for understanding genetic causes of complex disease traits. At present, identifying gene-gene interactions from genome-wide case-control studies is computationally and methodologically challenging. In this paper, we introduce a simple but powerful method, named ‘‘BOolean Operation-based Screening and Testing’’ (BOOST). For the discovery of unknown gene-gene interactions that underlie complex diseases, BOOST allows examination of all pairwise interactions in genome-wide case-control studies in a remarkably fast manner. We have carried out interaction analyses on seven data sets from the Wellcome Trust Case Control Consortium (WTCCC). Each analysis took less than 60 hr to completely evaluate all pairs of roughly 360,000 SNPs on a standard 3.0 GHz desktop with 4G memory running the Windows XP system. The interaction patterns identified from the type 1 diabetes data set display significant difference from those identified from the rheumatoid arthritis data set, although both data sets share a very similar hit region in the WTCCC report. BOOST has also identified some disease-associated interactions between genes in the major histocompatibility complex region in the type 1 diabetes data set. We believe that our method can serve as a computationally and statistically useful tool in the coming era of large-scale interaction mapping in genome-wide case-control studies.
Brain Expressed microRNAs Implicated in Schizophrenia
"... Background. Protein encoding genes have long been the major targets for research in schizophrenia genetics. However, with the identification of regulatory microRNAs (miRNAs) as important in brain development and function, miRNAs genes have emerged as candidates for schizophrenia-associated genetic f ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
Background. Protein encoding genes have long been the major targets for research in schizophrenia genetics. However, with the identification of regulatory microRNAs (miRNAs) as important in brain development and function, miRNAs genes have emerged as candidates for schizophrenia-associated genetic factors. Indeed, the growing understanding of the regulatory properties and pleiotropic effects that miRNA have on molecular and cellular mechanisms, suggests that alterations in the interactions between miRNAs and their mRNA targets may contribute to phenotypic variation. Methodology/Principal Findings. We have studied the association between schizophrenia and genetic variants of miRNA genes associated with brainexpression using a case-control study design on three Scandinavian samples. Eighteen known SNPs within or near brainexpressed miRNAs in three samples (Danish, Swedish and Norwegian: 420/163/257 schizophrenia patients and 1006/177/293 control subjects), were analyzed. Subsequently, joint analysis of the three samples was performed on SNPs showing marginal association. Two SNPs rs17578796 and rs1700 in hsa-mir-206 (mir-206) and hsa-mit-198 (mir-198) showed nominal significant allelic association to schizophrenia in the Danish and Norwegian sample respectively (P = 0.0021 & p = 0.038), of which only
Department of Forensic Molecular Biology,
, 2009
"... A genome-wide association study of northwestern Europeans involves the C-type natriuretic peptide signaling pathway in the etiology of human height variation ..."
Abstract
- Add to MetaCart
A genome-wide association study of northwestern Europeans involves the C-type natriuretic peptide signaling pathway in the etiology of human height variation

