Results 11 - 20
of
529
WebGestalt: an integrated system for exploring gene sets in various biological contexts
- Nucleic Acids Res
, 2005
"... High-throughput technologies have led to the rapid generation of large-scale datasets about genes and gene products. These technologies have also shifted our research focus from ‘single genes ’ to ‘gene sets’. We have developed a web-based integrated data mining system, WebGestalt ..."
Abstract
-
Cited by 28 (2 self)
- Add to MetaCart
High-throughput technologies have led to the rapid generation of large-scale datasets about genes and gene products. These technologies have also shifted our research focus from ‘single genes ’ to ‘gene sets’. We have developed a web-based integrated data mining system, WebGestalt
SOURCE: a unified genomic resource of functional annotations, ontologies, and gene expression data
- Nucleic Acids Res
, 2003
"... The explosion in the number of functional genomic datasets generated with tools such as DNA microarrays has created a critical need for resources that facilitate the interpretation of large-scale biological data. SOURCE is a web-based database that brings together information from a broad range of r ..."
Abstract
-
Cited by 26 (0 self)
- Add to MetaCart
The explosion in the number of functional genomic datasets generated with tools such as DNA microarrays has created a critical need for resources that facilitate the interpretation of large-scale biological data. SOURCE is a web-based database that brings together information from a broad range of resources, and provides it in manner particularly useful for genome-scale analyses. SOURCE’s GeneReports include aliases, chromosomal location, functional descriptions, GeneOntology annotations, gene expression data, and links to external databases. We curate published microarray gene expression datasets and allow users to rapidly identify sets of co-regulated genes across a variety of tissues and a large number of conditions using a simple and intuitive interface. SOURCE provides content both in gene and cDNA clone-centric pages, and thus simplifies analysis of datasets generated using cDNA microarrays. SOURCE is continuously updated and contains the most recent and accurate information available for human, mouse, and rat genes. By allowing dynamic linking to individual gene or clone reports, SOURCE facilitates browsing of large genomic datasets. Finally, SOURCEs batch interface allows rapid extraction of data for thousands of genes or clones at once and thus facilitates statistical analyses such as assessing the enrichment of functional attributes within clusters of genes. SOURCE is available at
A Literature-Based Method For Assessing The Functional Coherence Of A Gene Group
, 2003
"... Motivation: Many experimental and algorithmic approaches in biology generate groups of genes that need to be examined for related functional properties. For example, gene expression profiles are frequently organized into clusters of genes that may share functional properties. We evaluate a method, n ..."
Abstract
-
Cited by 25 (2 self)
- Add to MetaCart
Motivation: Many experimental and algorithmic approaches in biology generate groups of genes that need to be examined for related functional properties. For example, gene expression profiles are frequently organized into clusters of genes that may share functional properties. We evaluate a method, neighbor divergence per gene (NDPG), that uses scientific literature to assess whether a group of genes are functionally related. The method requires only a corpus of documents and an index connecting the documents to genes.
The subsystems approach to genome annotation and its use in the project to annotate 1000 genomes
- Nucleic Acids Res
, 2005
"... ..."
Integrative database analysis in structural genomics
- Nat Struct Biol 2000, Suppl:960–963
"... Abstract (2 sentences) An important aspect of structural genomics is connecting coordinate data with whole-genome information related to phylogenetic occurrence, protein function, gene expression, and protein-protein interactions. Integrative database analysis can highlight certain folds and structu ..."
Abstract
-
Cited by 24 (9 self)
- Add to MetaCart
Abstract (2 sentences) An important aspect of structural genomics is connecting coordinate data with whole-genome information related to phylogenetic occurrence, protein function, gene expression, and protein-protein interactions. Integrative database analysis can highlight certain folds and structural features that stand out against the general population of proteins in particular ways. Individual bits of genomic data need to be put in a context to be meaningful. For instance, the isolated fact that yeast gene YBR191w is expressed at a level of 65 copies per cell in GeneChip experiments is, by itself, meaningless. However, if one can connect this measurement to those of other genes and an overall functional classification, one can determine that this gene codes for a ribosomal protein and that ribosomal proteins have amongst the highest levels of expression in yeast. The same logic applies to structure. Coordinates by themselves just specify shape and are not of intrinsic biological value, unless they can be related to other information. In the past, for "singlemolecule" experiments, formal integration was unnecessary; one got the whole picture through reading the literature. However, this is impossible for all ~18,000 proteins in the worm. Thus,
Assessing the limits of genomic data integration for predicting protein networks. Genome Res
, 2005
"... All genomic feature data used in this study can be downloaded at ..."
Abstract
-
Cited by 23 (2 self)
- Add to MetaCart
All genomic feature data used in this study can be downloaded at
UniqueProt: creating representative protein sequence sets
- Nucleic Acids Res
, 2003
"... UniqueProt is a practical and easy to use web service designed to create representative, unbiased data sets of protein sequences. The largest possible representative sets are found through a simple greedy algorithm using the HSSP-value to establish sequence similarity. UniqueProt is not a real clust ..."
Abstract
-
Cited by 22 (6 self)
- Add to MetaCart
UniqueProt is a practical and easy to use web service designed to create representative, unbiased data sets of protein sequences. The largest possible representative sets are found through a simple greedy algorithm using the HSSP-value to establish sequence similarity. UniqueProt is not a real clustering program in the sense that the ‘representatives’ are not at the centres of well-defined clusters since the definition of such clusters is problem-specific. Overall, UniqueProt is a reasonable fast solution for bias in data sets. The service is accessible at
Functional interpretation of microarray experiments
- OMICS
, 2006
"... Over the past few years, due to the popularisation of high-throughput methodologies such as DNA microarrays, the possibility of obtaining experimental data has increased significantly. Nevertheless, the interpretation of the results, which involves translating these data into useful biological knowl ..."
Abstract
-
Cited by 22 (11 self)
- Add to MetaCart
Over the past few years, due to the popularisation of high-throughput methodologies such as DNA microarrays, the possibility of obtaining experimental data has increased significantly. Nevertheless, the interpretation of the results, which involves translating these data into useful biological knowledge, still remains a challenge. The methods and strategies used for this interpretation are in continuous evolution and new proposals are constantly arising. Initially, a two-step approach was used in which genes of interest were initially selected, based on thresholds that consider only experimental values, and then in a second, independent step the enrichment of these genes in biologically relevant terms, was analysed. For different reasons, these methods are relatively poor in terms of performance and a new generation of procedures, which draw inspiration from systems biology criteria, are currently under development. Such procedures, aim to directly test the behaviour of blocks of functionally related genes, instead of focusing on single genes.
Harmonic Measure
- New Mathematical Monographs. Cambridge
, 2005
"... semantic web tools to integrate experimental ..."
Automatic extraction of gene/protein biological functions from biomedical text
- Bioinformatics
, 2005
"... MOTIVATION: With the rapid advancement of biomedical science and the development of high-throughput analysis methods, the extraction of various types of information from biomedical text has become critical. Since automatic functional annotations of genes are quite useful for interpreting large amoun ..."
Abstract
-
Cited by 18 (0 self)
- Add to MetaCart
MOTIVATION: With the rapid advancement of biomedical science and the development of high-throughput analysis methods, the extraction of various types of information from biomedical text has become critical. Since automatic functional annotations of genes are quite useful for interpreting large amounts of high-throughput data efficiently, the demand for automatic extraction of information related to gene functions from text has been increasing. RESULTS: We have developed a method for automatically extracting the biological process functions of genes/protein/families based on Gene Ontology (GO) from text using a shallow parser and sentence structure analysis techniques. When the gene/protein/family names and their functions are described in ACTOR (doer of action) and OBJECT (receiver of action) relationships, the corresponding GO-IDs are assigned to the genes/proteins/families. The gene/protein/family names are recognized using the gene/protein/family name dictionaries developed by our group. To achieve wide recognition of the gene/protein/family functions, we semi-automatically gather

