DE eng

Search in the Catalogues and Directories

Hits 1 – 1 of 1

1
Combining evidence, biomedical literature and statistical dependence: new insights for functional annotation of gene sets.
In: ISSN: 1471-2105 ; BMC Bioinformatics ; https://hal.archives-ouvertes.fr/hal-00094484 ; BMC Bioinformatics, BioMed Central, 2006, 7, pp.241. ⟨10.1186/1471-2105-7-241⟩ (2006)
Abstract: 18 p. ; International audience ; BACKGROUND: Large-scale genomic studies based on transcriptome technologies provide clusters of genes that need to be functionally annotated. The Gene Ontology (GO) implements a controlled vocabulary organised into three hierarchies: cellular components, molecular functions and biological processes. This terminology allows a coherent and consistent description of the knowledge about gene functions. The GO terms related to genes come primarily from semi-automatic annotations made by trained biologists (annotation based on evidence) or text-mining of the published scientific literature (literature profiling). RESULTS: We report an original functional annotation method based on a combination of evidence and literature that overcomes the weaknesses and the limitations of each approach. It relies on the Gene Ontology Annotation database (GOA Human) and the PubGene biomedical literature index. We support these annotations with statistically associated GO terms and retrieve associative relations across the three GO hierarchies to emphasise the major pathways involved by a gene cluster. Both annotation methods and associative relations were quantitatively evaluated with a reference set of 7397 genes and a multi-cluster study of 14 clusters. We also validated the biological appropriateness of our hybrid method with the annotation of a single gene (cdc2) and that of a down-regulated cluster of 37 genes identified by a transcriptome study of an in vitro enterocyte differentiation model (CaCo-2 cells). CONCLUSION: The combination of both approaches is more informative than either separate approach: literature mining can enrich an annotation based only on evidence. Text-mining of the literature can also find valuable associated MEDLINE references that confirm the relevance of the annotation. Eventually, GO terms networks can be built with associative relations in order to highlight cooperative and competitive pathways and their connected molecular functions.
Keyword: [INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM]; [SDV.BIBS]Life Sciences [q-bio]/Quantitative Methods [q-bio.QM]; [SDV.GEN.GH]Life Sciences [q-bio]/Genetics/Human genetics; Controlled; Genetic; MESH: Algorithms; MESH: Artificial Intelligence; MESH: Documentation; MESH: Evidence-Based Medicine; MESH: Humans; MESH: MEDLINE; MESH: Models; MESH: Multigene Family; MESH: Natural Language Processing; MESH: Proteins; MESH: Terminology as Topic; MESH: Vocabulary; Statistical
URL: https://hal.archives-ouvertes.fr/hal-00094484/file/1471-2105-7-241.pdf
https://hal.archives-ouvertes.fr/hal-00094484/document
https://doi.org/10.1186/1471-2105-7-241
https://hal.archives-ouvertes.fr/hal-00094484
BASE
Hide details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
1
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern