Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5 6 7 8 9...89

Hits 81 – 100 of 1.772

81	Grammatically Coded Corpus Of Spoken Lithuanian: Methodology And Development ...
	L. Kamandulytė-Merfeldienė. - : Zenodo, 2017
	BASE
	Show details

82	Grammatically Coded Corpus Of Spoken Lithuanian: Methodology And Development ...
	L. Kamandulytė-Merfeldienė. - : Zenodo, 2017
	BASE
	Show details

83	Biomedical word sense disambiguation with word embeddings
	Antunes, Rui; Matos, Sérgio. - : Springer, 2017
	BASE
	Show details

84	Crowdsourcing lexical semantic judgements from bilingual dictionary users
	Fothergill, Richard James. - 2017
	BASE
	Show details

85	Concept and entity grounding using indirect supervision
	Tsai, Chen-Tse. - 2017
	BASE
	Show details

86	Desambiguación Verbal Automática: un estudio sobre el rendimiento de la información semántica argumental ; Verb Sense Disambiguation: a study about the performance of argumental semantic information
	Priego García, José; Castellón Masalles, Irene. - : Sociedad Española para el Procesamiento del Lenguaje Natural, 2017
	BASE
	Show details

87	Knowledge-driven entity recognition and disambiguation in biomedical text
	Siu, Amy. - 2017
	Abstract: Entity recognition and disambiguation (ERD) for the biomedical domain are notoriously difficult problems due to the variety of entities and their often long names in many variations. Existing works focus heavily on the molecular level in two ways. First, they target scientific literature as the input text genre. Second, they target single, highly specialized entity types such as chemicals, genes, and proteins. However, a wealth of biomedical information is also buried in the vast universe of Web content. In order to fully utilize all the information available, there is a need to tap into Web content as an additional input. Moreover, there is a need to cater for other entity types such as symptoms and risk factors since Web content focuses on consumer health. The goal of this thesis is to investigate ERD methods that are applicable to all entity types in scientific literature as well as Web content. In addition, we focus on under-explored aspects of the biomedical ERD problems -- scalability, long noun phrases, and out-of-knowledge base (OOKB) entities. This thesis makes four main contributions, all of which leverage knowledge in UMLS (Unified Medical Language System), the largest and most authoritative knowledge base (KB) of the biomedical domain. The first contribution is a fast dictionary lookup method for entity recognition that maximizes throughput while balancing the loss of precision and recall. The second contribution is a semantic type classification method targeting common words in long noun phrases. We develop a custom set of semantic types to capture word usages; besides biomedical usage, these types also cope with non-biomedical usage and the case of generic, non-informative usage. The third contribution is a fast heuristics method for entity disambiguation in MEDLINE abstracts, again maximizing throughput but this time maintaining accuracy. The fourth contribution is a corpus-driven entity disambiguation method that addresses OOKB entities. The method first captures the entities expressed in a corpus as latent representations that comprise in-KB and OOKB entities alike before performing entity disambiguation. ; Die Erkennung und Disambiguierung von Entitäten für den biomedizinischen Bereich stellen, wegen der vielfältigen Arten von biomedizinischen Entitäten sowie deren oft langen und variantenreichen Namen, große Herausforderungen dar. Vorhergehende Arbeiten konzentrieren sich in zweierlei Hinsicht fast ausschließlich auf molekulare Entitäten. Erstens fokussieren sie sich auf wissenschaftliche Publikationen als Genre der Eingabetexte. Zweitens fokussieren sie sich auf einzelne, sehr spezialisierte Entitätstypen wie Chemikalien, Gene und Proteine. Allerdings bietet das Internet neben diesen Quellen eine Vielzahl an Inhalten biomedizinischen Wissens, das vernachlässigt wird. Um alle verfügbaren Informationen auszunutzen besteht der Bedarf weitere Internet-Inhalte als zusätzliche Quellen zu erschließen. Außerdem ist es auch erforderlich andere Entitätstypen wie Symptome und Risikofaktoren in Betracht zu ziehen, da diese für zahlreiche Inhalte im Internet, wie zum Beispiel Verbraucherinformationen im Gesundheitssektor, relevant sind. Das Ziel dieser Dissertation ist es, Methoden zur Erkennung und Disambiguierung von Entitäten zu erforschen, die alle Entitätstypen in Betracht ziehen und sowohl auf wissenschaftliche Publikationen als auch auf andere Internet-Inhalte anwendbar sind. Darüber hinaus setzen wir Schwerpunkte auf oft vernachlässigte Aspekte der biomedizinischen Erkennung und Disambiguierung von Entitäten, nämlich Skalierbarkeit, lange Nominalphrasen und fehlende Entitäten in einer Wissensbank. In dieser Hinsicht leistet diese Dissertation vier Hauptbeiträge, denen allen das Wissen von UMLS (Unified Medical Language System), der größten und wichtigsten Wissensbank im biomedizinischen Bereich, zu Grunde liegt. Der erste Beitrag ist eine schnelle Methode zur Erkennung von Entitäten mittels Lexikonabgleich, welche den Durchsatz maximiert und gleichzeitig den Verlust in Genauigkeit und Trefferquote (precision and recall) balanciert. Der zweite Beitrag ist eine Methode zur Klassifizierung der semantischen Typen von Nomen, die sich auf gebräuchliche Nomen von langen Nominalphrasen richtet und auf einer selbstentwickelten Sammlung von semantischen Typen beruht, die die Verwendung der Nomen erfasst. Neben biomedizinischen können diese Typen auch nicht-biomedizinische und allgemeine, informationsarme Verwendungen behandeln. Der dritte Beitrag ist eine schnelle Heuristikmethode zur Disambiguierung von Entitäten in MEDLINE Kurzfassungen, welche den Durchsatz maximiert, aber auch die Genauigkeit erhält. Der vierte Beitrag ist eine korpusgetriebene Methode zur Disambiguierung von Entitäten, die speziell fehlende Entitäten in einer Wissensbank behandelt. Die Methode wandelt erst die Entitäten, die in einem Textkorpus ausgedrückt aber nicht notwendigerweise in einer Wissensbank sind, in latente Darstellungen um und führt anschließend die Disambiguierung durch.
	Keyword: biomedical text mining; Biomedizin; ddc:004; entity disambiguation; entity recognition; Sprachverarbeitung; Wissensbasis
	URL: https://doi.org/10.22028/D291-26790 http://nbn-resolving.org/urn:nbn:de:bsz:291-scidok-69580
	BASE
	Hide details

88	Evaluation of word embedding vector averaging functions for biomedical word sense disambiguation
	Antunes, Rui; Matos, Sérgio. - : UA Editora, 2017
	BASE
	Show details

89	Unsupervised, knowledge-free, and interpretable word sense disambiguation
	Panchenko, A.; Marten, F.; Ruppert, E....
	In: EMNLP 2017 - Conference on Empirical Methods in Natural Language Processing: System Demonstrations, Proceedings (2017)
	BASE
	Show details

90	Domain-specific Named Entity Disambiguation in Historical Memoirs
	Marco, Rovera; Federico, Nanni; Simone Paolo Ponzetto,. - : CEUR, 2017. : country:DEU, 2017. : place:Aachen, 2017
	BASE
	Show details

91	Dalla Word Sense Disambiguation alla sintassi: il problema dell'articolo partitivo in italiano
	Cipolla, E.; Mirto, I.. - : country:IT, 2017
	BASE
	Show details

92	Uso de representaciones vectoriales de las palabras para la detección de dobles sentidos (puns)
	Carrasco Gómez, Pascual Andrés. - : Universitat Politècnica de València, 2017
	BASE
	Show details

93	Approches d'analyse distributionnelle pour améliorer la désambiguïsation sémantique
	Billami, Mokhtar Boumedyen; Gala, Núria
	In: Journées internationales d'Analyse statistique des Données Textuelles (JADT) ; https://hal.archives-ouvertes.fr/hal-01477502 ; Journées internationales d'Analyse statistique des Données Textuelles (JADT), Jun 2016, Nice, France ; https://jadt2016.sciencesconf.org/ (2016)
	BASE
	Show details

94	Ambiguity Diagnosis for Terms in Digital Humanities
	Daille, Béatrice; Jacquey, Evelyne; Lejeune, Gaël...
	In: Language Resources and Evaluation Conference ; https://hal.inria.fr/hal-01423650 ; Language Resources and Evaluation Conference, May 2016, Portorož, Slovenia (2016)
	BASE
	Show details

95	СИНТАКСИЧЕСКАЯ МНОГОЗНАЧНОСТЬ И НЕОДНОЗНАЧНОСТЬ В ПЕРСПЕКТИВЕ МАШИННОГО ПЕРЕВОДА
	КОЗЕРЕНКО ЕЛЕНА БОРИСОВНА. - : Федеральное государственное бюджетное образовательное учреждение высшего образования «Московский педагогический государственный университет», 2016
	BASE
	Show details

96	Building Semantic Trees from XML Documents
	Tekli, Joe; Charbel, Nathalie; Chbeir, Richard
	In: ISSN: 1570-8268 ; Journal of Web Semantics ; https://hal-univ-pau.archives-ouvertes.fr/hal-02079156 ; Journal of Web Semantics, Elsevier, 2016, 37-38, pp.1-24. ⟨10.1016/J.WEBSEM.2016.03.002⟩ (2016)
	BASE
	Show details

97	Semantic Interoperability of Multilingual Lexical Resources in Lexical Linked Data ; Interopérabilité Sémantique Multi-lingue des Ressources Lexicales en Données Liées Ouvertes
	Tchechmedjiev, Andon. - : HAL CCSD, 2016
	In: https://tel.archives-ouvertes.fr/tel-01681358 ; Informatique et langage [cs.CL]. Université Grenoble Alpes, 2016. Français. ⟨NNT : 2016GREAM067⟩ (2016)
	BASE
	Show details

98	Using Domain Ontologies for Classification and Semantic Interpretation of Documents
	Iltache, Samia; Comparot, Catherine; Si-Mohammed, Malik...
	In: Proceedings of ALLDATA 2016 ; International Workshop on Knowledge Extraction and Semantic Annotation (KESA 2016) in ALLDATA 2016 : 2nd International Conference on Big Data, Small Data, Linked Data and Open Data ; https://hal.archives-ouvertes.fr/hal-01535945 ; International Workshop on Knowledge Extraction and Semantic Annotation (KESA 2016) in ALLDATA 2016 : 2nd International Conference on Big Data, Small Data, Linked Data and Open Data, Feb 2016, Lisbon, Portugal. pp. 76-81 (2016)
	BASE
	Show details

99	Automatic processing of Tunisian dialect: construction of linguistic resources ; TRAITEMENT AUTOMATIQUE DU DIALECTE TUNISIEN : CONSTRUCTION DE RESSOURCES LINGUISTIQUES
	Zribi, Inès. - : HAL CCSD, 2016
	In: https://hal.archives-ouvertes.fr/tel-02869866 ; Informatique et langage [cs.CL]. Université de Sfax (Tunisie), 2016. Français (2016)
	BASE
	Show details

100	Prosodic disambiguation and attachment height
	Grillo, N.; turco, giuseppina
	In: Speech Prosody 8 ; https://halshs.archives-ouvertes.fr/halshs-01422841 ; Speech Prosody 8, May 2016, Boston, United States. pp.1176-1180 (2016)
	BASE
	Show details

Page: 1 2 3 4 5 6 7 8 9...89

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern