1 |
A Sentence Meaning Based Alignment Method for Parallel Text Corpora Preparation ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
News Across Languages - Cross-Lingual Document Similarity and Event Tracking ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Building Subject-aligned Comparable Corpora and Mining it for Truly Parallel Sentence Pairs ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Leveraging Textual Features for Best Answer Prediction in Community-based Question Answering ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Applying deep learning techniques on medical corpora from the World Wide Web: a prototypical system and evaluation ...
|
|
|
|
Abstract:
BACKGROUND: The amount of biomedical literature is rapidly growing and it is becoming increasingly difficult to keep manually curated knowledge bases and ontologies up-to-date. In this study we applied the word2vec deep learning toolkit to medical corpora to test its potential for identifying relationships from unstructured text. We evaluated the efficiency of word2vec in identifying properties of pharmaceuticals based on mid-sized, unstructured medical text corpora available on the web. Properties included relationships to diseases ('may treat') or physiological processes ('has physiological effect'). We compared the relationships identified by word2vec with manually curated information from the National Drug File - Reference Terminology (NDF-RT) ontology as a gold standard. RESULTS: Our results revealed a maximum accuracy of 49.28% which suggests a limited ability of word2vec to capture linguistic regularities on the collected medical corpora compared with other published results. We were able to document ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences; Information Retrieval cs.IR; Machine Learning cs.LG; Neural and Evolutionary Computing cs.NE
|
|
URL: https://arxiv.org/abs/1502.03682 https://dx.doi.org/10.48550/arxiv.1502.03682
|
|
BASE
|
|
Hide details
|
|
7 |
Editorial for the First Workshop on Mining Scientific Papers: Computational Linguistics and Bibliometrics ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
A fully data-driven method to identify (correlated) changes in diachronic corpora ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Color Aesthetics and Social Networks in Complete Tang Poems: Explorations and Discoveries ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Aspect-based Opinion Summarization with Convolutional Neural Networks ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
amLite: Amharic Transliteration Using Key Map Dictionary ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Approaches for Sentiment Analysis on Twitter: A State-of-Art study ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Selecting Relevant Web Trained Concepts for Automated Event Retrieval ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Better Summarization Evaluation with Word Embeddings for ROUGE ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Extending a Single-Document Summarizer to Multi-Document: a Hierarchical Approach ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Visual Affect Around the World: A Large-scale Multilingual Visual Sentiment Ontology ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Towards Evaluation of Cultural-scale Claims in Light of Topic Model Sampling Effects ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Twitter Sentiment Analysis: Lexicon Method, Machine Learning Method and Their Combination ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|