6 |
Media and milieus for complex numbers: An experiment with Maple based text
|
|
|
|
In: CERME 9 - Ninth Congress of the European Society for Research in Mathematics Education ; https://hal.archives-ouvertes.fr/hal-01288593 ; CERME 9 - Ninth Congress of the European Society for Research in Mathematics Education, Charles University in Prague, Faculty of Education; ERME, Feb 2015, Prague, Czech Republic. pp.2131-2137 (2015)
|
|
BASE
|
|
Show details
|
|
10 |
An evaluation of POS tagging for tweets using HMM modeling
|
|
|
|
Abstract:
Recently there has been an increased demand for natural language processing tools that work well on unstructured and noisy texts such as texts from Twitter messages. It has been shown that tools developed for structured texts, do not work well when used on unstructured texts hence necessitates considerable customization and re-training for the tools to be able to achieve the same accuracy on unstructured texts. This paper presents the results of testing a HMM (Hidden Markov Model) based POS (Part-Of-Speech) tagger customized for unstructured texts. The tagger was trained on Tweeter messages on existing publicly available data and customized for abbreviations and named entities common in Tweets. We evaluated the tagger firstly training and testing on the same source corpus and later did cross-validation testing by training on one Twitter corpus and testing on a different Twitter corpus. We also did similar experiments with the datasets using a CRF (Conditional Random Frequency) based state-of-the-art POS tagger customized for Tweet messages. The results show that the CRF-based POS tagger from GATE performed slightly better compared to the HMM model at token level, however at the sentence level the performances were approximately the same. An even more intriguing result was that the cross-validation experiments showed that both the tagger’s results deteriorated by approximately 25% at the token level and a massive 80% at the sentence level. This suggests vast differences between the two Tweet corpora used and emphasizes the importance of recall values for NLP systems. A detailed analysis of this deterioration is presented and the HMM trained model together with the data has also been made available for research purposes.
|
|
Keyword:
HMM POS Tagger; Machine Learning; POS Tagging; Social Media; Twitter
|
|
URL: http://hdl.handle.net/10292/8450
|
|
BASE
|
|
Hide details
|
|
11 |
Decoding the Islamic State: Islamic State Hostage Videos and State Formation
|
|
|
|
In: Senior Projects Spring 2015 (2015)
|
|
BASE
|
|
Show details
|
|
12 |
Machine Intelligence for Health Information: Capturing Concepts & Trends in Social Media via Query Expansion
|
|
|
|
In: Machine Intelligence for Health Information: Capturing Concepts & Trends in Social Media via Query Expansion (2015)
|
|
BASE
|
|
Show details
|
|
13 |
Multimodale Kommunikation im Social Web : Forschungsansätze und Analysen zu Text-Bild-Relationen
|
|
Siever, Christina Margrit (VerfasserIn). - Frankfurt am Main ; Bern ; Bruxelles ; New York ; Oxford ; Warszawa ; Wien : Peter Lang Edition, 2015
|
|
IDS Mannheim
|
|
Show details
|
|
14 |
Корпоративный микроблогинг в обучении немецкому языку для специальных целей
|
|
МОРОЗОВА МАЙЯ АНДРЕЕВНА. - : Федеральное государственное автономное образовательное учреждение высшего профессионального образования «Северный (Арктический) федеральный университет им. М.В. Ломоносова», 2015
|
|
BASE
|
|
Show details
|
|
16 |
МЕТАЛИНГВИСТИЧЕСКИЕ ОСОБЕННОСТИ МЕДИАШУМА В СОЦИАЛЬНЫХ МЕДИА
|
|
|
|
BASE
|
|
Show details
|
|
17 |
КОНСТРУИРОВАНИЕ МЕЖНАЦИОНАЛЬНЫХ ОТНОШЕНИЙ В СМИ: СПЕЦИФИКА РЕПРЕЗЕНТАЦИЙ
|
|
ДУБРОВСКАЯ Т.В.; КОЖЕМЯКИН Е.А.. - : Федеральное государственное автономное образовательное учреждение высшего профессионального образования «Белгородский государственный национальный исследовательский университет», 2015
|
|
BASE
|
|
Show details
|
|
19 |
The meaning and making of union delegate networks
|
|
|
|
In: Peetz, D, Murray, G, Muurlink, O & May, M 2015, 'The meaning and making of union delegate networks', The Economic and Labour Relations Review, vol. 26, no. 4, pp. 596-613, http://dx.doi.org/10.1177/1035304615614717 (2015)
|
|
BASE
|
|
Show details
|
|
20 |
Minority language Twitter: part-of-speech tagging and analysis of Irish Tweets
|
|
|
|
In: Lynn, Teresa, Scannell, Kevin and Maguire, Eimear (2015) Minority language Twitter: part-of-speech tagging and analysis of Irish Tweets. In: ACL 2015 Workshop on Noisy User-generated Text 2015 (W-NUT), 31 July 2015, Beijing, China. (2015)
|
|
BASE
|
|
Show details
|
|
|
|