Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5...64

Hits 1 – 20 of 1.272

1	VivesDebate: A New Annotated Multilingual Corpus of Argumentation in a Debate Tournament ...
	Ruiz-Dolz, Ramon; Nofre, Montserrat; Taulé, Mariona. - : Zenodo, 2022
	BASE
	Show details

2	VivesDebate: A New Annotated Multilingual Corpus of Argumentation in a Debate Tournament ...
	Ruiz-Dolz, Ramon; Nofre, Montserrat; Taulé, Mariona. - : Zenodo, 2022
	BASE
	Show details

3	DanFEVER: claim verification dataset for Danish ...
	Nørregaard, Jeppe; Derczynski, Leon. - : figshare, 2022
	BASE
	Show details

4	VivesDebate: A New Annotated Multilingual Corpus of Argumentation in a Debate Tournament ...
	Ruiz-Dolz, Ramon; Nofre, Montserrat; Taulé, Mariona. - : Zenodo, 2022
	BASE
	Show details

5	DanFEVER: claim verification dataset for Danish ...
	Nørregaard, Jeppe; Derczynski, Leon. - : figshare, 2022
	BASE
	Show details

6	A Corpus-Based Sentence Classifier for Entity–Relationship Modelling
	Sabrina Šuman; Sanja Čandrlić; Alen Jakupović
	In: Electronics; Volume 11; Issue 6; Pages: 889 (2022)
	BASE
	Show details

7	Text Mining from Free Unstructured Text: An Experiment of Time Series Retrieval for Volcano Monitoring
	Margherita Berardi; Luigi Santamaria Amato; Francesca Cigna; Deodato Tapete; Mario Siciliani de Cumis
	In: Applied Sciences; Volume 12; Issue 7; Pages: 3503 (2022)
	BASE
	Show details

8	Transformer-Based Abstractive Summarization for Reddit and Twitter: Single Posts vs. Comment Pools in Three Languages
	Ivan S. Blekanov; Nikita Tarasov; Svetlana S. Bodrunova
	In: Future Internet; Volume 14; Issue 3; Pages: 69 (2022)
	BASE
	Show details

9	Using Conceptual Recurrence and Consistency Metrics for Topic Segmentation in Debate
	Jaejong Ho; Hyoji Ha; Seok-Won Lee; Kyungwon Lee
	In: Applied Sciences; Volume 12; Issue 6; Pages: 2952 (2022)
	BASE
	Show details

10	Capability Language Processing (CLP): Classification and Ranking of Manufacturing Suppliers Based on Unstructured Capability Data
	Zandbiglari, Kimia. - 2022
	BASE
	Show details

11	StaResGRU-CNN with CMedLMs: a stacked residual GRU-CNN with pre-trained biomedical language models for predictive intelligence
	Ni, Pin; Li, Gangmin; Hung, Patrick C.K.. - : Elsevier Ltd, 2022
	BASE
	Show details

12	Machine Learning approaches for Topic and Sentiment Analysis in multilingual opinions and low-resource languages: From English to Guarani
	Agüero Torales, Marvin Matías. - : Universidad de Granada, 2022
	Abstract: This dissertation has focused on the study of machine learning techniques for sentiment analysis and topic modeling in texts from social media. It puts a special emphasis on approaches and methods for handling low-resource languages, i.e., languages lacking large monolingual or parallel corpora and/or manually elaborated linguistic resources sufficient for building Natural Language Processing (NLP) applications; and the implementation of these approaches and methods to multilingual scenarios, such code-switching (i.e., alternating between two or more languages or varieties of language in a phrase or word). First, we presented a data science workflow to perform machine learning models for social media texts written in low-resource languages, even if these suffer code-switching. The workflow proposed is able to handle different difficulties for the purpose at hand (such as, for example, web text collection, dealing with unbalanced classes, or implementing crosslingual models). In the following, we described how to build machine learning models to perform topic modeling with large data coming from social media with short texts written in Spanish, as well as a number of sentiment analysis related tasks for Guarani (a South American indigenous language) and Jopara (i.e., Guarani-Spanish mixture), namely polarity classification, emotion recognition, humor detection, and offensive and toxic language identification. Emphasis was also placed on noisy and short texts coming from social media. Experiments with the corpora created and the evaluation of the machine learning models built, show the robustness of the approaches and methods proposed in this dissertation, in monolingual, multilingual, and code-switching settings. The contributions presented in this dissertation may be useful both for the Spanishspeaking community and the Guarani-speaking community. There are many use cases in different areas and disciplines that can benefit from the insights created by the approaches we presented in this thesis. Therefore, there are a number of possible applications for the democratization of low-resource languages, such as the ability to perform less biased monitoring of social networks in multilingual environments or the capacity to extract automatically the knowledge available in non-dominant languages. ; Esta tesis se ha centrado en el estudio de técnicas de aprendizaje automático para el análisis de sentimientos y el modelado de temas en textos procedentes de medios sociales. Se ha puesto un énfasis especial en los enfoques y métodos para el manejo de idiomas conocidos como low-resource, es decir, lenguas que carecen de grandes corpus monolingües o paralelos y/o de recursos lingüísticos elaborados manualmente suficientes para construir aplicaciones de Procesamiento del Lenguaje Natural (PLN); y en la aplicación de estos enfoques y métodos en escenarios multilingües, como el code-switching (es decir, alternar dos o más lenguas o variedades lingüísticas en una frase o palabra). Por un lado, introdujimos un flujo de trabajo de ciencia de datos para llevar a cabo modelos de aprendizaje automático para textos provenientes de medios sociales y escritos en idiomas low-resource, incluso si estos presentan code-switching. Este flujo de trabajo es capaz de lidiar con diferentes dificultades presentes en este ámbito (como, por ejemplo, la colección de texto en la web, el tratamiento de clases con ejemplos desequilibrados o la implementación de modelos multilingües). Por otra parte, describimos cómo construir modelos de aprendizaje automático para realizar modelado de temas con datos masivos provenientes de medios sociales, con textos cortos en español, así como una serie de tareas para el análisis de sentimientos en guaraní (una lengua indígena sudamericana) y jopara (es decir, la mezcla del guaraní con el español), a saber, la clasificación de polaridad, el reconocimiento de emociones, la detección de humor y la identificación de lenguaje ofensivo y toxico, también con énfasis en los textos cortos y gramaticalmente pobres provenientes de las redes sociales. Los experimentos con los corpus creados y la evaluación de los modelos de aprendizaje automático construidos, muestran la robustez de los enfoques y métodos propuestos en esta tesis, tanto en entornos monolingües y multilingües, como de code-switching. Las aportaciones presentadas en esta tesis pueden ser útiles tanto para la comunidad hispanohablante como para la comunidad guaraní-hablante. Hay muchos casos de uso en diferentes áreas y disciplinas que pueden beneficiarse de las ideas creadas por los enfoques que proponemos aquí. Por lo tanto, existen una serie de posibles aplicaciones para la democratización de las lenguas de bajo recursos, como la capacidad de realizar un seguimiento menos sesgado de las redes sociales en entornos multilingües o la capacidad de extraer automáticamente el conocimiento disponible en los idiomas no dominantes. ; Tesis Univ. Granada. ; Partially funded by Barcelona Supercomputing Center (BSC) through the Spanish Plan for advancement of Language Technologies ‘Plan TL’ and the Secretar´ıa de Estado de Digitalizaci ´on e Inteligencia Artificial (SEDIA).
	Keyword: Aprendizaje automático (Inteligencia artificial); Aprendizaje profundo; Code-switching; deep learning; Idiomas con escasez de recursos; Low-resource languages; Machine learning; Minería de textos; Natural language processing; Procesamiento de lenguaje natural; Text mining
	URL: http://hdl.handle.net/10481/72863
	BASE
	Hide details

13	Dynamics of prescriptivism and lexical borrowings in Contemporary French
	Zsombok, Gyula. - 2022
	BASE
	Show details

14	CorpusExplorer ; Eine Software zur korpuspragmatischen Analyse
	Rüdiger, Jan Oliver. - 2022
	BASE
	Show details

15	Deceptive Opinions Detection Using New Proposed Arabic Semantic Features
	Ziani, Amel; Azizi, Nabiha; Schwab, Didier...
	In: ISSN: 1877-0509 ; EISSN: 1877-0509 ; Procedia Computer Science ; https://hal.archives-ouvertes.fr/hal-03299022 ; Procedia Computer Science, Elsevier, 2021, 189, pp.29 - 36. ⟨10.1016/j.procs.2021.05.067⟩ (2021)
	BASE
	Show details

16	An Approach Utilizing Linguistic Features for Fake News Detection
	Kasseropoulos, Dimitrios,; Tjortjis, Christos
	In: IFIP Advances in Information and Communication Technology ; 17th IFIP International Conference on Artificial Intelligence Applications and Innovations (AIAI) ; https://hal.inria.fr/hal-03287679 ; 17th IFIP International Conference on Artificial Intelligence Applications and Innovations (AIAI), Jun 2021, Hersonissos, Crete, Greece. pp.646-658, ⟨10.1007/978-3-030-79150-6_51⟩ (2021)
	BASE
	Show details

17	Sentiment Analysis of Arabic Documents
	Rahab, Hichem; Djoudi, Mahieddine; Zitouni, Abdelhafid
	In: Natural Language Processing for Global and Local Business ; https://hal.archives-ouvertes.fr/hal-03124729 ; Fatih Pinarbasi; M. Nurdan Taskiran. Natural Language Processing for Global and Local Business, pp.307-331, 2021, 9781799842408. ⟨10.4018/978-1-7998-4240-8.ch013⟩ ; https://www.igi-global.com/ (2021)
	BASE
	Show details

18	The Role of Inferences in Opinion Mining : Applications to Chinese Social Media ; Le rôle des inférences pour la fouille d'opinion : applications aux réseaux sociaux en langue chinoise
	Yan, Liyun. - : HAL CCSD, 2021
	In: https://tel.archives-ouvertes.fr/tel-03469568 ; Linguistique. Institut National des Langues et Civilisations Orientales- INALCO PARIS - LANGUES O', 2021. Français. ⟨NNT : 2021INAL0016⟩ (2021)
	BASE
	Show details

19	The Machine in the Garden of Meter and Rythm
	Beaudouin, Valérie
	In: Plotting Poetry. On Mechanically-Enhanced Reading ; https://hal.telecom-paris.fr/hal-03255491 ; Bories, Anne-Sophie ; Purnelle, Gérald ; Marchal, Hugues. Plotting Poetry. On Mechanically-Enhanced Reading, Presses universitaires de Liège, 2021, 978-2-87562-280-8 (2021)
	BASE
	Show details

20	On Multi-domain Sentence Level Sentiment Analysis for Roman Urdu ...
	Mehmood, Khawar. - : UNSW Sydney, 2021
	BASE
	Show details

Page: 1 2 3 4 5...64

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern