DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...51
Hits 1 – 20 of 1.020

1
Source Code for Youtube dataset processing ...
TURENNE, Nicolas. - : Zenodo, 2022
BASE
Show details
2
Source Code for Youtube dataset processing ...
TURENNE, Nicolas. - : Zenodo, 2022
BASE
Show details
3
A Quantum Language-Inspired Tree Structural Text Representation for Semantic Analysis
In: Mathematics; Volume 10; Issue 6; Pages: 914 (2022)
BASE
Show details
4
An Unsupervised Approach to Structuring and Analyzing Repetitive Semantic Structures in Free Text of Electronic Medical Records
In: Journal of Personalized Medicine; Volume 12; Issue 1; Pages: 25 (2022)
BASE
Show details
5
Text Mining from Free Unstructured Text: An Experiment of Time Series Retrieval for Volcano Monitoring
In: Applied Sciences; Volume 12; Issue 7; Pages: 3503 (2022)
BASE
Show details
6
Detection of Chinese Deceptive Reviews Based on Pre-Trained Language Model
In: Applied Sciences; Volume 12; Issue 7; Pages: 3338 (2022)
BASE
Show details
7
Automated Customer Complaint Processing for Water Utilities Based on Natural Language Processing—Case Study of a Dutch Water Utility
In: Water; Volume 14; Issue 4; Pages: 674 (2022)
BASE
Show details
8
MLLP-VRAIN Spanish ASR Systems for the Albayzín-RTVE 2020 Speech-to-Text Challenge: Extension
In: Applied Sciences; Volume 12; Issue 2; Pages: 804 (2022)
BASE
Show details
9
Transformer-Based Abstractive Summarization for Reddit and Twitter: Single Posts vs. Comment Pools in Three Languages
In: Future Internet; Volume 14; Issue 3; Pages: 69 (2022)
BASE
Show details
10
Using Conceptual Recurrence and Consistency Metrics for Topic Segmentation in Debate
In: Applied Sciences; Volume 12; Issue 6; Pages: 2952 (2022)
BASE
Show details
11
Connecting Text Classification with Image Classification: A New Preprocessing Method for Implicit Sentiment Text Classification
In: Sensors; Volume 22; Issue 5; Pages: 1899 (2022)
BASE
Show details
12
Predicting Institution Outcomes for Inter Partes Review (IPR) Proceedings at the United States Patent Trial & Appeal Board by Deep Learning of Patent Owner Preliminary Response Briefs
In: Applied Sciences; Volume 12; Issue 7; Pages: 3656 (2022)
BASE
Show details
13
Analysis of the Effects of Lockdown on Staff and Students at Universities in Spain and Colombia Using Natural Language Processing Techniques
In: International Journal of Environmental Research and Public Health; Volume 19; Issue 9; Pages: 5705 (2022)
BASE
Show details
14
FedQAS: Privacy-Aware Machine Reading Comprehension with Federated Learning
In: Applied Sciences; Volume 12; Issue 6; Pages: 3130 (2022)
BASE
Show details
15
A Dynamic Attention and Multi-Strategy-Matching Neural Network Based on Bert for Chinese Rice-Related Answer Selection
In: Agriculture; Volume 12; Issue 2; Pages: 176 (2022)
BASE
Show details
16
Correcting Diacritics and Typos with a ByT5 Transformer Model
In: Applied Sciences; Volume 12; Issue 5; Pages: 2636 (2022)
Abstract: Due to the fast pace of life and online communications and the prevalence of English and the QWERTY keyboard, people tend to forgo using diacritics, make typographical errors (typos) when typing in other languages. Restoring diacritics and correcting spelling is important for proper language use and the disambiguation of texts for both humans and downstream algorithms. However, both of these problems are typically addressed separately: the state-of-the-art diacritics restoration methods do not tolerate other typos, but classical spellcheckers also cannot deal adequately with all the diacritics missing.In this work, we tackle both problems at once by employing the newly-developed universal ByT5 byte-level seq2seq transformer model that requires no language-specific model structures. For a comparison, we perform diacritics restoration on benchmark datasets of 12 languages, with the addition of Lithuanian. The experimental investigation proves that our approach is able to achieve results (>98%) comparable to the previous state-of-the-art, despite being trained less and on fewer data. Our approach is also able to restore diacritics in words not seen during training with >76% accuracy. Our simultaneous diacritics restoration and typos correction approach reaches >94% alpha-word accuracy on the 13 languages. It has no direct competitors and strongly outperforms classical spell-checking or dictionary-based approaches. We also demonstrate all the accuracies to further improve with more training. Taken together, this shows the great real-world application potential of our suggested methods to more data, languages, and error classes.
Keyword: ByT5; diacritics restoration; natural language processing; QWERTY; transformer models; typo correction
URL: https://doi.org/10.3390/app12052636
BASE
Hide details
17
The Competitive Advantage of the Indian and Korean Film Industries: An Empirical Analysis Using Natural Language Processing Methods
In: Applied Sciences; Volume 12; Issue 9; Pages: 4592 (2022)
BASE
Show details
18
eHealth Engagement on Facebook during COVID-19: Simplistic Computational Data Analysis
In: International Journal of Environmental Research and Public Health; Volume 19; Issue 8; Pages: 4615 (2022)
BASE
Show details
19
Cross-Lingual Transfer Learning for Arabic Task-Oriented Dialogue Systems Using Multilingual Transformer Model mT5
In: Mathematics; Volume 10; Issue 5; Pages: 746 (2022)
BASE
Show details
20
Measuring Gender Bias in Contextualized Embeddings
In: Computer Sciences & Mathematics Forum; Volume 3; Issue 1; Pages: 3 (2022)
BASE
Show details

Page: 1 2 3 4 5...51

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
1.020
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern