1 |
An Overview of Indian Spoken Language Recognition from Machine Learning Perspective
|
|
|
|
In: ISSN: 2375-4699 ; EISSN: 2375-4702 ; ACM Transactions on Asian and Low-Resource Language Information Processing ; https://hal.inria.fr/hal-03616853 ; ACM Transactions on Asian and Low-Resource Language Information Processing, ACM, In press, ⟨10.1145/3523179⟩ (2022)
|
|
BASE
|
|
Show details
|
|
4 |
Capability Language Processing (CLP): Classification and Ranking of Manufacturing Suppliers Based on Unstructured Capability Data
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Usage-Based Contact Linguistics : Effects of Frequency and Similarity in Language Contact
|
|
|
|
BASE
|
|
Show details
|
|
6 |
TSM: Measuring the Enticement of Honeyfiles with Natural Language Processing
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Automatic Detection of Plagiarism in Writing
|
|
|
|
In: Studies in Applied Linguistics & TESOL, Vol 21, Iss 2 (2022) (2022)
|
|
Abstract:
This paper reports on preliminary steps to create an external plagiarism detection tool. I used the PAN-PC-11 data sets and extracted tf-idf scores of text documents and cosine similarity measures between source and suspicious documents to find text overlap. The model was able to successfully create vectors and measure the similarity metrics. However, the algorithm was not extended further to automatically retrieve related documents to follow on the pipeline (converting texts to n-grams for detailed analysis and revealing the best match as a source of plagiarism and evaluating the accuracy of the model). The model produced a matrix of cosine similarity for all the documents, which I used to manually retrieve documents and check for overlap using online tools. While extending the algorithm based on the suggested pipeline would allow for a more accurate evaluation of the model, manual comparison of sample documents provided some validity of the model developed for the present study.
|
|
Keyword:
cosine similarity; English language; LB5-3640; PE1-3729; plagiarism detection; similarity metrics; Theory and practice of education
|
|
URL: https://doaj.org/article/cbbb75eb45944064831019c46041c062
|
|
BASE
|
|
Hide details
|
|
8 |
Automatic Detection of Plagiarism in Writing
|
|
|
|
In: Studies in Applied Linguistics & TESOL, Vol 21, Iss 2 (2022) (2022)
|
|
BASE
|
|
Show details
|
|
9 |
Easy-to-use combination of POS and BERT model for domain-specific and misspelled terms
|
|
|
|
In: NL4IA Workshop Proceedings ; https://hal.archives-ouvertes.fr/hal-03474696 ; NL4IA Workshop Proceedings, Nov 2021, Milan, Italy (2021)
|
|
BASE
|
|
Show details
|
|
10 |
Investigating the impact of preprocessing on document embedding: an empirical comparison
|
|
|
|
In: ISSN: 1759-1163 ; EISSN: 1759-1171 ; International Journal of Data Mining, Modelling and Management ; https://hal.inrae.fr/hal-03574696 ; International Journal of Data Mining, Modelling and Management, Inderscience, 2021, 13 (4), pp.351-363 (2021)
|
|
BASE
|
|
Show details
|
|
11 |
Avoiding gender ambiguous pronouns in French
|
|
|
|
In: ISSN: 0010-0277 ; EISSN: 1873-7838 ; Cognition ; https://hal.archives-ouvertes.fr/hal-03374279 ; Cognition, Elsevier, 2021 (2021)
|
|
BASE
|
|
Show details
|
|
13 |
Korpuslinguistik in der Rechtswissenschaft. Eine webbasierte Analyseplattform für EuGH-Entscheidungen ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Effects of similarity on speakers’ production of referring expressions ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
The Influence of Cross-Linguistic Similarity and Language Background on Writing to Dictation
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Architecture design of a reinforcement environment for learning sign languages
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Usage-Based Contact Linguistics : Effects of Frequency and Similarity in Language Contact
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Effects of similarity on speakers’ production of referring expressions
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Semantic Oppositeness for Inconsistency and Disagreement Detection in Natural Language
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Costs and Benefits of Native Language Similarity for Non-native Word Learning
|
|
|
|
BASE
|
|
Show details
|
|
|
|