1 |
Between History and Natural Language Processing: Study, Enrichment and Online Publication of French Parliamentary Debates of the Early Third Republic (1881-1899)
|
|
|
|
In: ParlaCLARIN III at LREC2022 - Workshop on Creating, Enriching and Using Parliamentary Corpora ; https://hal.archives-ouvertes.fr/hal-03623351 ; ParlaCLARIN III at LREC2022 - Workshop on Creating, Enriching and Using Parliamentary Corpora, Jun 2022, Marseille, France ; https://www.clarin.eu/ParlaCLARIN-III (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Chinese-Uyghur Bilingual Lexicon Extraction Based on Weak Supervision
|
|
|
|
In: Information; Volume 13; Issue 4; Pages: 175 (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Investigating the Efficient Use of Word Embedding with Neural-Topic Models for Interpretable Topics from Short Texts
|
|
|
|
In: Sensors; Volume 22; Issue 3; Pages: 852 (2022)
|
|
BASE
|
|
Show details
|
|
4 |
Analysis of the Effects of Lockdown on Staff and Students at Universities in Spain and Colombia Using Natural Language Processing Techniques
|
|
|
|
In: International Journal of Environmental Research and Public Health; Volume 19; Issue 9; Pages: 5705 (2022)
|
|
BASE
|
|
Show details
|
|
5 |
An Enhanced Neural Word Embedding Model for Transfer Learning
|
|
|
|
In: Applied Sciences; Volume 12; Issue 6; Pages: 2848 (2022)
|
|
BASE
|
|
Show details
|
|
6 |
Deep Sentiment Analysis Using CNN-LSTM Architecture of English and Roman Urdu Text Shared in Social Media
|
|
|
|
In: Applied Sciences; Volume 12; Issue 5; Pages: 2694 (2022)
|
|
BASE
|
|
Show details
|
|
7 |
Predicting Academic Performance: Analysis of Students’ Mental Health Condition from Social Media Interactions
|
|
|
|
In: Behavioral Sciences; Volume 12; Issue 4; Pages: 87 (2022)
|
|
BASE
|
|
Show details
|
|
8 |
Vec2Dynamics: A Temporal Word Embedding Approach to Exploring the Dynamics of Scientific Keywords—Machine Learning as a Case Study
|
|
|
|
In: Big Data and Cognitive Computing; Volume 6; Issue 1; Pages: 21 (2022)
|
|
BASE
|
|
Show details
|
|
9 |
Methods, Models and Tools for Improving the Quality of Textual Annotations
|
|
|
|
In: Modelling; Volume 3; Issue 2; Pages: 224-242 (2022)
|
|
BASE
|
|
Show details
|
|
10 |
Creating multi-scripts sentiment analysis lexicons for Algerian, Moroccan and Tunisian dialects
|
|
|
|
In: 7th International Conference on Data Mining (DTMN 2021) Computer Science Conference Proceedings in Computer Science & Information Technology (CS & IT) ; https://hal.archives-ouvertes.fr/hal-03308111 ; 7th International Conference on Data Mining (DTMN 2021) Computer Science Conference Proceedings in Computer Science & Information Technology (CS & IT), Sep 2021, Copenhagen, Denmark (2021)
|
|
BASE
|
|
Show details
|
|
11 |
Bilingual English-German word embedding models for scientific text ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Bilingual English-German word embedding models for scientific text ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
以《Cofacts 真的假的》資料庫為基礎建立中文科學假訊息之探勘模型 ; Text Mining Model for Detecting Chinese Fake Scientific Messages based on Cofacts Open Data
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Automatic Part-of-Speech Tagging for Security Vulnerability Descriptions ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Automatic Part-of-Speech Tagging for Security Vulnerability Descriptions ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Text ranking based on semantic meaning of sentences ; Textrankning baserad på semantisk betydelse hos meningar
|
|
|
|
Abstract:
Finding a suitable candidate to client match is an important part of consultant companies work. It takes a lot of time and effort for the recruiters at the company to read possibly hundreds of resumes to find a suitable candidate. Natural language processing is capable of performing a ranking task where the goal is to rank the resumes with the most suitable candidates ranked the highest. This ensures that the recruiters are only required to look at the top ranked resumes and can quickly get candidates out in the field. Former research has used methods that count specific keywords in resumes and can make decisions on whether a candidate has an experience or not. The main goal of this thesis is to use the semantic meaning of the text in the resumes to get a deeper understanding of a candidate’s level of experience. It also evaluates if the model is possible to run on-device and if the database can contain a mix of English and Swedish resumes. An algorithm was created that uses the word embedding model DistilRoBERTa that is capable of capturing the semantic meaning of text. The algorithm was evaluated by generating job descriptions from the resumes by creating a summary of each resume. The run time, memory usage and the ranking the wanted candidate achieved was documented and used to analyze the results. When the candidate who was used to generate the job description is ranked in the top 10 the classification was considered to be correct. The accuracy was calculated using this method and an accuracy of 68.3% was achieved. The results show that the algorithm is capable of ranking resumes. The algorithm is able to rank both Swedish and English resumes with an accuracy of 67.7% for Swedish resumes and 74.7% for English. The run time was fast enough at an average of 578 ms but the memory usage was too large to make it possible to use the algorithm on-device. In conclusion the semantic meaning of resumes can be used to rank resumes and possible future work would be to combine this method with a method that counts keywords to research if the accuracy would increase. ; Att hitta en lämplig kandidat till kundmatchning är en viktig del av ett konsultföretags arbete. Det tar mycket tid och ansträngning för rekryterare på företaget att läsa eventuellt hundratals CV:n för att hitta en lämplig kandidat. Det finns språkteknologiska metoder för att rangordna CV:n med de mest lämpliga kandidaterna rankade högst. Detta säkerställer att rekryterare endast behöver titta på de topprankade CV:erna och snabbt kan få kandidater ut i fältet. Tidigare forskning har använt metoder som räknar specifika nyckelord i ett CV och är kapabla att avgöra om en kandidat har specifika erfarenheter. Huvudmålet med denna avhandling är att använda den semantiska innebörden av texten iCV:n för att få en djupare förståelse för en kandidats erfarenhetsnivå. Den utvärderar också om modellen kan köras på mobila enheter och om algoritmen kan rangordna CV:n oberoende av om CV:erna är på svenska eller engelska. En algoritm skapades som använder ordinbäddningsmodellen DistilRoBERTa som är kapabel att fånga textens semantiska betydelse. Algoritmen utvärderades genom att generera jobbeskrivningar från CV:n genom att skapa en sammanfattning av varje CV. Körtiden, minnesanvändningen och rankningen som den önskade kandidaten fick dokumenterades och användes för att analysera resultatet. När den kandidat som användes för att generera jobbeskrivningen rankades i topp 10 ansågs klassificeringen vara korrekt. Noggrannheten beräknades med denna metod och en noggrannhet på 68,3 % uppnåddes. Resultaten visar att algoritmen kan rangordna CV:n. Algoritmen kan rangordna både svenska och engelska CV:n med en noggrannhet på 67,7 % för svenska och 74,7 % för engelska. Körtiden var i genomsnitt 578 ms vilket skulle möjliggöra att algoritmen kan köras på mobila enheter men minnesanvändningen var för stor. Sammanfattningsvis kan den semantiska betydelsen av CV:n användas för att rangordna CV:n och ett eventuellt framtida arbete är att kombinera denna metod med en metod som räknar nyckelord för att undersöka hur noggrannheten skulle påverkas.
|
|
Keyword:
Computer Sciences; CV rankning; Datavetenskap (datalogi); Natural language processing; Ordinbäddning; Resume Ranking; Semantic meaning; Semantisk betydelse; Språkteknologi; Word Embedding
|
|
URL: http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-300442
|
|
BASE
|
|
Hide details
|
|
19 |
Efficient Estimate of Low-Frequency Words’ Embeddings Based on the Dictionary: A Case Study on Chinese
|
|
|
|
In: Applied Sciences ; Volume 11 ; Issue 22 (2021)
|
|
BASE
|
|
Show details
|
|
20 |
Acoustic Word Embeddings for End-to-End Speech Synthesis
|
|
|
|
In: Applied Sciences ; Volume 11 ; Issue 19 (2021)
|
|
BASE
|
|
Show details
|
|
|
|