Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5...8

Hits 1 – 20 of 158

1	Between History and Natural Language Processing: Study, Enrichment and Online Publication of French Parliamentary Debates of the Early Third Republic (1881-1899)
	Puren, Marie; Bourgeois, Nicolas; Pellet, Aurélien...
	In: ParlaCLARIN III at LREC2022 - Workshop on Creating, Enriching and Using Parliamentary Corpora ; https://hal.archives-ouvertes.fr/hal-03623351 ; ParlaCLARIN III at LREC2022 - Workshop on Creating, Enriching and Using Parliamentary Corpora, Jun 2022, Marseille, France ; https://www.clarin.eu/ParlaCLARIN-III (2022)
	BASE
	Show details

2	Chinese-Uyghur Bilingual Lexicon Extraction Based on Weak Supervision
	Anwar Aysa; Mijit Ablimit; Hankiz Yilahun; Askar Hamdulla
	In: Information; Volume 13; Issue 4; Pages: 175 (2022)
	BASE
	Show details

3	Investigating the Efficient Use of Word Embedding with Neural-Topic Models for Interpretable Topics from Short Texts
	Riki Murakami; Basabi Chakraborty
	In: Sensors; Volume 22; Issue 3; Pages: 852 (2022)
	BASE
	Show details

4	Analysis of the Effects of Lockdown on Staff and Students at Universities in Spain and Colombia Using Natural Language Processing Techniques
	Mario Jojoa; Begonya Garcia-Zapirain; Marino J. Gonzalez; Bernardo Perez-Villa; Elena Urizar; Sara Ponce; Maria Tobar
	In: International Journal of Environmental Research and Public Health; Volume 19; Issue 9; Pages: 5705 (2022)
	BASE
	Show details

5	An Enhanced Neural Word Embedding Model for Transfer Learning
	Md. Kowsher; Md. Shohanur Islam Sobuj; Md. Fahim Shahriar; Nusrat Jahan Prottasha; Mohammad Shamsul Arefin; Pranab Kumar Dhar; Takeshi Koshiba
	In: Applied Sciences; Volume 12; Issue 6; Pages: 2848 (2022)
	BASE
	Show details

6	Deep Sentiment Analysis Using CNN-LSTM Architecture of English and Roman Urdu Text Shared in Social Media
	Lal Khan; Ammar Amjad; Kanwar Muhammad Afaq; Hsien-Tsung Chang
	In: Applied Sciences; Volume 12; Issue 5; Pages: 2694 (2022)
	BASE
	Show details

7	Predicting Academic Performance: Analysis of Students’ Mental Health Condition from Social Media Interactions
	Md. Saddam Hossain Mukta; Salekul Islam; Swakkhar Shatabda; Mohammed Eunus Ali; Akib Zaman
	In: Behavioral Sciences; Volume 12; Issue 4; Pages: 87 (2022)
	BASE
	Show details

8	Vec2Dynamics: A Temporal Word Embedding Approach to Exploring the Dynamics of Scientific Keywords—Machine Learning as a Case Study
	Amna Dridi; Mohamed Medhat Gaber; Raja Muhammad Atif Azad; Jagdev Bhogal
	In: Big Data and Cognitive Computing; Volume 6; Issue 1; Pages: 21 (2022)
	BASE
	Show details

9	Methods, Models and Tools for Improving the Quality of Textual Annotations
	Maria Teresa Artese; Isabella Gagliardi
	In: Modelling; Volume 3; Issue 2; Pages: 224-242 (2022)
	BASE
	Show details

10	Creating multi-scripts sentiment analysis lexicons for Algerian, Moroccan and Tunisian dialects
	Abidi, Karima; Smaïli, Kamel
	In: 7th International Conference on Data Mining (DTMN 2021) Computer Science Conference Proceedings in Computer Science & Information Technology (CS & IT) ; https://hal.archives-ouvertes.fr/hal-03308111 ; 7th International Conference on Data Mining (DTMN 2021) Computer Science Conference Proceedings in Computer Science & Information Technology (CS & IT), Sep 2021, Copenhagen, Denmark (2021)
	BASE
	Show details

11	Bilingual English-German word embedding models for scientific text ...
	Donner, Paul. - : Zenodo, 2021
	BASE
	Show details

12	Bilingual English-German word embedding models for scientific text ...
	Donner, Paul. - : Zenodo, 2021
	BASE
	Show details

13	以《Cofacts 真的假的》資料庫為基礎建立中文科學假訊息之探勘模型 ; Text Mining Model for Detecting Chinese Fake Scientific Messages based on Cofacts Open Data
	許博竣; XU, BO-JUN. - 2021
	BASE
	Show details

14	Automatic Part-of-Speech Tagging for Security Vulnerability Descriptions ...
	Yitagesu, Sofonias; Zhang, Xiaowang; Feng, Zhiyong. - : Zenodo, 2021
	BASE
	Show details

15	Automatic Part-of-Speech Tagging for Security Vulnerability Descriptions ...
	Yitagesu, Sofonias; Zhang, Xiaowang; Feng, Zhiyong. - : Zenodo, 2021
	BASE
	Show details

16	WELFake dataset for fake news detection in text data ...
	Verma, Pawan Kumar; Agrawal, Prateek; Prodan, Radu. - : Zenodo, 2021
	BASE
	Show details

17	WELFake dataset for fake news detection in text data ...
	Verma, Pawan Kumar; Agrawal, Prateek; Prodan, Radu. - : Zenodo, 2021
	BASE
	Show details

18	Text ranking based on semantic meaning of sentences ; Textrankning baserad på semantisk betydelse hos meningar
	Stigeborn, Olivia. - : KTH, Skolan för elektroteknik och datavetenskap (EECS), 2021
	Abstract: Finding a suitable candidate to client match is an important part of consultant companies work. It takes a lot of time and effort for the recruiters at the company to read possibly hundreds of resumes to find a suitable candidate. Natural language processing is capable of performing a ranking task where the goal is to rank the resumes with the most suitable candidates ranked the highest. This ensures that the recruiters are only required to look at the top ranked resumes and can quickly get candidates out in the field. Former research has used methods that count specific keywords in resumes and can make decisions on whether a candidate has an experience or not. The main goal of this thesis is to use the semantic meaning of the text in the resumes to get a deeper understanding of a candidate’s level of experience. It also evaluates if the model is possible to run on-device and if the database can contain a mix of English and Swedish resumes. An algorithm was created that uses the word embedding model DistilRoBERTa that is capable of capturing the semantic meaning of text. The algorithm was evaluated by generating job descriptions from the resumes by creating a summary of each resume. The run time, memory usage and the ranking the wanted candidate achieved was documented and used to analyze the results. When the candidate who was used to generate the job description is ranked in the top 10 the classification was considered to be correct. The accuracy was calculated using this method and an accuracy of 68.3% was achieved. The results show that the algorithm is capable of ranking resumes. The algorithm is able to rank both Swedish and English resumes with an accuracy of 67.7% for Swedish resumes and 74.7% for English. The run time was fast enough at an average of 578 ms but the memory usage was too large to make it possible to use the algorithm on-device. In conclusion the semantic meaning of resumes can be used to rank resumes and possible future work would be to combine this method with a method that counts keywords to research if the accuracy would increase. ; Att hitta en lämplig kandidat till kundmatchning är en viktig del av ett konsultföretags arbete. Det tar mycket tid och ansträngning för rekryterare på företaget att läsa eventuellt hundratals CV:n för att hitta en lämplig kandidat. Det finns språkteknologiska metoder för att rangordna CV:n med de mest lämpliga kandidaterna rankade högst. Detta säkerställer att rekryterare endast behöver titta på de topprankade CV:erna och snabbt kan få kandidater ut i fältet. Tidigare forskning har använt metoder som räknar specifika nyckelord i ett CV och är kapabla att avgöra om en kandidat har specifika erfarenheter. Huvudmålet med denna avhandling är att använda den semantiska innebörden av texten iCV:n för att få en djupare förståelse för en kandidats erfarenhetsnivå. Den utvärderar också om modellen kan köras på mobila enheter och om algoritmen kan rangordna CV:n oberoende av om CV:erna är på svenska eller engelska. En algoritm skapades som använder ordinbäddningsmodellen DistilRoBERTa som är kapabel att fånga textens semantiska betydelse. Algoritmen utvärderades genom att generera jobbeskrivningar från CV:n genom att skapa en sammanfattning av varje CV. Körtiden, minnesanvändningen och rankningen som den önskade kandidaten fick dokumenterades och användes för att analysera resultatet. När den kandidat som användes för att generera jobbeskrivningen rankades i topp 10 ansågs klassificeringen vara korrekt. Noggrannheten beräknades med denna metod och en noggrannhet på 68,3 % uppnåddes. Resultaten visar att algoritmen kan rangordna CV:n. Algoritmen kan rangordna både svenska och engelska CV:n med en noggrannhet på 67,7 % för svenska och 74,7 % för engelska. Körtiden var i genomsnitt 578 ms vilket skulle möjliggöra att algoritmen kan köras på mobila enheter men minnesanvändningen var för stor. Sammanfattningsvis kan den semantiska betydelsen av CV:n användas för att rangordna CV:n och ett eventuellt framtida arbete är att kombinera denna metod med en metod som räknar nyckelord för att undersöka hur noggrannheten skulle påverkas.
	Keyword: Computer Sciences; CV rankning; Datavetenskap (datalogi); Natural language processing; Ordinbäddning; Resume Ranking; Semantic meaning; Semantisk betydelse; Språkteknologi; Word Embedding
	URL: http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-300442
	BASE
	Hide details

19	Efficient Estimate of Low-Frequency Words’ Embeddings Based on the Dictionary: A Case Study on Chinese
	Xianwen Liao; Yongzhong Huang; Changfu Wei...
	In: Applied Sciences ; Volume 11 ; Issue 22 (2021)
	BASE
	Show details

20	Acoustic Word Embeddings for End-to-End Speech Synthesis
	Feiyu Shen; Chenpeng Du; Kai Yu
	In: Applied Sciences ; Volume 11 ; Issue 19 (2021)
	BASE
	Show details

Page: 1 2 3 4 5...8

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern