Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher
- Year:
  - 2022 (9)
  - 2021 (20)
  - 2020 (23)
  - 2019 (28)
  - 2018 (24)
  - 2017 (11)
  - 2016 (3)
  - 2015 (1)
  - 2014 (2)
  - 2013 (3)
  - more
- Medium:
  - Online (126)
  - Print (32)
- Type
- BLLDB-Access

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5...8

Hits 1 – 20 of 158

1	Between History and Natural Language Processing: Study, Enrichment and Online Publication of French Parliamentary Debates of the Early Third Republic (1881-1899)
	Puren, Marie; Bourgeois, Nicolas; Pellet, Aurélien; Vernus, Pierre
	In: ParlaCLARIN III at LREC2022 - Workshop on Creating, Enriching and Using Parliamentary Corpora ; https://hal.archives-ouvertes.fr/hal-03623351 ; ParlaCLARIN III at LREC2022 - Workshop on Creating, Enriching and Using Parliamentary Corpora, Jun 2022, Marseille, France ; https://www.clarin.eu/ParlaCLARIN-III (2022)
	Abstract: International audience ; We present the AGODA (Analyse sémantique et Graphes relationnels pour l'Ouverture des Débats à l'Assemblée nationale) project, which aims to create a platform for consulting and exploring digitised French parliamentary debates (1881-1940) available in the digital library of the National Library of France. This project brings together historians and NLP specialists: parliamentary debates are indeed an essential source for French history of the contemporary period, but also for linguistics. This project therefore aims to produce a corpus of texts that can be easily exploited with computational methods, and that respect the TEI standard. Ancient parliamentary debates are also an excellent case study for the development and application of tools for publishing and exploring large historical corpora. In this paper, we present the steps necessary to produce such a corpus. We detail the processing and publication chain of these documents, in particular by mentioning the problems linked to the extraction of texts from digitised images. We also introduce the first analyses that we have carried out on this corpus with "bag-of-words" techniques not too sensitive to OCR quality (namely topic modelling and word embedding).
	Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-CY]Computer Science [cs]/Computers and Society [cs.CY]; [INFO.INFO-IR]Computer Science [cs]/Information Retrieval [cs.IR]; [INFO.INFO-TT]Computer Science [cs]/Document and Text Processing; [SHS.HIST]Humanities and Social Sciences/History; France; OCR; Parliamentary debates; Third Republic; Topic modelling; Word embedding; XML-TEI
	URL: https://hal.archives-ouvertes.fr/hal-03623351/document https://hal.archives-ouvertes.fr/hal-03623351 https://hal.archives-ouvertes.fr/hal-03623351/file/puren_bourgeois_pellet_vernus_agoda2022.pdf
	BASE
	Hide details

2	Chinese-Uyghur Bilingual Lexicon Extraction Based on Weak Supervision
	Anwar Aysa; Mijit Ablimit; Hankiz Yilahun; Askar Hamdulla
	In: Information; Volume 13; Issue 4; Pages: 175 (2022)
	BASE
	Show details

3	Investigating the Efficient Use of Word Embedding with Neural-Topic Models for Interpretable Topics from Short Texts
	Riki Murakami; Basabi Chakraborty
	In: Sensors; Volume 22; Issue 3; Pages: 852 (2022)
	BASE
	Show details

4	Analysis of the Effects of Lockdown on Staff and Students at Universities in Spain and Colombia Using Natural Language Processing Techniques
	Mario Jojoa; Begonya Garcia-Zapirain; Marino J. Gonzalez; Bernardo Perez-Villa; Elena Urizar; Sara Ponce; Maria Tobar
	In: International Journal of Environmental Research and Public Health; Volume 19; Issue 9; Pages: 5705 (2022)
	BASE
	Show details

5	An Enhanced Neural Word Embedding Model for Transfer Learning
	Md. Kowsher; Md. Shohanur Islam Sobuj; Md. Fahim Shahriar; Nusrat Jahan Prottasha; Mohammad Shamsul Arefin; Pranab Kumar Dhar; Takeshi Koshiba
	In: Applied Sciences; Volume 12; Issue 6; Pages: 2848 (2022)
	BASE
	Show details

6	Deep Sentiment Analysis Using CNN-LSTM Architecture of English and Roman Urdu Text Shared in Social Media
	Lal Khan; Ammar Amjad; Kanwar Muhammad Afaq; Hsien-Tsung Chang
	In: Applied Sciences; Volume 12; Issue 5; Pages: 2694 (2022)
	BASE
	Show details

7	Predicting Academic Performance: Analysis of Students’ Mental Health Condition from Social Media Interactions
	Md. Saddam Hossain Mukta; Salekul Islam; Swakkhar Shatabda; Mohammed Eunus Ali; Akib Zaman
	In: Behavioral Sciences; Volume 12; Issue 4; Pages: 87 (2022)
	BASE
	Show details

8	Vec2Dynamics: A Temporal Word Embedding Approach to Exploring the Dynamics of Scientific Keywords—Machine Learning as a Case Study
	Amna Dridi; Mohamed Medhat Gaber; Raja Muhammad Atif Azad; Jagdev Bhogal
	In: Big Data and Cognitive Computing; Volume 6; Issue 1; Pages: 21 (2022)
	BASE
	Show details

9	Methods, Models and Tools for Improving the Quality of Textual Annotations
	Maria Teresa Artese; Isabella Gagliardi
	In: Modelling; Volume 3; Issue 2; Pages: 224-242 (2022)
	BASE
	Show details

10	Creating multi-scripts sentiment analysis lexicons for Algerian, Moroccan and Tunisian dialects
	Abidi, Karima; Smaïli, Kamel
	In: 7th International Conference on Data Mining (DTMN 2021) Computer Science Conference Proceedings in Computer Science & Information Technology (CS & IT) ; https://hal.archives-ouvertes.fr/hal-03308111 ; 7th International Conference on Data Mining (DTMN 2021) Computer Science Conference Proceedings in Computer Science & Information Technology (CS & IT), Sep 2021, Copenhagen, Denmark (2021)
	BASE
	Show details

11	Bilingual English-German word embedding models for scientific text ...
	Donner, Paul. - : Zenodo, 2021
	BASE
	Show details

12	Bilingual English-German word embedding models for scientific text ...
	Donner, Paul. - : Zenodo, 2021
	BASE
	Show details

13	以《Cofacts 真的假的》資料庫為基礎建立中文科學假訊息之探勘模型 ; Text Mining Model for Detecting Chinese Fake Scientific Messages based on Cofacts Open Data
	許博竣; XU, BO-JUN. - 2021
	BASE
	Show details

14	Automatic Part-of-Speech Tagging for Security Vulnerability Descriptions ...
	Yitagesu, Sofonias; Zhang, Xiaowang; Feng, Zhiyong. - : Zenodo, 2021
	BASE
	Show details

15	Automatic Part-of-Speech Tagging for Security Vulnerability Descriptions ...
	Yitagesu, Sofonias; Zhang, Xiaowang; Feng, Zhiyong. - : Zenodo, 2021
	BASE
	Show details

16	WELFake dataset for fake news detection in text data ...
	Verma, Pawan Kumar; Agrawal, Prateek; Prodan, Radu. - : Zenodo, 2021
	BASE
	Show details

17	WELFake dataset for fake news detection in text data ...
	Verma, Pawan Kumar; Agrawal, Prateek; Prodan, Radu. - : Zenodo, 2021
	BASE
	Show details

18	Text ranking based on semantic meaning of sentences ; Textrankning baserad på semantisk betydelse hos meningar
	Stigeborn, Olivia. - : KTH, Skolan för elektroteknik och datavetenskap (EECS), 2021
	BASE
	Show details

19	Efficient Estimate of Low-Frequency Words’ Embeddings Based on the Dictionary: A Case Study on Chinese
	Xianwen Liao; Yongzhong Huang; Changfu Wei...
	In: Applied Sciences ; Volume 11 ; Issue 22 (2021)
	BASE
	Show details

20	Acoustic Word Embeddings for End-to-End Speech Synthesis
	Feiyu Shen; Chenpeng Du; Kai Yu
	In: Applied Sciences ; Volume 11 ; Issue 19 (2021)
	BASE
	Show details

Page: 1 2 3 4 5...8

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern