Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher
- Year:
  - 2022 (8)
  - 2021 (8)
  - 2020 (13)
  - 2019 (6)
  - 2018 (5)
  - 2017 (3)
  - 2016 (3)
  - 2015 (6)
  - 2014 (6)
  - 2013 (2)
  - more
- Medium
- Type
- BLLDB-Access:
  - free (78)
  - subject to license (0)

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4

Hits 1 – 20 of 78

1	Simplification of literary and scientific texts to improve reading fluency and comprehension in beginning readers of French
	Javourey-Drevet, Ludivine; Dufau, Stéphane; François, Thomas...
	In: ISSN: 0142-7164 ; EISSN: 1469-1817 ; Applied Psycholinguistics ; https://hal-amu.archives-ouvertes.fr/hal-03549026 ; Applied Psycholinguistics, Cambridge University Press (CUP), 2022, pp.1-28. ⟨10.1017/S014271642100062X⟩ (2022)
	BASE
	Show details

2	Vikidia En/Fr bilingual dataset for Automatic Readability Assessment ...
	Lee, Justin; Vajjala, Sowmya. - : Zenodo, 2022
	BASE
	Show details

3	Vikidia En/Fr bilingual dataset for Automatic Readability Assessment ...
	Lee, Justin; Vajjala, Sowmya. - : Zenodo, 2022
	BASE
	Show details

4	Lexica corpus (v2.0) ...
	Hewett, Freya; Stede, Manfred. - : Zenodo, 2022
	BASE
	Show details

5	Lexica corpus (v2.0) ...
	Hewett, Freya; Stede, Manfred. - : Zenodo, 2022
	BASE
	Show details

6	LeiKo ...
	Jablotschkin, Sarah; Zinsmeister, Heike. - : Zenodo, 2022
	BASE
	Show details

7	LeiKo ...
	Jablotschkin, Sarah; Zinsmeister, Heike. - : Zenodo, 2022
	BASE
	Show details

8	Predicting lexical complexity in English texts: the Complex 2.0 dataset
	Shardlow, Matthew; Evans, Richard; Zampieri, Marcos. - : Springer, 2022
	BASE
	Show details

9	Simplification automatique de textes biomédicaux en français : les données précises de petite taille aident
	Cardon, Rémi; Grabar, Natalia
	In: Actes de la 28e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale ; TALN - Traitement Automatique des Langues Naturelles ; https://hal.archives-ouvertes.fr/hal-03509735 ; TALN - Traitement Automatique des Langues Naturelles, Jul 2021, Lille, France (2021)
	BASE
	Show details

10	Automatic simplification of technical and specialized texts ; Simplification automatique de textes techniques et spécialisés
	Cardon, Rémi. - : HAL CCSD, 2021
	In: https://hal.archives-ouvertes.fr/tel-03343769 ; Informatique et langage [cs.CL]. Université de Lille, 2021. Français. ⟨NNT : 2021LILUH007⟩ (2021)
	BASE
	Show details

11	Automatic text simplification of specialized and technical texts ; Simplification automatique de textes techniques et spécialisés
	Cardon, Rémi. - : HAL CCSD, 2021
	In: https://hal.archives-ouvertes.fr/tel-03343769 ; Informatique et langage [cs.CL]. Université de Lille, 2021. Français (2021)
	Abstract: Automatic text simplification is a subdomain of natural language processing (NLP). It aims at processing texts that are difficult to read for a given audience in order to make them more accessible. Our goal consists in automatically simplifying medical texts. We present our whole work on that question, that goes from data collection and analysis to automatic simplification experiments. We begin with the process of collecting a comparable corpus of biomedical texts. The corpus is made of document pairs that deal with the same subject : one is written for a specialist audience and the other is written for non specialists. The corpus contains three types of texts : drug information, medical literature reviews and encyclopedia articles. Once the documents are collected, we annotate a subset of the corpus and analyze the linguistic transformations that occur during simplification.From the comparable corpus, we build a method to extract a parallel corpus, a corpus that contains sentence pairs where the sentences have the same meaning but differ by their degree of difficulty. This type of corpus represents the basic material for automatic simplification methods. Our parallel sentences extraction method is made of two steps : (1) prefiltering the pairs that are candidate for alignment using syntactic heuristics and (2) using a binary classifier to distinguish sentences that have the same meaning. We evaluate various classifiers as well as the impact of the data imbalance on the results. In order to promote the parallel corpus, also create a corpus of sentence pairs that are annotated according to their degree of semantic similarity, with scores ranging from 0 (no similarity) to 5 (same meaning). Both corpora are available for research.Finally, we present a series of experiments for the automatic simplification of biomedical french texts. Indeed, we use a neural method that comes from automatic translation. We use several resources: the parallel medical corpus that we built, the parallel general language corpus that we automatically translated from English to French and a lexicon that matches medical terms with terms or paraphrases that are more accessible. We describe the experimental protocol and evaluate the results in two manners, quantitatively and qualitatively. The results are similar to the state of the art in general language simplification and show that the resulting simplifications can be exploited as part of a computer aided simplification task. ; La simplification automatique de textes est un domaine du traitement automatique des langues (TAL) qui vise à traiter des textes difficiles à lire pour un public donné de façon à les rendre plus accessibles. Notre objectif consiste à simplifier automatiquement les textes médicaux et de santé. Nous présentons l’ensemble de notre travail sur cette question, qui va de la collecte et analyse de corpus jusqu’aux expériences en simplification automatique.Nous commençons par la collecte d’un corpus comparable de textes médicaux. Ce corpus est constitué de couples de documents qui traitent du même sujet : l’un s’adressant à un public spécialiste et l’autre à un public néophyte. Le corpus contient trois types de textes : des informations sur les médicaments, des revues systématiques de littérature médicale et des articles encyclopédiques. Une fois les documents collectés, nous annotons un sous-ensemble de ces documents et analysons les transformations linguistiques qui y sont mises en œuvre lors de la simplification.À partir du corpus comparable, nous mettons en place une méthode pour en extraire un corpus parallèle, c’est-à-dire un corpus comprenant des couples de phrases qui ont le même sens mais diffèrent par leur degré de difficulté. Ce type de corpus représente le matériau principal pour les méthodes de simplification automatique. Notre méthoded’extraction de phrases parallèles comporte deux étapes : (1) le préfiltrage de paires de phrases candidates à l’alignement selon des heuristiques syntaxiques et (2) la classification binaire permettant de distinguer les phrases en relation de simplification. Nous évaluons différents classifieurs ainsi que l’influence du déséquilibre des donnéessur les performances. Afin de valoriser ce corpus parallèle, nous créons également un corpus de paires de phrases annotées selon leur similarité sémantique, avec des scores allant de 0 (sémantique indépendante) à 5 (même sémantique). Les deux corpus sont disponibles pour la recherche. Enfin, nous présentons une série d’expériences en simplification automatique de textes médicaux en français. Ainsi, nous mettons à l’œuvre une méthode neuronale issue de la traduction automatique. Nous utilisons plusieurs ressources : le corpus parallèle médical construit par nous, le corpus parallèle de langue générale automatiquement traduit par nous de l’anglais vers le français ainsi qu’un lexique qui apparie des termes médicaux avec des termes ou paraphrases accessibles au grand public. Nous décrivons le protocole expérimental et menons une évaluation en deux volets, quantitatif et qualitatif. Les résultats sont comparables à l’état de l’art de la simplification en langue générale et montrent que les simplifications produites peuvent être exploitées dans le cadre d’une tâche de simplification assistée par ordinateur.
	Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-TT]Computer Science [cs]/Document and Text Processing; Automatic text simplification; Biomedical texts; Corpora and resources; Corpus et ressources; Natural language processing; Simplification automatique de textes; Textes biomédicaux; Traitement automatique de la langue naturelle
	URL: https://hal.archives-ouvertes.fr/tel-03343769/document https://hal.archives-ouvertes.fr/tel-03343769 https://hal.archives-ouvertes.fr/tel-03343769/file/these_RCardon.pdf
	BASE
	Hide details

12	Overview of SimpleText 2021 - CLEF Workshop on Text Simplification for Scientific Information Access
	Ermakova, Liana; Bellot, Patrice; Braslavski, Pavel...
	In: Experimental IR Meets Multilinguality, Multimodality, and Interaction: 12th International Conference of the CLEF Association, CLEF 2021, Virtual Event, September 21–24, 2021, Proceedings ; ISBN: 978-3-030-85251-1 ; 12th Conference and Labs of the Evaluation Forum (CLEF 2021) ; https://hal.archives-ouvertes.fr/hal-03637807 ; 12th Conference and Labs of the Evaluation Forum (CLEF 2021), Sep 2021, Bucharest, Romania. pp.432-449, ⟨10.1007/978-3-030-85251-1_27⟩ ; http://clef2021.clef-initiative.eu/ (2021)
	BASE
	Show details

13	Lexica corpus ...
	Hewett, Freya; Stede, Manfred. - : Zenodo, 2021
	BASE
	Show details

14	A Multilingual Systematic Review on the Use of Easy Language in Educational Settings
	Casalegno, Elisa; Rodriguez Vazquez, Silvia
	In: 7th International IATIS Conference (2021) (2021)
	BASE
	Show details

15	Judicial Sentences and Textual and Terminological Accessibility ; Sentenças Judiciais e Acessibilidade Textual e Terminológica
	Motta, Ester
	In: Domínios de Lingu@gem; Vol 15 No 3 (2021): Athematic issue; 761-813 ; Domínios de Lingu@gem; v. 15 n. 3 (2021): Número atemático; 761-813 ; 1980-5799 (2021)
	BASE
	Show details

16	Textos de divulgação sobre depressão : uma análise de definições inteligíveis com o aporte da linguística de corpus
	Berwanger, Laura Pinto. - 2021
	BASE
	Show details

17	Parallel sentence alignment from biomedical comparable corpora
	Cardon, Rémi; Grabar, Natalia
	In: Studies in Health Technology and Informatics ; https://hal.archives-ouvertes.fr/hal-03095183 ; Studies in Health Technology and Informatics, 270, pp.362-366, 2020, ⟨10.3233/SHTI200183⟩ (2020)
	BASE
	Show details

18	Identifying Abstract and Concrete Words in French to Better Address Reading Difficulties
	Goriachun, Daria; Gala, Núria
	In: Workshop Tools and Resources to Empower People with Reading Difficulties (READI) at International conference on Language Resources and Evaluation (LREC 2020) ; https://hal.archives-ouvertes.fr/hal-02562128 ; Workshop Tools and Resources to Empower People with Reading Difficulties (READI) at International conference on Language Resources and Evaluation (LREC 2020), May 2020, Marseille, France. pp.33-40 (2020)
	BASE
	Show details

19	Alector: A Parallel Corpus of Simplified French Texts with Alignments of Misreadings by Poor and Dyslexic Readers
	Gala, Núria; Tack, Anaïs; Javourey-Drevet, Ludivine...
	In: Language Resources and Evaluation for Language Technologies (LREC) ; https://hal.archives-ouvertes.fr/hal-02503986 ; Language Resources and Evaluation for Language Technologies (LREC), May 2020, Marseille, France (2020)
	BASE
	Show details

20	Controllable Sentence Simplification
	Martin, Louis; Villemonte de La Clergerie, Éric; Sagot, Benoît...
	In: LREC 2020 - 12th Language Resources and Evaluation Conference ; https://hal.inria.fr/hal-02678214 ; LREC 2020 - 12th Language Resources and Evaluation Conference, May 2020, Marseille, France ; http://www.lrec-conf.org/proceedings/lrec2020/index.html (2020)
	BASE
	Show details

Page: 1 2 3 4

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern