1 |
Syntactic and Semantic Impact of Prepositions in Machine Translation : An Empirical Study of French-English Translation of Prepositions ‘à’, ‘de’ and ‘en’
|
|
|
|
In: Human Language Technology. Challenges for Computer Science and Linguistics 8th Language and Technology Conference, LTC 2017, Poznań, Poland, November 17–19, 2017, Revised Selected Papers ; 8th Language and Technology Conference (LTC 2017) ; https://hal-lirmm.ccsd.cnrs.fr/lirmm-03091307 ; Human Language Technology. Challenges for Computer Science and Linguistics 8th Language and Technology Conference, LTC 2017, Poznań, Poland, November 17–19, 2017, Revised Selected Papers, 12598, pp.273-287, 2020, Lecture Notes in Computer Science, 978-3-030-66526-5. ⟨10.1007/978-3-030-66527-2_20⟩ (2020)
|
|
BASE
|
|
Show details
|
|
2 |
An Empirical Study for a Machine Aided Translation of French Prepositions 'à', 'de' and 'en' into English
|
|
|
|
In: 8th Language and Technology Conference ; LTC: Language and Technology Conference ; https://hal-lirmm.ccsd.cnrs.fr/lirmm-01645154 ; LTC: Language and Technology Conference, Nov 2017, Poznan, Poland ; http://ltc.amu.edu.pl/a2017/ (2017)
|
|
BASE
|
|
Show details
|
|
3 |
How can catchy titles be generated without loss of informativeness?
|
|
|
|
In: ISSN: 0957-4174 ; Expert Systems with Applications ; https://hal-lirmm.ccsd.cnrs.fr/lirmm-00856372 ; Expert Systems with Applications, Elsevier, 2014, 41 (4), pp.1051-1062. ⟨10.1016/j.eswa.2013.07.102⟩ (2014)
|
|
BASE
|
|
Show details
|
|
4 |
Le résumé et le titrage automatique partagent-ils les mêmes objectifs ?
|
|
|
|
In: ISSN: 1638-5748 ; EISSN: 1638-573X ; CORELA - COgnition, REprésentation, LAngage ; https://hal-lirmm.ccsd.cnrs.fr/lirmm-00835944 ; CORELA - COgnition, REprésentation, LAngage, CERLICO-Cercle Linguistique du Centre et de l'Ouest (France), 2013, 11, pp.14 (2013)
|
|
BASE
|
|
Show details
|
|
5 |
GenDesc: A Partial Generalization of Linguistic Features For Text Classification
|
|
|
|
In: 18th International Conference on Applications of Natural Language to Information Systems ; NLDB: Natural Language Processing and Information Systems ; https://hal-lirmm.ccsd.cnrs.fr/lirmm-00823476 ; NLDB: Natural Language Processing and Information Systems, Jun 2013, Salford, United Kingdom. pp.343-348, ⟨10.1007/978-3-642-38824-8_35⟩ (2013)
|
|
BASE
|
|
Show details
|
|
6 |
Let the System Learn a Game: How Can FCA Optimize a Cognitive Memory Structure
|
|
|
|
In: ECAI'2012: European Conference on Artificial Intelligence ; https://hal-lirmm.ccsd.cnrs.fr/lirmm-00764071 ; ECAI'2012: European Conference on Artificial Intelligence, Aug 2012, Montpellier, France. pp.001-009 ; http://www2.lirmm.fr/ecai2012/index.php?option=com_content&view=article&id=80&Itemid=84 (2012)
|
|
BASE
|
|
Show details
|
|
7 |
Aligning through divergence
|
|
|
|
In: SNLP-AOS'2011: Joint International Symposium on Natural Language Processing and Agricultural Ontology Service ; https://hal-lirmm.ccsd.cnrs.fr/lirmm-00839340 ; SNLP-AOS'2011: Joint International Symposium on Natural Language Processing and Agricultural Ontology Service, Feb 2012, Phuket, Thailand. pp.150-159 (2012)
|
|
BASE
|
|
Show details
|
|
8 |
Author manuscript, published in "Computational Linguistics Applications, France (2011)" Two Memory-Based Methods for Phrase
|
|
|
|
In: http://hal.inria.fr/docs/00/76/40/92/PDF/CLA2011.pdf (2012)
|
|
BASE
|
|
Show details
|
|
9 |
Just Title It! (by an Online Application)
|
|
|
|
In: EACL'2012: European chapter of the Association for Computational Linguistics (Démonstration) ; https://hal-lirmm.ccsd.cnrs.fr/lirmm-00706648 ; EACL'2012: European chapter of the Association for Computational Linguistics (Démonstration), Apr 2012, Avignon, France. pp.31-34 ; http://eacl2012.org/home/index.html (2012)
|
|
BASE
|
|
Show details
|
|
10 |
How to Title Electronic Documents Using Text Mining Techniques
|
|
|
|
In: ISSN: 2150-7988 ; International Journal of Computer Information Systems and Industrial Management Applications ; https://hal-lirmm.ccsd.cnrs.fr/lirmm-00687096 ; International Journal of Computer Information Systems and Industrial Management Applications, Machine Intelligence Research Labs (MIR Labs), 2012, 4, pp.562-569 (2012)
|
|
BASE
|
|
Show details
|
|
11 |
NOMIT: Automatic Titling by Nominalizing
|
|
|
|
In: NAACL'2012: North American Chapter of the Association for Computational Linguistics: Human Language Technologies ; https://hal-lirmm.ccsd.cnrs.fr/lirmm-00706647 ; NAACL'2012: North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Jun 2012, Montréal, Canada. pp.274-283 (2012)
|
|
BASE
|
|
Show details
|
|
12 |
Using Alignment to detect associated multiword expressions in bilingual corpora. Session 6 - Translation and Natural Language Processing
|
|
|
|
In: Actes du colloque Tralogy : Anticiper les technologies pour la traduction ; Tralogy I. Métiers et technologies de la traduction : quelles convergences pour l'avenir ? ; https://hal.archives-ouvertes.fr/hal-02496037 ; Tralogy I. Métiers et technologies de la traduction : quelles convergences pour l'avenir ?, Mar 2011, Paris, France. 13p (2011)
|
|
Abstract:
International audience ; Translating multiword expressions from a language to another needs to recognize them as such. Bilingual multiword expressions are an issue when they are not the exact word-to-word translation of each other. The following examples are provided for a French-English translation task: (1) Phrasal verbs such as « to call in on » becoming « rendre visite », (2) « sorry to hear that », that a human translator translates into the simple 'désolé que" (3) most of adverbial locutions like « such as », equivalent to « de telle façon que », or « de manière à », etc.Thus, Machine Translation (MT) either requires a rich multiword bilingual database, or tends to create or enrich a first set of associated multiword expressions. Most of the time, existing resources are incomplete, and an interesting way to enhance covering is to provide a tool detecting 'associable' multiword expressions in parallel corpora. The latter are sets of texts that are translations of each others. There is an extensive literature in alignment techniques trying to link sentences from a text 2 SESSION 6-TRADUCTION ET TRAITEMENT AUTOMATIQUE DES LANGUES (TAL) USING ALIGNMENT TO DETECT ASSOCIATED MULTIWORD EXPRESSIONS IN BILINGUAL CORPORA-JOHAN SEGURA / VIOLAINE PRINCE in a language, named the source, to one or many in the other language, seen as target. Sentence alignment is the basic preliminary task that underlies all others, more fine-grained. Word-to-word alignment has largely been dealt with by statistical systems. Multiword expressions have a granularity that lies between word and sentence. They are mostly phrasal, and sometimes with a rather strong syntactical and lexical divergence. With the improvement of parsers, alignment methods using syntax have emerged. Syntax allows the translation task, among others, to focus on relevant phrase fragments and to link multiword units together. For instance, Ozdowska's AliBi system is based on dependencies structures. The Groves', Hearnes' and Way's system uses syntactic trees with internal node alignments. Bilingual terminology, consisting in recognizing equivalent groups of words, also relies on syntax to extract patterns, such as Noun-Verb, Adjective-Noun, Prepositional Noun Phrase, etc.(e.g. Claveau, 2009). Most of these multiword expressions could be reduced down to collocations. A collocation is a multiword expression, naturally translated with quite strong constraints (e.g., « to show respect »-» faire preuve de respect »). Seretan's method [Seretan, V. (1999)] recognizes numerous equivalent pairs of collocations throught bilingual alignments which POS-tags are equivalents or close (even with distant words). But it only retrieves two-words collocations. Thus, there is a need for systems that might detect longer collocations, and more divergent ones. The method proposed in this article is an alignment process between pairs of sentences, strongly based on syntax. It relies on is a rule-based system combining partial alignments from a database through a non-iterative graph-theory based process. Multiword expressions patterns built on examples help providing alignments with a good coverage, which in turn detect new multiword expressions, and enrich the database. The article sketches the state-of-the art in alignment, focusing on syntactic oriented systems, describes the designed system as well a corpus run experiment with promising results.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; Tralogy I
|
|
URL: https://hal.archives-ouvertes.fr/hal-02496037/document https://hal.archives-ouvertes.fr/hal-02496037/file/Tralogy_S6_A3_Prince.pdf https://hal.archives-ouvertes.fr/hal-02496037
|
|
BASE
|
|
Hide details
|
|
13 |
Alignment Memories: A Useful Tool to Handle Phrase Alignment Bottleneck
|
|
|
|
In: CLA'2011: Computational Linguistics-Applications Conference ; https://hal-lirmm.ccsd.cnrs.fr/lirmm-00764092 ; CLA'2011: Computational Linguistics-Applications Conference, Oct 2011, Jachranka, Poland. pp.61-67 ; http://www.wikicfp.com/cfp/servlet/event.showcfp?eventid=15440©ownerid=834 (2011)
|
|
BASE
|
|
Show details
|
|
14 |
Two Memory-Based Methods for Phrase Alignment
|
|
|
|
In: 5th Language and Technology Conference ; https://hal-lirmm.ccsd.cnrs.fr/lirmm-00838090 ; 5th Language and Technology Conference, Nov 2011, Poland. pp.101-110 ; http://www.ltc.amu.edu.pl/ (2011)
|
|
BASE
|
|
Show details
|
|
15 |
Automatic Titling of Articles Using Position and Statistical Information
|
|
|
|
In: RANLP'11: Recent Advances in Natural Language Processing ; https://hal-lirmm.ccsd.cnrs.fr/lirmm-00637975 ; RANLP'11: Recent Advances in Natural Language Processing, Dec 2011, Hissar, Bulgaria. pp.727-732 ; http://lml.bas.bg/ranlp2011/start3.php (2011)
|
|
BASE
|
|
Show details
|
|
16 |
Demo: Text Titling Application
|
|
|
|
In: Personal Semantic Data (EKAW'10 Demonstration) ; https://hal-lirmm.ccsd.cnrs.fr/lirmm-00563912 ; Personal Semantic Data (EKAW'10 Demonstration), Portugal. pp.N/A ; http://semanticweb.org/wiki/Personal_Semantic_Data (2011)
|
|
BASE
|
|
Show details
|
|
17 |
Automatic Titling of Electronic Documents with Noun Phrase Extraction
|
|
|
|
In: SOCPAR'10: SOft Computing and PAttern Recognition ; https://hal-lirmm.ccsd.cnrs.fr/lirmm-00563903 ; SOCPAR'10: SOft Computing and PAttern Recognition, France. pp.168-171 ; http://www.socpar.org/ (2011)
|
|
BASE
|
|
Show details
|
|
18 |
Managing Personal Information by Automatic Titling of E-mails
|
|
|
|
In: Personal Semantic Data (EKAW'10 Workshop) ; https://hal-lirmm.ccsd.cnrs.fr/lirmm-00563889 ; Personal Semantic Data (EKAW'10 Workshop), Lisbon, Portugal ; http://semanticweb.org/wiki/Personal_Semantic_Data (2011)
|
|
BASE
|
|
Show details
|
|
19 |
CORPUS and WEB: Two Allies in Building and Automatically Expanding Conceptual Classes
|
|
|
|
In: ISSN: 0350-5596 ; Informatica ; https://hal-lirmm.ccsd.cnrs.fr/lirmm-00534315 ; Informatica, Slovene Society Informatika, Ljubljana, 2010, 34 (3), pp.279-286 ; www.lirmm.fr/TAL (2010)
|
|
BASE
|
|
Show details
|
|
20 |
Compositional Semantic Vector Models with a Simplified Quantum Logic
|
|
|
|
In: https://hal-lirmm.ccsd.cnrs.fr/lirmm-00479633 ; RR-10014, 2010, pp.0010026 (2010)
|
|
BASE
|
|
Show details
|
|
|
|