21 |
Word Co-occurrence Counts Prediction for Bilingual Terminology Extraction from Comparable Corpora.
|
|
|
|
In: 6th International Joint Conference on Natural Language Processing. IJCNLP 2013. ; https://hal.archives-ouvertes.fr/hal-00949198 ; 6th International Joint Conference on Natural Language Processing. IJCNLP 2013., Oct 2013, Nagoya, Japan. pp.12 (2013)
|
|
BASE
|
|
Show details
|
|
22 |
Lexicon-Grammar, a method of linguistic description ; Le lexique-grammaire, une méthode de description linguistique ; O Léxico-Gramática, um método de descrição linguística
|
|
|
|
In: https://hal-upec-upem.archives-ouvertes.fr/hal-00823401 ; 2013, pp.1-125 (2013)
|
|
BASE
|
|
Show details
|
|
23 |
Multilingual compound splitting combining language dependent and independent features
|
|
|
|
In: Dialogue ; https://hal.archives-ouvertes.fr/hal-00920323 ; Dialogue, May 2013, Moscou, Russia. pp.455-463 (2013)
|
|
BASE
|
|
Show details
|
|
24 |
Coherence and Cohesion for the Assessment of Text Readability
|
|
|
|
In: Proceedings of 10th International Workshop on Natural Language Processing and Cognitive Science (NLPCS 2013) ; https://hal.archives-ouvertes.fr/hal-00860796 ; Proceedings of 10th International Workshop on Natural Language Processing and Cognitive Science (NLPCS 2013), Oct 2013, Marseille, France. pp.11-19 (2013)
|
|
BASE
|
|
Show details
|
|
25 |
Assisting the Translation of SNOMED CT into French.
|
|
|
|
In: ISSN: 0926-9630 ; EISSN: 1879-8365 ; Studies in Health Technology and Informatics ; https://hal.inria.fr/hal-00854309 ; Studies in Health Technology and Informatics, IOS Press, 2013, 192, pp.47-51 (2013)
|
|
BASE
|
|
Show details
|
|
26 |
الوزن والجذر في الصرف غير الإشتقاقي: جموع التكسير في اللغة العربية ; Pattern-and-root inflectional morphology: the Arabic broken plural
|
|
|
|
In: ISSN: 0388-0001 ; Language Sciences ; https://hal.archives-ouvertes.fr/hal-00831338 ; Language Sciences, Elsevier, 2013, 40, pp.221-250. ⟨10.1016/j.langsci.2013.06.002⟩ (2013)
|
|
Abstract:
International audience ; نقدم نموذجًا مفصّلاً لتوصيف جموع التكسير مرتكزاً على أولوية الوزن على الجذر. ويستخلص النموذج صيغة جمع التكسير مستنداً على أحرف المفرد وصيغته. وقد تمّ تنفيذه وإختباره في ترميز 3200 مدخل معجمي. وقد أولينا اهتماماً خاصًا بإدارة الموارد اللغوية والمعاجم من اجل تسهيل عملية التوصيف لتصبح أكثر ملائمةً للخبراء في اللغّة العربيّة.يستند النموذج على المفاهيم التقليدية للوزن والجذر. وبالمقارنة مع الصرف التقليدي، فإنه يُبعد الصرف الإشتقاقي من هذا التوصيف. كما في القواميس العربية التقليدية، ويتمحور القاموس على مداخله المعجمية القابلة للتحديث، وهي إملائياً مشكولة كلياً. في نموذجنا، يتعرّف نظام التحليل الصرفي آلياً على جموع التكسير في النص مباشرةً معتمداً على قاموس أشكال مصرّفة بالكامل ودون قواعد مورفوفنولوجية أو إملائية. يعتمد تصنيف صيغ جموع التكسير مبادئ سهلة، منتظمة ومفصّلة. تم تبسيط ترميز أوزان المفرد للصوائت القصيرة (v) والطويلة (vv) دون تحديد نوعها كضمة أو فتحة، أو كسرة. تم ترميز التبدّلات المورفوفنولوجية للجذر والتغيرات الإملائية لأحرف العلة والهمزة بشكل مستقل عن صيغة الوزن و بشكل مباشر، أي دون ردّ الجذر إلى أصله ودون قواعد مورفوفونولوجية.تم تصنيف صيغ جموع التكسير تَراتُبِياً وفقاً: لوزن الجمع، فوزن المفرد، فأحرف العلّة. تقتصر صيَغ جموع التكسير الرباعية على 3 أوزان متفرعة إلى 70 صنفاً، وصِيَغ التكسير الثلاثية علـى 22 وزناً متفرعة إلى 90 صنفاً. هذه الأصناف الـ 160، تصبح 300 عندما نأخذ في الاعتبار التغيرات الإعرابية والإملائية في صيغ المفرد. ; We present a substantially implemented model of description of the inflectional morphology of Arabic nouns, with special attention to the management of dictionaries and other language resources by Arabic-speaking linguists. Our model includes broken plurals (BPs), i.e. plurals formed by modifying the stem. It is based on the traditional notions of root and pattern of Semitic morphology. However, as compared to traditional Arabic morphology, it keeps the formal description of inflection separate from that of derivation and semantics. As traditional Arabic dictionaries, the updatable dictionary is structured in lexical entries for lemmas, and the reference spelling is fully diacritized. In our model, morphological analysis of Arabic text is performed directly with a dictionary of words and without morphophonological rules. Our taxonomy for noun inflection is simple, orderly and detailed. We simplify the taxonomy of singular patterns by specifying vowel quantity as v or vv, and ignoring vowel quality. Root alternations and orthographical variations are encoded independently from patterns and in a factual way, without deep roots or morphophonological or orthographical rules. Nouns with a triliteral BP are classified according to 22 patterns subdivided into 90 classes, and nouns with a quadriliteral BP according to 3 patterns subdivided into 70 classes. These 160 classes become 300 inflectional classes when we take into account inflectional variations that affect only the singular. We provide a straightforward encoding scheme that we applied to 3 200 entries of BP nouns.
|
|
Keyword:
[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; arabe; arabe standard; Arabic; flexion; inflexion; lexicon; lexique; modern standard Arabic; nom; noun; plural; pluriel
|
|
URL: https://hal.archives-ouvertes.fr/hal-00831338/document https://doi.org/10.1016/j.langsci.2013.06.002 https://hal.archives-ouvertes.fr/hal-00831338/file/Prim-final.pdf https://hal.archives-ouvertes.fr/hal-00831338
|
|
BASE
|
|
Hide details
|
|
27 |
XMG : eXtensible MetaGrammar
|
|
|
|
In: ISSN: 0891-2017 ; EISSN: 1530-9312 ; Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-00768224 ; Computational Linguistics, Massachusetts Institute of Technology Press (MIT Press), 2013, 39 (3), pp.591-629 (2013)
|
|
BASE
|
|
Show details
|
|
28 |
Detecting salient events in large corpora by a combination of NLP and data mining techniques (poster)
|
|
|
|
In: Supplementary Proceedings of the 14th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2013) ; https://hal.archives-ouvertes.fr/hal-01023926 ; Supplementary Proceedings of the 14th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2013), Mar 2013, Samos, Greece (2013)
|
|
BASE
|
|
Show details
|
|
29 |
Defining a verb taxonomy by a decision tree
|
|
|
|
In: Autour des verbes. Constructions et interprétations ; https://hal.archives-ouvertes.fr/hal-00860386 ; Kozué Ogata. Autour des verbes. Constructions et interprétations, John Benjamins, pp.87-108, 2013, Lingvisticae Investigationes Supplementa, 978 90 272 3139 0 (2013)
|
|
BASE
|
|
Show details
|
|
30 |
Dialogar é preciso ; : Linguística para processamento de línguas
|
|
|
|
In: https://hal-upec-upem.archives-ouvertes.fr/hal-00804609 ; 1, PPGEL/UFES, pp.268, 2013 (2013)
|
|
BASE
|
|
Show details
|
|
31 |
Approches à base de fréquences pour la simplification lexicale
|
|
|
|
In: TALN-RECITAL 2013 ; https://hal.archives-ouvertes.fr/hal-00838354 ; TALN-RECITAL 2013, Jun 2013, Les Sables d'Olonne, France. pp.493-506 (2013)
|
|
BASE
|
|
Show details
|
|
32 |
The Effects of Factorizing Root and Pattern Mapping in Bidirectional Tunisian - Standard Arabic Machine Translation
|
|
|
|
In: MT Summit 2013 ; https://hal.archives-ouvertes.fr/hal-00908761 ; MT Summit 2013, Sep 2013, France. pas d'édition papier (2013)
|
|
BASE
|
|
Show details
|
|
33 |
Word Spotting and Regular Expression Detection in Handwritten Documents
|
|
|
|
In: ICDAR ; https://hal.archives-ouvertes.fr/hal-00905535 ; ICDAR, 2013, United States. pp.516-520 (2013)
|
|
BASE
|
|
Show details
|
|
34 |
Stratégies discriminantes pour intégrer la reconnaissance des mots composés dans un analyseur syntaxique en constituants
|
|
|
|
In: ISSN: 1248-9433 ; EISSN: 1965-0906 ; Revue TAL ; https://hal-upec-upem.archives-ouvertes.fr/hal-00846888 ; Revue TAL, ATALA (Association pour le Traitement Automatique des Langues), 2013, 54 (1), pp.47-70 (2013)
|
|
BASE
|
|
Show details
|
|
35 |
Modèle d'analyse morpho-syntaxique adaptatif au web usages : ré-indexation sociale dans une norme syntagmatique
|
|
|
|
In: Colloque International CNPLET/MEN-LABORATOIRE (Alégrie) & Laboratoire PARAGRAPHE Paris8 (France ; https://hal.inria.fr/hal-00927183 ; Colloque International CNPLET/MEN-LABORATOIRE (Alégrie) & Laboratoire PARAGRAPHE Paris8 (France, Le C.N.P.L.E.T et le C.R.S.T.D.L.A (Algérie), en partenariat avec le Laboratoire Paragraphe des Universités (Paris 8 et Cergy - Pontoise, France), Nov 2013, Ghardaïa, Algérie ; http://www.paragraphe.univ-paris8.fr/colloque_international/amenagement_lexical_terminologie_traductionnelle/ (2013)
|
|
BASE
|
|
Show details
|
|
36 |
LIPN-CORE: Semantic Text Similarity using n-grams, WordNet, Syntactic Analysis, ESA and Information Retrieval based Features
|
|
|
|
In: Second Joint Conference on Lexical and Computational Semantics ; https://hal.archives-ouvertes.fr/hal-00825054 ; Second Joint Conference on Lexical and Computational Semantics, Jun 2013, Atlanta, United States. pp.63 (2013)
|
|
BASE
|
|
Show details
|
|
37 |
Combining Compound Recognition and PCFG-LA Parsing with Word Lattices and Conditional Random Fields
|
|
|
|
In: ISSN: 1550-4875 ; ACM - Transactions on Speech and Language Processing ; https://hal-upec-upem.archives-ouvertes.fr/hal-00841574 ; ACM - Transactions on Speech and Language Processing, Association for Computing Machinery, 2013, 10 (3), pp.8.1-8.24. ⟨10.1145/2483969.2483970⟩ (2013)
|
|
BASE
|
|
Show details
|
|
38 |
Descrição do verbo cortar para processamento automático de linguagem natural
|
|
|
|
In: Dialogar é preciso. Linguística para processamento de línguas ; https://hal-upec-upem.archives-ouvertes.fr/hal-00804811 ; Laporte, Éric ; Smarsaro, Aucione ; Vale, Oto. Dialogar é preciso. Linguística para processamento de línguas, PPGEL/UFES, pp.165-176, 2013, 978-85-8087-104-3 (2013)
|
|
BASE
|
|
Show details
|
|
39 |
How ontology based information retrieval systems may benefit from lexical text analysis
|
|
|
|
In: New Trends of Research in Ontologies and Lexical Resources ; https://hal.archives-ouvertes.fr/hal-00797143 ; Oltramari, Alessandro; Vossen, Piek; Qin, Lu; Hovy, Eduard. New Trends of Research in Ontologies and Lexical Resources, 15, Springer, pp.209-230, 2013, Theory and Applications of Natural Language Processing, 978-3-642-31781-1 (2013)
|
|
BASE
|
|
Show details
|
|
40 |
Typology of verbal constructions in Brazilian Portuguese. A proposal of classification of the verb 'dar' "give" ; Tipologia das construções verbais em PB: uma proposta de classificação do verbo dar
|
|
|
|
In: ISSN: 0103-2178 ; Caligrama: Revista de Estudos Românicos ; https://hal.archives-ouvertes.fr/hal-03587230 ; Caligrama: Revista de Estudos Românicos, 2013, 18 (2), pp.105-130. ⟨10.17851/2238-3824.18.2.105-130⟩ ; http://www.periodicos.letras.ufmg.br/index.php/caligrama/article/download/5039/4810 (2013)
|
|
BASE
|
|
Show details
|
|
|
|