DE eng

Search in the Catalogues and Directories

Hits 1 – 18 of 18

1
Diccionari electrònic bilingüe català>anglés de locucions referencials idiomàtiques de somatismes
Escolano Marín, Xènia. - : Universidad de Alicante, 2021
Abstract: The ultimate aim of this research is to design a Catalan>English bilingual electronic dictionary of somatic idioms. To do so, we undertake a semasiological characterisation of somatic idioms to determine their semantic value and morphosyntactic combinatorial possibilities going from language to abstraction, identifying the differences in conceptualisation between Catalan and English and their equivalence relationship on the basis of their occurrences in corpora. The methodology used for the description of these somatisms is based on lexicogrammar but in a bottom-up manner. The theory of lexicogrammar (M. Gross 1975, 1981) holds that every elementary phrase is constituted by at least one first order predicate that introduces its arguments, represented by nouns or phrases. For example, in the phrase 'Luc admire le courage de Léa', we can find the verbal predicate 'admirer' with its arguments 'Luc' and 'Léa', that are morphologically equivalent to 'Luc est admiratif (pour + devant) le courage de Léa', with an adjective predicate, and 'Luc a de l'admiration pour le courage de Léa', with a nominal predicate, since there is no change in the structure of arguments (Le Pesant & Mathieu-Colas 1998: 8). This entails a systematic description of the syntactic and semantic properties of verbs, predicate nouns and adjectives in the form of tables that represent a class of lexical elements –characterised in terms of syntactic-semantic features (such as human, animal, plant, concrete, abstract, locative and temporal)–which correspond to a certain syntactic category with a series of common distributional and transformational properties. However, this description is insufficient for the complete automatic treatment of language; this is why in 1992 G. Gross adds to lexicogrammar the notion of object classes: semantic groups defined by the syntactic relations they maintain with one or more classes of verbs, called appropriate predicates (Gross 2012: 101). This exhaustive description of the language has made it possible to consider certain linguistic phenomena like fixation for the automatic treatment of language. Considering that computerised corpora play a very prominent role in usage-based linguistics, a trend that considers grammar and use to be closely interrelated, lexicogrammar can represent a theoretical model particularly suitable for the detection and analysis of phraseological units (PhUs) in corpora. In particular, we focus on the description of the 50 most frequent idioms in Catalan containing one of the five most common anthropomorphic somatic lexemes in all languages: hand, head, heart, eye and ear (Mellado 2004). To identify which are these 50 most frequent idioms (10 per each lexeme), we retrieve all the occurrences for each of the five mentioned lexemes from the Corpus Textual Informatitzat de la Llengua Catalana (CTILC) (~ 52M words with texts from 1833-1988). Then we carry out a semi-automatic extraction of the different combinations of the most frequent bigrams (candidates for idioms of the dictionary) for each somatic lexeme using the software Metaconcor. For example, for mà (hand), we obtain: a mà (at hand), a mà (by hand), entre mans (on [one’s] hands), de mà en mà (from hand to hand), mà d’obra (labour force), mà dura (firm hand), picar de mans (to clap [one’s hands]), lligar de peus i mans (to tie [sb’s] hands and feet), a mans plenes (liberally) and rentar-se’n les mans (to wash [one’s] hands [of]). We also consider the most frequent verbal and nominal co-occurrences of each bigram. Once we have translated the Catalan idioms into English, we apply this step to the English equivalents based on their occurrences in the British National Corpus (BNC) (~ 100M words with texts from 1985-1994). In order to design the dictionary, we undertake a syntactic-semantic description of the Catalan idioms and their equivalents in English –indicating its argument structures and semantic values according to the object class to which we ascribe them (the one referring to the values of )–, based on the occurrences of these units offered by the CTILC for Catalan and the BNC for English. This analysis is expressed through a tagging recognised by automatic language processing systems, based on the one used by the Laboratoire de Linguistique Informatique (LLI) –current LDI (Lexiques, Dictionnaires, Informatique) from the Paris 13 University–, whose dictionaries follow the system of the Laboratoire d’Automatique Documentaire et Linguistique (LADL) (Paris 7) –founded by M. Gross in 1968– and incorporate the notion of G. Gross’ object classes (1992). Having identified the most frequent idioms and their English equivalents, as well as their distributional (syntagmatic relations) and transformational possibilities (paradigmatic relations), we give account of their semantic relations (for both languages): conceptual variations (polysemy), intersynonymic variants (relations of synonymy), their eventual relations of antonymy and hyperonymy and their paradigmatic variants (according to their occurrences in the corpora used). All this information is registered in the dictionary in the form of four coordinated files with fields of different nature (cf. 2): two files containing the arguments of the idioms (one file for Catalan and another file for English) and two files containing the predicates (i.e. the idioms as entries) (one file for Catalan and another one for English). Considering the relevance object classes may have in fixed expressions, our dictionary mostly contains idioms –predicates– that combine invariable (argument human nouns) and variable (anthropomorphic somatic lexemes) elements –arguments–: for example, to wash hands (of)/C:/G:v/N0:/N1:/N3: /Ca:rentar-se les mans (d’) , where N0 refers to the subject (a human), and N1 to the first complement (direct object) in the argument structure of this idiom with the somatic lexeme (so) hand, which in this case has a semantic value of refusal of responsibility (elre) (in other cases this lexeme will be in idioms which may evoke, among others, proximity or facility [at hand], manual labour [by hand], activity [on (one’s) hands], severity [firm hand], itinerancy [from hand to hand], human resources [labour force], approval, enthusiasm or attention [to clap (one’s hands)], immobility or repression [to tie (sb’s) hands and feet] and abundance [liberally], which will be reflected in the argument file). N3 is the third complement, which here corresponds to a concrete inanimate object ( )(e.g. “I’m entirely opposed to the er (pause) the idea that they should wash their hands of their (pause) er, obligations er, the nineteen sixty eight act (pause) as” [BNC]). In this case there is no N2 complement, which usually refers to an indirect object. Ca indicates the Catalan equivalent of the idiom. Opting for an electronic phraseological dictionary with a semasiological approach implies, on the one hand, finding the PhUs contained in it more easily, since instead of departing from specific concepts, it departs from the semantic values of the units (idioms) grouped in a linguistically well-defined object class. On the other, it offers the advantage of being suitable for Natural Language Processing, as it has a coding recognised by automatic language processing systems, incorporating numerous fields with information of morphological, syntactic-semantic (the most common distributional and transformational properties) and diasystematic nature (if applicable). It also contains a specific field referred to translations into other languages (bilingual or multilingual). Therefore, these repertoires have a dual function: decoding and encoding information. In fact, one of the main advantages offered by semasiological electronic dictionaries over more traditional ones is that they present in different entries each of the argument structures of a predicate, which allows it to be monosemised. Thus, each usage of a predicate is conceived as a lexical unit to which a description is assigned, a sine qua non for machine translation. This is particularly relevant to hybrid systems, which combine statistics with syntax and semantics (e.g. Systran and Sadaw). The eventual product derived from this thesis, as well as the linguistic data (methodology, examples, tagging and files), can be considered unprecedented. Until now, lexicogrammar had been implemented based on linguistic intuition, which could make the creation of object classes somehow biased by the linguist’s idiolect. Using a syntactic-semantic tagging based on the above-mentioned methodology enables precision and objectivity, starting from real language instances (occurrences of these idioms in corpora). All in all, the usability and replicability of the linguistic data provided in the research may offer a wide range of possibilities, since this tagging could be implemented into linguistic analysis tools (e.g. in the form of tags in the “part of speech” label if idioms are used) and the model could be replicated to expressions from other lexico-phraseological fields, different types of PhUs and other languages. ; La present tesi doctoral ha rebut el finançament del contracte d’investigació predoctoral Ayudas para la Formación de Profesorado Universitario (Ref. FPU17/0032) concedit pel Ministerio de Educación, Cultura y Deporte d’Espanya (actual Ministerio de Ciencia, Innovación y Universidades). El projecte de tesi s’ha inscrit en el si de l’Institut Superior d’Investigació Cooperativa IVITRA [ISIC-IVITRA] (Programa per a la Constitució i Acreditació d’Instituts Superiors d’Investigació Cooperativa d’Excel∙lència de la Generalitat Valenciana, Ref. ISIC/012/042) i s’ha desenvolupat en el marc dels projectes, xarxes i grups de recerca següents: «Variación y cambio lingüístico en catalán. Una aproximación diacrónica según la Lingüística de Corpus» (MICINUN, Ref. PGC2018-099399-B-100371); (IEC, Ref. PRO2018-S04-MARTINES); Grup d’Investigació VIGROB-125 de la UA; Xarxa de recerca en innovació en docència universitària «Lingüística de Corpus i Mediterrània intercultural: investigació educativa per a l’aplicació de la Lingüística de Corpus en entorns multilingües diacrònics. Aplicacions del Metacorpus CIMTAC» (Institut de Ciències de l’Educació de la UA, Ref. 4581-2018), i Grup d’Investigació en Tecnologia Educativa en Història de la Cultura, Diacronia lingüística i Traducció (Universitat d’Alacant, Ref. GITE-09009-UA).
Keyword: Corpus-Based Methodology; Electronic Dictionary; Filología Catalana; Idioms; Interlinguistic Equivalents; Lexicogrammar; Phraseology; Somatisms; Syntactic-Semantic Tagging
URL: http://hdl.handle.net/10045/122159
BASE
Hide details
2
Variació i canvi lingüístic en llengua catalana. La Lingüística de Corpus com a eina per a la investigació educativa en entorns multilingües
Martines, Vicent; Antolí Martínez, Jordi M.; Sánchez-López, Elena. - : Universitat d’Alacant. Institut de Ciències de l’Educació, 2020
BASE
Show details
3
Laughing one's head off in Spanish subtitles: a corpus-based study on diatopic variation and its consequences for translation
In: Fraseología, Diatopía Y Traducción / Phraseology, Diatopic Variation and Translation (2018)
BASE
Show details
4
Imágenes e innovación educativa en Historia Cultural. El tópico de Leandro y Hero, de Joan Rois de Corella, en su tradición iconográfica
Martines, Vicent. - : Octaedro, 2016
BASE
Show details
5
28è Curs de Sociolingüística de la Nucia: "Les llengües viatgen: Migracions i Multilingüisme" ...
BASE
Show details
6
"Ut pictura poiesis": text i imatge, mimesi i realitat quant a «Tirant lo Blanch»
In: Caplletra. Revista Internacional de Filologia; Núm. 43 (2007); 213-242 (2014)
BASE
Show details
7
Materials per a l'anàlisi multilingüe i contrastiva de locucions, col·locacions i fraseologia. Un escandall quant a Ausiàs March i «Tirant lo Blanch»
In: Caplletra. Revista Internacional de Filologia; Núm. 40 (2006); p. 153-190 (2014)
BASE
Show details
8
«Traductologia i discurs». Introducció al monogràfic
In: Caplletra. Revista Internacional de Filologia; Núm. 40 (2006); p. 95-98 (2014)
BASE
Show details
9
Tragedia de Caldesa. Tragedia razonando un caso afortunado que con una dama le sucedió
Martines, Vicent. - : University of California at Santa Barbara, 2013
BASE
Show details
10
El ars de traducir en Joan Roís de Corella, los límites de la difusión de sus obras y las claves para una Tragèdia de Caldesa políglota
Martines, Vicent. - : University of California at Santa Barbara, 2013
BASE
Show details
11
Pour une histoire comparée des traductions : traductions des classiques, traductions du latin, traductions des langues romaines du Moyen Âge et de la première modernité
Martines, Vicent (Hrsg.); Courcelles, Dominique de (Hrsg.). - Paris : École des Chartes, 2012
BLLDB
UB Frankfurt Linguistik
Show details
12
Aspectos de la poliédrica influencia de los clásicos: nuevas imágenes clave de la influencia de san Vicente Ferrer en Italia
Martines, Vicent. - : University of California, Santa Barbara. Department of Spanish and Portuguese, 2012
BASE
Show details
13
IVITRA- International Virtual Institute of Translation: Multilingual Text Processing Applications
BASE
Show details
14
Clàssics valencians multilingües ; Clásicos valencianos multilingües ; Multilingual Valencian classics ; Mehrsprachige Valenzianische klassiker ; Classiques valenciens multilingues
BASE
Show details
15
Materials per a l'anàlisi multilingüe i contrastiva de locucions, col·locacions i fraseologia. Un escandall quant a Ausiàs March i «Tirant lo Blanch»
In: Caplletra. Revista Internacional de Filologia.; Caplletra 40 (Primavera 2006); 153-190 ; Caplletra. Revista Internacional de Filologia; Caplletra 40 (Primavera 2006); 153-190 ; 2386-7159 ; 0214-8188 (2006)
BASE
Show details
16
Locucions verbals i intel·lecció de clàssics romànics medievals en les seues traduccions: locucions i col·locacions multilingües
Martines, Vicent. - : Universidad de Alicante. Departamento de Filología Española, Lingüística General y Teoría de la Literatura, 2004
BASE
Show details
17
Un món de llengües : actes de les V - IX jornades de sociolingüistica de la Nucia (Marina Baixa)
Martines, Josep (Hrsg.); Martines, Vicent (Hrsg.); Cano, Maria Antònia (Hrsg.). - Alcoi : Marfil, 1998
BLLDB
UB Frankfurt Linguistik
Show details
18
El «Tirant lo Blanch» i les seues traduccions anteriors al segle XIX: aspectes de recepció i de traducció
In: Caplletra. Revista Internacional de Filologia; Núm. 23 (1997); p. 91-122 (1997)
BASE
Show details

Catalogues
2
0
0
0
0
0
0
Bibliographies
2
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
16
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern