Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher
- Year:
  - 2022 (2)
  - 2021 (7)
  - 2020 (27)
  - 2019 (13)
  - 2018 (11)
  - 2017 (9)
  - 2016 (9)
  - 2015 (11)
  - 2014 (13)
  - 2013 (20)
  - more
- Medium
- Type
- BLLDB-Access

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5 6...8

Hits 21 – 40 of 148

21	Deep Fusion of Multiple Term-Similarity Measures For Biomedical Passage Retrieval
	Montes Gomez, Manuel; Rosso, Paolo; Rosso-Mateus, Andrés. - : IOS Press, 2020
	BASE
	Show details

22	A Twitter Political Corpus of the 2019 10N Spanish Election
	Rosso, Paolo; Ponzetto, Simone Paolo; Sánchez-Junquera, Javier. - : Springer, 2020
	BASE
	Show details

23	Do Linguistic Features Help Deep Learning? The Case of Aggressiveness in Mexican Tweets
	Frenda, Simona; Banerjee, Somnath; Rosso, Paolo. - : Instituto Politecnico Nacional/Centro de Investigacion en Computacion, 2020
	BASE
	Show details

24	Multimodal Fake News Detection with Textual, Visual and Semantic Information
	Giachanou, Anastasia; Zhang, Guobiao; Rosso, Paolo. - : Springer, 2020
	BASE
	Show details

25	An Emotional Analysis of False Information in Social Media and News Articles
	Rangel, Francisco; Rosso, Paolo; Ghanem, Bilal Hisham Hasan. - : Association for Computing Machinery, 2020
	BASE
	Show details

26	Irony Detection in Twitter with Imbalanced Class Distributions
	Herrera, Francisco; Rosso, Paolo; Hernandez-Farias, Delia Irazu. - : IOS Press, 2020
	BASE
	Show details

27	#Brexit: Leave or Remain? The Role of User's Community and Diachronic Evolution on Stance Detection
	Lai, Mirko; Patti, Viviana; Ruffo, Giancarlo. - : IOS Press, 2020
	BASE
	Show details

28	MSIR@FIRE: A Comprehensive Report from 2013 to 2016
	Banerjee, Somnath; Choudhury, Monojit; Das, Amitava. - : Springer, 2020
	BASE
	Show details

29	Fine-Grained Analysis of Language Varieties and Demographics
	Rangel, Francisco; Rosso, Paolo; Zaghouani, Wajdi; Charfi, Anis. - : Cambridge University Press, 2020
	Abstract: [EN] The rise of social media empowers people to interact and communicate with anyone anywhere in the world. The possibility of being anonymous avoids censorship and enables freedom of expression. Nevertheless, this anonymity might lead to cybersecurity issues, such as opinion spam, sexual harassment, incitement to hatred or even terrorism propaganda. In such cases, there is a need to know more about the anonymous users and this could be useful in several domains beyond security and forensics such as marketing, for example. In this paper, we focus on a fine-grained analysis of language varieties while considering also the authors¿ demographics. We present a Low-Dimensionality Statistical Embedding method to represent text documents. We compared the performance of this method with the best performing teams in the Author Profiling task at PAN 2017. We obtained an average accuracy of 92.08% versus 91.84% for the best performing team at PAN 2017. We also analyse the relationship of the language variety identification with the authors¿ gender. Furthermore, we applied our proposed method to a more fine-grained annotated corpus of Arabic varieties covering 22 Arab countries and obtained an overall accuracy of 88.89%. We have also investigated the effect of the authors¿ age and gender on the identification of the different Arabic varieties, as well as the effect of the corpus size on the performance of our method. ; This publication was made possible by NPRP grant 9-175-1-033 from the Qatar National Research Fund (a member of Qatar Foundation). The statements made herein are solely the responsibility of the authors. ; Rangel, F.; Rosso, P.; Zaghouani, W.; Charfi, A. (2020). Fine-Grained Analysis of Language Varieties and Demographics. Natural Language Engineering. 26(6):641-661. https://doi.org/10.1017/S1351324920000108 ; S ; 641 ; 661 ; 26 ; 6 ; Kestemont, M. , Tschuggnall, M. , Stamatatos, E. , Daelemans, W. , Specht, G. , Stein, B. and Potthast, M. (2018). Overview of the Author Identification Task at PAN-2018: Cross-domain Authorship Attribution and Style Change Detection. CLEF 2018 Labs and Workshops, Notebook Papers. CEUR Workshop Proceedings. CEUR-WS.org. ; McNemar, Q. (1947). Note on the sampling error of the difference between correlated proportions or percentages. Psychometrika, 12(2), 153-157. doi:10.1007/bf02295996 ; Lui, M. and Cook, P. (2013). Classifying english documents by national dialect. In Proceedings of the Australasian Language Technology Association Workshop, Citeseer pp. 5–15. ; Basile, A. , Dwyer, G. , Medvedeva, M. , Rawee, J. , Haagsma, H. and Nissim, M. (2017). Is there life beyond n-grams? A simple SVM-based author profiling system. In Cappellato L., Ferro N., Goeuriot L. and Mandl T. (eds), CLEF 2017 Working Notes. CEUR Workshop Proceedings (CEUR-WS.org), ISSN 1613-0073, http://ceur-ws.org/Vol-/. CLEF and CEUR-WS.org. ; Elfardy, H. and Diab, M.T. (2013). Sentence level dialect identification in arabic. In Association for Computational Linguistics (ACL), pp. 456–461. ; Salton, G., & Buckley, C. (1988). Term-weighting approaches in automatic text retrieval. Information Processing & Management, 24(5), 513-523. doi:10.1016/0306-4573(88)90021-0 ; Zaghouani, W. and Charfi, A. (2018a). ArapTweet: A large MultiDialect Twitter corpus for gender, age and language variety identification. In Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC), Miyazaki, Japan. ; Zampieri, M. , Tan, L. , Ljubešić, N. , Tiedemann, J. and Nakov, P. (2015). Overview of the DSL shared task 2015. In Proceedings of the Joint Workshop on Language Technology for Closely Related Languages, Varieties and Dialects, pp. 1–9. ; Huang, C.-R. and Lee, L.-H. (2008). Contrastive approach towards text source classification based on top-bag-of-word similarity. In PACLIC, pp. 404–410. ; Zaidan, O. F., & Callison-Burch, C. (2014). Arabic Dialect Identification. Computational Linguistics, 40(1), 171-202. doi:10.1162/coli_a_00169 ; Grouin, C. , Forest, D. , Paroubek, P. and Zweigenbaum, P. (2011). Présentation et résultats du défi fouille de texte DEFT2011 Quand un article de presse a t-il été écrit? À quel article scientifique correspond ce résumé? Actes du septième Défi Fouille de Textes, p. 3. ; Martinc, M. , Skrjanec, I. , Zupan, K. and Pollak, S. Pan (2017). Author profiling – gender and language variety prediction. In Cappellato L., Ferro N., Goeuriot L. and Mandl T. (eds), CLEF 2017 Working Notes. CEUR Workshop Proceedings (CEUR-WS.org), ISSN 1613-0073, http://ceur-ws.org/Vol-/. CLEF and CEUR-WS.org. ; Rangel, F. , Rosso, P. and Franco-Salvador, M. (2016b). A low dimensionality representation for language variety identification. In 17th International Conference on Intelligent Text Processing and Computational Linguistics, CICLing, LNCS. Springer-Verlag, arxiv:1705.10754. ; Hagen, M. , Potthast, M. and Stein, B. (2018). Overview of the Author Obfuscation Task at PAN 2018. CLEF 2018 Labs and Workshops, Notebook Papers. CEUR Workshop Proceedings. CEUR-WS.org. ; Zampieri, M. and Gebre, B.G. (2012). Automatic identification of language varieties: The case of portuguese. In The 11th Conference on Natural Language Processing (KONVENS), pp. 233–237 (2012) ; Rangel, F. , Rosso, P. , Montes-y-Gómez, M. , Potthast, M. and Stein, B. (2018). Overview of the 6th Author Profiling Task at PAN 2018: Multimodal Gender Identification in Twitter. In CLEF 2018 Labs and Workshops, Notebook Papers. CEUR Workshop Proceedings. CEUR-WS.org. ; Heitele, D. (1975). An epistemological view on fundamental stochastic ideas. Educational Studies in Mathematics, 6(2), 187-205. doi:10.1007/bf00302543 ; Inches, G. and Crestani, F. (2012). Overview of the International Sexual Predator Identification Competition at PAN-2012. CLEF Online working notes/labs/workshop, vol. 30. ; Rosso, P. , Rangel Pardo, F.M. , Ghanem, B. and Charfi, A. (2018b). ARAP: Arabic Author Profiling Project for Cyber-Security. Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN). ; Agić, Ž. , Tiedemann, J. , Dobrovoljc, K. , Krek, S. , Merkler, D. , Može, S. , Nakov, P. , Osenova, P. and Vertan, C. (2014). Proceedings of the EMNLP 2014 Workshop on Language Technology for Closely Related Languages and Language Variants. Association for Computational Linguistics. ; Sadat, F., Kazemi, F., & Farzindar, A. (2014). Automatic Identification of Arabic Language Varieties and Dialects in Social Media. Proceedings of the Second Workshop on Natural Language Processing for Social Media (SocialNLP). doi:10.3115/v1/w14-5904 ; Franco-Salvador, M., Rangel, F., Rosso, P., Taulé, M., & Antònia Martít, M. (2015). Language Variety Identification Using Distributed Representations of Words and Documents. Experimental IR Meets Multilinguality, Multimodality, and Interaction, 28-40. doi:10.1007/978-3-319-24027-5_3 ; Rosso, P., Rangel, F., Farías, I. H., Cagnina, L., Zaghouani, W., & Charfi, A. (2018). A survey on author profiling, deception, and irony detection for the Arabic language. Language and Linguistics Compass, 12(4), e12275. doi:10.1111/lnc3.12275 ; Malmasi, S. , Zampieri, M. , Ljubešić, N. , Nakov, P. , Ali, A. and Tiedemann, J. (2016). Discriminating between similar languages and arabic dialect identification: A report on the third DSL shared task. In Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial3), pp. 1–14. ; Rangel, F. , Rosso, P. , Potthast, M. and Stein, B. (2017). Overview of the 5th Author Profiling Task at PAN 2017: Gender and Language Variety Identification in Twitter. In Cappellato L., Ferro N., Goeuriot, L. and Mandl T. (eds), Working Notes Papers of the CLEF 2017 Evaluation Labs, p. 1613–0073, CLEF and CEUR-WS.org. ; Zampieri, M. , Malmasi, S. , Ljubešić, N. , Nakov, P. , Ali, A. , Tiedemann, J. , Scherrer, Y. , Aepli, N. (2017). Findings of the vardial evaluation campaign 2017. In Proceedings of the Fourth Workshop on NLP for Similar Languages, Varieties and Dialects, pp. 1–15. ; Bogdanova, D., Rosso, P., & Solorio, T. (2014). Exploring high-level features for detecting cyberpedophilia. Computer Speech & Language, 28(1), 108-120. doi:10.1016/j.csl.2013.04.007 ; Maier, W. and Gómez-Rodríguez, C. (2014). Language Variety Identification in Spanish Tweets. LT4CloseLang. ; Castro, D. , Souza, E. , de Oliveira, A.L.I. (2016). Discriminating between Brazilian and European Portuguese national varieties on Twitter texts. In 5th Brazilian Conference on Intelligent Systems (BRACIS), pp. 265–270. ; Zaghouani, W. and Charfi, A. (2018b). Guidelines and annotation framework for Arabic author profiling. In Proceedings of the 3rd Workshop on Open-Source Arabic Corpora and Processing Tools, 11th International Conference on Language Resources and Evaluation (LREC), Miyazaki, Japan. ; Hernández Fusilier, D., Montes-y-Gómez, M., Rosso, P., & Guzmán Cabrera, R. (2015). Detecting positive and negative deceptive opinions using PU-learning. Information Processing & Management, 51(4), 433-443. doi:10.1016/j.ipm.2014.11.001 ; Tellez, E.S. , Miranda-Jiménez, S. , Graff, M. and Moctezuma, D. (2017). Gender and language variety identification with microtc. In Cappellato L., Ferro N., Goeuriot L. and Mandl T. (eds). CLEF 2017 Working Notes. CEUR Workshop Proceedings (CEUR-WS.org), ISSN 1613-0073, http://ceur-ws.org/Vol-/. CLEF and CEUR-WS.org. ; Kandias, M., Stavrou, V., Bozovic, N., & Gritzalis, D. (2013). Proactive insider threat detection through social media. Proceedings of the 12th ACM workshop on Workshop on privacy in the electronic society. doi:10.1145/2517840.2517865
	Keyword: Age; Arabic; Author profiling; Cybersecurity; Demographics; Gender; Language variety identification; LENGUAJES Y SISTEMAS INFORMATICOS
	URL: http://hdl.handle.net/10251/166834 https://doi.org/10.1017/S1351324920000108
	BASE
	Hide details

30	Multilingual Stance Detection in Social Media Political Debates
	Lai, Mirko; Cignarella, Alessandra Teresa; Hernandez-Farias, Delia Irazu. - : Elsevier, 2020
	BASE
	Show details

31	Fake Opinion Detection: How Similar are Crowdsourced Datasets to Real Data?
	Fornaciari, Tommaso; Cagnina, Leticia; Rosso, Paolo. - : Springer-Verlag, 2020
	BASE
	Show details

32	FacTweet: Profiling Fake News Twitter Accounts
	Rosso, Paolo; Ponzetto, Simone Paolo; Ghanem, Bilal Hisham Hasan. - : Springer, 2020
	BASE
	Show details

33	Overview of PAN 2020: Authorship Verification, Celebrity Profiling, Profiling Fake News Spreaders on Twitter, and Style Change Detection
	Bevendorff, Janek; Ghanem, Bilal Hisham Hasan; Giachanou, Anastasia. - : Springer, 2020
	BASE
	Show details

34	The Role of Personality and Linguistic Patterns in Discriminating Between Fake News Spreaders and Fact Checkers
	Giachanou, Anastasia; Ríssola, Esteban A.; Ghanem, Bilal. - : Springer, 2020
	BASE
	Show details

35	Scalable and Language-Independent Embedding-based Approach for Plagiarism Detection Considering Obfuscation Type: No Training Phase
	Gharavi, Erfaneh; Veisi, Hadi; Rosso, Paolo. - : Springer-Verlag, 2020
	BASE
	Show details

36	Introduction to the Special Section on Computational Modeling and Understanding of Emotions in Conflictual Social Interactions
	Rosso, Paolo; Clavel, Chloé; Damiano, Rossana. - : Association for Computing Machinery, 2020
	BASE
	Show details

37	On the use of character n-grams as the only intrinsic evidence of plagiarism [<Journal>]
	Bensalem, Imene [Verfasser]; Rosso, Paolo [Verfasser]; Chikhi, Salim [Verfasser]
	DNB Subject Category Language
	Show details

38	Classifier combination approach for question classification for Bengali question answering system [<Journal>]
	Banerjee, Somnath [Verfasser]; Naskar, Sudip Kumar [Verfasser]; Rosso, Paolo [Verfasser].
	DNB Subject Category Language
	Show details

39	Stance polarity in political debates: A diachronic perspective of network homophily and conversations on Twitter
	Lai Mirko; Tambuscio Marcella; Patti Viviana. - 2019
	BASE
	Show details

40	IDAT@FIRE2019: Overview of the Track on Irony Detection in Arabic Tweets
	Ghanem, Bilal; Karoui, Jihen; Benamara, Farah. - : CEUR-WS.org, 2019
	BASE
	Show details

Page: 1 2 3 4 5 6...8

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern