DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5 6
Hits 1 – 20 of 118

1
Goldilocks: Just-Right Tuning of BERT for Technology-Assisted Review ...
Abstract: Technology-assisted review (TAR) refers to iterative active learning workflows for document review in high recall retrieval (HRR) tasks. TAR research and most commercial TAR software have applied linear models such as logistic regression to lexical features. Transformer-based models with supervised tuning are known to improve effectiveness on many text classification tasks, suggesting their use in TAR. We indeed find that the pre-trained BERT model reduces review cost by 10% to 15% in TAR workflows simulated on the RCV1-v2 newswire collection. In contrast, we likewise determined that linear models outperform BERT for simulated legal discovery topics on the Jeb Bush e-mail collection. This suggests the match between transformer pre-training corpora and the task domain is of greater significance than generally appreciated. Additionally, we show that just-right language model fine-tuning on the task collection before starting active learning is critical. Too little or too much fine-tuning hinders performance, ... : 6 pages, 1 figure, accepted at ECIR 2022 ...
Keyword: Computation and Language cs.CL; FOS Computer and information sciences; Information Retrieval cs.IR
URL: https://dx.doi.org/10.48550/arxiv.2105.01044
https://arxiv.org/abs/2105.01044
BASE
Hide details
2
On-demand HIV pre-exposure prophylaxis: what are the healthcare workers barriers to delivery in a culturally and linguistically diverse population?
Power, Melissa; Lewis, David A; Richardson, Daniel. - : BMJ Publishing Group Ltd, 2020
BASE
Show details
3
Copyright and the reuse of translation as data
In: Moorkens, Joss orcid:0000-0003-0766-0071 and Lewis, David orcid:0000-0002-3503-4644 (2019) Copyright and the reuse of translation as data. In: O'Hagan, Minako, (ed.) The Routledge Handbook of Translation and Technology. Routledge Translation Handbooks . Routledge, Abingdon, UK, pp. 469-481. ISBN 9781138232846 (2019)
BASE
Show details
4
Copyright and the re-use of translation as data
Lewis, David. - 2019
BASE
Show details
5
Translation of Markovnikov’s Magistr Khimii Dissertation: A Progress Report
BASE
Show details
6
Translation resources and translator disempowerment
In: Moorkens, Joss orcid:0000-0003-0766-0071 , Lewis, David orcid:0000-0002-3503-4644 , Reijers, Wessel orcid:0000-0003-2505-1587 , Vanmassenhove, Eva orcid:0000-0003-1162-820X and Way, Andy orcid:0000-0001-5736-5930 (2016) Translation resources and translator disempowerment. In: Tenth International Conference on Language Resources and Evaluation (LREC 2016), 24 - 28 May 2016, Portorož, Slovenia. ISBN 978-2-9517408-9-1 (2016)
BASE
Show details
7
An empirical study of segment prioritization for incrementally retrained post-editing-based SMT
In: Du, Jinhua orcid:0000-0002-3267-4881 , Ankit, Srivastava, Way, Andy orcid:0000-0001-5736-5930 , Maldonado Guerra, Alfredo orcid:0000-0001-8426-5249 and Lewis, David orcid:0000-0002-3503-4644 (2015) An empirical study of segment prioritization for incrementally retrained post-editing-based SMT. In: The Fifteenth MT Summit Conference, 30 Oct-3 Nov 2015, Miami, FL, USA. (2015)
BASE
Show details
8
Developments in Whistleblowing Research 2015
Lewis, David; Vandekerckhove, Wim. - : International Whistleblowing Research Network, 2015
BASE
Show details
9
A Brief Survey of Multimedia Annotation Localisation on the Web of Linked Data ; 3rd Workshop on Linked Data in Linguistics: Multilingual Knowledge Resources and Natural Language Processing
LEWIS, DAVID. - : ELRA, 2014
BASE
Show details
10
Internationalization Tag Set (ITS) Version 2.0
BASE
Show details
11
The key to protection: civil and employment law remedies
Lewis, David; Devine, Tom; Harpur, Paul. - : Edward Elgar Publishing, 2014
BASE
Show details
12
Generalizing ITS as an Interoperable Annotation Technique for Global Intelligent Content
BASE
Show details
13
Linking localisation and language resources ; Linked Data in Linguistics
Lawless, Seamus; Lewis, David. - : Springer-Verlag LNCS, 2012
BASE
Show details
14
A Semantic Model for Integrated Content Management, Localisation and Language Technology Processing ; 2nd Workshop on the Multilingual Semantic Web
BASE
Show details
15
Truth in fiction
In: Arguing about language (New York, 2010), p. 561-572
MPI für Psycholinguistik
Show details
16
Languages and language
In: Arguing about language (New York, 2010), p. 225-245
MPI für Psycholinguistik
Show details
17
Languages and language
In: Semantics (London, 2009), p. 142-167
MPI für Psycholinguistik
18
Score keeping in a language game
In: Pragmatics, thought, and some contemporary issues (London, 2009), p. 33-50
MPI für Psycholinguistik
19
Web service integration for next generation localisation
In: Lewis, David orcid:0000-0002-3503-4644 , Curran, Stephen, Feeney, Kevin, Etzioni, Zohar, Keeney, John, Way, Andy orcid:0000-0001-5736-5930 and Schäler, Reinhard (2009) Web service integration for next generation localisation. In: SETQA-NLP 2009 - Workshop on Software Engineering, Testing, and Quality Assurance for Natural Language Processing, 5 June 2009, Boulder, Colorado, USA. (2009)
BASE
Show details
20
Convention : A Philosophical Study
Lewis, David [Verfasser]. - New York, NY : John Wiley & Sons, 2008
DNB Subject Category Language
Show details

Page: 1 2 3 4 5 6

Catalogues
8
6
2
0
1
1
1
Bibliographies
34
0
0
3
4
0
0
0
40
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
21
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern