Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Erdmann, Alexander | - |
dc.contributor.author | Wrisley, David Joseph | - |
dc.contributor.author | Brown, Christopher | - |
dc.contributor.author | Cohen-Bodénès, Sophie | - |
dc.contributor.author | Elsner, Micha | - |
dc.contributor.author | Feng, Yukun | - |
dc.contributor.author | Joseph, Brian | - |
dc.contributor.author | Joyeux-Prunel, Béatrice | - |
dc.contributor.author | de Marneffe, Marie-Catherine | - |
dc.date.accessioned | 2019-09-07T22:52:03Z | - |
dc.date.available | 2019-09-07T22:52:03Z | - |
dc.date.issued | 2019 | - |
dc.identifier.citation | Erdmann, A. et al. (2019) Practical, Efficient, and Customizable Active Learning for Named Entity Recognition in the Digital Humanities. Proceedings of NAACL-HLT 2019, pages 2223–2234 Minneapolis, Minnesota, June 2 - June 7, 2019. | en |
dc.identifier.uri | https://www.aclweb.org/anthology/N19-1231 | - |
dc.identifier.uri | http://hdl.handle.net/2451/60381 | - |
dc.description.abstract | Scholars in inter-disciplinary fields like the Digital Humanities are increasingly interested in semantic annotation of specialized corpora. Yet, under-resourced languages, imperfect or noisily structured data, and user-specific classification tasks make it difficult to meet their needs using off-the-shelf models. Manual annotation of large corpora from scratch, meanwhile, can be prohibitively expensive. Thus, we propose an active learning solution for named entity recognition, attempting to maximize a custom model’s improvement per additional unit of manual annotation. Our system robustly handles any domain or user-defined label set and requires no external resources, enabling quality named entity recognition for Humanities corpora where such resources are not available. Evaluating on typologically disparate languages and datasets, we reduce required annotation by 20-60% and greatly outperform a competitive active learning baseline. | en |
dc.description.sponsorship | New York University–Paris Sciences Lettres Global Alliance grant; National Endowment for the Humanities grant, award HAA-256078-17; Computational Approaches to Modeling Language lab at New York University Abu Dhabi | en |
dc.language.iso | en_US | en |
dc.publisher | Association for Computational Linguistics | en |
dc.subject | digital humanities | en |
dc.subject | named entity recognition | en |
dc.subject | active learning | en |
dc.subject | machine learning | en |
dc.title | Practical, Efficient, and Customizable Active Learning for Named Entity Recognition in the Digital Humanities | en |
dc.type | Article | en |
Appears in Collections: | David Wrisley's Collection |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
N19-1231.pdf | NAACL2019 | 1.2 MB | Adobe PDF | View/Open |
Items in FDA are protected by copyright, with all rights reserved, unless otherwise indicated.