Investigating BERT’s Knowledge of Language: Five Analysis Methods with NPIs

Warstadt, Alex; Bowman, Samuel R.; et al.

doi:10.18653/v1/D19-1286

Full metadata record

DC Field	Value	Language
dc.contributor.author	Warstadt, Alex	-
dc.contributor.author	Bowman, Samuel R.	-
dc.contributor.author	et al.	-
dc.date.accessioned	2020-01-08T19:35:07Z	-
dc.date.available	2020-01-08T19:35:07Z	-
dc.date.issued	2019-11	-
dc.identifier.citation	Anthology ID: D19-1286 Volume: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) Month: November Year: 2019 Address: Hong Kong, China Venues: EMNLP \| IJCNLP SIG: SIGDAT Publisher: Association for Computational Linguistics Note: Pages: 2877–2887 URL: https://www.aclweb.org/anthology/D19-1286 DOI: 10.18653/v1/D19-1286	en
dc.identifier.uri	http://hdl.handle.net/2451/60991	-
dc.description.abstract	Though state-of-the-art sentence representation models can perform tasks requiring significant knowledge of grammar, it is an open question how best to evaluate their grammatical knowledge. We explore five experimental methods inspired by prior work evaluating pretrained sentence representation models. We use a single linguistic phenomenon, negative polarity item (NPI) licensing, as a case study for our experiments. NPIs like any are grammatical only if they appear in a licensing environment like negation (Sue doesn’t have any cats vs. *Sue has any cats). This phenomenon is challenging because of the variety of NPI licensing environments that exist. We introduce an artificially generated dataset that manipulates key features of NPI licensing for the experiments. We find that BERT has significant knowledge of these features, but its success varies widely across different experimental methods. We conclude that a variety of methods is necessary to reveal all relevant aspects of a model’s grammatical knowledge in a given domain.	en
dc.description.sponsorship	This project was a joint effort by the participants in the Spring 2019 NYU Linguistics seminar course Linguistic Knowledge in Reusable Sentence Encoders. We are grateful to the department for making this seminar possible. This material is based upon work supported by the National Science Foundation under Grant No. 1850208. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation. This project has also benefited from financial support to SB by Samsung Research under the project Improving Deep Learning using Latent Structure and from the donation of a Titan V GPU by NVIDIA Corporation.	en
dc.language.iso	en_US	en
dc.publisher	Association for Computational Lingusitics	en
dc.relation.ispartofseries	ACL Anthology;D19-1286	-
dc.subject	computational linguistics, natural language processing	en
dc.title	Investigating BERT’s Knowledge of Language: Five Analysis Methods with NPIs	en
dc.type	Article	en
dc.identifier.DOI	10.18653/v1/D19-1286	-
Appears in Collections:	Machine Learning for Language Lab

Files in This Item:

File	Description	Size	Format
data_generation_code.zip	Python 3 code for data generation.	226.92 MB	Unknown	View/Open
modeling_code.zip	Python 3 modeling code based on the jiant package. See scripts/bert_npi for instructions.	58.17 MB	Unknown	View/Open
npi_data_all_environments.tsv	Experimental data.	70.95 MB	Unknown	View/Open
paper.pdf	Research paper published at EMNLP 2019.	1.27 MB	Adobe PDF	View/Open
paper_appendix.pdf	Appendix/supplement to research paper published at EMNLP 2019.	1.93 MB	Adobe PDF	View/Open

Show simple item record