Skip navigation

Machine Learning for Language Lab

Collection's Items: 1 to 15 of 15

Issue DateTitleAuthor(s)
Jul-2020BLiMP: The Benchmark of Linguistic Minimal Pairs for English (Electronic Resources)Warstadt, Samuel R.; Parrish, Alicia; Liu, Haokun; Mohananey, Anhad; Peng, Wei; Wang, Sheng-Fu; Bowman, Samuel R.
2019CoLA: The Corpus of Linguistic Acceptability (with added annotations)Warstadt, Alex; Singh, Amanpreet; Bowman, Samuel R.
2021Comparing Test Sets with Item Response TheoryClara Vania; Samuel R. Bowman
Nov-2023Data for "Debate Helps Supervise Unreliable Experts"Michael, Julian; Rein, David; Bowman, Samuel; et al.
Jun-2023Data for "Inverse Scaling: When Bigger Isn't Better"McKenzie, Ian; Bowman, Samuel R.; Perez, Ethan
2021Does Putting a Linguist in the Loop Improve NLU Data Collection?Alicia Parrish; Samuel R. Bowman
Nov-2023GPQA: A Graduate-Level Google-Proof Q&A BenchmarkRein, David; Bowman, Samuel; et al.
Nov-2019Investigating BERT’s Knowledge of Language: Five Analysis Methods with NPIsWarstadt, Alex; Bowman, Samuel R.; et al.
2020The Mixed Signals Generalization SetWarstadt, Alex; Zhang, Yian; Li, Haau-Sing; Bowman, Samuel R.
2018The Multi-Genre NLI CorpusWilliams, Adina; Nangia, Nikita; Bowman, Samuel R.
2023Pretraining Language Models with Human PreferencesTomasz Korbak; Samuel R. Bowman; Ethan Perez
2023(QA)^2: Question Answering with Questionable AssumptionsSamuel R. Bowman; Phu Mon Htut; Najoung Kim
2022QuALITY: Question Answering with Long Input Texts, Yes!Richard Yuanzhe Pang; Samuel R. Bowman
Jun-2015The SNLI CorpusBowman, Samuel R.; Angeli, Gabor; Potts, Christopher; Manning, Christopher D.
2022SQuALITY: Building a Long-Document Summarization Dataset the Hard WayAlex Wang; Samuel R. Bowman