David Wadden

I'm a research scientist on the AllenNLP and Semantic Scholar teams at the Allen Institute for AI. I'm interested in NLP and machine learning, often motivated by applications in science and health.

Previously, I was a PhD student at the University of Washington, advised by Hanna Hajishirzi.

More details in my CV.


This list may be out of date; please see my Google Scholar or Semantic Scholar for an up-to-date list.

  • Entity-Centric Query Refinement
    David Wadden, Nikita Gupta, Kenton Lee, Kristina Toutanova
    AKBC, 2022
    Best Paper Honorable Mention
    Article   Code  

  • SciFact-Open: Towards open-domain scientific claim verification
    David Wadden, Kyle Lo, Bailey Kuehl, Arman Cohan, Iz Beltagy, Lucy Lu Wang, Hannaneh Hajishirzi
    EMNLP Findings, 2022

  • MultiVerS: Improving scientific claim verification with weak supervision and full-document context
    David Wadden, Kyle Lo, Lucy Lu Wang, Arman Cohan, Iz Beltagy, Hannaneh Hajishirzi
    NAACL Findings, 2022
    Article   Code  

  • Generating Scientific Claims for Zero-Shot Scientific Fact Checking
    Dustin Wright, David Wadden, Kyle Lo, Bailey Kuehl, Arman Cohan, Isabelle Augenstein, Lucy Lu Wang
    ACL, 2022

  • The Effect of Moderation on Online Mental Health Conversations
    David Wadden, Tal August, Qisheng Li, Tim Althoff
    ICWSM, 2021
    Outstanding study design paper

  • Scientific Language Models for Biomedical Knowledge Base Completion: An Empirical Study
    Rahul Nadkarni, David Wadden, Iz Beltagy, Noah A. Smith, Hannaneh Hajishirzi, Tom Hope
    AKBC, 2021

  • Extracting a Knowledge Base of Mechanisms from COVID-19 Papers
    Aida Amini, Tom Hope, David Wadden, Madeleine van Zuylen, Eric Horvitz, Roy Schwartz, Hannaneh Hajishirzi
    NAACL, 2021

  • Overview and Insights from the SciVer Shared Task on Scientific Claim Verification
    David Wadden, Kyle Lo
    Scholarly Document Processing Workshop @ NAACL, 2021

  • Fact or Fiction: Verifying Scientific Claims
    David Wadden, Shanchuan Lin, Kyle Lo, Lucy Lu Wang, Madeleine van Zuylen, Arman Cohan, Hannaneh Hajishirzi
    EMNLP, 2020
    Article   Code   Demo  

  • Entity, Relation, and Event Extraction with Contextualized Span Representations
    David Wadden, Ulme Wennberg, Yi Luan, Hannaneh Hajishirzi
    EMNLP, 2019
    Article   Code  

  • A General Framework for Information Extraction using Dynamic Span Graphs
    Yi Luan, David Wadden, Luheng He, Amy Shah, Mari Ostendorf, Hannaneh Hajishirzi
    NAACL, 2019

Older Publications (Comp Bio)

  • Stratification of amyotrophic lateral sclerosis patients: a crowdsourcing approach
    The ALS Stratification Consortium.
    Scientific Reports, 2019

  • The GCTx format and cmap{Py, R, M, J} packages: resources for the optimized storage and integrated traversal of dense matrices of annotated dense matrices
    Oana M Enache, David L Lahr, Ted E Natoli, Lev Litichevskiy, David Wadden, Corey Flynn, Joshua Z Gould, Jacob K Asiedu, Rajiv Narayan, Aravind Subramanian.
    Bioinformatics, 2018

  • A Next Generation Connectivity Map: L1000 Platform And The First 1,000,000 Profiles
    Aravind Subramanian et al.
    Cell, 2017

  • Evaluation of RNAi and CRISPR technologies by large-scale gene expression profiling in the Connectivity Map
    Ian Smith, Peyton Greenside, Ted Natoli, David L. Lahr, David Wadden, Itay Tirosh, Rajiv Narayan, David E. Root, Todd R. Golub, Aravind Subramanian, John G. Doench.
    PLoS Biology, 2017