Analyzing and interpreting neural networks for NLP

Revealing the content of the neural black box: workshop on the analysis and interpretation of neural networks for Natural Language Processing.

This project is maintained by blackboxnlp

BlackboxNLP 2021

The fourth edition of BlackboxNLP is collocated with EMNLP 2021.

Important dates

Best paper award

The winner of the Best Paper Award is the paper ``On the Limits of Minimal Pairs in Contrastive Evaluation’’ by Jannis Vamvas and Rico Sennrich.

Programme

San Francisco (UTC-8) | Punta Cana (UTC-4) | London (UTC) | Beijing (UTC+8)

22:00 - 22:15 | 02:00 - 02:15 | 06:00 - 06:15 | 14:00 - 14:15 Opening remarks. Zoom.
22:15 - 23:00 | 02:15 - 03:00 | 06:15 - 07:00 | 14:15 - 15:00 Keynote 1 (w/ Q&A): Willem Zuidema. Zoom.
23:00 - 23:15 | 03:00 - 03:15 | 07:00 - 07:15 | 15:00 - 15:15 Break.
23:15 - 00:00 | 03:15 - 04:00 | 07:15 - 08:00 | 15:15 - 16:00 Oral presentation session 1 (w/ Q&A). Zoom.
00:00 - 00:30 | 04:00 - 04:30 | 08:00 - 08:30 | 16:00 - 16:30 Break.
00:30 - 02:00 | 04:30 - 06:00 | 08:30 - 10:00 | 16:30 - 18:00 Poster session 1. Gather.town.
02:00 - 02:15 | 06:00 - 06:15 | 10:00 - 10:15 | 18:00 - 18:15 Break.
02:15 - 03:00 | 06:15 - 07:00 | 10:15 - 11:00 | 18:15 - 19:00 Oral presentation session 2 (w/ Q&A). Zoom.
03:00 - 03:30 | 07:00 - 07:30 | 11:00 - 11:30 | 19:00 - 19:30 Break.
03:30 - 04:00 | 07:30 - 08:00 | 11:30 - 12:00 | 19:30 - 20:00 Keynote 2 (w/o Q&A): Ana Marasović. Zoom.
04:00 - 04:30 | 08:00 - 08:30 | 12:00 - 12:30 | 20:00 - 20:30 Keynote 3 (w/o Q&A): Sara Hooker. Zoom.
04:30 - 04:45 | 08:30 - 08:45 | 12:30 - 12:45 | 20:30 - 20:45 Closing – virtual program. Zoom.
Break. Hybrid program starts (On-site sessions in room Bavaro 2).
05:00 - 05:15 | 09:00 - 09:15 | 13:00 - 13:15 | 21:00 - 21:15 Opening remarks and best paper award. On-site & Zoom.
05:15 - 06:00 | 09:15 - 10:00 | 13:15 - 14:00 | 21:15 - 22:00 Keynote 4 (w/ Q&A): Willem Zuidema. On-site & Zoom.
06:00 - 06:30 | 10:00 - 10:30 | 14:00 - 14:30 | 22:00 - 22:30 Oral presentation session 3 (w/ Q&A). On-site & Zoom.
06:30 - 07:00 | 10:30 - 11:00 | 14:30 - 15:00 | 22:30 - 23:00 Coffee break.
07:00 - 08:00 | 11:00 - 12:00 | 15:00 - 16:00 | 23:00 - 00:00 Poster session 2. Gather.town.
08:00 - 09:00 | 12:00 - 13:00 | 16:00 - 17:00 | 00:00 - 01:00 Lunch break.
09:00 - 09:45 | 13:00 - 13:45 | 17:00 - 17:45 | 01:00 - 01:45 Keynote 5 (w/ Q&A): Sara Hooker. On-site & Zoom.
09:45 - 10:15 | 13:45 - 14:15 | 17:45 - 18:15 | 01:45 - 02:15 Oral presentation session 4 (w/ Q&A). On-site & Zoom.
10:15 - 10:45 | 14:15 - 14:45 | 18:15 - 18:45 | 02:15 - 02:45 Coffee break.
10:45 - 12:15 | 14:45 - 16:15 | 18:45 - 20:15 | 02:45 - 04:15 Poster session 3. On-site & Gather.town.
12:15 - 10:45 | 16:15 - 14:45 | 20:15 - 18:45 | 04:15 - 02:45 Coffee break.
12:45 - 13:15 | 16:45 - 17:15 | 20:45 - 21:15 | 04:45 - 05:15 Oral presentation session 5 (w/ Q&A). On-site & Zoom.
13:15 - 14:00 | 17:15 - 18:00 | 21:15 - 22:00 | 05:15 - 06:00 Keynote 6 (w/ Q&A): Ana Marasović. On-site & Zoom.
14:00 - 14:15 | 18:00 - 18:15 | 22:00 - 22:15 | 06:00 - 06:15 Closing remarks. On-site & Zoom.

The programme is fully outlined here.

Workshop description

Neural networks have rapidly become a central component in NLP systems in the last few years. The improvement in accuracy and performance brought by the introduction of neural networks has typically come at the cost of our understanding of the system: How do we assess what the representations and computations are that the network learns? The goal of this workshop is to bring together people who are attempting to peek inside the neural network black box, taking inspiration from machine learning, psychology, linguistics, and neuroscience. The topics of the workshop will include, but are not limited to:

BlackboxNLP 2021 is the fourth BlackboxNLP workshop. The programme and proceedings of the previous editions, which were held at EMNLP 2018, ACL 2019 and EMNLP 2020, can be found here, here and here.

The official call for papers is available here.

Paper submission

We accept two types of papers

Both papers and abstracts should follow the official EMNLP 2020 style guidelines and should be submitted via softconf:

https://www.softconf.com/emnlp2021/BlackboxNLP

Accepted submissions will be presented at the workshop: most as posters, some as oral presentations (determined by the program committee).

Dual submissions and preprints

Dual submissions with the main conference are allowed, but authors must declare dual submission by entering the paper’s main conference submission id. The reviews for the submission for the main conference will be automatically forwarded to the workshop and taken into consideration when your paper is evaluated. Authors of dual-submission papers accepted to the main conference should retract them from the workshop by September 20.

Papers posted to preprint servers such as arxiv can be submitted without any restrictions on when they were posted.

Camera-ready information

Authors of accepted archival papers should upload the final version of their paper to the submission system by the camera-ready deadline. Authors may use one extra page to address reviewer comments, for a total of nine pages.

Invited speakers

Sara Hooker

Sara Hooker is a research scientist at Google Brain working on training models that go beyond test-set accuracy to fulfill multiple desiderata. Her research interests gravitate towards interpretability, model compression and fairness. She is a founding organizer of the cross-institutional Trustworthy ML Initiative, a forum and seminar series dedicated to trustworthy machine learning research. Her current work centers on building tools that help human-in-the-loop audits of model behavior.

Ana Marasović

Ana Marasović is a postdoctoral researcher at the Allen Institute for AI (AllenNLP Team) and at the University of Washington (Noah’s ARK). Her research interests span natural language processing, explainable AI, and multimodality. She is currently focused on developing and evaluating models that provide readable explanations of their decision process for tasks requiring advanced reasoning abilities. She received her Ph.D. in the Heidelberg University NLP Group where she worked on learning with limited labeled data for discourse-oriented tasks.

Willem Zuidema

Willem Zuidema is associate professor of computational linguistics and cognitive science at the Institute for Logic, Language and Computation, University of Amsterdam. His lab works on deep learning models for NLP, with a focus on interpretability, bias, cognitive and neural relevance, and the relation between language and music. Zuidema and his students were early contributors to deep learning models in NLP, with work on neural parsing (from 2008), tree-shaped neurals networks (from 2012), and diagnostic classification/probing (from 2016). Recent work includes the integration of formal logic and deep learning, representational stability analysis, contextual decomposition and knowledge distillation.

Organizers

Jasmijn Bastings

Jasmijn Bastings (bastings[-at-]google.com) is a researcher at Google Amsterdam, having joined Google in Berlin late 2019. She holds a PhD from ILLC, University of Amsterdam, on the topic of Interpretable and Linguistically-informed Deep Learning for NLP. Recently, Jasmijn has been focusing on explainability, fairness and robustness within natural language processing. She authored two BlackboxNLP papers (2018, 2020) on generalisation and saliency methods, as well as an ACL paper (2019) on interpretable neural predictions using differentiable binary variables.

Yonatan Belinkov

Yonatan Belinkov (belinkov@technion.ac.il) is an assistant professor at the Henry and Marilyn Taub Faculty of Computer Science at the Technion. He has previosuly been a Postdoctoral Fellowat the Harvard School of Engineering and Applied Sciences (SEAS) and the MIT Computer Scienceand Artificial Intelligence Laboratory (CSAIL). His recent research focuses on interpretability androbustness of neural network models of language. His research has been published at leading NLPand ML venues.
His PhD dissertation at MIT analyzed internal language representations in deeplearning models. He has been awarded the Harvard Mind, Brain, and Behavior Postdoctoral Fellowship and the Azrieli Early Career Faculty Fellowship. He co-organised the second and third editions of BlackboxNLP and the first and second machine translation robustness tasks at WMT.

Dieuwke Hupkes

Dieuwke Hupkes (dieuwkehupkes@fb.com) is a Research scientist at Facebook AI Research, and the scientific manager of the Amsterdam unit of the ELLIS society. The main focus of her research is understanding how neural networks can understand and learn the structures that occur in natural language. Developing methods to interpret and interact with neural networks has therefore been an important area of focus in her research. She authored several articles directly relevant to the workshop, two of them published in a top AI journal (Journal of Artificial Intelligence), and she is co-organizing a workshop on compositionality, neural networks, and the brain, held at the Lorentz Center in the summer of 2019.

Emmanuel Dupoux

Emmanuel Dupoux (emmanuel.dupoux@gmail.com) is full professor at the Ecole des Hautes Etudesen Sciences Sociales (EHESS), and directs the Cognitive Machine Learning team at the Ecole NormaleSupérieure (ENS) in Paris and INRIA. Since 2018, he has been a part-time research scientist atFacebook AI Research. His research mixes developmental science, cognitive neuroscience, and machinelearning, with a focus on the reverse engineering of infant language and cognitive development using unsupervised or weakly supervised learning.
He has directed the CNRS Laboratoire de SciencesCognitives et Psycholinguistique for 10 years. He is the recipient of an Advanced ERC grant, theorganiser of the Zero Resource Speech Challenge (2015, 2017, 2019, 2020), the Intuitive Physics Benchmark (2019) and led in 2017 a Jelinek Summer Workshop at CMU on multimodal speechlearning

Yuval Pinter

Yuval Pinter (me@yuvalpinter.com) is a Senior Lecturer in the Computer Science department at Ben-Gurion University. He authored three papers on the topic of NLP neural model interpretation, looking into attention modules and character-level LSTMs. He co-organised the TREC Live QA competition for its three years of existence (2015–2017) including administering the real-time challenge, and served as publicity and social media co-chair at NAACL 2019.

Hassan Sajjad

Hassan Sajjad (hsajjad@hbku.edu.qa) is a research scientist at the Arabic Language Technologies group, Qatar Computing Research Institute - HBKU. His recent research focuses on developing methods to analyze and interpret neural network models both at the representation-level and at the individual neuron-level. His work on the analysis of deep models is recognized at various prestigious research venues such as ACL, NAACL, ICLR, and AAAI.

Mario Giulianelli

Mario Giulianelli (m.giulianelli@uva.nl) is a PhD student at the University of Amsterdam. He investigates whether neural networks can be employed as computational models of language learning and use, and works on proposing interpretable and controllable neural architectures which more explicitly emulate the processes underlying human language cognition. He authored two articles on investigating the propensity of language models to capture syntactic and semantic phenomena, presented at EMNLP and ACL. At the first edition of BlackBoxNLP, he won the Best Paper Award with his work on probing and improving an LSTM’s ability to track number agreement information.

Anti-Harassment Policy

BlackboxNLP 2021 adheres to the ACL Anti-Harassment Policy.