BlackboxNLP 2025

Call for Papers

Overview

BlackboxNLP 2025 invites the submission of archival and non-archival papers featuring original and unpublished research on interpreting and explaining NLP models by taking inspiration from fields such as machine learning, psychology, linguistics, and neuroscience. We hope the workshop will serve as an interdisciplinary meetup that allows for cross-collaboration.

The topics of the workshop include, but are not limited to:

Mechanistic interpretability, reverse engineering approaches to understanding particular properties of neural models.
Understanding how language models use context by measuring their context-mixing processes (e.g., their token-to-token interactions)
Scaling up analysis methods for large language models (LLMs)
Probing methods for testing whether models have acquired or represent certain linguistic properties.
Adapting and applying analysis techniques from other disciplines (e.g., neuroscience or computer vision).
Examining model performance on simplified or formal languages.
Proposing modifications to neural architectures that increase their interpretability.
Open-source tools for analysis, visualization, or explanation to democratize access to interpretability techniques in NLP.
Explanation methods such as saliency, attribution, free-text explanations, or explanations with structured properties.
Evaluation of explanation methods: how do we know the explanation is faithful to the model?
Evaluation of techniques for steering LLM output behavior
Uncovering the reasoning processes of LLMs
Understanding under the hood of memorization in LLMs
Insights into LLM Failures
Opinion pieces about the state of interpretability in NLP.

Paper Submission Information

We will accept direct submissions or ARR commitments through OpenReview. All submissions should use the *ACL template and formatting requirements, following the official EMNLP style guidelines. Archival paper must be fully anonymized.

Submission Types

Archival papers of up to 8 pages + references. These are papers reporting on completed, original, and unpublished research. Papers shorter than this maximum are also welcome. An optional appendix may appear after the references in the same pdf file. Accepted papers are expected to be presented at the workshop and will be published in the workshop proceedings of the ACL Anthology, meaning they cannot be published elsewhere. They should report on obtained results rather than intended work. These papers will undergo double-blind peer-review, and should thus be anonymized.
Non-archival extended abstracts of 2 pages + references. These may report on work in progress or may be cross-submissions that have already appeared (or are scheduled to appear) in another venue. These submissions are non-archival and will not be included in the proceedings. The selection will not be based on a double-blind review and thus submissions of this type need not be anonymized.

Accepted submissions for both tracks will be presented at the workshop: most as posters, some as oral presentations (determined by the program committee).

Important dates

August 15th, 2025 - Direct paper submission deadline (through OpenReview).
September 5th, 2025 - Commitment deadline for ARR papers (through OpenReview).
September 15th, 2025 - Notification of acceptance.
October 3rd, 2025 - Camera ready deadline.
November 9th, 2025 - Workshop date.

Dual Submissions and Preprints

Dual submissions are allowed for the archival track, but please check the dual submissions policy for the other venue that you are dual-submitting to. Papers posted to preprint servers such as arXiv can be submitted without any restrictions on when they were posted.

Camera-ready information

Authors of accepted archival papers should upload the final version of their paper to the submission system by the camera-ready deadline. Authors may use one extra page to address reviewer comments, for a total of nine pages + references. Broader Impacts/Ethics and Limitations sections are optional and can be included on a 10th page.

Contact

Please contact the organizers at blackboxnlp@googlegroups.com for any questions.

Anti-Harassment Policy

BlackboxNLP 2025 adheres to the ACL Anti-Harassment Policy.