Recent papers library. Parsed and updated from my own Mendeley library.
Updated: 2/4/2026
- Random Paper
- Paper List
Random paper for the day
A regularized framework for sparse and structured neural attention
Authors: Vlad Niculae and Mathieu Blondel
Year: 2017
Abstract: Modern neural networks are often augmented with an attention mechanism, which tells the network where to focus within the input. We propose in this paper a new framework for sparse and structured attention, building upon a smoothed max operator. We show that the gradient of this operator defines a mapping from real values to probabilities, suitable as an attention mechanism. Our framework includes softmax and a slight generalization of the recently-proposed sparsemax as special cases. However, we also show how our framework can incorporate modern structured penalties, resulting in more interpretable attention mechanisms, that focus on entire segments or groups of an input. We derive efficient algorithms to compute the forward and backward passes of our attention mechanisms, enabling their use in a neural network trained with backpropagation. To showcase their potential as a drop-in replacement for existing ones, we evaluate our attention mechanisms on three large-scale tasks: textual entailment, machine translation, and sentence summarization. Our attention mechanisms improve interpretability without sacrificing performance; notably, on textual entailment and summarization, we outperform the standard attention mechanisms based on softmax and sparsemax.
Critical Reading Questions
1. What is the main research question or hypothesis, and what are the key findings?
This helps you understand the core purpose of the study and its primary results.
2. How sound is the methodology, and do the conclusions logically follow from the data?
This assesses the reliability of the research and the validity of its conclusions.
3. What are the implications and limitations of the study, and how does it contribute to the broader field?
This helps you contextualize the research and understand its significance and potential applications.