ML Bookmarks

RNN, seq2seq and all related

A Neural Transducer Net than can generate predicition as more inputs arrives, without attention mechanism.
Attention and Augmented Recurrent Neural Networks Explanation of various RNNs complex architectures.
Fully Character-Level Neural Machine Translation without Explicit Segmentation model that maps a source character sequence to a target character sequence without any segmentation. (CNN + highway + biGRU)
Phased LSTM: Accelerating Recurrent Network Training for Long or Event-based Sequences LSTMs with additional time gate controlled by time step. This gate allow update cell value and hidden output only during an “open” phase.
Search Results Words or Characters? Fine-grained Gating for Reading Comprehension

Progressive Neural Networks progressive networks approach immune to forgetting and can leverage prior knowledge via lateral connections to previously learned features.
Reward Augmented Maximum Likelihood for Neural Structured Prediction (Short Summary)This paper presents a simple and computationally efficient approach to incorporate task reward into a maximum likelihood framework. We establish a connection between the log-likelihood and regularized expected reward objectives, showing that at a zero temperature, they are approximately equivalent in the vicinity of the optimal solution.

HYPER NETWORKS This work explores hypernetworks: an approach of using a small network, also known as a hypernetwork, to generate the weights for a larger network.
AdaNet: Adaptive Structural Learning of Artificial Neural Networks Our approach simultaneously and adaptively learns both the structure of the network as well as its weights.
A Roadmap towards Machine Intelligence In this paper, some fundamental properties that intelligent machines should have were proposed, focusing in particular on communication and learning.
Neural Architecture Search with Reinforcement Learning - broad grid search for availbale models architectures with LSTM. As result we receive new conv-net architecture and new RNN node.