Ari Holtzman's repositories
adaptive-softmax
Implements an efficient softmax approximation as described in the paper "Efficient softmax approximation for GPUs" (http://arxiv.org/abs/1609.04309)
adaptive-span
Adaptive Attention Span in Transformers
awd-lstm-lm
LSTM and QRNN Language Model Toolkit for PyTorch
examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Language:PythonBSD-3-Clause000
transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.