Dzmitry Bahdanau's repositories
attention-lvcsr
End-to-End Attention-Based Large Vocabulary Speech Recognition
actor-critic-public
The source code for "An Actor Critic Algorithm for Structured Prediction"
systematic-generalization-sqoop
Code for "Systematic Generalization: What Is Required and Can It Be Learned"
baby-ai-game
Prototype of a game where a reinforcement learning agent is trained through natural language instructions
kaldi-python
Python wrappers for Kaldi data
blocks-benchmarks
Speed benchmarks for blocks
blocks-examples
Examples and scripts using Blocks
blocks-extras
A collection of extensions to the Blocks framework
gym-minigrid
Minimalistic gridworld environment for OpenAI Gym
ift6266h16
My course work for IFT6266h16
picklable_itertools
itertools. But picklable.
prototypical-networks
Code for the NIPS 2017 Paper "Prototypical Networks for Few-shot Learning"
pytorch-a2c-ppo
A recurrent, multi-process and readable PyTorch implementation of the deep reinforcement algorithms A2C and PPO
rizar.github.io
My academic website
three-file-tic-tac-toe
A simplified starting template for the tic-tac-toe example from the react tutorial
trl
Train transformer language models with reinforcement learning.