Ankit Gupta (ag1988)

ag1988

Geek Repo

Company:IBM Research

Home Page:https://sites.google.com/view/ag1988

Github PK Tool:Github PK Tool

Ankit Gupta's repositories

injecting_numeracy

The accompanying code for "Injecting Numerical Reasoning Skills into Language Models" (Mor Geva*, Ankit Gupta* and Jonathan Berant, ACL 2020).

dss

Sequence Modeling with Structured State Spaces

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:59Issues:1Issues:0

top_k_attention

The accompanying code for "Memory-efficient Transformers via Top-k Attention" (Ankit Gupta, Guy Dar, Shaya Goodman, David Ciprut, Jonathan Berant. SustaiNLP 2021).

Language:PythonStargazers:57Issues:2Issues:0

dlr

The accompanying code for "Simplifying and Understanding State Space Models with Diagonal Linear RNNs" (Ankit Gupta, Harsh Mehta, Jonathan Berant).

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:19Issues:3Issues:0

gmat

The accompanying code for the paper "GMAT: Global Memory Augmentation for Transformers" (Ankit Gupta and Jonathan Berant).

AI_Bot_for_playing_PacMan

A deep convolutional neural net (CNN) for playing the well-known arcade game ‘Ms. Pac-Man’ trained using Q-learning (reinforcement learning). Game data is obtained using the ‘OpenAI Gym’. The neural net significantly outperforms all tried uniform-random strategies. We do not inject any prior knowledge in the Q-nets. For instance, we do not explicitly use the distance from the ghosts - the system only has access to the game frames and has to learn everything by itself.

Language:Jupyter NotebookStargazers:4Issues:1Issues:0

English_Vietnamese_Translator

A recurrent neural net (RNN) based Neural Machine Translator for translating English to Vietnamese built using an Encoder-Decoder model. Training was done using a small-scale parallel corpus of TED talks.

Language:Jupyter NotebookStargazers:3Issues:1Issues:0

Sentiment_Analysis

A deep recurrent neural network (RNN) based classifier trained using the IMDb reviews dataset for detecting +ve/-ve sentiment in text. Text is encoded using trainable word embeddings and we get 90% test accuracy. Classifier can then be used to determine the sentiment score on Twitter data collected using the Twitter API.

Language:Jupyter NotebookStargazers:1Issues:1Issues:0

Creative_Text_Generator

A deep recurrent neural net (RNN) for both character-level and word-level language modeling that, given any starting context, generates creative text depending on the training data. Training was done using OANC corpus. Even with probabilistic sampling of character sequences, the RNN almost always outputs valid dictionary words making meaningful sentences.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Snapchat_Filter

A Snapchat-like application based on the standard pipeline of 1) Face Detection using Histogram of Oriented Gradients (HOG) descriptor, 2) Facial Landmark Estimation using Kazemi-Sullivan and finally, 3) adding overlays/lenses (like funny glasses, etc) to the frames using the previously estimated landmarks.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

value_aware_attn

The accompanying code for "Value-aware Approximate Attention" (Ankit Gupta and Jonathan Berant, EMNLP 2021).

Stargazers:0Issues:1Issues:0