Faris Alasmary's repositories

psu-language-modeling-session

The code of the "Language Models and Their Applications" session

Language:Jupyter NotebookLicense:MITStargazers:9Issues:1Issues:0

psu-sentiment-analysis-session

PSU Sentiment Analysis Session Code

Language:Jupyter NotebookLicense:MITStargazers:4Issues:1Issues:0

sbvqa2.0

The official implementation of the paper: SBVQA 2.0: Robust End-to-End Speech-Based Visual Question Answering for Open-Ended Questions

Language:PythonLicense:MITStargazers:4Issues:2Issues:0

shieldrnn

The implementation of ShieldRNN

Language:PythonLicense:MITStargazers:3Issues:1Issues:0

adversarial-machine-learning-example

Train a CNN model on MNIST dataset and use it to develop an adversarial example to fool the model

Language:Jupyter NotebookLicense:MITStargazers:2Issues:1Issues:0

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

bottom-up-attention

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

CLIP-ViL

[ICLR 2022] code for "How Much Can CLIP Benefit Vision-and-Language Tasks?" https://arxiv.org/abs/2107.06383

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ctcdecode

PyTorch CTC Decoder bindings

Language:C++License:MITStargazers:0Issues:0Issues:0

CTDNN

MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DeepFilterNet

Noise supression using deep filtering

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

deepspeech.pytorch

Speech Recognition using DeepSpeech2.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Face-Transformer

Face Transformer for Recognition

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

kaldi-serve

Server framework for Kaldi ASR Toolkit

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

Listen-Attend-and-Spell

PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper

Language:PythonStargazers:0Issues:0Issues:0

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

License:MITStargazers:0Issues:0Issues:0

NeMo

NeMo: a toolkit for conversational AI

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pydub

Manipulate audio with a simple and easy high level interface

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

recurrent-memory-transformer-pytorch

Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

RegionCLIP

[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

sequitur-g2p

This is a github repository of the abandonware Sequitur G2P by Bisani & Ney

License:GPL-2.0Stargazers:0Issues:0Issues:0

Speech-Transformer

PyTorch re-implementation of Speech-Transformer

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

train-transformer-xl-huggingface

This repo contains a notebook that illustrates how to train Transformer-XL on 🤗 Transformers library

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

transformer

PyTorch Implementation of "Attention Is All You Need"

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

License:MPL-2.0Stargazers:0Issues:0Issues:0

vinvl-visualbackbone

Original VinVL visual backbone with simplified APIs to easily extract features, boxes, object detections, in a few lines of Python code.

Language:PythonStargazers:0Issues:0Issues:0

VQA-AttReg

This is an official PyTorch implementation of “Answer Questions with Right Image Regions: A Visual Attention Regularization Approach” (https://arxiv.org/abs/2102.01916).

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

VQVAE-Pytorch

This repo implements VQVAE on mnist and as well as colored version of mnist images. It also implements simple LSTM for generating sample numbers using the encoder outputs of trained VQVAE

Stargazers:0Issues:0Issues:0