Vikas Raunak (vyraun)

vyraun

Geek Repo

Company:Microsoft

Location:Redmond

Home Page:https://vyraun.github.io/

Github PK Tool:Github PK Tool

Vikas Raunak's starred repositories

VideoX

VideoX: a collection of video cross-modal models

Language:PythonLicense:NOASSERTIONStargazers:934Issues:0Issues:0

deep-explanation-penalization

Code for using CDEP from the paper "Interpretations are useful: penalizing explanations to align neural networks with prior knowledge" https://arxiv.org/abs/1909.13584

Language:Jupyter NotebookLicense:MITStargazers:123Issues:0Issues:0

Class-balanced-loss-pytorch

Pytorch implementation of the paper "Class-Balanced Loss Based on Effective Number of Samples"

Language:PythonLicense:MITStargazers:771Issues:0Issues:0

ViP

Video Platform for Action Recognition and Object Detection in Pytorch

Language:PythonLicense:MITStargazers:221Issues:0Issues:0

Video-Grounding-from-Text

Source code for "Weakly-Supervised Video Object Grounding from Text by Loss Weighting and Object Interaction"

Language:PythonLicense:NOASSERTIONStargazers:43Issues:0Issues:0

Audiovisual-Synthesis

Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders

Language:PythonStargazers:120Issues:0Issues:0

dual_encoding

[CVPR2019] Dual Encoding for Zero-Example Video Retrieval

Language:PythonLicense:Apache-2.0Stargazers:155Issues:0Issues:0

UVR-NMT

Neural Machine Translation with universal Visual Representation (ICLR 2020)

Language:PythonStargazers:87Issues:0Issues:0

pytorch-video-feature-extractor

A repository for extract CNN features from videos using pytorch

Language:PythonLicense:MITStargazers:68Issues:0Issues:0

CBP

Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware Prediction"

Language:PythonStargazers:59Issues:0Issues:0

UVC

Joint-task Self-supervised Learning for Temporal Correspondence (NeurIPS 2019)

Language:PythonLicense:MITStargazers:176Issues:0Issues:0

Greedy_InfoMax

Code for the paper: Putting An End to End-to-End: Gradient-Isolated Learning of Representations

Language:PythonLicense:MITStargazers:282Issues:0Issues:0

focal_loss_pytorch

A PyTorch Implementation of Focal Loss.

Language:PythonLicense:MITStargazers:937Issues:0Issues:0

dspn

[NeurIPS 2019] Deep Set Prediction Networks

Language:PythonLicense:MITStargazers:100Issues:0Issues:0

nvidia-htop

A tool for enriching the output of nvidia-smi.

Language:PythonLicense:BSD-3-ClauseStargazers:512Issues:0Issues:0

SCDM

Code for the paper: Semantic Conditioned Dynamic Modulation for Temporal Sentence Grounding in Videos

Language:PythonStargazers:68Issues:0Issues:0

chazutsu

The tool to make NLP datasets ready to use

Language:PythonLicense:Apache-2.0Stargazers:244Issues:0Issues:0

Conference-Acceptance-Rate

Acceptance rates for the major AI conferences

Language:Jupyter NotebookLicense:MITStargazers:3860Issues:0Issues:0

keita

My personal toolkit for PyTorch development.

Language:PythonStargazers:128Issues:0Issues:0

VL-BERT

Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".

Language:Jupyter NotebookLicense:MITStargazers:734Issues:0Issues:0

pytorch-metric-learning

The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.

Language:PythonLicense:MITStargazers:5798Issues:0Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language:PythonLicense:Apache-2.0Stargazers:2111Issues:0Issues:0

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

License:MITStargazers:1214Issues:0Issues:0

amazon-transcribe-research-pseudolikelihood

DEPRECATED. Use the MLM Scoring library instead: https://github.com/awslabs/mlm-scoring

Stargazers:9Issues:0Issues:0
Language:PythonStargazers:118Issues:0Issues:0

curriculum

Competence-based Curriculum Learning

Language:ScalaStargazers:9Issues:0Issues:0

MS-SNSD

The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) levels desired.

Language:HTMLLicense:MITStargazers:445Issues:0Issues:0

epitran

A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)

Language:PythonLicense:MITStargazers:587Issues:0Issues:0

pytorch-NetVlad

Pytorch implementation of NetVlad including training on Pittsburgh.

Language:PythonStargazers:398Issues:0Issues:0

SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Language:PythonLicense:Apache-2.0Stargazers:6302Issues:0Issues:0