Pankaj Singh's repositories
cs224n-gpu-that-talks
Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)
text-detection-ctpn
text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network
api-guidelines
Microsoft REST API Guidelines
autoEdit_2
Fast text based video editing, node Electron Os X desktop app, with Backbone front end.
awesome-deep-learning
A curated list of awesome Deep Learning tutorials, projects and communities.
awesome-tensorflow
TensorFlow - A curated list of dedicated resources http://tensorflow.org
casmacat-home-edition
CASMACAT Home Edition (installation and admin)
cf-sdk-python
Python examples using requests for the IBM Research Cognitive Fashion APIs.
EffectiveTensorflow
TensorFlow tutorials and best practices.
engineering-blogs
A curated list of engineering blogs
kaldi
This is now the official location of the Kaldi project.
ocropy
Python-based tools for document analysis and OCR
ocrorot
Rotation and skew detection using DL.
oTranscribe
A free & open tool for transcribing audio interviews
pipeline
PipelineAI: Real-Time Enterprise AI Platform
PRM
Weakly Supervised Instance Segmentation using Class Peak Response, in CVPR 2018 (Spotlight)
Public_Speaking
A repository of resources about public speaking, specifically in the context of software development and IT conferences.
pyarmor
A tool used to obfuscate python scripts
pytorch-coviar
Compressed Video Action Recognition
sampleRNN_ICLR2017
SampleRNN: An Unconditional End-to-End Neural Audio Generation Model
styleguide
Style guides for Google-originated open-source projects
vid2vid
Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.
VITON
Code and dataset for paper "VITON: An Image-based Virtual Try-on Network"
wer_are_we
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.