Andrew Sofie's starred repositories
taming-transformers
Taming Transformers for High-Resolution Image Synthesis
stable-diffusion
A latent text-to-image diffusion model
whiteboard
Lightweight collaborative Whiteboard / Sketchboard
this-word-does-not-exist
This Word Does Not Exist
ToolChanger
STPs / STLs / DXFs / PDFs
OpenSeq2Seq
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
legacy-v1-python-example
Example script (supported) to help you integrate with our SaaS v1 API
noizeus_corpora
Speech corpora for the speech recognition evaluation system
StoryTelling
A neural network based StoryTeller that outputs a short story from an input image
SPADE-Tensorflow
Simple Tensorflow implementation of "Semantic Image Synthesis with Spatially-Adaptive Normalization" a.k.a. GauGAN, SPADE (CVPR 2019 Oral)
pytorch_GAN_zoo
A mix of GAN implementations including progressive growing
SceneGraphParser
A python toolkit for parsing captions (in natural language) into scene graphs (as symbolic representations).
DEXTR-PyTorch
Deep Extreme Cut http://www.vision.ee.ethz.ch/~cvlsegmentation/dextr
Phonetisaurus
Phonetisaurus G2P
neural_renderer
A PyTorch port of the Neural 3D Mesh Renderer
WER-in-python
This program calculates the word error rate of hypothesis in ASR and print the aligned result.
free-spoken-digit-dataset
A free audio dataset of spoken digits. An audio version of MNIST.
text-to-ssml
Converts your text to AWS Polly's SSML.