Andrew Sofie (andrewsofie)

andrewsofie

Geek Repo

Location:San Francisco, CA

Github PK Tool:Github PK Tool

Andrew Sofie's starred repositories

snd

Sales & Dungeons — Thermal Printer as D&D / TTRPG Utility

Language:TypeScriptLicense:MITStargazers:477Issues:0Issues:0

taming-transformers

Taming Transformers for High-Resolution Image Synthesis

Language:Jupyter NotebookLicense:MITStargazers:5728Issues:0Issues:0

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:67745Issues:0Issues:0

langchain

🦜🔗 Build context-aware reasoning applications

Language:Jupyter NotebookLicense:MITStargazers:92999Issues:0Issues:0

hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Language:PythonLicense:MITStargazers:9252Issues:0Issues:0

whiteboard

Lightweight collaborative Whiteboard / Sketchboard

Language:JavaScriptLicense:MITStargazers:714Issues:0Issues:0

dendry

Tools to create and build interactive fiction.

Language:JavaScriptLicense:MITStargazers:20Issues:0Issues:0

erato

A poetry evaluation framework

Language:Jupyter NotebookStargazers:6Issues:0Issues:0

ucity

The open-source city-building game for Game Boy Color.

Language:AssemblyStargazers:427Issues:0Issues:0

MemGPT

Letta (fka MemGPT) is a framework for creating stateful LLM services.

Language:PythonLicense:Apache-2.0Stargazers:11872Issues:0Issues:0

this-word-does-not-exist

This Word Does Not Exist

Language:PythonLicense:MITStargazers:1020Issues:0Issues:0

mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6885Issues:0Issues:0

flowtron

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:888Issues:0Issues:0

ToolChanger

STPs / STLs / DXFs / PDFs

License:GPL-3.0Stargazers:301Issues:0Issues:0

OpenSeq2Seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

Language:PythonLicense:Apache-2.0Stargazers:1543Issues:0Issues:0

legacy-v1-python-example

Example script (supported) to help you integrate with our SaaS v1 API

Language:PythonStargazers:14Issues:0Issues:0

noizeus_corpora

Speech corpora for the speech recognition evaluation system

Stargazers:17Issues:0Issues:0

StoryTelling

A neural network based StoryTeller that outputs a short story from an input image

Language:PythonStargazers:13Issues:0Issues:0

SPADE-Tensorflow

Simple Tensorflow implementation of "Semantic Image Synthesis with Spatially-Adaptive Normalization" a.k.a. GauGAN, SPADE (CVPR 2019 Oral)

Language:PythonLicense:MITStargazers:365Issues:0Issues:0

pytorch_GAN_zoo

A mix of GAN implementations including progressive growing

Language:PythonLicense:BSD-3-ClauseStargazers:1607Issues:0Issues:0

TTS

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Language:Jupyter NotebookLicense:MPL-2.0Stargazers:9263Issues:0Issues:0

SceneGraphParser

A python toolkit for parsing captions (in natural language) into scene graphs (as symbolic representations).

Language:PythonLicense:MITStargazers:538Issues:0Issues:0

planercnn

PlaneRCNN detects and reconstructs piece-wise planar surfaces from a single RGB image

Language:PythonLicense:NOASSERTIONStargazers:554Issues:0Issues:0

DEXTR-PyTorch

Deep Extreme Cut http://www.vision.ee.ethz.ch/~cvlsegmentation/dextr

Language:PythonLicense:GPL-3.0Stargazers:844Issues:0Issues:0

Phonetisaurus

Phonetisaurus G2P

Language:ShellLicense:BSD-3-ClauseStargazers:446Issues:0Issues:0

neural_renderer

A PyTorch port of the Neural 3D Mesh Renderer

Language:PythonLicense:NOASSERTIONStargazers:1129Issues:0Issues:0

voca

This codebase demonstrates how to synthesize realistic 3D character animations given an arbitrary speech signal and a static character mesh.

Language:PythonStargazers:1142Issues:0Issues:0

WER-in-python

This program calculates the word error rate of hypothesis in ASR and print the aligned result.

Language:PythonLicense:MITStargazers:152Issues:0Issues:0

free-spoken-digit-dataset

A free audio dataset of spoken digits. An audio version of MNIST.

Language:PythonStargazers:618Issues:0Issues:0

text-to-ssml

Converts your text to AWS Polly's SSML.

Language:RustLicense:MITStargazers:11Issues:0Issues:0