mhussar

Hitlab Studios's repositories

ICON

[CVPR'22] ICON: Implicit Clothed humans Obtained from Normals

Language:PythonNOASSERTION100

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonMIT100

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT000

Auto-Photoshop-StableDiffusion-Plugin

A user-friendly plug-in that makes it easy to generate stable diffusion images inside Photoshop using Automatic1111-sd-webui as a backend.

Language:JavaScriptMIT000

ComfyUI

A powerful and modular stable diffusion GUI with a graph/nodes interface.

Language:PythonGPL-3.0000

ComfyUI_examples

Examples of ComfyUI workflows

Language:HTML000

distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

MIT000

dreamtalk

Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

MIT000

DWPose

Official implementation of the paper "Effective Whole-body Pose Estimation with Two-stages Distillation"

Apache-2.0000

facefusion

Next generation face swapper and enhancer

000

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

Apache-2.0000

iff-chatgpt-api-tutorial

Basic set up in python for ChatGPT API

Language:PythonMIT010

LivePortrait

Bring portraits to life!

MIT000

mhussar.github.io

R&D

Language:JavaScript010

mmpose

OpenMMLab Pose Estimation Toolbox and Benchmark.

Language:PythonApache-2.0000

NuGetForUnity

A NuGet Package Manager for Unity

MIT000

obsidian-dataview

A high-performance data index and query language over Markdown files, for https://obsidian.md/.

Language:TypeScriptMIT000

open-interpreter

OpenAI's Code Interpreter in your terminal, running locally

MIT000

OpenAI-API-dotnet

An unofficial C#/.NET SDK for accessing the OpenAI GPT-3 API

Language:C#NOASSERTION000

OpenAI-Unity

An unofficial OpenAI Unity Package that aims to help you use OpenAI API directly in Unity Game engine.

Language:C#MIT000

privateGPT

Interact privately with your documents using the power of GPT, 100% privately, no data leaks

Language:PythonApache-2.0000

rembg

Rembg is a tool to remove images background

Language:PythonMIT000

Retrieval-based-Voice-Conversion-WebUI

Voice data <= 10 mins can also be used to train a good VC model!

MIT000

roop

one-click face swap

GPL-3.0000

sd-webui-panorama-viewer

Sends rendered SD_auto1111 images quickly to this panorama (hdri, equirectangular) viewer

GPL-3.0000

sd-webui-txt-img-to-3d-model

A custom extension for sd-webui that allow you to generate 3D model from txt or image, basing on OpenAI Shap-E.

Language:PythonAGPL-3.0000

StableDiffusion-CheatSheet

A list of StableDiffusion styles and some notes for offline use. Pure HTML, CSS and a bit of JS.

Language:HTMLMIT000

Thin-Plate-Spline-Motion-Model

[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.

Language:Jupyter NotebookMIT000

WarpFusion

Language:DockerfileNOASSERTION000

whisper_real_time

Real time transcription with OpenAI Whisper.

000