Matthew's repositories
AudioSep
Official implementation of "Separate Anything You Describe"
BLoRA
batched loras
buck2
Build system, successor to Buck
CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
cog-whisperspeech
Cog wrapper for collabora/WhisperSpeech
course-builder
🍄 experimental platform for building Badass Courses
FastSAM
Fast Segment Anything
glish
map all words to single-syllable version
gptme
A fancy CLI to interact with LLMs in a Chat-style interface, with additional capabilities like executing commands on the local machine.
InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
MotionDirector
MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
Open-AnimateAnyone
Unofficial Implementation of Animate Anyone
pips2
PIPs++
ProPainter
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
requestly
🚀 Most Popular developer tool for frontend developers & QAs to debug web and mobile applications. Redirect URL (Switch Environments), Modify Headers, Mock APIs, Modify Response, Insert Scripts & Report Bugs with debugging sessions.
Rerender_A_Video
[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
simpleaichat
Python package for easily interfacing with chat apps, with robust features and minimal code complexity.
stable-ts
Transcription, forced alignment, and audio indexing with OpenAI's Whisper
tldraw
a very good whiteboard
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.