Xelawk

Xelawk

Geek Repo

Location:Guangzhou

Github PK Tool:Github PK Tool

Xelawk's starred repositories

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:65148Issues:543Issues:0

Prompt-Engineering-Guide

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:40128Issues:392Issues:1290

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:32273Issues:273Issues:1068

Langchain-Chatchat

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

Language:TypeScriptLicense:Apache-2.0Stargazers:30190Issues:282Issues:3620

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:29833Issues:190Issues:982

OpenVoice

Instant voice cloning by MyShell.

Language:PythonLicense:MITStargazers:27613Issues:209Issues:212

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:23535Issues:252Issues:287

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonLicense:NOASSERTIONStargazers:11320Issues:150Issues:811

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonLicense:Apache-2.0Stargazers:10580Issues:122Issues:207

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

one-key-hidpi

Enable macOS HiDPI and have a native setting.

clone-voice

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频

Language:PythonLicense:NOASSERTIONStargazers:6836Issues:37Issues:122

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4691Issues:60Issues:357

GeneFace

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

Language:PythonLicense:MITStargazers:2473Issues:50Issues:281

comfyui_controlnet_aux

ComfyUI's ControlNet Auxiliary Preprocessors

Language:PythonLicense:Apache-2.0Stargazers:1759Issues:15Issues:348

auto-subtitle

Automatically generate and overlay subtitles for any video.

Language:PythonLicense:MITStargazers:1346Issues:17Issues:64

HRNet-Facial-Landmark-Detection

This is an official implementation of facial landmark detection for our TPAMI paper "Deep High-Resolution Representation Learning for Visual Recognition". https://arxiv.org/abs/1908.07919

Language:PythonLicense:MITStargazers:1031Issues:31Issues:91

Wav2Lip-GFPGAN

High quality Lip sync

EdgeSAM

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:776Issues:16Issues:24

awesome-faceReenactment

papers about Face Reenactment/Talking Face Generation

ComfyUI-Stable-Video-Diffusion

ComfyUI nodes for Stable Video Diffusion

StableIdentity

🔥 StableIdentity: Inserting Anybody into Anywhere at First Sight

Language:PythonLicense:MITStargazers:244Issues:25Issues:7

sd-webui-loractl

An Automatic1111 extension for dynamically controlling the weights of LoRAs during image generation

Language:PythonLicense:MITStargazers:235Issues:9Issues:34

watermark-detection

Model for watermark classification implemented with PyTorch

Language:Jupyter NotebookStargazers:58Issues:1Issues:5
Language:PythonStargazers:20Issues:1Issues:0

KeyPosS

[ACM MM 2023] KeyPosS: Plug-and-Play Facial Landmark Detection through GPS-Inspired True-Range Multilateration

Language:PythonStargazers:9Issues:0Issues:0

blurnet

A CNN to detect blurry images.

Language:PythonLicense:MITStargazers:5Issues:0Issues:0