水镜扒奇 (kelingg)

kelingg

Geek Repo

Location:北京

Github PK Tool:Github PK Tool

水镜扒奇's starred repositories

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:69070Issues:574Issues:0

faceswap

Deepfakes Software For All

Language:PythonLicense:GPL-3.0Stargazers:52060Issues:1533Issues:861

Fooocus

Focus on prompting and generating

Language:PythonLicense:GPL-3.0Stargazers:40592Issues:315Issues:1507

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:33734Issues:206Issues:1241

insightface

State-of-the-art 2D and 3D Face Analysis Project

Language:PythonLicense:MITStargazers:23028Issues:510Issues:2476

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:16036Issues:109Issues:1051

ChatALL

Concurrently chat with ChatGPT, Bing Chat, Bard, Alpaca, Vicuna, Claude, ChatGLM, MOSS, 讯飞星火, 文心一言 and more, discover the best answers

Language:JavaScriptLicense:Apache-2.0Stargazers:15134Issues:124Issues:545

first-order-model

This repository contains the source code for the paper First Order Motion Model for Image Animation

Language:Jupyter NotebookLicense:MITStargazers:14479Issues:353Issues:531

video-subtitle-extractor

视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

Language:PythonLicense:Apache-2.0Stargazers:5836Issues:45Issues:271

nanodet

NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥

Language:PythonLicense:Apache-2.0Stargazers:5707Issues:67Issues:462

Firefly

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Baichuan-7B

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Language:PythonLicense:Apache-2.0Stargazers:5670Issues:66Issues:129

Background-Matting

Background Matting: The World is Your Green Screen

SimSwap

An arbitrary face-swapping framework on images and videos with one single trained model!

Language:PythonLicense:NOASSERTIONStargazers:4488Issues:80Issues:442

ECCV2022-RIFE

ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Language:PythonLicense:MITStargazers:4403Issues:77Issues:329

video-subtitle-remover

基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.

Language:PythonLicense:Apache-2.0Stargazers:4082Issues:33Issues:84

FaceDetection-DSFD

腾讯优图高精度双分支人脸检测器

Language:PythonLicense:NOASSERTIONStargazers:2896Issues:106Issues:89

GeneFace

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

Language:PythonLicense:MITStargazers:2517Issues:51Issues:281

SadTalker-Video-Lip-Sync

本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。

face2face-demo

pix2pix demo that learns from facial landmarks and translates this into a face

Language:PythonLicense:MITStargazers:1434Issues:73Issues:40

FaceRecognition-tensorflow

基于TensorFlow训练的人脸识别神经网络

LiveSpeechPortraits

Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation (SIGGRAPH Asia 2021)

Language:PythonLicense:MITStargazers:1197Issues:21Issues:92

best-chinese-prompt

AI中文提示词秘籍ChatGPT中文提示词秘籍(Prompt圣经)K-Render整理

DINet

The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."

RAD-NeRF

Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition

Language:PythonLicense:MITStargazers:888Issues:30Issues:96

Realistic-Neural-Talking-Head-Models

My implementation of Few-Shot Adversarial Learning of Realistic Neural Talking Head Models (Egor Zakharov et al.).

Language:PythonLicense:GPL-3.0Stargazers:829Issues:43Issues:72

face_segmentation

Deep face segmentation in extremely hard conditions

Language:C++License:Apache-2.0Stargazers:727Issues:44Issues:29

Video-Auto-Wipe

Erase specific content from the video that you don't wanna see

Language:PythonLicense:GPL-3.0Stargazers:267Issues:12Issues:5

StyleMask

Authors official PyTorch implementation of the "StyleMask: Disentangling the Style Space of StyleGAN2 for Neural Face Reenactment" [FG 2023].