vincent's repositories
Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
bark
🔊 Text-Prompted Generative Audio Model
bark-voice-cloning-HuBERT-quantizer
The code for the bark-voicecloning model. Training and inference.
chatgpt-vscode
A VSCode extension that allows you to use ChatGPT
CocoaMarkdown
Markdown parsing and rendering for iOS and OS X
EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
ijkplayer
Android/iOS video player based on FFmpeg n3.4, with MediaCodec, VideoToolbox support.
IntuneTest
IntuneTest
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
my-first-static-web-app
microsoft static-web-app test
PaddleGAN
PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.
relay_proxy_test
relay_proxy_test
SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
SimpleEdge
This is a simple browser prototype based on WKWebview, compatible with iPhone and iPad devices.
swift-markdown-ui
Display and customize Markdown text in SwiftUI
swift-markdownkit
A framework for parsing and transforming text in Markdown format written in Swift 5 for macOS, iOS, and Linux. The supported syntax is based on the CommonMark specification. The framework defines an abstract syntax for Markdown, provides a parser for parsing strings into abstract syntax trees,
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
UniCAVE
A Unity3D Plugin for Non-Head Mounted Virtual Reality Display Systems
UnityRenderStreaming
Streaming server for Unity
video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
WebAV
基于 WebCodecs 在浏览器中处理音视频数据。Video and Audio tools built on WebCodecs + Canvas.
WebGLInput
IME for Unity WebGL
website-company
公司官网demo
whisper
Robust Speech Recognition via Large-Scale Weak Supervision
WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.