gaoxiaowei

followers

following

stars

vincent's repositories

Auto-GPT

An experimental open-source attempt to make GPT-4 fully autonomous.

MIT000

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookMIT000

bark-voice-cloning-HuBERT-quantizer

The code for the bark-voicecloning model. Training and inference.

MIT000

bark-with-voice-clone

Language:PythonNOASSERTION000

chatgpt-vscode

A VSCode extension that allows you to use ChatGPT

000

CocoaMarkdown

Markdown parsing and rendering for iOS and OS X

Language:Objective-CMIT000

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Apache-2.0000

ijkplayer

Android/iOS video player based on FFmpeg n3.4, with MediaCodec, VideoToolbox support.

Language:CGPL-2.0000

IntuneTest

IntuneTest

Language:Objective-C000

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

NOASSERTION000

my-first-static-web-app

microsoft static-web-app test

Language:JavaScript000

my_resources

000

MyHeyGen

Language:Python000

PaddleGAN

PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.

Apache-2.0000

relay_proxy_test

relay_proxy_test

Language:JavaScript000

SadTalker

[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

MIT000

sambert

Language:Python000

SimpleEdge

This is a simple browser prototype based on WKWebview, compatible with iPhone and iPad devices.

Language:Objective-C000

swift-markdown-ui

Display and customize Markdown text in SwiftUI

MIT000

swift-markdownkit

A framework for parsing and transforming text in Markdown format written in Swift 5 for macOS, iOS, and Linux. The supported syntax is based on the CommonMark specification. The framework defines an abstract syntax for Markdown, provides a parser for parsing strings into abstract syntax trees,

Apache-2.0000

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

MPL-2.0000

UniCAVE

A Unity3D Plugin for Non-Head Mounted Virtual Reality Display Systems

MIT000

UnityRenderStreaming

Streaming server for Unity

NOASSERTION000

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Apache-2.0000

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.

000

WebAV

基于 WebCodecs 在浏览器中处理音视频数据。Video and Audio tools built on WebCodecs + Canvas.

000

WebGLInput

IME for Unity WebGL

MIT000

website-company

公司官网demo

Language:HTML000

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

MIT000

WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

MIT000