xunnew's repositories
backend-with-gpt-vits
chat backend with GPT3/chatGPT and multilingual VITS, and multilingual speech input supported
DINet
The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."
emotional-vits
无需情感标注的情感可控语音合成模型,基于VITS
ER-NeRF
[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis
freeswitch
FreeSWITCH is a Software Defined Telecom Stack enabling the digital transformation from proprietary telecom switches to a versatile software implementation that runs on any commodity hardware. From a Raspberry PI to a multi-core server, FreeSWITCH can unlock the telecommunications potential of any device.
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.
GGboy_deeplearning
This is some deep learning code, i want to share with you.
gin-vue-admin
基于vite+vue3+gin搭建的开发基础平台(支持TS,JS混用),集成jwt鉴权,权限管理,动态路由,显隐可控组件,分页封装,多点登录拦截,资源权限,上传下载,代码生成器,表单生成器等开发必备功能。
jitsi-meet
Jitsi Meet - Secure, Simple and Scalable Video Conferences that you use as a standalone app or embed in your web application.
langchaingo
LangChain for Go
libav
Libav github mirror, clone of git://git.libav.org/libav
Luckysheet
Luckysheet is an online spreadsheet like excel that is powerful, simple to configure, and completely open source.
MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
ocpp-go
Open Charge Point Protocol implementation in Go
pytorch-UNet
pytorch搭建自己的unet网络,训练自己的数据集。
qSIP
VoIP/SIP client (softphone)
qt-linphone
This is a sip client based on QT QML and linphone sdk4.4 ,Mainly experience c + + beautiful UI
RAD-NeRF
Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition
so-vits-svc
SoftVC VITS Singing Voice Conversion
Track-Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
vertojs
Typescript FreeSWITCH verto interface
vue_webrtc_demo
vuejs WebRTC demo using FreeSWITCH sip-server
Wav2Lip-GFPGAN
High quality Lip sync
whisper
Robust Speech Recognition via Large-Scale Weak Supervision
whisper-websockets
Websockets implementation with OpenAI whisper for real time speech recognition
whisper.cpp
Port of OpenAI's Whisper model in C/C++
whisper_real_time
Real time transcription with OpenAI Whisper.