xunnew

0

followers

following

stars

xunnew's repositories

backend-with-gpt-vits

chat backend with GPT3/chatGPT and multilingual VITS, and multilingual speech input supported

MIT000

DINet

The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."

000

emotional-vits

无需情感标注的情感可控语音合成模型，基于VITS

MIT000

ER-NeRF

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

MIT000

freeswitch

FreeSWITCH is a Software Defined Telecom Stack enabling the digital transformation from proprietary telecom switches to a versatile software implementation that runs on any commodity hardware. From a Raspberry PI to a multi-core server, FreeSWITCH can unlock the telecommunications potential of any device.

NOASSERTION000

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.

NOASSERTION000

GGboy_deeplearning

This is some deep learning code, i want to share with you.

MIT000

gin-vue-admin

基于vite+vue3+gin搭建的开发基础平台（支持TS,JS混用），集成jwt鉴权，权限管理，动态路由，显隐可控组件，分页封装，多点登录拦截，资源权限，上传下载，代码生成器，表单生成器等开发必备功能。

Apache-2.0000

jitsi-meet

Jitsi Meet - Secure, Simple and Scalable Video Conferences that you use as a standalone app or embed in your web application.

Apache-2.0000

langchaingo

LangChain for Go

ISC000

libav

Libav github mirror, clone of git://git.libav.org/libav

NOASSERTION000

Luckysheet

Luckysheet is an online spreadsheet like excel that is powerful, simple to configure, and completely open source.

MIT000

MiniGPT-4

MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models

BSD-3-Clause000

ocpp-go

Open Charge Point Protocol implementation in Go

Language:GoMIT000

pytorch-UNet

pytorch搭建自己的unet网络，训练自己的数据集。

NOASSERTION000

qSIP

VoIP/SIP client (softphone)

000

qt-linphone

This is a sip client based on QT QML and linphone sdk4.4 ,Mainly experience c + + beautiful UI

000

RAD-NeRF

Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition

MIT000

so-vits-svc

SoftVC VITS Singing Voice Conversion

AGPL-3.0000

Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

MIT000

vertojs

Typescript FreeSWITCH verto interface

000

VITS_TextToSpeech

MIT000

vue_webrtc_demo

vuejs WebRTC demo using FreeSWITCH sip-server

MIT000

Wav2Lip-GFPGAN

High quality Lip sync

000

wav2lip384

000

wav2lip_data_preprocessing

000

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

MIT000

whisper-websockets

Websockets implementation with OpenAI whisper for real time speech recognition

MIT000

whisper.cpp

Port of OpenAI's Whisper model in C/C++

MIT000

whisper_real_time

Real time transcription with OpenAI Whisper.

000