博文 (Boxie5)

Boxie5

Geek Repo

Company:neo-wave

Location:Shanghai

Home Page:blog.xbowen.com

Github PK Tool:Github PK Tool

博文's starred repositories

pond

🔘 Minimalistic and High-performance goroutine worker pool written in Go

Language:GoLicense:MITStargazers:1384Issues:0Issues:0

numba

NumPy aware dynamic Python compiler using LLVM

Language:PythonLicense:BSD-2-ClauseStargazers:9650Issues:0Issues:0

ShadowsocksX-NG

Next Generation of ShadowsocksX

Language:SwiftLicense:GPL-3.0Stargazers:32270Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:29397Issues:0Issues:0

Bert-VITS2

vits2 backbone with multilingual-bert

Language:PythonLicense:AGPL-3.0Stargazers:7529Issues:0Issues:0

ChatLaw

ChatLaw:A Powerful LLM Tailored for Chinese Legal. 中文法律大模型

License:AGPL-3.0Stargazers:6681Issues:0Issues:0

datasets

TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...

Language:PythonLicense:Apache-2.0Stargazers:4241Issues:0Issues:0

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Language:PythonLicense:Apache-2.0Stargazers:18776Issues:0Issues:0

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:33873Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:32087Issues:0Issues:0

spleeter

Deezer source separation library including pretrained models.

Language:PythonLicense:MITStargazers:25334Issues:0Issues:0

vocal-separate

an extremely simple tool for separating vocals and background music, completely localized for web operation, using 2stems/4stems/5stems models 这是一个极简的人声和背景音乐分离工具,本地化网页操作,无需连接外网

Language:PythonLicense:GPL-3.0Stargazers:1146Issues:0Issues:0

safetensors

Simple, safe way to store and distribute tensors

Language:PythonLicense:Apache-2.0Stargazers:2628Issues:0Issues:0

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonLicense:MITStargazers:761Issues:0Issues:0

text-generation-webui

A Gradio web UI for Large Language Models.

Language:PythonLicense:AGPL-3.0Stargazers:38515Issues:0Issues:0

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:11790Issues:0Issues:0

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonLicense:NOASSERTIONStargazers:11256Issues:0Issues:0

GFPGAN

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

Language:PythonLicense:NOASSERTIONStargazers:35149Issues:0Issues:0

BasicSR

Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.

Language:PythonLicense:Apache-2.0Stargazers:6514Issues:0Issues:0

AI-For-Beginners

12 Weeks, 24 Lessons, AI for All!

Language:Jupyter NotebookLicense:MITStargazers:33319Issues:0Issues:0

AISystem

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:9639Issues:0Issues:0

the-super-tiny-compiler

:snowman: Possibly the smallest compiler ever

Language:JavaScriptLicense:CC-BY-4.0Stargazers:27682Issues:0Issues:0

AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Language:PythonLicense:NOASSERTIONStargazers:9905Issues:0Issues:0

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonLicense:Apache-2.0Stargazers:6133Issues:0Issues:0

Fay

Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guides, broadcasters, assistants, waiters, teachers, and voice or text-based mobile assistants.

License:GPL-3.0Stargazers:8600Issues:0Issues:0

vanna

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.

Language:PythonLicense:MITStargazers:9859Issues:0Issues:0

self-operating-computer

A framework to enable multimodal models to operate a computer.

Language:PythonLicense:MITStargazers:8256Issues:0Issues:0

LLMLingua

To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Language:PythonLicense:MITStargazers:4238Issues:0Issues:0

Voyager

An Open-Ended Embodied Agent with Large Language Models

Language:JavaScriptLicense:MITStargazers:5384Issues:0Issues:0

autogen

A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap

Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:28576Issues:0Issues:0