zty's starred repositories

bert

TensorFlow code and pre-trained models for BERT

Language:PythonLicense:Apache-2.0Stargazers:37703Issues:999Issues:1142

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:32911Issues:277Issues:1089

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:31459Issues:195Issues:1118

insightface

State-of-the-art 2D and 3D Face Analysis Project

Language:PythonLicense:MITStargazers:22563Issues:509Issues:2454

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20485Issues:200Issues:371

triton

Development repository for the Triton language and compiler

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:11161Issues:160Issues:263

QAnything

Question and Answer based on Anything.

Language:PythonLicense:Apache-2.0Stargazers:11129Issues:97Issues:364

PhotoMaker

PhotoMaker [CVPR 2024]

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:9211Issues:103Issues:149

MaxKB

🚀 基于 LLM 大语言模型的知识库问答系统。开箱即用、模型中立、灵活编排,支持快速嵌入到第三方业务系统。

Language:PythonLicense:GPL-3.0Stargazers:8996Issues:67Issues:572

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:7988Issues:87Issues:1735

Depth-Anything

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:6669Issues:49Issues:206

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonLicense:MITStargazers:6557Issues:39Issues:953

awesome-pretrained-chinese-nlp-models

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

Language:PythonLicense:MITStargazers:4659Issues:90Issues:11

midjourney-proxy

代理 MidJourney 的discord频道,实现api形式调用AI绘图

Language:JavaLicense:Apache-2.0Stargazers:4641Issues:41Issues:0

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:PythonLicense:GPL-3.0Stargazers:4176Issues:38Issues:414

AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Language:PythonLicense:Apache-2.0Stargazers:4160Issues:51Issues:121

PySceneDetect

:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.

Language:PythonLicense:BSD-3-ClauseStargazers:3076Issues:69Issues:311

yuzu

Nintendo Switch emulator (unofficial mirror fork)

Language:C++License:GPL-3.0Stargazers:2661Issues:27Issues:20

sd-webui-deforum

Deforum extension for AUTOMATIC1111's Stable Diffusion webui

Language:PythonLicense:NOASSERTIONStargazers:2649Issues:41Issues:416

copymanga

拷贝漫画的第三方APP,优化阅读/下载体验

Language:KotlinLicense:GPL-3.0Stargazers:2010Issues:18Issues:80

BCEmbedding

Netease Youdao's open-source embedding and reranker models for RAG products.

Language:PythonLicense:Apache-2.0Stargazers:1310Issues:9Issues:76

hecate

Automagically generate thumbnails, animated GIFs, and summaries from videos

Language:C++License:Apache-2.0Stargazers:477Issues:25Issues:26

TransNetV2

TransNet V2: Shot Boundary Detection Neural Network

Language:PythonLicense:MITStargazers:430Issues:9Issues:47

GBFR-ACT

A combat data track and analytic mod, suchas dps tracking

SunoSongsCreator

About High quality songs generation by https://www.suno.ai/. Reverse engineered API.

Language:PythonLicense:GPL-3.0Stargazers:277Issues:5Issues:22

rag-omni

基于BM25、BGE、OpenAI Embedding检索算法的检索增强生成RAG示例,支持OpenAI风格的大模型服务

whisper-onnx-tensorrt

ONNX and TensorRT implementation of Whisper

Language:PythonLicense:MITStargazers:54Issues:4Issues:0