alanlaye617

followers

following

stars

zty's starred repositories

bert

TensorFlow code and pre-trained models for BERT

Language:PythonApache-2.037703 999 1142

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.032911 277 1089

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT31459 195 1118

insightface

State-of-the-art 2D and 3D Face Analysis Project

Language:PythonMIT22563 509 2454

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT20485 200 371

triton

Development repository for the Triton language and compiler

Language:C++MIT12314 184 1367

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonMIT11161 160 263

QAnything

Question and Answer based on Anything.

Language:PythonApache-2.011129 97 364

PhotoMaker

PhotoMaker [CVPR 2024]

Language:Jupyter NotebookNOASSERTION9211 103 149

MaxKB

🚀 基于 LLM 大语言模型的知识库问答系统。开箱即用、模型中立、灵活编排，支持快速嵌入到第三方业务系统。

Language:PythonGPL-3.08996 67 572

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.07988 87 1735

Depth-Anything

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Language:PythonApache-2.06669 49 206

stable-diffusion-webui-forge

Language:PythonAGPL-3.06623 77 921

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonMIT6557 39 953

awesome-pretrained-chinese-nlp-models

Awesome Pretrained Chinese NLP Models，高质量中文预训练模型&大模型&多模态模型&大语言模型集合

Language:PythonMIT4659 90 11

midjourney-proxy

代理 MidJourney 的discord频道，实现api形式调用AI绘图

Language:JavaApache-2.04641 410

TripoSR

Language:PythonMIT4201 47 92

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:PythonGPL-3.04176 38 414

AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Language:PythonApache-2.04160 51 121

PySceneDetect

:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.

Language:PythonBSD-3-Clause3076 69 311

yuzu

Nintendo Switch emulator (unofficial mirror fork)

Language:C++GPL-3.02661 27 20

sd-webui-deforum

Deforum extension for AUTOMATIC1111's Stable Diffusion webui

Language:PythonNOASSERTION2649 41 416

copymanga

拷贝漫画的第三方APP，优化阅读/下载体验

Language:KotlinGPL-3.02010 18 80

BCEmbedding

Netease Youdao's open-source embedding and reranker models for RAG products.

Language:PythonApache-2.01310 9 76

hecate

Automagically generate thumbnails, animated GIFs, and summaries from videos

Language:C++Apache-2.0477 25 26

TransNetV2

TransNet V2: Shot Boundary Detection Neural Network

Language:PythonMIT430 9 47

GBFR-ACT

A combat data track and analytic mod, suchas dps tracking

Language:JavaScript338 7 37

SunoSongsCreator

About High quality songs generation by https://www.suno.ai/. Reverse engineered API.

Language:PythonGPL-3.0277 5 22

rag-omni

基于BM25、BGE、OpenAI Embedding检索算法的检索增强生成RAG示例，支持OpenAI风格的大模型服务

Language:Python73 2 2

whisper-onnx-tensorrt

ONNX and TensorRT implementation of Whisper

Language:PythonMIT54 40