yao ayang's starred repositories

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:65179Issues:545Issues:0

llama.cpp

LLM inference in C/C++

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonLicense:MITStargazers:52281Issues:435Issues:130

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:35184Issues:355Issues:306

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:33977Issues:316Issues:423

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:27881Issues:188Issues:4400

mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

Language:C++License:Apache-2.0Stargazers:26349Issues:494Issues:5036

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

spleeter

Deezer source separation library including pretrained models.

Language:PythonLicense:MITStargazers:25379Issues:383Issues:767

ChatDev

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Language:ShellLicense:Apache-2.0Stargazers:24619Issues:310Issues:245

examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Language:PythonLicense:BSD-3-ClauseStargazers:22081Issues:397Issues:636

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20308Issues:198Issues:367

best-of-ml-python

🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

dvc

🦉 ML Experiments and Data Management with Git

Language:PythonLicense:Apache-2.0Stargazers:13443Issues:141Issues:4654

faceai

一款入门级的人脸、视频、文字检测以及识别的项目.

Language:PythonLicense:MITStargazers:10698Issues:387Issues:46

rt-thread

RT-Thread is an open source IoT real-time operating system (RTOS).

Language:CLicense:Apache-2.0Stargazers:10106Issues:530Issues:1387

paper2gui

Convert AI papers to GUI,Make it easy and convenient for everyone to use artificial intelligence technology。让每个人都简单方便的使用前沿人工智能技术

Language:Jupyter NotebookLicense:MITStargazers:10057Issues:117Issues:83

computervision-recipes

Best Practices, code samples, and documentation for Computer Vision.

Language:Jupyter NotebookLicense:MITStargazers:9358Issues:285Issues:260

bloop

bloop is a fast code search engine written in Rust.

Language:RustLicense:Apache-2.0Stargazers:9330Issues:64Issues:134

cat-catch

猫抓 浏览器资源嗅探扩展 / cat-catch Browser Resource Sniffing Extension

Language:JavaScriptLicense:GPL-3.0Stargazers:8220Issues:53Issues:381

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonLicense:NOASSERTIONStargazers:8116Issues:100Issues:85

AI4Animation

Bringing Characters to Life with Computer Brains in Unity

fawkes

Fawkes, privacy preserving tool against facial recognition systems. More info at https://sandlab.cs.uchicago.edu/fawkes

Language:PythonLicense:BSD-3-ClauseStargazers:5169Issues:114Issues:160

competition-baseline

数据挖掘、计算机视觉、自然语言处理、推荐系统竞赛知识、代码、思路

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:4139Issues:86Issues:24

python-miio

Python library & console tool for controlling Xiaomi smart appliances

Language:PythonLicense:GPL-3.0Stargazers:3554Issues:87Issues:899

freeswitch

FreeSWITCH is a Software Defined Telecom Stack enabling the digital transformation from proprietary telecom switches to a versatile software implementation that runs on any commodity hardware. From a Raspberry PI to a multi-core server, FreeSWITCH can unlock the telecommunications potential of any device.

Language:CLicense:NOASSERTIONStargazers:3391Issues:146Issues:1360

DI-engine

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Language:PythonLicense:Apache-2.0Stargazers:2840Issues:21Issues:197

Real-Time-Rendering-4th-CN

《Real-Time Rendering 4th》 (RTR4) 中文翻译

VideoPipe

跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星 : ) 。**VideoPipe下一版本正在开发中,在保证跨平台、易上手的前提下,预计性能直逼deepstream等各硬件平台官方框架**。

Language:C++License:Apache-2.0Stargazers:1245Issues:20Issues:21

comfyui_segment_anything

Based on GroundingDino and SAM, use semantic strings to segment any element in an image. The comfyui version of sd-webui-segment-anything.

Language:PythonLicense:Apache-2.0Stargazers:589Issues:6Issues:54