zgq91's repositories
ai-edu
AI education materials for Chinese students, teachers and IT professionals.
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
anylabeling
Effortless AI-assisted data labeling with AI support from Segment Anything and YOLO!
build-your-own-x
Master programming by recreating your favorite technologies from scratch.
Catch2
A modern, C++-native, test framework for unit-tests, TDD and BDD - using C++14, C++17 and later (C++11 support is in v2.x branch, and C++03 on the Catch1.x branch)
cs-self-learning
计算机自学指南
DT
基于QT开发的组件式框架DT
hello-algo
《Hello 算法》一本动画图解、能运行、可提问的数据结构与算法入门书
FemtoDet
Official codes of ICCV2023 paper: <<FemtoDet: an object detection baseline for energy versus performance tradeoffs>>
gstreamer-example
Gstreamer开发教程。
ijkplayer
Android/iOS video player based on FFmpeg n3.4, with MediaCodec, VideoToolbox support.
langchain
⚡ Building applications with LLMs through composability ⚡
leedl-tutorial
《李宏毅深度学习教程》,PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
libsamplerate
An audio Sample Rate Conversion library
LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models
MobileSAM
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
ppl.nn
A primitive library for neural network
rtl8188gu
Driver for Linux RTL8188GU (RTL8710B) (VID:PID = 0x0BDA:0xB711)
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
simpleocv
Make a minimal OpenCV runable on any where, WIP
speechbrain
A PyTorch-based Speech Toolkit
srs
SRS is a simple, high-efficiency, real-time video server supporting RTMP, WebRTC, HLS, HTTP-FLV, SRT, MPEG-DASH, and GB28181.
TensorRT
NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applications.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Tube-Link
Universal Video Segmentaion For VSS, VPS and VIS (ICCV-2023)
VABlog
YUV/PCM/H264/H265/AAC/FFmpeg/Opengl. 这有丰富的音视频开发的学习资源、开发工具、优秀书籍、教程、面试题和开源项目,旨在帮助开发者和爱好者更好地学习、实践和工作。
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit