mediagarden's starred repositories
ClipCap-Chinese
基于ClipCap的看图说话Image Caption模型
mobile-ffmpeg
FFmpeg for Android, iOS and tvOS. Not maintained anymore. Superseded by FFmpegKit.
Wave-U-Net-for-Speech-Enhancement
Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.
MossFormer
This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Joint Self-Attentions", which was submitted to ICASSP 2023.
hello_driver
HelloWorld for Linux Device Driver
AudioClassification-Pytorch
The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.
vddswitcher
A Windows console tool that uses parsec-vdd to switch to a virtual display.
person_search_demo
利用YOLOv3结合行人重识别模型,实现行人的检测识别,查找特定行人
tspi-linux-sdk
【非立创官方版本】LCEDA Tai-Shang Pi Linux SDK.立创开发板泰山派Linux SDK.
uwe5621ds-aml
uwe5621ds driver for amlogic platform
alsa-examples
generic alsa samples.
sherpa-ncnn
Real-time speech recognition using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Raspberry Pi, VisionFive2, LicheePi4A etc.
sherpa-onnx
Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift