quanminchaoren's starred repositories

cardboard

Open source Cardboard SDK and samples

Language:C++License:NOASSERTIONStargazers:1471Issues:0Issues:0

gvr-android-sdk

Google VR SDK for Android

License:NOASSERTIONStargazers:3278Issues:0Issues:0

resonance-audio

Resonance Audio Source Code

Language:C++License:Apache-2.0Stargazers:494Issues:0Issues:0

spatialaudio-unity

This repository provides plugins, tools and samples for integrating spatial audio and acoustics into your Unity 3D applications and games.

Language:C++License:MITStargazers:113Issues:0Issues:0

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:AGPL-3.0Stargazers:28454Issues:0Issues:0

gstreamer

GStreamer open-source multimedia framework

Language:CLicense:NOASSERTIONStargazers:2238Issues:0Issues:0

webrtc-web

Realtime communication with WebRTC

Language:JavaScriptLicense:Apache-2.0Stargazers:748Issues:0Issues:0

denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

Language:PythonLicense:NOASSERTIONStargazers:1617Issues:0Issues:0

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:ShellStargazers:6586Issues:0Issues:0

spatial-media

Specifications and tools for 360º video and spatial audio.

Language:PythonLicense:NOASSERTIONStargazers:1804Issues:0Issues:0

omnitone

Spatial Audio Rendering on the web.

Language:JavaScriptLicense:Apache-2.0Stargazers:849Issues:0Issues:0

openalpr-android

Android Automatic License Plate Recognition library (http://www.openalpr.com) ported for android.

Language:JavaLicense:Apache-2.0Stargazers:770Issues:0Issues:0

aimet

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

Language:PythonLicense:NOASSERTIONStargazers:2040Issues:0Issues:0

ai-hub-models

The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.

Language:PythonLicense:BSD-3-ClauseStargazers:367Issues:0Issues:0

v2rayA

A web GUI client of Project V which supports VMess, VLESS, SS, SSR, Trojan, Tuic and Juicity protocols. 🚀

Language:GoLicense:AGPL-3.0Stargazers:10497Issues:0Issues:0

whisper_android

Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android

Language:C++License:MITStargazers:171Issues:0Issues:0

ollama

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.

Language:GoLicense:MITStargazers:82353Issues:0Issues:0

sherpa-onnx

Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter

Language:C++License:Apache-2.0Stargazers:2638Issues:0Issues:0

ai-edge-torch

Supporting PyTorch models with the Google AI Edge TFLite runtime.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:230Issues:0Issues:0

mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

Language:C++License:Apache-2.0Stargazers:26362Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:65306Issues:0Issues:0

d2l-zh

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Language:PythonLicense:Apache-2.0Stargazers:59909Issues:0Issues:0

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:24952Issues:0Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:62771Issues:0Issues:0

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:C++License:MITStargazers:33421Issues:0Issues:0

mnn-llm

llm deploy project based mnn.

Language:C++License:Apache-2.0Stargazers:1384Issues:0Issues:0