Jason Ni's starred repositories
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
cuda_programming
Code from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch
ffmpeg-video-player
An FFmpeg and SDL Tutorial.
web-audio-api-rs
A Rust implementation of the Web Audio API, for use in non-browser contexts
virtio-drivers
VirtIO guest drivers in Rust.
whisper-live-transcription
Live-Transcription (STT) with Whisper PoC
benchmarks
🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.
stream-translator-gpt
A stream-translator fork with VAD based audio slicing & GPT / Gemini translation.
mvisor-win-vgpu-driver
Implementation of OpenGL on windows guest virtual machine using Mesa/Virgl protocol.
yolov9-face-detection
Training YOLOv9 for face detection on the WIDER Face dataset
Voice-activity-detection-VAD-paper-and-code
Voice activity detection (VAD) paper and code(From 198*~ )and its classification.
demucs.cpp
C++17 port of Demucs v3 (hybrid) and v4 (hybrid transformer) models with ggml and Eigen3
sdl2_video_player
video player built with ffmpeg and SDL2
How-to-find-if-an-image-is-bright-or-dark
Input image is resized to 10x10 pixel, to reduce the computation, Convert it to LAB color space to access the luminous channel which is independent of colors, Normalize pixel values to be in range of 0 - 1. Compare the mean value of pixels with a threshold value.
pmpp__programming_massively_parallel_processors
Examples and exercises from the book Programming Massively Parallel Processors - A Hands-on Approach. David B. Kirk and Wen-mei W. Hwu (Third Edition)
HighPerformanceComputing
Class of High Performance Computing taken at U.T.P 2017
pyannote-onnx
PyAnnote Voice Activity Detection (ONNX version)