sliveysun's starred repositories

DiariZen

A toolkit for speaker diarization.

Language:Jupyter NotebookLicense:MITStargazers:140Issues:0Issues:0

weekly

科技爱好者周刊,每周五发布

Stargazers:47555Issues:0Issues:0

OmniSenseVoice

Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯

Language:PythonLicense:Apache-2.0Stargazers:692Issues:0Issues:0
Language:PythonLicense:MITStargazers:1Issues:0Issues:0

SenseVoice

Multilingual Voice Understanding Model

Language:PythonLicense:NOASSERTIONStargazers:3356Issues:0Issues:0

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-2-ClauseStargazers:12296Issues:0Issues:0

VideoLingo

Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组

Language:PythonLicense:Apache-2.0Stargazers:5430Issues:0Issues:0

libsamplerate-js

Resample audio in node or browser using a web assembly port of libsamplerate.

Language:JavaScriptLicense:NOASSERTIONStargazers:32Issues:0Issues:0

voice-activity-detection-vad-realtime

Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)

Language:PythonStargazers:53Issues:0Issues:0

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonLicense:MITStargazers:4331Issues:0Issues:0

llama4micro

A "large" language model running on a microcontroller

Language:C++License:MITStargazers:493Issues:0Issues:0

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookLicense:MITStargazers:6286Issues:0Issues:0

websocat

Command-line client for WebSockets, like netcat (or curl) for ws:// with advanced socat-like functions

Language:RustLicense:MITStargazers:7119Issues:0Issues:0

ESP32_Soundboard

Press button, play sound.

Language:CLicense:MPL-2.0Stargazers:22Issues:0Issues:0

Webspector

Web bases 8-64 channel spectrum analyzer for ESP32

Language:CLicense:GPL-3.0Stargazers:17Issues:0Issues:0

Adafruit-Audio-BFF-PCB

PCB files for the Adafruit Audio BFF

License:NOASSERTIONStargazers:2Issues:0Issues:0

WAVRecorder

Arduino Library for voice recording using Electret Microphones for ESP32, ESP8266 and Arduino Due.

Language:C++Stargazers:29Issues:0Issues:0

ADeus

An open source AI wearable device that captures what you say and hear in the real world and then transcribes and stores it on your own server. You can then chat with Adeus using the app, and it will have all the right context about what you want to talk about - a truly personalized, personal AI.

Language:TypeScriptLicense:NOASSERTIONStargazers:2929Issues:0Issues:0

clapper

Clapper.app, a video synthesizer and sequencer designed for the age of AI cinema

Language:TypeScriptLicense:GPL-3.0Stargazers:2060Issues:0Issues:0

omi

AI wearables

Language:CLicense:MITStargazers:3647Issues:0Issues:0

OpenGlass

Turn any glasses into AI-powered smart glasses

Language:CLicense:MITStargazers:3328Issues:0Issues:0

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20924Issues:0Issues:0

awesome-ai-music-generation

A curated compilation of AI-driven generative music resources and projects. Explore the blend of machine learning algorithms and musical creativity.

License:Apache-2.0Stargazers:181Issues:0Issues:0

quom

Quom generates a single header file from your for C/C++ sources. This is also known as amalgamation.

Language:PythonLicense:MITStargazers:160Issues:0Issues:0

goron

Yet another llvm based obfuscator

License:Apache-2.0Stargazers:595Issues:0Issues:0

obfuscator

ollvm,base on llvm-clang 5.0.2, 6.0.1 , 7.0.1,8.0,9.0,9.0.1,10.x,11.x,12.x,13.x,14.x,swift-llvm-clang 5.0,swift-llvm-clang 5.5

Stargazers:1083Issues:0Issues:0
Language:C++License:NOASSERTIONStargazers:237Issues:0Issues:0

Arkari

Yet another llvm based obfuscator based on goron.

Language:LLVMLicense:NOASSERTIONStargazers:378Issues:0Issues:0

esp-bin2elf

Converts a flash dump from an ESP8266 device into an ELF executable file for analysis and reverse engineering.

Language:PythonStargazers:15Issues:0Issues:0