milker90's starred repositories
screenshot-to-code
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
openWakeWord
An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.
search_with_lepton
Building a quick conversation-based search demo with Lepton AI.
WhisperHallu
Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts
SimplePingHelper
How to perform a Ping in an iPhone app
NetSpeedMeasurer
网络实时测速 下载和上传的最大最小网速以及平均网速、以及实时速率
react-navigation
Routing and navigation for your React Native apps
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
mlx-examples
Examples in the MLX framework
mobile-aloha
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation
pyannote-audio_speaker-diarization_cpp
C++ version of pyannote audio speaker diarizaiton pipeline
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
whisper.rn
React Native binding of whisper.cpp.
multilingual_kws
Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus
interactive-tutorials
Interactive Tutorials
Wake-UP-word-detection
Wake-up-word(WUW)system is an emerging development in recent times. Voice interaction with systems have made life ease and aids in multi-tasking. Apple, Google, Microsoft, Amazon have developed a custom wake-word engine, which are addressed by words such as ‘Hey Siri’. ‘Ok Google’, ‘Cortana’, ‘Alexa’. Our project focuses initially only detection and response to a customized wake-up command. The wake-up command used is “GOLUMOLU”. A wake-up-word detection system search for specific word and reads the word, where it rejects all other words, phrases and sounds. WUW system needs only less memory space, low computational cost and high precision. Artificial Neural Networks(ANN) have reduced the complexity, computational time, latency, thus the efficiency of system has improved. Deep learning has improved the efficiency of automatic speech recognition(SR), where wake word detection is a subset of SR but unlike keyword spotting and voice recognition. A deep learning RNN model is used for the training of the network. RNN are specifically used in case of temporal sequence data and has the ability to process data of different length but of same dimension. For training a model, labelled dataset is needed. We generated three forms of data: golumolu, negative and background. Such that, the model learns circumspectly and attentively detects when specific word found. To start communication with system, the wake word should be delivered. The main task of WUW detection system is to detect the speech, to identify WUW words among spoken words, to check whether the word spoken in altering context.
movie-remake
Generate remakes of movies using OpenAi
ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
english-note
从0开始学习英语语法
awesome-interview-questions
:octocat: A curated awesome list of lists of interview questions. Feel free to contribute! :mortar_board:
hello-algo
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing