HughLan1214's starred repositories
mixpanel-js-wrapper
A GitHub project created under the Mixpanel organization to store the Mixpanel JS wrapper
Dialogue-Topic-Segmenter
Improving Unsupervised Dialogue Topic Segmentation with Utterance-Pair Coherence Scoring
Speaker_Verification
Tensorflow implementation of "Generalized End-to-End Loss for Speaker Verification"
VoiceprintRecognition-Pytorch
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods
You-Only-Speak-Once
Deep Learning - one shot learning for speaker recognition using Filter Banks
camerakit-js
Library for Web Camera API. Increase ease of use and compatibility in your next project
meetingsdk-react-sample
Use the Zoom Meeting SDK in React
flask-video-stream
Simple webcam video streaming python3 script using Flask.
jpeg_camera
JpegCamera – JavaScript webcam image capture library
streamlit-webrtc
Real-time video and audio streams over the network, with Streamlit.
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
SuperDialseg
Supervised Dialogue Segmentation
BERT-like-is-All-You-Need
The code for our INTERSPEECH 2020 paper - Jointly Fine-Tuning "BERT-like'" Self Supervised Models to Improve Multimodal Speech Emotion Recognition
Self-Supervised-Embedding-Fusion-Transformer
The code for our IEEE ACCESS (2020) paper Multimodal Emotion Recognition with Transformer-Based Self Supervised Feature Fusion.
mlx-examples
Examples in the MLX framework
sparrow-donut
Data extraction with Donut ML model
EMO-AffectNetModel
Dynamic and static models for real-time facial emotion recognition