HughLan1214

followers

following

stars

HughLan1214's starred repositories

Ignite

A static site generator for Swift developers.

Language:SwiftMIT159800

mixpanel-js-wrapper

A GitHub project created under the Mixpanel organization to store the Mixpanel JS wrapper

Language:JavaScript200

Dialogue-Topic-Segmenter

Improving Unsupervised Dialogue Topic Segmentation with Utterance-Pair Coherence Scoring

Language:Python5700

nlp

Language:Jupyter NotebookMIT44000

SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.

Language:PythonMIT111900

Speaker_Verification

Tensorflow implementation of "Generalized End-to-End Loss for Speaker Verification"

Language:PythonMIT34900

VoiceprintRecognition-Pytorch

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods

Language:PythonApache-2.072300

You-Only-Speak-Once

Deep Learning - one shot learning for speaker recognition using Filter Banks

Language:Jupyter Notebook14800

camerakit-js

Library for Web Camera API. Increase ease of use and compatibility in your next project

Language:TypeScriptMIT4000

meetingsdk-react-sample

Use the Zoom Meeting SDK in React

Language:JavaScriptNOASSERTION14900

flask-video-stream

Simple webcam video streaming python3 script using Flask.

Language:PythonMIT7000

jpeg_camera

JpegCamera – JavaScript webcam image capture library

Language:CoffeeScriptMIT36900

webcamjs

HTML5 Webcam Image Capture Library with Flash Fallback

Language:ActionScriptMIT249200

streamlit-webrtc

Real-time video and audio streams over the network, with Streamlit.

Language:PythonMIT132300

tutorial-streamlit-demo

Language:PythonMIT3400

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookMIT571300

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonMIT6602200

SuperDialseg

Supervised Dialogue Segmentation

Language:JavaMIT500

DialogLM

Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."

Language:PythonMIT13500

BERT-like-is-All-You-Need

The code for our INTERSPEECH 2020 paper - Jointly Fine-Tuning "BERT-like'" Self Supervised Models to Improve Multimodal Speech Emotion Recognition

Language:PythonMIT11200

FAb-Net

Pytorch code for BMVC 2018 paper

Language:Jupyter NotebookMIT8500

Self-Supervised-Embedding-Fusion-Transformer

The code for our IEEE ACCESS (2020) paper Multimodal Emotion Recognition with Transformer-Based Self Supervised Feature Fusion.

Language:PythonMIT10600

mlx-examples

Examples in the MLX framework

Language:PythonMIT569000

sparrow-donut

Data extraction with Donut ML model

Language:PythonApache-2.04600

sparrow

Data processing with ML and LLM

Language:PythonGPL-3.0256700

donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Language:PythonMIT561900

hf-datasets

Language:Jupyter Notebook300

EMO-AffectNetModel

Dynamic and static models for real-time facial emotion recognition

Language:Jupyter NotebookMIT7400

soxan

Wav2Vec for speech recognition, classification, and audio classification

Language:Jupyter NotebookApache-2.023900