Sam Dang (dangvansam)

dangvansam

User data from Github https://github.com/dangvansam

Company:Techainer

Location:Ha Noi, Viet Nam

GitHub:@dangvansam

Sam Dang's repositories

viet-asr

VietASR - Vietnamese Automatic Speech Recognition

Language:PythonLicense:Apache-2.0Stargazers:144Issues:5Issues:8

viet-tts

VietTTS: An Open-Source Vietnamese Text to Speech

Language:PythonLicense:Apache-2.0Stargazers:66Issues:6Issues:7

pyannote-onnx

PyAnnote Voice Activity Detection (ONNX version)

Language:Jupyter NotebookLicense:MITStargazers:18Issues:0Issues:0

phobert-text-classification

Phân loại văn bản Tiếng Việt sử dụng pretrained model - PhoBERT

text-detection-recognize-ctpn-tesseract

text detection CTPN and recognize with Tesseract

deepxi-flask-server

DeepXi with Flask Server

Language:PythonStargazers:5Issues:1Issues:0

website-ban-noi-that

Website bán đồ nội thất gỗ - ASP.NET MVC + SQL Server

Language:CSSStargazers:3Issues:2Issues:0

flask-img2wav

Image - Audio(Spectrogram) Translation, Flask Server, PHGI Phase Recovery From Spectrogram

Language:PythonStargazers:2Issues:3Issues:0

nvidia-nemo-jasper-quartznet-asr-vietnamese

Nhận dạng giọng nói Tiếng Việt sử dụng model Quartznet (Nvidia) + flask demo

Language:PythonStargazers:1Issues:1Issues:0

invoice-data-extract

Trích xuất dữ liệu từ ảnh (Hóa đơn) sử dụng deeplearning

Language:HTMLStargazers:0Issues:2Issues:0
License:Apache-2.0Stargazers:0Issues:1Issues:0

ABSADatasets

Public & Community-shared datasets for Aspect-based sentiment analysis and Text Classification

Language:HTMLLicense:MITStargazers:0Issues:1Issues:0

CNWeb-ShopThoiTrang

BTL CNW - Website thời trang nữ | C#( EntityFramework) + SQLServer

Language:CSSStargazers:0Issues:2Issues:0
Stargazers:0Issues:2Issues:0

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptLicense:NOASSERTIONStargazers:0Issues:0Issues:0

DTrOCR

A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

fish-speech

Brand new TTS solution

Language:PythonStargazers:0Issues:0Issues:0

flask-image2text

EAST detect and Tesseract recognize text in image

Language:C++Stargazers:0Issues:2Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

ichigo

Local realtime voice AI

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

lanms-python3.6

Builded modul lanms for python 3.6+windows

Language:C++Stargazers:0Issues:1Issues:0

realtime-chat-flask-socketio

Realtime chat with Python Flask & SocketIO

Language:HTMLStargazers:0Issues:1Issues:0

SenseVoice

Multilingual Voice Understanding Model

License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

speech-enhancement-flask

speech-enhancement-flask

Language:PythonStargazers:0Issues:2Issues:0

ultravox

A fast multimodal LLM for real-time voice

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

whisperY

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-2-ClauseStargazers:0Issues:0Issues:0