Austin Zhang (ZhangAustin)

ZhangAustin

Geek Repo

Company:Microsoft

Location:Redmond

Github PK Tool:Github PK Tool

Austin Zhang's repositories

cocktailparty

Multi-Modal Multi-Channel System and Corpus For Cocktail Party Problem

algo

Set up a personal VPN in the cloud

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:1Issues:0

ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

asteroid

The PyTorch-based audio source separation toolkit for researchers || Pretrained models available

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

AugLy

A data augmentations library for audio, image, text, and video.

License:MITStargazers:0Issues:0Issues:0

BigCiDian

Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.

Language:PythonStargazers:0Issues:1Issues:0

chinese_text_normalization

Chinese text normalization for speech processing

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

edex-ui

A cross-platform, customizable science fiction terminal emulator with advanced monitoring & touchscreen support.

Language:JavaScriptLicense:GPL-3.0Stargazers:0Issues:1Issues:0

examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

google-research

Google Research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Halide

a language for fast, portable data-parallel computation

Language:C++License:NOASSERTIONStargazers:0Issues:2Issues:0

Hey-Jetson

Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:0Issues:1Issues:0

jetson-inference

Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.

Language:C++License:MITStargazers:0Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

local-llms-analyse-finance

In this project, I explored how local LLMs can be used to label data and support analyses. Specifically, I used Llama2 model to automatically categorise my bank transaction data.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Megatron-LLM

distributed trainer for LLMs

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

MMdnn

MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, PyTorch Onnx and CoreML.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

netron

Visualizer for deep learning and machine learning models

Language:JavaScriptLicense:MITStargazers:0Issues:1Issues:0

Nuklear

A single-header ANSI C immediate mode cross-platform GUI library

Language:CStargazers:0Issues:1Issues:0

odas

ODAS: Open embeddeD Audition System

Language:CLicense:GPL-3.0Stargazers:0Issues:1Issues:0

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, speaker embedding

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

pyctcdecode

A fast and lightweight python-based CTC beam search decoder for speech recognition.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pyflow

Fast, accurate and easy to run dense optical flow with python wrapper

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0

PythonRobotics

Python sample codes for robotics algorithms.

Language:PythonLicense:MITStargazers:0Issues:3Issues:0

pytorch-struct

Fast, general, and tested differentiable structured prediction in PyTorch

Language:Jupyter NotebookLicense:MITStargazers:0Issues:2Issues:0

rendezvous

Next generation videoconference system

Language:CLicense:GPL-3.0Stargazers:0Issues:1Issues:0

rnnt_decoder_cuda

An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.

Language:CudaLicense:MITStargazers:0Issues:1Issues:0

SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.

Language:PythonStargazers:0Issues:2Issues:0

vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0