ZhangXin's starred repositories
soundstorm-pytorch
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
SpeechTokenizer
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
soundstorm-speechtokenizer
Implementation of SoundStorm built upon SpeechTokenizer.
Vote2Cap-DETR
[CVPR 2023] Vote2Cap-DETR and [T-PAMI 2024] Vote2Cap-DETR++; A set-to-set perspective towards 3D Dense Captioning; State-of-the-Art 3D Dense Captioning methods
SLMTokBench
SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"
LLM-Verified-Retrieval
Repo for Llatrieval
NumPytorch
A numpy based python package with pytorch-like interface for training a Convolution Neural Network.