Qiaolin Wang (qiaolinwang)

qiaolinwang

Geek Repo

Company:Wuhan University

Location:New York

Github PK Tool:Github PK Tool

Qiaolin Wang's starred repositories

LlamaVoice

LlamaVoice is a llama-based large voice generation model, providing inference and training ability.

Language:PythonStargazers:214Issues:0Issues:0

Summer2025-Internships

Collection of Summer 2025 tech internships!

Stargazers:34333Issues:0Issues:0

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:11707Issues:0Issues:0

SpeechGPT

SpeechGPT Series: Speech Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:1241Issues:0Issues:0

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:9594Issues:0Issues:0

llama-trl

LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA

Language:PythonLicense:Apache-2.0Stargazers:178Issues:0Issues:0

LLM-RLHF-Tuning

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

Language:PythonStargazers:355Issues:0Issues:0

PPO-for-Beginners

A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.

Language:PythonLicense:MITStargazers:739Issues:0Issues:0

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Language:PythonStargazers:10120Issues:0Issues:0

libriheavy

Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context

Language:PythonLicense:Apache-2.0Stargazers:172Issues:0Issues:0

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:7520Issues:0Issues:0

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:28112Issues:0Issues:0

MELD

MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversation

Language:PythonLicense:GPL-3.0Stargazers:793Issues:0Issues:0
Language:PythonStargazers:5Issues:0Issues:0

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:31847Issues:0Issues:0
Language:PythonStargazers:167Issues:0Issues:0

UniSA

UniSA: Unified Generative Framework for Sentiment Analysis

Language:PythonLicense:MITStargazers:45Issues:0Issues:0

Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language:PythonLicense:BSD-3-ClauseStargazers:2732Issues:0Issues:0

OneLLM

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Language:PythonLicense:NOASSERTIONStargazers:564Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Stargazers:12002Issues:0Issues:0

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

License:MITStargazers:5929Issues:0Issues:0

awesome-emotion-recognition-in-conversations

A comprehensive reading list for Emotion Recognition in Conversations

Stargazers:252Issues:0Issues:0

WeChatMsg

提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手

Language:PythonLicense:GPL-3.0Stargazers:33615Issues:0Issues:0

mimalloc

mimalloc is a compact general purpose allocator with excellent performance.

Language:CLicense:MITStargazers:10449Issues:0Issues:0

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Language:PythonLicense:Apache-2.0Stargazers:7289Issues:0Issues:0

elevenlabs-python

The official Python API for ElevenLabs Text to Speech.

Language:PythonLicense:MITStargazers:2123Issues:0Issues:0

Diff-HierVC

Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation"

Language:PythonStargazers:189Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:140Issues:0Issues:0

SpeechMOS

Easy-to-Use Speech MOS predictors

Language:PythonLicense:MITStargazers:215Issues:0Issues:0

DTTNet-Pytorch

An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation

Language:PythonLicense:Apache-2.0Stargazers:74Issues:0Issues:0