Guan-Ting (Daniel) Lin (DanielLin94144)

DanielLin94144

Geek Repo

Company:National Taiwan University

Location:Taiwan

Home Page:https://daniellin94144.github.io/

Github PK Tool:Github PK Tool

Guan-Ting (Daniel) Lin's starred repositories

leetcode-master

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:25177Issues:207Issues:215

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:21052Issues:179Issues:424

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookLicense:MITStargazers:13972Issues:108Issues:309

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:11076Issues:163Issues:240

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:7295Issues:89Issues:114

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonLicense:Apache-2.0Stargazers:2929Issues:47Issues:61

jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

Language:PythonLicense:NOASSERTIONStargazers:2575Issues:37Issues:52

stable-audio-tools

Generative models for conditional audio generation

Language:PythonLicense:MITStargazers:2397Issues:42Issues:77

Awesome-Graph-LLM

A collection of AWESOME things about Graph-Related LLMs.

HierSpeechpp

The official implementation of HierSpeech++

Language:PythonLicense:MITStargazers:1143Issues:57Issues:50

acl-style-files

Official style files for papers submitted to venues of the Association for Computational Linguistics

speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

StableTTS

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

Language:PythonLicense:MITStargazers:290Issues:26Issues:13

snac

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

Language:PythonLicense:MITStargazers:258Issues:6Issues:16

CPED

CPED: A Large-Scale Chinese Personalized and Emotional Dialogue Dataset for Conversational AI | 中文个性情感对话数据集

Language:PythonLicense:Apache-2.0Stargazers:194Issues:4Issues:6

control-vc

This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"

Language:PythonLicense:NOASSERTIONStargazers:125Issues:9Issues:12

agc

Audiogen Codec

Language:PythonLicense:MITStargazers:107Issues:3Issues:1

EAT

[IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer

Language:PythonLicense:MITStargazers:94Issues:5Issues:5

DiscreteSpeechMetrics

Reference-aware automatic speech evaluation toolkit

Language:PythonLicense:MITStargazers:82Issues:4Issues:2

SpeechAgents

SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems

last

A JAX library for building lattice-based speech transducer models

Language:PythonLicense:Apache-2.0Stargazers:38Issues:7Issues:1

Interspeech2024_DiscreteSpeechChallenge

This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.

Spatial-AST

🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)

Language:PythonLicense:NOASSERTIONStargazers:26Issues:0Issues:0

PyToBI

A Toolkit for ToBI Labeling with Python Data Structures

Language:PythonLicense:GPL-3.0Stargazers:24Issues:2Issues:7

emphassess

This repository presents an evaluation framework for speech-to-speech (S2S) models, following the methodology described in the EmphAsses paper (de Seyssel et al., 2023).

Language:PythonLicense:NOASSERTIONStargazers:11Issues:5Issues:2
Language:PythonLicense:MITStargazers:4Issues:1Issues:0