姚睿銘 (RayminQAQ)

RayminQAQ

Geek Repo

Company:@NTUAI

Github PK Tool:Github PK Tool

姚睿銘's starred repositories

scenic

Scenic: A Jax Library for Computer Vision Research and Beyond

Language:PythonLicense:Apache-2.0Stargazers:3178Issues:0Issues:0

Python-100-Days-zh_TW

Python - 100天從新手到大師(繁體中文)

Language:HTMLStargazers:220Issues:0Issues:0

Awesome-Transformer-Attention

An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

Stargazers:4479Issues:0Issues:0

udlbook

Understanding Deep Learning - Simon J.D. Prince

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:5743Issues:0Issues:0

lm-format-enforcer

Enforce the output format (JSON Schema, Regex etc) of a language model

Language:PythonLicense:MITStargazers:1213Issues:0Issues:0

iTransformer

Official implementation for "iTransformer: Inverted Transformers Are Effective for Time Series Forecasting" (ICLR 2024 Spotlight), https://openreview.net/forum?id=JePfAI8fah

Language:PythonLicense:MITStargazers:1015Issues:0Issues:0

Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Language:Jupyter NotebookLicense:MITStargazers:8711Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18475Issues:0Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language:PythonLicense:Apache-2.0Stargazers:2179Issues:0Issues:0

BERT-like-is-All-You-Need

The code for our INTERSPEECH 2020 paper - Jointly Fine-Tuning "BERT-like'" Self Supervised Models to Improve Multimodal Speech Emotion Recognition

Language:PythonLicense:MITStargazers:110Issues:0Issues:0

generative-fusion-decoding

Generative Fusion Decoding (GFD) is a novel framework for integrating Large Language Models (LLMs) into multi-modal text recognition systems like ASR and OCR, improving performance and efficiency by enabling seamless fusion without requiring re-training.

Language:PythonLicense:Apache-2.0Stargazers:58Issues:0Issues:0

fish-speech

Brand new TTS solution

Language:PythonLicense:NOASSERTIONStargazers:6807Issues:0Issues:0

ckiptagger

CKIP Neural Chinese Word Segmentation, POS Tagging, and NER

Language:PythonLicense:GPL-3.0Stargazers:1626Issues:0Issues:0

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:C++License:MITStargazers:7720Issues:0Issues:0

Machine-Learning-Collection

A resource for learning about Machine learning & Deep Learning

Language:PythonLicense:MITStargazers:7335Issues:0Issues:0

transformer

Transformer: PyTorch Implementation of "Attention Is All You Need"

Language:PythonStargazers:2572Issues:0Issues:0

efficient-kan

An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).

Language:PythonLicense:MITStargazers:3686Issues:0Issues:0

awesome-kan

A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and more, for researchers and developers in the Kolmogorov-Arnold Network field.

Stargazers:1899Issues:0Issues:0

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookLicense:MITStargazers:13962Issues:0Issues:0

Transformer-TTS

A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"

Language:PythonLicense:MITStargazers:649Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8178Issues:0Issues:0

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:4337Issues:0Issues:0

generative-ai-for-beginners

18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Language:Jupyter NotebookLicense:MITStargazers:57835Issues:0Issues:0

DiffuseVAE

Official implementation of "DiffuseVAE: Efficient, Controllable and High-Fidelity Generation from Low-Dimensional Latents"

Language:PythonLicense:MITStargazers:332Issues:0Issues:0

LLM101n

LLM101n: Let's build a Storyteller

Stargazers:26333Issues:0Issues:0

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:23887Issues:0Issues:0

voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

Stargazers:1636Issues:0Issues:0

bilibot

A local chatbot fine-tuned by bilibili user comments.

Language:PythonLicense:Apache-2.0Stargazers:3025Issues:0Issues:0

youtube-music

YouTube Music Desktop App bundled with custom plugins (and built-in ad blocker / downloader)

Language:TypeScriptLicense:MITStargazers:7236Issues:0Issues:0