RayminQAQ

姚睿銘's starred repositories

scenic

Scenic: A Jax Library for Computer Vision Research and Beyond

Language:PythonApache-2.0317800

Python-100-Days-zh_TW

Python - 100天從新手到大師（繁體中文）

Language:HTML22000

Awesome-Transformer-Attention

An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

447900

udlbook

Understanding Deep Learning - Simon J.D. Prince

Language:Jupyter NotebookNOASSERTION574300

lm-format-enforcer

Enforce the output format (JSON Schema, Regex etc) of a language model

Language:PythonMIT121300

iTransformer

Official implementation for "iTransformer: Inverted Transformers Are Effective for Time Series Forecasting" (ICLR 2024 Spotlight), https://openreview.net/forum?id=JePfAI8fah

Language:PythonMIT101500

Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Language:Jupyter NotebookMIT871100

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.01847500

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language:PythonApache-2.0217900

BERT-like-is-All-You-Need

The code for our INTERSPEECH 2020 paper - Jointly Fine-Tuning "BERT-like'" Self Supervised Models to Improve Multimodal Speech Emotion Recognition

Language:PythonMIT11000

Generative Fusion Decoding (GFD) is a novel framework for integrating Large Language Models (LLMs) into multi-modal text recognition systems like ASR and OCR, improving performance and efficiency by enabling seamless fusion without requiring re-training.

Language:PythonApache-2.05800

fish-speech

Brand new TTS solution

Language:PythonNOASSERTION680700

ckiptagger

CKIP Neural Chinese Word Segmentation, POS Tagging, and NER

Language:PythonGPL-3.0162600

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:C++MIT772000

Snowballed_Hallucination

4400

Machine-Learning-Collection

A resource for learning about Machine learning & Deep Learning

Language:PythonMIT733500

transformer

Transformer: PyTorch Implementation of "Attention Is All You Need"

Language:Python257200

efficient-kan

An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).

Language:PythonMIT368600

awesome-kan

A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and more, for researchers and developers in the Kolmogorov-Arnold Network field.

189900

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookMIT1396200

Transformer-TTS

A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"

Language:PythonMIT64900

espnet

End-to-End Speech Processing Toolkit

Language:PythonApache-2.0817800

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonMIT433700