Yizhou Lu (luyizhou4)

luyizhou4

Geek Repo

Company:Shanghai Jiao Tong University

Location:Shanghai

Github PK Tool:Github PK Tool

Yizhou Lu's starred repositories

dasheng

Official PyTorch code for Deep Audio-Signal Holistic Embeddings

Language:PythonLicense:Apache-2.0Stargazers:9Issues:0Issues:0

SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

Language:PythonLicense:MITStargazers:414Issues:0Issues:0

Youku-mPLUG

Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks

Language:PythonLicense:Apache-2.0Stargazers:272Issues:0Issues:0

patch_conv

Patch convolution to avoid large GPU memory usage of Conv2D

Language:PythonLicense:MITStargazers:71Issues:0Issues:0

distrifuser

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Language:PythonLicense:MITStargazers:512Issues:0Issues:0

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5191Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Stargazers:10850Issues:0Issues:0

Long-Context

This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and benchmark tasks that evaluate a model’s information retrieval capabilities with context expansion. We also include key experimental results and instructions for reproducing and building on them.

Language:PythonLicense:Apache-2.0Stargazers:570Issues:0Issues:0

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5388Issues:0Issues:0

llm-hallucination-survey

Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"

Stargazers:868Issues:0Issues:0

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20295Issues:0Issues:0

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

License:CC0-1.0Stargazers:16351Issues:0Issues:0

tango

A family of diffusion models for text-to-audio generation.

Language:PythonLicense:NOASSERTIONStargazers:966Issues:0Issues:0

AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Language:PythonLicense:NOASSERTIONStargazers:9909Issues:0Issues:0

ReinMax

Beyond Straight-Through

Language:PythonLicense:MITStargazers:80Issues:0Issues:0

Large-Audio-Models

Keep track of big models in audio domain, including speech, singing, music etc.

Stargazers:417Issues:0Issues:0

KeSpeech

The repo provides information about KeSpeech dataset.

License:NOASSERTIONStargazers:98Issues:0Issues:0

FateZero

[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"

Language:Jupyter NotebookLicense:MITStargazers:1080Issues:0Issues:0

audio-ai-timeline

A timeline of the latest AI models for audio generation, starting in 2023!

Stargazers:1873Issues:0Issues:0

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++License:Apache-2.0Stargazers:5681Issues:0Issues:0

tiny-training

On-Device Training Under 256KB Memory [NeurIPS'22]

Language:PythonLicense:MITStargazers:420Issues:0Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:5016Issues:0Issues:0

CoFiPruning

[ACL 2022] Structured Pruning Learns Compact and Accurate Models https://arxiv.org/abs/2204.00408

Language:PythonLicense:MITStargazers:187Issues:0Issues:0

public_talks

Materials of public talks given By SJTU X-LANCE members

Stargazers:14Issues:0Issues:0

autocut

用文本编辑器剪视频

Language:PythonLicense:Apache-2.0Stargazers:6443Issues:0Issues:0

retraining-free-pruning

[NeurIPS 2022] A Fast Post-Training Pruning Framework for Transformers

Language:PythonStargazers:157Issues:0Issues:0

AudioMAE

This repo hosts the code and models of "Masked Autoencoders that Listen".

Language:PythonLicense:NOASSERTIONStargazers:509Issues:0Issues:0

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Language:Jupyter NotebookStargazers:546Issues:0Issues:0

bert

TensorFlow code and pre-trained models for BERT

Language:PythonLicense:Apache-2.0Stargazers:37546Issues:0Issues:0

IguanaTex

A PowerPoint add-in allowing you to insert LaTeX equations into PowerPoint presentations on Windows and Mac

Language:VBALicense:NOASSERTIONStargazers:808Issues:0Issues:0