luyizhou4

This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and benchmark tasks that evaluate a model’s information retrieval capabilities with context expansion. We also include key experimental results and instructions for reproducing and building on them.

Language:PythonApache-2.057000

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonBSD-3-Clause538800

llm-hallucination-survey

Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"

86800

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT2029500

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

CC0-1.01635100

tango

A family of diffusion models for text-to-audio generation.

Language:PythonNOASSERTION96600

AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Language:PythonNOASSERTION990900

ReinMax

Beyond Straight-Through

Language:PythonMIT8000

Large-Audio-Models

Keep track of big models in audio domain, including speech, singing, music etc.

41700

KeSpeech

The repo provides information about KeSpeech dataset.

NOASSERTION9800

FateZero

[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"

Language:Jupyter NotebookMIT108000

audio-ai-timeline

A timeline of the latest AI models for audio generation, starting in 2023!

187300

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++Apache-2.0568100

tiny-training

On-Device Training Under 256KB Memory [NeurIPS'22]

Language:PythonMIT42000

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonNOASSERTION501600