Beast code in Giters

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Apache-2.0000

LLaMA-Adapter

Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

GPL-3.0000

mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

Apache-2.0000

open_flamingo

An open-source framework for training large multimodal models.

MIT000

OpenFace

OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.

NOASSERTION000

PaddleSpeech

An Easy-to-use Speech Toolkit including SOTA ASR pipeline, influential TTS with text frontend and End-to-End Speech Simultaneous Translation.

Language:PythonApache-2.0000

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.

Apache-2.0000

rnn-transducer

A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition

Language:Python000

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Apache-2.0000

li563042811

Jason's Lab's repositories

av_hubert

avsr-conformer

AVSR_papers

ColossalAI

Conference-Acceptance-Rate

diffusers

e2e_lfmmi

fairseq

hugo

icefall

Leveraging-Self-Supervised-Learning-for-AVSR

lit-llama

LLaMA-Adapter

mediapipe

Multimodal-GPT

open_flamingo

OpenFace

PaddleSpeech

ray

rnn-transducer

RWKV-LM

sentencepiece

sherpa

sherpa-onnx

transformers

voxceleb_trainer

voxpopuli

wenet

whisper

youtube-dl