AI-X-King

AI-X-King

Geek Repo

0

followers

0

following

Github PK Tool:Github PK Tool

AI-X-King's repositories

AlpacaDataCleaned

Alpaca dataset from Stanford, cleaned and curated

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

apps

one benchmark for llm coding

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

axolotl

Go ahead and axolotl questions

License:Apache-2.0Stargazers:0Issues:0Issues:0

CodeFormer

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

data-preparation

Code used for sourcing and cleaning the BigScience ROOTS corpus

License:Apache-2.0Stargazers:0Issues:0Issues:0

data_management_LLM

Collection of training data management explorations for large language models

Stargazers:0Issues:0Issues:0

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

DNS-Challenge

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

License:CC-BY-4.0Stargazers:0Issues:0Issues:0

ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

License:MITStargazers:0Issues:0Issues:0

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

lhotse

Tools for handling speech data in machine learning projects.

License:Apache-2.0Stargazers:0Issues:0Issues:0

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

License:Apache-2.0Stargazers:0Issues:0Issues:0

llama.cpp

LLM inference in C/C++

License:MITStargazers:0Issues:0Issues:0

promptbase

All things prompt engineering

License:MITStargazers:0Issues:0Issues:0

PSST

Prosodic Speech Segmentation with Transformers

License:MITStargazers:0Issues:0Issues:0

pyctcdecode

A fast and lightweight python-based CTC beam search decoder for speech recognition.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pytorch-docker

Pure Pytorch Docker Images.

Language:ShellLicense:Apache-2.0Stargazers:0Issues:0Issues:0

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

License:Apache-2.0Stargazers:0Issues:0Issues:0

sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

sglang

SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.

License:Apache-2.0Stargazers:0Issues:0Issues:0

sherpa

Streaming and non-streaming ASR server for next-gen Kaldi

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

License:Apache-2.0Stargazers:0Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

whisper-finetune

Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0