atnafuatx

🐳 Aurora is a [Chinese Version] MoE model. Aurora is a further work based on Mixtral-8x7B, which activates the chat capability of the model's Chinese open domain.

Apache-2.0000

axolotl

Go ahead and axolotl questions

Apache-2.0000

bible_scapper

python script that allows to scrape any version of bible from bible.com

Language:Python000

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonApache-2.0000

Chinese-Mixtral

中文Mixtral混合专家大模型（Chinese Mixtral MoE LLMs）

Apache-2.0000

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Apache-2.0000

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

NOASSERTION000

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT000

indaba-pracs-2023

Notebooks for the Practicals at the Deep Learning Indaba 2023.

Language:Jupyter Notebook000

Llama-2

All the projects related to Llama

000

llm-translator

Mixtral-based Ja-En (En-Ja) Translation model

000

lm-evaluation-harness

A framework for few-shot evaluation of language models.

MIT000

masakhane-news

MasakhaNEWS: News Topic Classification for African Languages

Language:Python000

masakhane-pos

POS for African languages

Language:Jupyter NotebookMIT000

mgpt

Multilingual Generative Pretrained Model

NOASSERTION000

mot

Multilingual Open Text

MIT000

open-instruct

Apache-2.0000

OpenMoE

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

000

sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

Apache-2.0000

simple-nmt

This repo contains a simple source code for advanced neural machine translation based on sequence-to-sequence.

Language:Python000

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Apache-2.0000

tower-eval

Language:Python000

TPU-Alignment

Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free

Apache-2.0000