Beast code in Giters

i4never's starred repositories

HowToCook

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Language:DockerfileUnlicense66624 402 665

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonApache-2.040465 394 1293

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.034959 342 2746

gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Language:PythonApache-2.032482 170 4779

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonApache-2.011692 206 2247

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonNOASSERTION10157 162 729

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonMIT7007 43 1001

sglang

SGLang is a fast serving framework for large language models and vision language models.

Language:PythonApache-2.05386 55 540

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Language:PythonApache-2.04532 76 89

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonApache-2.04474 47 191

GPT-4-LLM

Instruction Tuning with GPT-4

Language:HTMLApache-2.04175 43 34

NumCpp

C++ implementation of the Python Numpy library

Language:C++MIT3528 80 194

trafilatura

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Language:PythonApache-2.03499 31 365

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

MIT3416 64 53

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Language:PythonNOASSERTION3325 40 169

Chinese-LangChain

中文langchain项目|小必应，Q.Talk，强聊，QiangTalk

Language:Python2672 25 56

chain-of-thought-hub

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Language:Jupyter NotebookMIT2537 37 34

TigerBot

TigerBot: A multi-language multi-task LLM

Language:PythonApache-2.02233 31 126

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonApache-2.01971 44 125

simple-computer

the scott CPU from "But How Do It Know?" by J. Clark Scott

Language:Go1884 43 2

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonApache-2.01852 35 325

MOSS-RLHF

Language:PythonApache-2.01274 34 52

clarity-ai

Come join the best place on the internet to learn AI skills. Use code "clarityai" for an extra 20% off.

Language:TypeScriptMIT1203 22 7

ml-aim

This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Models

Language:PythonNOASSERTION685 18 5

xDiT

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) on multi-GPU Clusters

Language:PythonApache-2.0547 4 68

selfcheckgpt

SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models

Language:PythonMIT444 6 26

long-context-attention

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Language:PythonApache-2.0321 4 16

NBCE

Naive Bayes-based Context Extension

Language:Python311 6 7

perplexityai

A python api to use perplexity.ai

Language:Python207 8 51

i4never