TissueC

Hao's starred repositories

elasticsearch

Free and Open, Distributed, RESTful Search Engine

Language:JavaNOASSERTION68711 2685 35677

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonApache-2.032232 476 18068

Yi

A series of large language models trained from scratch by developers @01-ai

Language:PythonApache-2.07506 111 289

outlines

Structured Text Generation

Language:PythonApache-2.07324 46 514

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:Shell6497 40 681

DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Language:PythonMIT6170 69 151

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++Apache-2.05681 64 623

MiniCPM

MiniCPM-2B: An end-side LLM outperforming Llama2-13B.

Language:PythonApache-2.04460 52 139

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonApache-2.04248 42 175

mergekit

Tools for merging pretrained large language models.

Language:PythonLGPL-3.04172 47 261

GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Language:PythonApache-2.03899 30 338

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonApache-2.03391 24 430

DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Language:PythonMIT1900 18 45

mup

maximal update parametrization (µP)

Language:Jupyter NotebookMIT1255 29 59

DeepSeek-MoE

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Language:PythonMIT931 15 35

llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

Language:PythonApache-2.0810 7 18

Yuan-2.0

Yuan 2.0 Large Language Model

Language:PythonNOASSERTION672 5 91

tensor_parallel

Automatically split your PyTorch models on multiple GPUs for training & inference

Language:PythonMIT604 7 65

Pai-Megatron-Patch

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Language:PythonApache-2.0577 7 90

lighteval

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Language:PythonMIT495 31 89

Your browser's reference manager: automatic paper detection (Arxiv, OpenReview & more), publication venue matching and code repository discovery! Also enhances ArXiv: BibTex citation, Markdown link, direct download and more!

Language:JavaScriptMIT478 7 76