baby-care

Christmas's starred repositories

build-your-own-x

Master programming by recreating your favorite technologies from scratch.

Natural_Language_Processing_with_Transformers

Natural Language Processing with Transformers 中译本，最权威Transformers教程

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonApache-2.0322000

how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Language:Cuda118700

CEPE

[ACL 2024] Long-Context Language Modeling with Parallel Encodings

Language:PythonMIT11700

mmrazor

OpenMMLab Model Compression Toolbox and Benchmark.

Language:PythonApache-2.0141200

EAGLE

Official Implementation of EAGLE-1 and EAGLE-2

Language:PythonApache-2.064600

fastllm

纯c++的全平台llm加速库，支持python调用，chatglm-6B级模型单卡可达10000+token / s，支持glm, llama, moss基座，手机端流畅运行

Language:C++Apache-2.0320000

ntk_alibi

NTK scaled version of ALiBi position encoding in Transformer.

6100

laser

The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction

Language:PythonMIT34400

Collinear-Constrained-Attention

Language:PythonApache-2.05600

roformer

Rotary Transformer

Language:PythonApache-2.074000

Chinese-Vicuna

Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案，结构参考alpaca

Language:CApache-2.0414400

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT580900

exllama

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

Language:PythonMIT266800

baichuan-Dynamic-NTK-ALiBi

百川Dynamic NTK-ALiBi的代码实现：无需微调即可推理更长文本

Language:Python4500

LongBench

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

Language:PythonMIT53900

VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Language:PythonApache-2.0403600

CogView2

official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"

Language:PythonApache-2.093200

CogView

Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".

Language:PythonApache-2.0163100

CodeGeeX2

CodeGeeX2: A More Powerful Multilingual Code Generation Model

Language:PythonApache-2.0760000

CodeGeeX

CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)

Language:PythonApache-2.0796600

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Language:PythonApache-2.01303700

reformer-pytorch

Reformer, the efficient Transformer, in Pytorch

Language:PythonMIT207800

yarn

YaRN: Efficient Context Window Extension of Large Language Models

Language:PythonMIT125600

This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and benchmark tasks that evaluate a model’s information retrieval capabilities with context expansion. We also include key experimental results and instructions for reproducing and building on them.

Language:PythonApache-2.056800