yuuxiaooqingg

0

followers

0

following

stars

yuuxiaooqingg's starred repositories

ChunkLlama

[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Language:PythonApache-2.033900

bowdpr

Codebase for [Paper] Pre-training with Bag-of-Word Prediction for Dense Passage Retrieval

Language:PythonApache-2.01200

tensor_parallel

Automatically split your PyTorch models on multiple GPUs for training & inference

Language:PythonMIT61800

token_visualizer

Token level visualization tools for large language models

Language:Python4600

LongAlign

[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs

Language:PythonApache-2.019900

MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Language:Jupyter NotebookApache-2.0695400

st-moe-pytorch

Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch

Language:PythonMIT28200

TransnormerLLM

Official implementation of TransNormerLLM: A Faster and Better LLM

Language:PythonApache-2.022400

mutransformers

some common Huggingface transformers in maximal update parametrization (µP)

Language:Jupyter NotebookMIT7600

math-lm

Language:PythonMIT103200

CLEX

[ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models

Language:PythonMIT7200

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

MIT341100