yuuxiaooqingg

yuuxiaooqingg

Geek Repo

0

followers

0

following

Github PK Tool:Github PK Tool

yuuxiaooqingg's starred repositories

ChunkLlama

[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Language:PythonLicense:Apache-2.0Stargazers:339Issues:0Issues:0

bowdpr

Codebase for [Paper] Pre-training with Bag-of-Word Prediction for Dense Passage Retrieval

Language:PythonLicense:Apache-2.0Stargazers:12Issues:0Issues:0

tensor_parallel

Automatically split your PyTorch models on multiple GPUs for training & inference

Language:PythonLicense:MITStargazers:618Issues:0Issues:0

token_visualizer

Token level visualization tools for large language models

Language:PythonStargazers:46Issues:0Issues:0

LongAlign

[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs

Language:PythonLicense:Apache-2.0Stargazers:199Issues:0Issues:0

MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6954Issues:0Issues:0

st-moe-pytorch

Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch

Language:PythonLicense:MITStargazers:282Issues:0Issues:0

TransnormerLLM

Official implementation of TransNormerLLM: A Faster and Better LLM

Language:PythonLicense:Apache-2.0Stargazers:224Issues:0Issues:0

mutransformers

some common Huggingface transformers in maximal update parametrization (µP)

Language:Jupyter NotebookLicense:MITStargazers:76Issues:0Issues:0
Language:PythonLicense:MITStargazers:1032Issues:0Issues:0

CLEX

[ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models

Language:PythonLicense:MITStargazers:72Issues:0Issues:0

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

License:MITStargazers:3411Issues:0Issues:0