Piji Li's starred repositories
pythia-mlkv
Multi-Layer Key-Value sharing experiments on Pythia models
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
llama3-from-scratch
llama3 implementation one matrix multiplication at a time
GuoFeng-Webnovel
Multilingual Corpus of Web Fiction
HIT-dataset
This is a dataset of inter-shaft bearing based on the vibration signal of rotors and casings, which comes from a aero-engine test with inter-shaft bearing fault. Due to the large size of the data set file, we uploaded the dataset to Google Drive with the link as: https://drive.google.com/drive/folders/1Km1Go4ilB_bI033SBJ7eJ0uCzbqEqbgt?usp=sharing
LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
Skywork
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation methods, etc. 天工系列模型在3.2TB高质量多语言和代码数据上进行预训练。我们开源了模型参数,训练数据,评估数据,评估方法。
MS-MARCO-Web-Search
A large-scale information-rich web dataset, featuring millions of real clicked query-document labels
DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
StableDiffusionReconstruction
Takagi and Nishimoto, CVPR 2023
Awesome-TimeSeries-SpatioTemporal-LM-LLM
A professional list on Large (Language) Models and Foundation Models (LLM, LM, FM) for Time Series, Spatiotemporal, and Event Data.
llm-resource
LLM全栈优质资源汇总