ChasonShi

Chengshun SHI's starred repositories

llm-datasets

High-quality datasets, tools, and concepts for LLM fine-tuning.

awesome-llm-role-playing-with-persona

Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas

42100

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookNOASSERTION138300

Counting-Stars

Counting-Stars (★)

Language:Jupyter NotebookMIT6700

LongBench

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

Language:PythonMIT57400

Streamer-Sales

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁，一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️

Language:PythonApache-2.0209200

sad

Situational Awareness Dataset

Language:HTMLCC-BY-4.01000

WeChatMsg

提取微信聊天记录，将其导出成HTML、Word、Excel文档永久保存，对聊天记录进行分析生成年度聊天报告，用聊天数据训练专属于个人的AI聊天助手

Language:PythonGPL-3.03211600

torchtitan

A native PyTorch Library for large model training

Language:PythonBSD-3-Clause145000

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language:PythonMIT112200

NLP_ability

总结梳理自然语言处理工程师(NLP)需要积累的各方面知识，包括面试题，各种基础知识，工程能力等等，提升核心竞争力

Language:Python659300

awesome-transformers-LM-analytics

This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers in this list investigate the learning behavior, generalization ability, and other properties of language models through theoretical analysis, empirical analysis, or a combination of both.

Language:Python2100

LiGO

[ICLR 2023] "Learning to Grow Pretrained Models for Efficient Transformer Training" by Peihao Wang, Rameswar Panda, Lucas Torroba Hennigen, Philip Greengard, Leonid Karlinsky, Rogerio Feris, David Cox, Zhangyang Wang, Yoon Kim

Language:PythonMIT8000