Chengshun SHI (ChasonShi)

ChasonShi

Geek Repo

Company:Shandong University

Github PK Tool:Github PK Tool


Organizations
irlab-sdu

Chengshun SHI's starred repositories

Language:PythonStargazers:153Issues:0Issues:0

llm-datasets

High-quality datasets, tools, and concepts for LLM fine-tuning.

Stargazers:1186Issues:0Issues:0

awesome-llm-role-playing-with-persona

Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas

Stargazers:421Issues:0Issues:0

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1383Issues:0Issues:0

Counting-Stars

Counting-Stars (★)

Language:Jupyter NotebookLicense:MITStargazers:67Issues:0Issues:0

LongBench

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

Language:PythonLicense:MITStargazers:574Issues:0Issues:0

Streamer-Sales

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️

Language:PythonLicense:Apache-2.0Stargazers:2092Issues:0Issues:0

sad

Situational Awareness Dataset

Language:HTMLLicense:CC-BY-4.0Stargazers:10Issues:0Issues:0
Language:PythonStargazers:17Issues:0Issues:0

WeChatMsg

提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手

Language:PythonLicense:GPL-3.0Stargazers:32116Issues:0Issues:0

torchtitan

A native PyTorch Library for large model training

Language:PythonLicense:BSD-3-ClauseStargazers:1450Issues:0Issues:0

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language:PythonLicense:MITStargazers:1122Issues:0Issues:0

NLP_ability

总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力

Language:PythonStargazers:6593Issues:0Issues:0

awesome-transformers-LM-analytics

This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers in this list investigate the learning behavior, generalization ability, and other properties of language models through theoretical analysis, empirical analysis, or a combination of both.

Language:PythonStargazers:21Issues:0Issues:0

LiGO

[ICLR 2023] "Learning to Grow Pretrained Models for Efficient Transformer Training" by Peihao Wang, Rameswar Panda, Lucas Torroba Hennigen, Philip Greengard, Leonid Karlinsky, Rogerio Feris, David Cox, Zhangyang Wang, Yoon Kim

Language:PythonLicense:MITStargazers:80Issues:0Issues:0

tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

Language:SystemVerilogStargazers:6789Issues:0Issues:0

qwen2_moe_mergekit

根据Qwen2(Qwen1.5)模型生成qwen2 MoE模型的工具

Language:PythonLicense:Apache-2.0Stargazers:7Issues:0Issues:0

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:25367Issues:0Issues:0

gpu_poor

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

Language:JavaScriptStargazers:740Issues:0Issues:0

RecMamba

Uncovering Selective State Space Model's Capabilities in Lifelong Sequential Recommendation

Language:PythonStargazers:22Issues:0Issues:0

OpenMoE

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Language:PythonStargazers:1323Issues:0Issues:0

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:4290Issues:0Issues:0

llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

Language:PythonLicense:Apache-2.0Stargazers:817Issues:0Issues:0

laserRMT

This is our own implementation of 'Layer Selective Rank Reduction'

Language:PythonLicense:Apache-2.0Stargazers:228Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:24526Issues:0Issues:0

axolotl

Go ahead and axolotl questions

Language:PythonLicense:Apache-2.0Stargazers:7215Issues:0Issues:0

CAA

Steering Llama 2 with Contrastive Activation Addition

Language:Jupyter NotebookLicense:MITStargazers:71Issues:0Issues:0

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:ShellStargazers:6775Issues:0Issues:0

leetcode-master

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Language:ShellStargazers:49689Issues:0Issues:0

DiffMIC

[MICCAI 2023] DiffMIC: Dual-Guidance Diffusion Network for Medical Image Classification

Language:PythonStargazers:131Issues:0Issues:0