RundongLi (李润东) (Rundong-Li)

Rundong-Li

Geek Repo

Company:University of Chinese Academy of Sciences

Location:Beijing, PRC

Home Page:lirundong16@mails.ucas.edu.cn

Github PK Tool:Github PK Tool

RundongLi (李润东)'s starred repositories

DeepSeek-MoE

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Language:PythonLicense:MITStargazers:935Issues:0Issues:0

promptsource

Toolkit for creating, sharing and using natural language prompts.

Language:PythonLicense:Apache-2.0Stargazers:2614Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:54851Issues:0Issues:0

InternLM

Official release of InternLM2.5 7B base and chat models. 1M context support

Language:PythonLicense:Apache-2.0Stargazers:5938Issues:0Issues:0

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8834Issues:0Issues:0

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:ShellStargazers:6664Issues:0Issues:0

geektime-spring-family

极客时间视频课程《玩转Spring全家桶》

Language:JavaStargazers:1085Issues:0Issues:0

CLUE

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Language:PythonStargazers:3933Issues:0Issues:0

ChatYuan

ChatYuan: Large Language Model for Dialogue in Chinese and English

Language:PythonLicense:NOASSERTIONStargazers:1902Issues:0Issues:0

PromptCLUE

PromptCLUE, 全中文任务支持零样本学习模型

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:647Issues:0Issues:0

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptLicense:NOASSERTIONStargazers:40157Issues:0Issues:0

streamlit

Streamlit — A faster way to build and share data apps.

Language:PythonLicense:Apache-2.0Stargazers:33752Issues:0Issues:0

semantic-kernel

Integrate cutting-edge LLM technology quickly and easily into your apps

Language:C#License:MITStargazers:20909Issues:0Issues:0

llm-action

本项目旨在分享大模型相关技术原理以及实战经验。

Language:HTMLLicense:Apache-2.0Stargazers:8293Issues:0Issues:0

WechatAnnualReport

微信聊天记录导出、微信年度报告生成!记录你的2023!

Language:PythonStargazers:119Issues:0Issues:0

OpenLLM

Run any open-source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.

Language:PythonLicense:Apache-2.0Stargazers:9492Issues:0Issues:0

GPTs

leaked prompts of GPTs

Stargazers:27935Issues:0Issues:0

chatbot-ui

AI chat for every model.

Language:TypeScriptLicense:MITStargazers:27708Issues:0Issues:0

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:35075Issues:0Issues:0

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:4230Issues:0Issues:0

self-llm

《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合**宝宝的部署教程

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6912Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:12740Issues:0Issues:0

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:12889Issues:0Issues:0

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:15328Issues:0Issues:0

Yi

A series of large language models trained from scratch by developers @01-ai

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7526Issues:0Issues:0

airoboros

Customizable implementation of the self-instruct paper.

Language:PythonLicense:Apache-2.0Stargazers:983Issues:0Issues:0

GPT-4-LLM

Instruction Tuning with GPT-4

Language:HTMLLicense:Apache-2.0Stargazers:4122Issues:0Issues:0

UltraChat

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

Language:PythonLicense:MITStargazers:2184Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29225Issues:0Issues:0

self-instruct

Aligning pretrained language models with instruction data generated by themselves.

Language:PythonLicense:Apache-2.0Stargazers:3987Issues:0Issues:0