sudanl

followers

following

stars

University of Macau

Macau SAR, China

sudanl.github.io

Organizations

microsoft

MicrosoftCopilot

NLP2CT

Shudong Liu's starred repositories

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookApache-2.034928 364 65

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION24971 206 212

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.023924 217 3685

WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Language:Python9118 111 189

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonApache-2.03595 33 1157

CampusShame

互联网仍有记忆！那些曾经在校招过程中毁过口头offer、意向书、三方的公司！纵然人微言轻，也想尽绵薄之力！

weak-to-strong

Language:PythonMIT2468 33 18

RAG-Survey

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Language:Jupyter NotebookApache-2.01353 7 133

DeepSeek-LLM

DeepSeek LLM: Let there be answers

Language:MakefileMIT1349 23 32

textgrad

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Language:PythonMIT1263 19 48

DeepSeek-MoE

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Language:PythonMIT934 15 35

llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

Language:PythonApache-2.0812 7 18

DeepSeek-Math

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Language:PythonMIT731 13 24

ruozhiba

Awesome-Knowledge-Distillation-of-LLMs

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.

deita

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Language:PythonApache-2.0429 6 24

dl4math

Resources of deep learning for mathematical reasoning (DL4MATH).

magpie

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

Language:PythonMIT273 6 13

LLM4Annotation

semantic_uncertainty

Language:PythonMIT106 1 12

Token-level-Direct-Preference-Optimization

Reference implementation for Token-level Direct Preference Optimization(TDPO)

Language:PythonApache-2.076 1 3

easy-to-hard-generalization

Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"

Language:PythonApache-2.044 60

CalibratedMath

Teaching Models to Express Their Uncertainty in Words

Language:Python33 3 1

awesome-cultural-nlp

Resources for cultural NLP research

openview_quicklook

Language:JavaScript26 2 3

awesome-instruction-selector

Paper list and datasets for the paper: A Survey on Data Selection for LLM Instruction Tuning

2200

MathCheck

Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist

Language:Python1800

privacy-preserving-prompt

Privacy-Preserving Prompt Tuning for Large Language Model

Language:Python900

TempoSum

Can LMs Generalize to Future Data? An Empirical Analysis on Text Summarization

6 60