lijinfeng0713

followers

following

stars

Jinfeng Li's repositories

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.

Language:PythonApache-2.0100

Adan

Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models

Language:PythonApache-2.0000

adversarial-robustness-toolbox

Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams

Language:PythonMIT000

AGIEval

MIT000

ChatGLM-6B

ChatGLM-6B：开源双语对话语言模型 | An Open Bilingual Dialogue Language Model

Apache-2.0000

ChatGLM-Finetuning

基于ChatGLM-6B、ChatGLM2-6B模型，进行下游具体任务微调，涉及Freeze、Lora、P-tuning、全参微调等

000

ChatGLM-Tuning

一种平价的chatgpt实现方案, 基于ChatGLM-6B + LoRA

MIT000

ChineseNLPCorpus

中文自然语言处理数据集，平时做做实验的材料。欢迎补充提交合并。

000

DeepMatch

A deep matching model library for recommendations & advertising. It's easy to train models and to export representation vectors which can be used for ANN search.

Apache-2.0000

EasyRec

A framework for large scale recommendation algorithms.

Apache-2.0000

EconML

ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to bring automation to complex causal inference problems. To date, the ALICE Python SDK (econml) implements orthogonal machine learning algorithms such as the double machine learning work of Chernozhukov et al. This toolkit is designed to measure the causal effect of some treatment variable(s) t on an outcome variable y, controlling for a set of features x.

Language:Jupyter NotebookNOASSERTION010

ganbert

Enhancing the BERT training with Semi-supervised Generative Adversarial Networks

Apache-2.0000

GPT-4-LLM

Instruction Tuning with GPT-4

Apache-2.0000

helm

Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110).

Apache-2.0000

imbalanced-dataset-sampler

A (PyTorch) imbalanced dataset sampler for oversampling low frequent classes and undersampling high frequent ones.

MIT000

jailbreak_llms

A dataset consists of 6,387 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 666 jailbreak prompts).

MIT000

langchain-ChatGLM

langchain-ChatGLM, local knowledge based ChatGLM with langchain ｜基于本地知识库的 ChatGLM 问答

Apache-2.0000

llm-attacks

Universal and Transferable Attacks on Aligned Language Models

MIT000

LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

MIT000

LLMZoo

⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡

Apache-2.0000

mt-dnn

Multi-Task Deep Neural Networks for Natural Language Understanding

MIT000

pCLUE

pCLUE: 1000000+多任务提示学习数据集

000

recmetrics

A library of metrics for evaluating recommender systems

MIT000

S-Eval

S-Eval: Automatic and Adaptive Test Generation for Benchmarking Safety Evaluation of Large Language Models

NOASSERTION000

shap

A game theoretic approach to explain the output of any machine learning model.

MIT000

T2Ranking

T2Ranking: A large-scale Chinese benchmark for passage ranking.

000

transformers-tutorials

Github repo with tutorials to fine tune transformers for diff NLP tasks

MIT000

transformers_tasks

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.

000

xtuner

XTuner is a toolkit for efficiently fine-tuning LLM

Apache-2.0000

zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

MIT000