GUORUIWANG's repositories
ALCE
Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627
bert_seq2seq
pytorch实现 Bert 做seq2seq任务,使用unilm方案,现在也可以做自动摘要,文本分类,情感分析,NER,词性标注等任务,支持t5模型,支持GPT2进行文章续写。
ChatGLM-RLHF
对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF
ColossalAI
Making large AI models cheaper, faster and more accessible
ContextualSP
Multiple paper open-source codes of the Microsoft Research Asia DKI group
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
dgl
Python package built to ease deep learning on graph, on top of existing DL frameworks.
echarts
Apache ECharts is a powerful, interactive charting and data visualization library for browser
Entity_Alignment_Papers
Must-read papers on entity alignment published in recent years
evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
fastllm
纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llmam, moss基座,手机端流畅运行
FlagEmbedding
Dense Retrieval and Retrieval-augmented LLMs
HanLP
中文分词 词性标注 命名实体识别 依存句法分析 语义依存分析 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
Knover
Large-scale open domain KNOwledge grounded conVERsation system based on PaddlePaddle
LeCaRD
A Chinese legal case retrieval dataset.
llama_index
LlamaIndex (GPT Index) is a data framework for your LLM applications
MTBook
《机器翻译:基础与模型》肖桐 朱靖波 著 - Machine Translation: Foundations and Models
NBCE
Naive Bayes-based Context Extension
open-chatgpt
The open source implementation of ChatGPT, Alpaca, Vicuna and RLHF Pipeline. 从0开始实现一个ChatGPT.
Python-Study-Notes
fastreid
RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
rlhf_chatglm
chatglm_rlhf_finetuning
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
tagger_rewriter
对话改写介绍文章
tf_geometric
Efficient and Friendly Graph Neural Network Library for TensorFlow 1.x and 2.x
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
VisualGLM-6B
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
zero_nlp
中文nlp应用(大模型、数据、模型、训练、推理)