Nealcly's starred repositories

Explainable_GEC

The official code of the 2023 ACL paper "Enhancing Grammatical Error Correction Systems with Explanations"

Language:PythonStargazers:25Issues:0Issues:0

WikiSQL

A large annotated semantic parsing corpus for developing natural language interfaces.

Language:HTMLLicense:BSD-3-ClauseStargazers:1597Issues:0Issues:0

kg-2019

2019年百度的三元组抽取比赛,“科学空间队”源码

Language:PythonStargazers:764Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1099Issues:0Issues:0

fairseq-detect-hallucination

Detect hallucinated tokens for conditional sequence generation.

Language:PythonLicense:MITStargazers:61Issues:0Issues:0

disco-pointer

Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span Selection

Language:PythonStargazers:13Issues:0Issues:0

ALCE

[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627

Language:PythonLicense:MITStargazers:415Issues:0Issues:0

TransformerPrograms

[NeurIPS 2023] Learning Transformer Programs

Language:PythonStargazers:154Issues:0Issues:0

RRHF

[NIPS2023] RRHF & Wombat

Language:PythonStargazers:781Issues:0Issues:0

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:4398Issues:0Issues:0

prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems

Language:PythonLicense:MITStargazers:1355Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:164Issues:0Issues:0

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++License:Apache-2.0Stargazers:5669Issues:0Issues:0

safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Language:PythonLicense:Apache-2.0Stargazers:1252Issues:0Issues:0

NaSGEC

Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)

Language:PythonStargazers:70Issues:0Issues:0
Language:PythonStargazers:923Issues:0Issues:0

OpenAlpaca

OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA

Language:PythonLicense:Apache-2.0Stargazers:302Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:2467Issues:0Issues:0

instruction-datasets

All available datasets for Instruction Tuning of Large Language Models

Stargazers:230Issues:0Issues:0

Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Language:PythonLicense:Apache-2.0Stargazers:36850Issues:0Issues:0

hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

License:MITStargazers:1509Issues:0Issues:0

LLMZoo

⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡

Language:PythonLicense:Apache-2.0Stargazers:2906Issues:0Issues:0

GPT-4-LLM

Instruction Tuning with GPT-4

Language:HTMLLicense:Apache-2.0Stargazers:4098Issues:0Issues:0

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonLicense:Apache-2.0Stargazers:5859Issues:0Issues:0

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:15102Issues:0Issues:0

ParroT

The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1-mt) and human written translation and evaluation data.

Language:PythonStargazers:166Issues:0Issues:0

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonLicense:MITStargazers:9847Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38382Issues:0Issues:0

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

License:MITStargazers:3240Issues:0Issues:0