Nealcly's starred repositories

License:NOASSERTIONStargazers:443Issues:0Issues:0

Llama-X

Open Academic Research on Improving LLaMA to SOTA LLM

Language:PythonLicense:Apache-2.0Stargazers:1582Issues:0Issues:0

CTCWordBeamSearch

Connectionist Temporal Classification (CTC) decoder with dictionary and language model.

Language:C++License:MITStargazers:549Issues:0Issues:0

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonLicense:Apache-2.0Stargazers:17941Issues:0Issues:0

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:9453Issues:0Issues:0
Language:Jupyter NotebookStargazers:4Issues:0Issues:0

CHEF

The source code of paper "CHEF: A Pilot Chinese Dataset for Evidence-Based Fact-Checking"

Language:PythonStargazers:65Issues:0Issues:0

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29179Issues:0Issues:0

HMT

Source code for ICLR 2023 spotlight paper "Hidden Markov Transformer for Simultaneous Machine Translation"

Language:PythonLicense:MITStargazers:21Issues:0Issues:0

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8414Issues:0Issues:0
Language:PythonLicense:MITStargazers:33Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:54285Issues:0Issues:0

Automated-Fact-Checking-Resources

Links to conference/journal publications in automated fact-checking (resources for the TACL22/EMNLP23 paper).

License:MITStargazers:369Issues:0Issues:0

detect-gpt

DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature

Language:PythonLicense:MITStargazers:341Issues:0Issues:0
Language:Jupyter NotebookStargazers:18Issues:0Issues:0

neural-Jacana

This is the code for neural-Jacana aligner, and the data for MultiMWA dataset.

Language:PythonStargazers:19Issues:0Issues:0

genius

💡GENIUS – generating text using sketches! A strong text generation & data augmentation tool.

Language:PythonStargazers:175Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:13Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:23Issues:0Issues:0

m2scorer

MaxMatch (M^2) Scorer - Evaluation program for grammatical error correction systems.

Language:PythonLicense:GPL-2.0Stargazers:144Issues:0Issues:0

editpro

Learning to Model Editing Processes

Stargazers:26Issues:0Issues:0

kilogram

The KiloGram Tangrams dataset

Language:Jupyter NotebookStargazers:50Issues:0Issues:0

EditScorer

The code for EMNLP2022 paper "Improved grammatical error correction by ranking elementary edits"

Language:PythonStargazers:18Issues:0Issues:0

CoSDA-ML

CoSDA-ML: Multi-Lingual Code-Switching Data Augmentation for Zero-Shot Cross-Lingual NLP

Language:PythonStargazers:49Issues:0Issues:0

Cross-Align

EMNLP2022 "Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment"

Language:PythonLicense:Apache-2.0Stargazers:15Issues:0Issues:0

NMLA-NAT

Code for NeurIPS 2022 Spotlight paper " Non-Monotonic Latent Alignments for CTC-Based Non-Autoregressive Machine Translation"

Language:PythonLicense:MITStargazers:20Issues:0Issues:0

GLUE-X

We leverage 14 datasets as OOD test data and conduct evaluations on 8 NLU tasks over 21 popularly used models. Our findings confirm that the OOD accuracy in NLP tasks needs to be paid more attention to since the significant performance decay compared to ID accuracy has been found in all settings.

Language:PythonStargazers:114Issues:0Issues:0

UniSumm

UNISUMM: Unified Few-shot Summarization with Multi-Task Pre-Training and Prefix-Tuning

Language:PythonLicense:MITStargazers:60Issues:0Issues:0