arian-askari

This dataset contains human judgements about answer equivalence. The data is based on SQuAD (Stanford Question Answering Dataset), and contains 9k human judgements of answer candidates generated by Albert on the SQuAD train set, and an additional 14k human judgements for answer candidates produced by BiDAF, Luke, and XLNet on the SQuAD dev set.

Apache-2.0000

awesome-pretrained-models-for-information-retrieval

A curated list of awesome papers related to pre-trained models for information retrieval (a.k.a., pretraining for IR).

000

BioGPT

Language:PythonMIT010

convert-tf

Impementation of ConveRT (Conversational Representations from Transformers) paper in Tensorflow.

Apache-2.0000

detect-gpt

DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature

000

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonApache-2.0000

Directional-Stimulus-Prompting

Codebase for the paper: "Guiding Large Language Models with Directional Stimulus Prompting"

Apache-2.0000

dlkp

A deep learning library for identifying keyphrases from text

MIT000

examples

Home for Elasticsearch examples available to everyone. It's a great way to get started.

Apache-2.0000

finBERT

Financial Sentiment Analysis with BERT

Apache-2.0000

GenRead

Code and Checkpoints for "Generate rather than Retrieve: Large Language Models are Strong Context Generators" in ICLR 2023.

000

hands-on-with-pke

000

inPars

Inquisitive Parrots for Search

Apache-2.0000

IOT-Match

000

lex-glue

LexGLUE: A Benchmark Dataset for Legal Language Understanding in English

000

llama

Inference code for LLaMA models

Language:PythonGPL-3.0010

Parrot_Paraphraser

A practical and feature-rich paraphrasing framework to augment human intents in text form to build robust NLU models for conversational engines. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.

Language:PythonApache-2.0010