Markus Dreyer's starred repositories

simpletransformers

Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI

Language:PythonLicense:Apache-2.0Stargazers:3978Issues:62Issues:1112

gdown

Google Drive Public File Downloader when Curl/Wget Fails

Language:PythonLicense:MITStargazers:3877Issues:22Issues:161

lmql

A language for constraint-guided and efficient LLM programming.

Language:PythonLicense:Apache-2.0Stargazers:3288Issues:22Issues:238

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonLicense:Apache-2.0Stargazers:2081Issues:26Issues:53

lichobile

lichess.org mobile application

Language:TypeScriptLicense:GPL-3.0Stargazers:1968Issues:66Issues:1969

python-tabulate

Pretty-print tabular data in Python, a library and a command-line utility. Repository migrated from bitbucket.org/astanin/python-tabulate.

Language:PythonLicense:MITStargazers:1965Issues:22Issues:215

libfsm

DFA regular expression library & friends

Language:CLicense:BSD-2-ClauseStargazers:897Issues:24Issues:83

st-annotated-text

A simple component to display annotated text in Streamlit apps.

Language:PythonLicense:Apache-2.0Stargazers:473Issues:11Issues:30

falcontune

Tune any FALCON in 4-bit

Language:PythonLicense:Apache-2.0Stargazers:472Issues:13Issues:35

python_example

Example pybind11 module built with a Python-based build system

Language:PythonLicense:NOASSERTIONStargazers:465Issues:15Issues:51

openskill.py

Multiplayer Rating System. No Friction.

Language:Jupyter NotebookLicense:MITStargazers:240Issues:6Issues:26

ACLPUB

The official tool for creating proceedings for conferences of the Association for Computational Linguistics (ACL).

seahorse

Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 quality dimensions: comprehensibility, repetition, grammar, attribution, main idea(s), and conciseness, covering 6 languages, 9 systems and 4 datasets.

Commonsense-Dialogues

A crowdsourced dataset of dialogues grounded in social contexts involving utilization of commonsense.

MACE

Multi-Annotator Competence Estimation tool

qags

Question Answering and Generation for Summarization

chess-image-generator

Accepts FEN, PGN or array data for chess board and generates PNG or buffer.

Language:JavaScriptLicense:MITStargazers:53Issues:2Issues:13

chess-graph

A program that will produce a graphical sunburst chart of chess openings from the PGN that is provided to it

fact-graph

Implementation of the paper "FactGraph: Evaluating Factuality in Summarization with Semantic Graph Representations (NAACL 2022)"

Language:PythonLicense:NOASSERTIONStargazers:45Issues:7Issues:5
Language:Jupyter NotebookLicense:MITStargazers:44Issues:1Issues:0

cnn-dailymail

Code to obtain the CNN / Daily Mail dataset (non-anonymized) for summarization (Python3)

Language:PythonStargazers:43Issues:2Issues:0

lattice-generation

Code for Massive-scale Decoding for Text Generation using Lattices

Language:HTMLStargazers:41Issues:2Issues:0

BartGraphSumm

Implementation of the paper "Efficiently Summarizing Text and Graph Encodings of Multi-Document Clusters (NAACL 2021)"

Language:PythonLicense:MIT-0Stargazers:25Issues:8Issues:5

abstractive-factual-tradeoff

Code and data for the Dreyer et al (2023) paper on abstractiveness and factuality in abstractive summarization

Language:PythonLicense:MIT-0Stargazers:10Issues:7Issues:0

py_common_subseq

A re-usable Python micro-library that finds all of the subsequences shared between two sequences (like strings or lists) in polynomial time.

Language:PythonLicense:MITStargazers:10Issues:2Issues:0

bouncing-ball-game-with-pygame

simple bouncing ball with python and pygame

Language:PythonStargazers:3Issues:2Issues:0

Bouncing_Balls

Simple python pygame program with basic physics.

Language:PythonLicense:MITStargazers:2Issues:1Issues:0