MaiNLP (mainlp)

MaiNLP

mainlp

Geek Repo

MaiNLP research lab at CIS, LMU Munich

Location:Germany

Home Page:https://mainlp.github.io/

Twitter:@MaiNLPlab

Github PK Tool:Github PK Tool

MaiNLP's repositories

awesome-human-label-variation

A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, accompanying The 'Problem' of Human Label Variation: On Ground Truth in Data, Modeling and Evaluation (EMNLP 2022)

CrossRE

CrossRE: A Cross-Domain Dataset for Relation Extraction (Findings of EMNLP 2022)

Language:PythonLicense:GPL-3.0Stargazers:44Issues:2Issues:0

germanic-lrl-corpora

A survey of corpora for Germanic low-resource languages and dialects

How-to-distill-your-BERT

Code for the paper: How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives (ACL 2023)

Language:PythonLicense:MITStargazers:9Issues:3Issues:0

escoxlmr

Repository for the paper ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain (ACL2023)

Language:PythonLicense:Apache-2.0Stargazers:8Issues:1Issues:0

ActiveAED

This repository contains the code for the paper "ActiveAED: A Human in the Loop Improves Annotation Error Detection". The goal of ActiveAED is to improve the performance of Annotation Error Detection (AED) models by involving a human annotator in the prediction loop.

Language:PythonLicense:MITStargazers:3Issues:2Issues:0
Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0

noisydialect

Does manipulating tokenization aid cross-lingual transfer? A study on POS tagging for non-standardized languages

Language:PythonStargazers:2Issues:1Issues:0

convert-qcri-4dialects

Converts the Four Arabic Dialects POS tagged Dataset (Darwish ea 2018) to UPOS

Language:PythonStargazers:1Issues:0Issues:0

maibaam-code

Code for preprocessing data for UD annotations and for tagging/parsing experiments of MaiBaam

Language:PythonStargazers:1Issues:0Issues:0

NaLiBaSID

Repository with data and code for "Slot and Intent Detection Resources for Bavarian and Lithuanian: Assessing Translations vs Natural Queries to Digital Assistants"

Language:PythonStargazers:1Issues:0Issues:0
Language:Jupyter NotebookStargazers:1Issues:0Issues:0

nnose

Codebase for NNOSE: Nearest Neighbor Occupational Skill Extraction

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

syntax-pre-training-for-RE

Silver Syntax Pre-training for Cross-Domain Relation Extraction (Findings of ACL 2023)

Language:PythonLicense:GPL-3.0Stargazers:1Issues:0Issues:0
Language:NewLispLicense:CC-BY-SA-4.0Stargazers:1Issues:2Issues:0

dialect-ToD-robustness

Exploring the Robustness of Task-oriented Dialogue Systems for Colloquial German Varieties (EACL 2024)

Language:PythonStargazers:0Issues:2Issues:0

mainlp.github.io

MaiNLP research lab

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0
Language:ShellStargazers:0Issues:0Issues:0

common-voice

Common Voice is part of Mozilla's initiative to help teach machines how real people speak.

Language:TypeScriptLicense:MPL-2.0Stargazers:0Issues:0Issues:0

conllueditor

Fork of Orange-OpenSource/conllueditor

Language:JavaLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

convert-restaure-occitan

Converts the Annotated Corpus for Occitan (10.5281/zenodo.1182948, Bras ea 2018) to UPOS by splitting contractions

Language:PythonStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:0Issues:0

Eevee

An Easy Annotation Tool for Natural Language Processing

Stargazers:0Issues:0Issues:0

el_esco

Codebase for Entity Linking in the Job Market Domain

Language:PythonStargazers:0Issues:0Issues:0

label-variation-nli

Code used in More Labels or Cases? Assessing Label Variation in Natural Language Inference.

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

RC-analysis

Code for "What’s wrong with your model? A Quantitative Analysis of Relation Classification"

License:GPL-3.0Stargazers:0Issues:0Issues:0

SkillSpan

SKILLSPAN: Competences as Spans for Skill Extraction from Job Postings

Language:PerlLicense:MITStargazers:0Issues:0Issues:0

subspace-chronicles

How Linguistic Information Emerges, Shifts and Interacts during Language Model Training (EMNLP 2023)

License:MITStargazers:0Issues:0Issues:0