tommasoc80's repositories

EventStoryLine

Event StoryLine Corpus - annotated data, baselines and evaluation scripts, evaluation data.

Language:DMLicense:NOASSERTIONStargazers:73Issues:7Issues:12

AbuseEval

Data set for LREC 2020 paper "I Feel Offended, Don't Be Abusive!"

Language:PythonLicense:NOASSERTIONStargazers:18Issues:3Issues:0

DALC

Dutch abusive language data

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:11Issues:5Issues:0

DNT

Diachronic News and Travel (DNT) corpus

License:CC0-1.0Stargazers:2Issues:2Issues:0

cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:1Issues:0

CLiC-it_2023_tutorial

This repository hosts materials from the CLiC-IT 2023 tutorial

License:Apache-2.0Stargazers:0Issues:0Issues:0

connhyp

Connotative Hyperplane for computing connotative shift in dia* corpora

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:2Issues:0

contextualized-topic-models

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.

License:MITStargazers:0Issues:0Issues:0

COVID-19-disinformation

Dataset: Fighting the COVID-19 Infodemic: Modeling the Perspective of Journalists, Fact-Checkers, Social Media Platforms, Policy Makers, and the Society

Stargazers:0Issues:0Issues:0
Language:PythonLicense:CC0-1.0Stargazers:0Issues:0Issues:0

COVID19Tweet

WNUT-2020 Task 2: Identification of informative COVID-19 English Tweets

Stargazers:0Issues:0Issues:0

crowd_expert_time

Data used for the comparison of event and temporal expression annotation between crowd and experts

License:CC0-1.0Stargazers:0Issues:0Issues:0

emory-qtm340

Practical Approaches to Data Science with Text

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

event_interoperability

Repository for experiments on interoperability of semantically annotated corpora for events in English

Language:PythonLicense:CC0-1.0Stargazers:0Issues:0Issues:0

GrofLex

A Dutch lexicon of abusive language

Stargazers:0Issues:1Issues:0

hatespeechdata

Catalog of abusive language data

Stargazers:0Issues:0Issues:0

LCL2023-Lab2

Lab2: Exploring Multi-Modal Neural Models and their applications

Stargazers:0Issues:0Issues:0

multimodal-ml-music

List of academic resources on Multimodal ML for Music

License:MITStargazers:0Issues:0Issues:0

news-please

news-please - an integrated web crawler and information extractor for news that just works.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

partisan-news2019

Dataset for partisan news detection

Stargazers:0Issues:0Issues:0

probability-statistics-notebook

Probability and Statistics repository for Python code and coursework review

Stargazers:0Issues:0Issues:0

spectral-probing

Spectral Probing (EMNLP 2022)

Stargazers:0Issues:0Issues:0

sql-mysteries

Inspired by @veltman's command-line mystery, use SQL to research clues and find out whodunit!

License:MITStargazers:0Issues:0Issues:0

stat_rethinking_2023

Statistical Rethinking Course for Jan-Mar 2023

License:CC0-1.0Stargazers:0Issues:0Issues:0

tlink_probing

Set of probing experiments using a multilingual language model (XLM-RoBERTa) for temporal relation classification between events in english, italian, spanish and french.

Language:PythonStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:2Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

License:Apache-2.0Stargazers:0Issues:0Issues:0