Beast code in Giters

The Spanish Fake News Corpus contains a collection of 971 news divided into 491 real news and 480 fake news. The corpus covers news from 9 different topics: Science, Sport, Economy, Education, Entertainment, Politics, Health, Security, and Society

CC-BY-4.0000

files2rouge

Calculating ROUGE score between two files (line-by-line)

Language:PerlMIT000

ganbert-pytorch

Enhancing the BERT training with Semi-supervised Generative Adversarial Networks in Pytorch/HuggingFace

Apache-2.0000

Legal-Docs-Large-MLTC

Multi Label Text Classification for Legal documents. Work on mono-lingual and multilingual parallel data

000

lmtc-eurlex57k

Large-Scale Multi-Label Text Classification on EU Legislation

Apache-2.0000

mlm-scoring

Python library & examples for Masked Language Model Scoring (ACL 2020)

Apache-2.0000

multi-eurlex

MultiEURLEX - A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transfer

000

multilingual-fake-news

The code related to the paper

Apache-2.0000

Multimodal-Toolkit

Multimodal model for text and tabular data with HuggingFace transformers as building block for text data

MIT000

neural-document-aligner

Document aligner which uses neural technologies to search matches across bilingual documents

GPL-3.0000

Nimbus

NOASSERTION000

question_generator

An NLP system for generating reading comprehension questions

MIT000

quick-tips

000

spatialdata

An open and universal framework for processing spatial omics data

BSD-3-Clause000

TopicalChange

Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.

000

trafilatura

Web scraping library and command-line tool for text discovery and extraction (main content, metadata, comments)

GPL-3.0000

Voice-Privacy-Challenge-2020

Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/docs/VoicePrivacy_2020_Eval_Plan_v1_3.pdf

000

word2word

Easy-to-use word-to-word translations for 3,564 language pairs.

Apache-2.0000

wordfreq

Access a database of word frequencies, in various natural languages.

MIT000

ArneDefauw

ArneD's repositories

BERT_doc_classification

bert_document_classification

BERT_NER

cache-conda-envs

CVDD-PyTorch

Demo

diffgram

dkpro-cassis

doc_classification_tfidf

DPR

fake_news_semantics

FakeNewsCorpusSpanish

files2rouge

ganbert-pytorch

Legal-Docs-Large-MLTC

lmtc-eurlex57k

mlm-scoring

multi-eurlex

multilingual-fake-news

Multimodal-Toolkit

neural-document-aligner

Nimbus

question_generator

quick-tips

spatialdata

TopicalChange

trafilatura

Voice-Privacy-Challenge-2020

word2word

wordfreq