ArneD (ArneDefauw)

ArneDefauw

Geek Repo

Github PK Tool:Github PK Tool

ArneD's repositories

BERT_doc_classification

Document classification with BERT

Language:PythonStargazers:4Issues:0Issues:0

bert_document_classification

architectures and pre-trained models for long document classification.

Language:PythonStargazers:0Issues:0Issues:0

BERT_NER

NER with BERT

Language:PythonStargazers:0Issues:0Issues:0

cache-conda-envs

Speed up your builds by caching Anaconda environments on GitHub Actions

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CVDD-PyTorch

A PyTorch implementation of Context Vector Data Description (CVDD), a method for Anomaly Detection on text.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Demo

Demo repo for tutotial articles on Opensource.com

Stargazers:0Issues:0Issues:0

diffgram

Training Data (Data Labeling, Annotation, Catalog, Workflow) for all Data Types (Image, Video, 3D, Text, Geo, Audio, more) at scale.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

dkpro-cassis

UIMA CAS processing library written in Python

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

DPR

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

fake_news_semantics

Code for the paper "Do Sentence Interactions Matter ? Leveraging Sentence Level Representations for Fake News Classification"

Stargazers:0Issues:0Issues:0

FakeNewsCorpusSpanish

The Spanish Fake News Corpus contains a collection of 971 news divided into 491 real news and 480 fake news. The corpus covers news from 9 different topics: Science, Sport, Economy, Education, Entertainment, Politics, Health, Security, and Society

License:CC-BY-4.0Stargazers:0Issues:0Issues:0

files2rouge

Calculating ROUGE score between two files (line-by-line)

Language:PerlLicense:MITStargazers:0Issues:0Issues:0

ganbert-pytorch

Enhancing the BERT training with Semi-supervised Generative Adversarial Networks in Pytorch/HuggingFace

License:Apache-2.0Stargazers:0Issues:0Issues:0

Legal-Docs-Large-MLTC

Multi Label Text Classification for Legal documents. Work on mono-lingual and multilingual parallel data

Stargazers:0Issues:0Issues:0

lmtc-eurlex57k

Large-Scale Multi-Label Text Classification on EU Legislation

License:Apache-2.0Stargazers:0Issues:0Issues:0

mlm-scoring

Python library & examples for Masked Language Model Scoring (ACL 2020)

License:Apache-2.0Stargazers:0Issues:0Issues:0

multi-eurlex

MultiEURLEX - A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transfer

Stargazers:0Issues:0Issues:0

multilingual-fake-news

The code related to the paper

License:Apache-2.0Stargazers:0Issues:0Issues:0

Multimodal-Toolkit

Multimodal model for text and tabular data with HuggingFace transformers as building block for text data

License:MITStargazers:0Issues:0Issues:0

neural-document-aligner

Document aligner which uses neural technologies to search matches across bilingual documents

License:GPL-3.0Stargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

question_generator

An NLP system for generating reading comprehension questions

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

spatialdata

An open and universal framework for processing spatial omics data

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

TopicalChange

Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.

Stargazers:0Issues:0Issues:0

trafilatura

Web scraping library and command-line tool for text discovery and extraction (main content, metadata, comments)

License:GPL-3.0Stargazers:0Issues:0Issues:0

Voice-Privacy-Challenge-2020

Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/docs/VoicePrivacy_2020_Eval_Plan_v1_3.pdf

Stargazers:0Issues:0Issues:0

word2word

Easy-to-use word-to-word translations for 3,564 language pairs.

License:Apache-2.0Stargazers:0Issues:0Issues:0

wordfreq

Access a database of word frequencies, in various natural languages.

License:MITStargazers:0Issues:0Issues:0