Luca Di Liello's repositories
pytorch-apple-silicon-benchmarks
Performance of PyTorch on Apple Silicon
bleurt-pytorch
BLEURT implementation in PyTorch
semantic-loss-pytorch
PyPSDD porting to Python 3 + PyTorch equivalent tree construction.
compressed-dictionary
A dictionary which values are compressed to save memory.
sweep-line-algorithm-python
Python2 Implementation of the Sweep Line Algorithm
transformers-framework
SOTA training framework based on PyTorch Lightning and Transformers
answer-selection
New datasets for Answer Sentence Selection task
datasets_augmentation
Increment datasets size retrieving similar sentences from large sources
mrqa-lightning
MRQA test suite on PyTorch Lightning
natural-question-answering
Natural Question dataset adapted to be a Question-Answering benchmark.
bart-small
bart-small model release page
python_regex_generator
A C++ program to generate Python regex from a list of strings
asnq-challenging
ASNQ without trivial negative answers.
italian-faq-dataset
FAQs in italian collected from companies websites worldwide.
python-multiprocessing-generator
Easily process data from a generator in multicore and return another generator.
wqa-multi-sentence-inference
This repository contains code used for our Multi Sentence Inference NAACL'22 paper.
CCQA
CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training
datasets-news-please
Download Common Crawl data directly into an HuggingFace dataset.
lucadiliello
Profile description
lucadiliello.github.io
My personal website
news-please
news-please - an integrated web crawler and information extractor for news that just works
opus-dataset-parser
Parse OPUS parallel dataset to create multilingual parallel corpora ready to be used for NLP
wikiextractor
A tool for extracting plain text from Wikipedia dumps