Julien Tourille's repositories
download_oscar
Downloading all files of a language from the OSCAR (Open Super-large Crawled Aggregated coRpus)
Language:PythonMIT000
gpt-2-simple
Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts
jtourille.github.io
Personal website
oscar-corpus-downloader
Simple tool to download the OSCAR corpus.
pubmed_parser
:clipboard: A Python Parser for PubMed Open-Access XML Subset and MEDLINE XML Dataset
quickumls-simstring
SimString
text-generation-inference
Large Language Model Text Generation Inference