EleutherAI / pile-uspto

A script for collecting the USPTO Backgrounds dataset in a language modelling friendly format.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

pile_uspto

A script for collecting the USPTO Backgrounds dataset in a language modelling friendly format.

To run, do:

git clone https://github.com/cfoster0/pile_uspto.git
cd pile_uspto
virtualenv env
. env/bin/activate
pip install -r requirements.txt
python main.py

About

A script for collecting the USPTO Backgrounds dataset in a language modelling friendly format.

License:MIT License


Languages

Language:Python 100.0%