arnicas / text-gen-arxiv-papers

Code for text generation papers searches on ArXiv, with very manual jekyll site creation.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

text-gen-arxiv-papers

This is the raw files for the gh pages site: https://arnicas.github.io/text-gen-arxiv-papers.

The code is kind of a nightmare, but is being gradually cleaned up and checked in. Basically I do most of it manually using pandas, since jekyll is pretty bad at what I needed. It pretty much needs a giant refactor.

The file scrape.py has the search strings and saves a pickle of the latest data from ArXiv.

The file build_pages.py takes the pickle as an argument and processes it. There are required files and directories etc. I'll try to document more and clean it up for re-use.

About

Code for text generation papers searches on ArXiv, with very manual jekyll site creation.


Languages

Language:CSS 45.2%Language:Python 43.5%Language:HTML 11.0%Language:Ruby 0.2%