fitrialif / sotawhat

Returns latest research results by crawling arxiv papers and summarizing abstracts. Helps you stay afloat with so many new papers everyday.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

sotawhat

This script runs using Python 3.

First, install the required packages. This script only requires nltk and PyEnchant.

$ pip3 install -r requirements.txt

If you run the error that the package punkt doesn't exist, download it by going into your Python environment and running:

$ python3

>>> import nltk
>>> nltk.download('punkt')

In MacOS, you can get the SSL error

[nltk_data] Error loading punkt: <urlopen error [SSL:
[nltk_data]     CERTIFICATE_VERIFY_FAILED] certificate verify failed:
[nltk_data]     unable to get local issuer certificate (_ssl.c:1045)>

this will be fixed by reinstalling certificates

$ /Applications/Python\ 3.x/Install\ Certificates.command

To query for a certain keyword, run:

$ python3 sotawhat.py "[keyword]" [number of results]

For example:

$ python3 sotawhat.py "perplexity" 10

If you don't specify the number of results, by default, the script returns 5 results. Each result contains the title of the paper with author and published date, a summary of the abstract, and link to the paper.

We've found that this script works well with keywords that are:

  • a model (e.g. transformer, wavenet, ...)
  • a dataset (e.g. wikitext, imagenet, ...)
  • a task (e.g. 'language model', 'machine translation', 'fuzzing', ...)
  • a metric (e.g. BLEU, perplexity, ...)
  • random stuff

About

Returns latest research results by crawling arxiv papers and summarizing abstracts. Helps you stay afloat with so many new papers everyday.


Languages

Language:Python 100.0%