hank110 / acl-anthology-structurizer

Home Page:http://www.aclweb.org/anthology

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ACL trend survey

Requirements

  • Python 2.7

  • Install scrapy

      $pip install scrapy
    

Run

Data crawling

  • Configure year and journal in crawler/crawler/settings.py. (Haven't tried crawling other proceedings/journals than ACL though)

      $ cd crawler
      $ scrapy crawl acl -o items.csv -t csv
      $ scrapy crawl acl -o items.json -t json
    
  • Be careful of running the code twice because the json file gets appended, rather than overwritten.

Calculate frequent authors

$ python count.py

Author

License

About

http://www.aclweb.org/anthology


Languages

Language:Python 100.0%