WorldsEndless / gc-to-orgmode-scraper

Scrape LDS General Conference and convert to orgmode

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

gc-scraper

2018 April edition. A quick-and-dirty webscraper for LDS General Conference, using Pandoc to convert to orgmode files.

Prerequisites

You will need Leiningen 2.0.0 or above installed.

You will also need Pandoc which is available free and also through linux repos. Pandoc performs the format conversion.

Running

Right now this can only be run from the REPL, like:

gc-scraper.general-conference> (get-web-gc "my/slash/ending/output/directory/"

Improvements

  • [ ] Clean up handling of references
  • [ ] More robust output-dir handling
  • [ ] Commandline Operability
  • [ ] Allow specifying of different conference years or urls

License

Copyright © 2018 Tory S. Anderson with the GPL v3 License. Use and modification is fine; attribution appreciated.

About

Scrape LDS General Conference and convert to orgmode


Languages

Language:Clojure 98.9%Language:Lua 1.1%