patverga / Proteus

Million Book Project

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Proteus

Or the Million Books Repository

  • Pharos: Named entity recognition and linking (jdalton)
  • Phokas: toktei to mbtei (dasmith)
  • Pontos: djvu, etc to toktei (dasmith)
  • Homer: Subproject built to contain all the indexing code necessary for Proteus. Uses Galago 3.6-SNAPSHOT

About

Million Book Project


Languages

Language:Java 51.6%Language:JavaScript 18.7%Language:Scala 12.5%Language:Clojure 10.7%Language:Python 4.3%Language:CSS 1.5%Language:Shell 0.6%