oxcar / wiki-links-serde

Serialization for wiki-links data.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Building

To build an uberjar:

$ ./sbt11 package-dist

The uberjar (with all dependencies included) will be in target/scala-2.9.2/thrifting-stage-1-for-wiki-link_2.9.2-0.1-SNAPSHOT-one-jar.jar.

Viewing serialized files

$ ./sbt11
... Downloading deps ...
> console

scala> import com.github.brianmartin.wiki.Viewer
import com.github.brianmartin.wiki.Viewer

scala> Viewer(new
java.io.File("/Users/brian/wrk-umass/wiki-links-thrift/thrift-sample/000000003.thrift"))
SLF4J: Failed to load class "org.slf4j.impl.StaticLoggerBinder".
SLF4J: Defaulting to no-operation (NOP) logger implementation
SLF4J: See http://www.slf4j.org/codes.html#StaticLoggerBinder for further
details.
res0: String = 
"docID: 3
url:
ftp://Autoidread:read@ftp.rrc.ru/!!!Motorola/Motorola%20MSP%20SCHOOL/Moscow%20Oct%202010/student/091102_MCD_Class.pdf
mention.head:
Mention(http://en.wikipedia.org/wiki/Bootstrapping,bootstrapping,598297)
rareWords.head: RareWord(reducing,518133)
raw length: 2297374
"

About

Serialization for wiki-links data.


Languages

Language:Scala 92.8%Language:Thrift 7.0%Language:Shell 0.2%