koshinryuu / Snowball

Snowball: Extracting Relations from Large Plain-Text Collections

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Snowball: Extracting Relations from Large Plain-Text Collections

This is my own implementation of the the Snowball system to bootstrap relationship instances. You can find more details here:

A sample file containing sentences where the named-entities are already tagged can be downloaded, which has 1 million sentences taken from the New York Times articles part of the English Gigaword Collection.

NOTE: look at the desription of BREDS to understand how to give a tagged document collection and seeds to setup the bootstrapping of relationship instances with Snowball, both systems have a similar setup.

About

Snowball: Extracting Relations from Large Plain-Text Collections

License:GNU General Public License v3.0


Languages

Language:Python 100.0%