JnBrymn / BestTimeToUseStackExchange

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

AutoTaxonomyExtractionAndTagging

To start Solr:

cd solr_example_dir
java -jar -Dsolr.solr.home=<full_path_to_this_dir>/solr_home start.jar

To index documents:

python extractDocs.py "<full_path_to_stack_exchange_dump>/posts.xml" | curl -d @- http://localhost:8983/solr/update?commit=true -v -H "Content-Type:text/xml"

testtoken1 testtoken2 testtoken3 testtoken4 testtoken5 testtoken6 testtoken7 testtoken8

About


Languages

Language:XSLT 47.5%Language:JavaScript 40.7%Language:CSS 6.3%Language:Python 3.4%Language:HTML 2.2%