gi11es / articlequality

A library for performing automatic detection of assessment classes of Wikipedia article text.

Home Page:http://pythonhosted.org/wikiclass

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Wikipedia article quality classification

This library provides a set of utilities for performing automatic detection of assessment classes of Wikipedia articles. For more information, see the full documentation at http://pythonhosted.org/wikiclass.

Compatible with Python 3.x only. Sorry.

Basic usage

>>> import articlequality
>>> from revscoring import Model
>>>
>>> scorer_model = Model.load(open("models/enwiki.nettrom_wp10.gradient_boosting.model", "rb"))
>>>
>>> text = "I am the text of a page.  I have a <ref>word</ref>"
>>> articlequality.score(scorer_model, text)
{'prediction': 'stub',
 'probability': {'stub': 0.27156163795807853,
                 'b': 0.14707452309674252,
                 'fa': 0.16844898943510833,
                 'c': 0.057668704007171959,
                 'ga': 0.21617801281707663,
                 'start': 0.13906813268582238}}

Authors

About

A library for performing automatic detection of assessment classes of Wikipedia article text.

http://pythonhosted.org/wikiclass

License:MIT License


Languages

Language:Python 83.3%Language:Makefile 16.7%