dmafanasyev / rulexicon

Russian Lexicons for Text Analysis

Home Page:https://dmafanasyev.github.io/rulexicon/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

rulexicon

Project Status: Active - The project has reached a stable, usable state and is being actively developed. Lifecycle: stable

An R data package containing a collection of Russian lexicons, polarity tables, stop words lists, lemmatization tables and etc. for text analysis.

Naming convention

The naming convention based on the idea of lexicon package, and help to categorize the data types.

Prefix

Description

key_

A data.frame with a lookup and return value

hash_

A keyed data.table hash table

hash_lemmas_

A lemmatization hash table

hash_sentiment_

A sentiment polarity hash table

profanity_

A profane words vector

sw_

A stopword vector

freq_

A data.table of terms with frequencies

\_en An English version of dictionary

Installation

You can install this package from GitHub with:

# install.packages("devtools")
devtools::install_github("dmafanasyev/rulexicon")