GRAAL-Research / tcc

Text Complexity Computer (TCC) is a package to compute various complexity metrics on text

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

PyPI - Python Version PyPI Status PyPI Status Downloads

Formatting Linting Tests pr welcome License: LGPL v3

Here is TextComplexityComputer.

TextComplexityComputer is a state-of-the-art library for assessing complexity of a text in French. Use it to assess the complexity of a text in French. it is compatible with the latest version of Scikit-Learn and Python >= 3.8.

Getting Started:

from text_complexity_computer import TextComplexityComputer

tcc = TextComplexityComputer()
print(tcc.get_metrics_scores("Alibaba et les 40 voleurs."))
print(tcc.compute("Alibaba et les 40 voleurs."))

Installation

  • Install the stable version of TextComplexityComputer:
pip install text_complexity_computer
  • Install the latest development version of TextComplexityComputer:
pip install -U git+https://github.com/GRAAL-Research/tcc.git@dev

Cite

Use the following to cite this package and our article;

@article{Primpied2022Quantifying,
	author = {Primpied, Vincent and Beauchemin, David and Khoury, Richard},
	journal = {Proceedings of the Canadian Conference on Artificial Intelligence},
	year = {2022},
	month = {may 27},
	note = {https://caiac.pubpub.org/pub/iaeeogod},
	publisher = {Canadian Artificial Intelligence Association (CAIAC)},
	title = {Quantifying {French} {Document} {Complexity} },
}

Contributing to TextComplexityComputer

We welcome user input, whether it is regarding bugs found in the library or feature propositions ! Make sure to have a look at our contributing guidelines for more details on this matter.

License

TextComplexityComputer is LGPLv3 licensed, as found in the LICENSE file.


About

Text Complexity Computer (TCC) is a package to compute various complexity metrics on text

License:GNU General Public License v3.0


Languages

Language:Python 100.0%