Frix-x / FrenchLefffLemmatizer

A French Lemmatizer in Python based on LEFFF, a large-scale morphological and syntactic lexicon for French.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Introduction

A French Lemmatizer in Python based on the LEFFF (Lexique des Formes Fléchies du Français / Lexicon of French inflected forms) is a large-scale morphological and syntactic lexicon for French.

Main reference:

[Sagot,2010] Sagot, B. (2010). The Lefff, a freely available and large-coverage morphological and syntactic lexicon for French. In 7th international conference on Language Resources and Evaluation (LREC 2010). Retrieved from https://hal.archives-ouvertes.fr/file/index/docid/521242/filename/lrec10lefff.pdf

Benoît Sagot Webpage about LEFFF
http://alpage.inria.fr/~sagot/lefff-en.html

More precisely, we use the morphological lexicon only: .mlex file) which has a simple format in CSV (4 fields separated by '\ t')

LEFFF download hyperlink

Tagset format FRMG - from the ALPAGE project since 2004
Tagset

License

Copyright (C) 2017 Claude Coulombe

Licensed under the Apache License, Version 2.0 (the 'License'); you may not use this file except in compliance with the License. You may obtain a copy of the License at

Apache 2.0 License

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an 'AS IS' BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

About

A French Lemmatizer in Python based on LEFFF, a large-scale morphological and syntactic lexicon for French.


Languages

Language:Python 100.0%