Xavier Tannier (xtannier)

xtannier

Geek Repo

Company:Sorbonne Université, Inserm, Limics, Polytech Sorbonne

Location:Paris, France

Home Page:http://xavier.tannier.free.fr/

Github PK Tool:Github PK Tool

Xavier Tannier's repositories

WebAnnotator

WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/firefox/addon/webannotator/), allowing annotation of both offline and inline pages. The HTML rendering is fully preserved and all annotations consist in new HTML spans with specific styles. WebAnnotator provides an easy and general-purpose framework and is made available under CeCILL free license (close to GNU GPL — see the license text), so that use and further contributions are made simple. All parts of an HTML document can be annotated: text, images, videos, tables, menus, etc. The annotations are created by simply selecting a part of the document and clicking on the relevant type and subtypes. The annotated elements are then highlighted in a specific color. Annotation schemas can be defined by the user by creating a simple DTD representing the types and subtypes that must be highlighted. Finally, annotations can be saved (HTML with highlighted parts of documents) or exported (in a machine-readable format).

DCTFinder

Extract title and creation time from web page.

Language:JavaLicense:NOASSERTIONStargazers:2Issues:0Issues:0

EZAnnot

Tool for fast concept and rule-based extraction for dummies.

Language:PythonLicense:Apache-2.0Stargazers:2Issues:1Issues:0

hyperopt-sklearn

Hyper-parameter optimization for sklearn

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0

kea

A tokenizer for French

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:2Issues:0

NCRFpp

NCRF++, an Open-source Neural Sequence Labeling Toolkit. It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components. (code for COLING/ACL 2018 paper)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

nlstruct

Natural language structuring library

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

PyRATA

"Python Rule-based feAture sTructure Analysis" or "Python Rule-bAsed Text Analysis"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

simpletransformers

Transformers made simple with training, evaluation, and prediction possible with one line each. Currently supports Sequence Classification (binary, multiclass, multilabel, sentence pair), Token Classification (NER), Question Answering, Language Modeling, Regression, Conversational AI, and Multi-Modal tasks. Built on top of the Hugging Face Transformer library.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

term-extractor

Extraction de termes

Stargazers:0Issues:0Issues:0

yaset

Yet Another SEquence Tagger

Language:PythonStargazers:0Issues:0Issues:0