MTA-PPKE Hungarian Language Technology Research Group (ppke-nlpg)

MTA-PPKE Hungarian Language Technology Research Group

ppke-nlpg

Geek Repo

Location:Budapest

Home Page:http://nlpg.itk.ppke.hu

Github PK Tool:Github PK Tool

MTA-PPKE Hungarian Language Technology Research Group's repositories

purepos

PurePos is an open source hybrid morphological tagger.

Language:JavaLicense:LGPL-3.0Stargazers:15Issues:9Issues:10

boilerplateResults

Results of boilerplate removal algorithms

Language:PythonStargazers:8Issues:0Issues:0

HunTag3

A sequential tagger for NLP using Maximum Entropy Learning and Hidden Markov Models

Language:LexLicense:LGPL-3.0Stargazers:8Issues:7Issues:0

pywnxml

Python3 API for WordNet XML (Hungarian WordNet / BalkaNet / VisDic format)

Language:PythonLicense:GPL-2.0Stargazers:5Issues:8Issues:0

whats-wrong-python

What's Wrong With My NLP? is visualizer and graphical diff for Natural Language Processing problems. We are reimplementing this program in Python 3. For more information about the original program go to http://whatswrong.googlecode.com

Language:PythonLicense:GPL-3.0Stargazers:5Issues:6Issues:11

manocska

Manócska -- integrált igei vonzatkeret adatbázis

Language:PythonStargazers:4Issues:8Issues:0

emmorphpy

A wrapper, a lemmatizer and REST API implemented in Python for emMorph (Humor) Hungarian morphological analyzer

Language:PythonLicense:LGPL-3.0Stargazers:3Issues:7Issues:0

purepos-python3

PurePOS rewritten in Python3

Language:PythonLicense:LGPL-3.0Stargazers:3Issues:10Issues:8

AraSum

Arab Summarization Corpus

AnaGramma-Parser

Egy pszicholingvisztikai indíttatású elemző modell

Language:PythonLicense:LGPL-3.0Stargazers:1Issues:8Issues:0

CleanPortalEval

boilerplate removal test set for portals (more sites from the same domain)

Language:HTMLStargazers:1Issues:0Issues:0

commoncrawl-downloader

Simple Python command line tools for retrieving a list of urls and specific files in bulk

Language:PythonLicense:LGPL-3.0Stargazers:1Issues:8Issues:0
Language:HTMLLicense:MITStargazers:1Issues:0Issues:0

gut-besser-chunker

The program used in the paper 'Gut, Besser, Chunker – Selecting the best models for text chunking with voting' by Balázs Indig and István Endrédy

Language:PythonLicense:LGPL-3.0Stargazers:1Issues:9Issues:0

less-is-more

The program used in the paper 'Less is More, More or Less... – Finding the Optimal Threshold for Lexicalisation in Chunking' by Balázs Indig

Language:PythonLicense:GPL-3.0Stargazers:1Issues:8Issues:0

nom-or-not

algorithm for case-disambiguation

Language:PythonStargazers:1Issues:2Issues:0

purepospy

Python wrapper for PurePos

Language:JavaLicense:LGPL-3.0Stargazers:1Issues:8Issues:0

SS05

The original SS05 algorithm from Hong Shen and Anoop Sarkar used in the paper 'Voting Between Multiple Data Representations for Text Chunking'

Language:PerlLicense:LGPL-3.0Stargazers:1Issues:9Issues:2

NYTK-NerKor-Cars-OntoNotesPP

A 1M+-token Hungarian named entity dataset with ~30 entity types derived from NYTK-NerKor

Stargazers:0Issues:4Issues:0
Stargazers:0Issues:0Issues:0
Language:PythonLicense:LGPL-3.0Stargazers:0Issues:8Issues:0
Stargazers:0Issues:2Issues:0

nom-or-what

Nom-or-what algorithm, designed to disambiguate case endings on nouns, adjectives, numerals etc. in Hungarian.

Language:PythonStargazers:0Issues:7Issues:0

postp

Data of the study on postpositions (PhD thesis, Noémi Ligeti-Nagy)

Stargazers:0Issues:2Issues:0

vframe

A method for constraining possible verbal frames based on the preverb and the infinitival argument for Hungarian verbs

Language:PythonLicense:LGPL-3.0Stargazers:0Issues:7Issues:0