There are 93 repositories under linguistics topic.
😀😄😂😭 A curated list of Sentiment Analysis methods, implementations and misc. 😥😟😱😤
LexNLP by LexPredict
微信公众号语料库
Monolingual wordlists with pronunciation information in IPA
Rime Cantonese input schema | 粵語拼音輸入方案
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
A curated list of anything remotely related to linguistics
Cantonese Linguistics and NLP
Prosodic: a metrical-phonological parser, written in Python. For English and Finnish, with flexible language support.
A web-based engine for creating and annotating textual corpora
Tweets when words are published for the first time in the NYT
Chooses correct Korean particle morphs for arbitrary words.
Crawler for linguistic corpora
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.
Read, write, and manipulate Praat TextGrid files with Python
FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.github.io/folia), a rich XML-based format for linguistic annotation. Flat allows users to view annotated FoLiA documents and enrich these documents with new annotations, a wide variety of linguistic annotation types is supported through the FoLiA paradigm.
Syntax tree generator for linguistic research
A modular annotation system that supports complex, interactive annotation graphs embedded on top of sequences of text.
Unannotated Spanish 3 Billion Words Corpora
Libvoikko and essential linguistic resources
Nüshu script enters the Noto Sans font family...