Language Technology at the University of Helsinki (Helsinki-NLP)

Language Technology at the University of Helsinki

Helsinki-NLP

Geek Repo

Projects and resources developed in the Language Technology Research Group at the University of Helsinki.

Location:Helsinki, Finland

Home Page:https://blogs.helsinki.fi/language-technology/

Twitter:@HelsinkiNLP

Github PK Tool:Github PK Tool

Language Technology at the University of Helsinki's repositories

HBMP

Sentence Embeddings in NLI with Iterative Refinement Encoders

Language:PythonLicense:MITStargazers:77Issues:6Issues:9

XED

XED multilingual emotion datasets

Language:Jupyter NotebookStargazers:54Issues:9Issues:5

UkrainianLT

A collection of links to Ukrainian language tools

Language:PerlLicense:LGPL-3.0Stargazers:14Issues:3Issues:1

FoTraNMT

Open Source Neural Machine Translation in PyTorch

Language:PythonLicense:MITStargazers:13Issues:10Issues:2

OPUS-MT-testsets

benchmarks for evaluating MT models

Language:SmalltalkLicense:NOASSERTIONStargazers:9Issues:3Issues:14
Language:PerlLicense:MITStargazers:4Issues:3Issues:0

americasnlp2021-st

AmericasNLP 2021 shared task

Language:JavaScriptStargazers:3Issues:0Issues:0
Language:HTMLLicense:GPL-3.0Stargazers:3Issues:3Issues:1
Language:PythonStargazers:2Issues:4Issues:0

murreviikko

Dialectologically annotated and normalized dataset of dialectal Finnish tweets

Language:PythonStargazers:1Issues:2Issues:0

ndc-aligned

Word-aligned version of the Norwegian Dialect Corpus

Language:PythonStargazers:1Issues:2Issues:0

OpusFilter-hub

A hub of OpusFilter configurations

Language:PythonStargazers:1Issues:6Issues:0

SELF-FEIL

Emotion Lexicons for Finnish

Stargazers:1Issues:0Issues:0

uralicNLP

An NLP library for small Uralic languages such as Skolt Sami, Moksha and so on

Language:PythonLicense:NOASSERTIONStargazers:1Issues:3Issues:0

americasnlp2023-st

AmericasNLP 2023 shared task (Helsinki fork)

Language:JavaScriptStargazers:0Issues:0Issues:0

building-nlp-apps-notebooks

Python notebook demos for the Building NLP Applications course

Language:Jupyter NotebookStargazers:0Issues:3Issues:0

controlled_simplification_ru

A project on controlled Russian text simplification.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

dial_align

Character alignment for normalized dialect corpora

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:2Issues:0

murre

The amazing 🐕will normalize non-standard Finnish/Swedish and dialectalize standard Finnish!

License:NOASSERTIONStargazers:0Issues:0Issues:0

OPUS-MT-bot

Translation Bot between Ukrainian and Czech.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

OPUS-MT-devsets

development data for OPUS-MT

Stargazers:0Issues:3Issues:0

OPUS-MT-map

A map of available translation models

Language:PHPLicense:MITStargazers:0Issues:3Issues:0

RuAdapt

A Parallel Russian-Simple Russian Dataset

License:MITStargazers:0Issues:2Issues:0

syntaxmaker

The NLG tool for Finnish

Language:PythonLicense:Apache-2.0Stargazers:0Issues:4Issues:0

wikitextprocessor

Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. For data extraction, bulk syntax checking, error detection, and offline formatting.

Language:LuaLicense:NOASSERTIONStargazers:0Issues:2Issues:0

wiktextract

Wiktionary dump file parser and multilingual data extractor

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0

yasa

yasa is a program that aligns two translations of a text sentence by sentence in order to produce a bi-text

Language:C++License:Apache-2.0Stargazers:0Issues:2Issues:0