CLARIN.SI (clarinsi)

CLARIN.SI

clarinsi

Organization data from Github https://github.com/clarinsi

Slovenian research infrastructure for language resources and technologies

Location:Ljubljana, Slovenia

Home Page:https://www.clarin.si/

GitHub:@clarinsi

Twitter:@ClarinSlovenia

CLARIN.SI's repositories

classla

CLASSLA Fork of the Official Stanford NLP Python Library for Many Human Languages

Language:PythonLicense:NOASSERTIONStargazers:40Issues:4Issues:45

mte-msd

MULTEXT-East morphosyntactic specifications

babushka-bench

Benchmarking NLP tools on Slovene, Croatian and Serbian

parlaspeech

Code for bootstrapping ASR datasets from parliamentary recordings and transcripts

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5Issues:4Issues:1

reldi-tokeniser

A two-mode (standard, nonstandard) tokeniser for South Slavic languages

Language:PythonLicense:Apache-2.0Stargazers:5Issues:4Issues:3
Language:PythonLicense:Apache-2.0Stargazers:4Issues:7Issues:0

benchich

BENCHić - the benchmark for Bosnian, Croatian, Montenegrin, Serbian (and friends)

dialect-copa

Data for the DIALECT-COPA unshared task of dialectal causal common-sense reasoning

Slovenian-Language-Technologies-Overview

An ever-expanding overview of the knowledge on large language models (LLMs), speech technologies, and other NLP technologies for Slovenian language.

Stargazers:2Issues:0Issues:0

TEI-schema

Recommended TEI schema for CLARIN.SI resources, cf. also https://clarinsi.github.io/TEI-schema/

Language:XSLTStargazers:2Issues:4Issues:0
Language:PythonLicense:MITStargazers:1Issues:6Issues:2

drevesnik

Web portal for searching and displaying syntacically annotated corpora

Language:JavaScriptStargazers:1Issues:5Issues:0

slobench-eval-docker

Repository for SloBench evaluation docker images

Language:PerlStargazers:1Issues:2Issues:0

Slovene_normalizator

Slovene text normalization tool

Language:PythonLicense:Apache-2.0Stargazers:1Issues:6Issues:0

clarin-dspace

LINDAT/CLARIN digital repository based on DSpace

Language:JavaLicense:BSD-3-ClauseStargazers:0Issues:5Issues:41
Language:TypeScriptLicense:Apache-2.0Stargazers:0Issues:4Issues:0

slovene_g2p

A converter that converts Slovene words to their IPA and/or SAMPA transcriptions.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:4Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:6Issues:0

classla-training

Training scripts for the CLASSLA pipeline

Language:PythonLicense:Apache-2.0Stargazers:0Issues:6Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:4Issues:0
Language:PythonStargazers:0Issues:4Issues:0

hbs_features

Tool for extracting linguistic features with highest (known) variation among the HBS standards

Language:PythonStargazers:0Issues:3Issues:0

mezzanine_resources

Repo for tracking resources for the Mezzanine project

Stargazers:0Issues:3Issues:0

parlasent_analysis

Code for ParlaSent research note

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:0Issues:3Issues:0
Language:ElixirStargazers:0Issues:0Issues:0

rsdo_gos

Software for the GOS corpus of spoken Slovenian

Language:C#Stargazers:0Issues:5Issues:0

swell-editor

Editor for normalising learner texts (error annotation and tagging.)

Language:TypeScriptLicense:MITStargazers:0Issues:2Issues:0
Language:PythonLicense:MITStargazers:0Issues:6Issues:0