Sławomir Dadas (sdadas)

sdadas

Geek Repo

Location:Warsaw

Github PK Tool:Github PK Tool

Sławomir Dadas's repositories

polish-nlp-resources

Pre-trained models and language resources for Natural Language Processing in Polish

warsaw-transport

A visualization of Warsaw public transport

Language:TypeScriptLicense:GPL-3.0Stargazers:88Issues:5Issues:2

polish-roberta

RoBERTa models for Polish

Language:PythonLicense:LGPL-3.0Stargazers:81Issues:10Issues:8

fsbrowser

Fast desktop client for Hadoop Distributed File System

Language:JavaLicense:GPL-3.0Stargazers:31Issues:5Issues:3

polish-sentence-evaluation

Evaluation of Sentence Representations in Polish

Language:PythonLicense:GPL-3.0Stargazers:20Issues:8Issues:0

spring2ts

Generate TypeScript REST client directly from Spring MVC application source

Language:JavaLicense:MITStargazers:8Issues:4Issues:1

commoncrawl-downloader

Application for downloading text data from Common Crawl

gitdmp

A tool for automatic export of commits from git repositories

Language:JavaStargazers:2Issues:4Issues:0

RankGPT

Is ChatGPT Good at Search? LLMs as Re-Ranking Agent

Language:PythonStargazers:1Issues:0Issues:0

scinote

A personal bibliography manager and paper recommendation engine

Language:JavaLicense:GPL-3.0Stargazers:1Issues:2Issues:0

vwsd

Code for SemEval 2023 Task 1: Visual Word Sense Disambiguation

Language:PythonStargazers:1Issues:0Issues:0

yast

Yet Another Sequence Tagging library

Language:PythonLicense:Apache-2.0Stargazers:1Issues:3Issues:0

boundary-aware-nested-ner

The Implementation of Boundary-aware Model for Nested Named Entity Recognition

Language:PythonStargazers:0Issues:2Issues:0

DiPS

NAACL 2019: Submodular optimization-based diverse paraphrasing and its effectiveness in data augmentation

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

elasticsearch-analysis-morfologik

Morfologik Polish Lemmatizer plugin for Elasticsearch

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

fake-smtp-server

A simple SMTP Server for Testing purposes. Emails are stored in an in-memory database and rendered in a Web UI

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:2Issues:0

label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Language:JavaScriptLicense:Apache-2.0Stargazers:0Issues:1Issues:0

LASER

Language-Agnostic SEntence Representations

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

nested-ner-2019-bert

Implementation of Nested Named Entity Recognition using BERT

Language:PythonLicense:GPL-3.0Stargazers:0Issues:2Issues:0

pawls

Software that makes labeling PDFs easy.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:JavaStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

splade

SPLADE: sparse neural search (SIGIR21, SIGIR22)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

tevatron

Tevatron - A flexible toolkit for dense retrieval research and development.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

wiki-index

Simple full text indexing for Wikipedia

Language:JavaStargazers:0Issues:1Issues:0