Sławomir Dadas's repositories
polish-nlp-resources
Pre-trained models and language resources for Natural Language Processing in Polish
warsaw-transport
A visualization of Warsaw public transport
polish-roberta
RoBERTa models for Polish
polish-sentence-evaluation
Evaluation of Sentence Representations in Polish
commoncrawl-downloader
Application for downloading text data from Common Crawl
boundary-aware-nested-ner
The Implementation of Boundary-aware Model for Nested Named Entity Recognition
DiPS
NAACL 2019: Submodular optimization-based diverse paraphrasing and its effectiveness in data augmentation
elasticsearch-analysis-morfologik
Morfologik Polish Lemmatizer plugin for Elasticsearch
fake-smtp-server
A simple SMTP Server for Testing purposes. Emails are stored in an in-memory database and rendered in a Web UI
label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
LASER
Language-Agnostic SEntence Representations
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
nested-ner-2019-bert
Implementation of Nested Named Entity Recognition using BERT
pawls
Software that makes labeling PDFs easy.
splade
SPLADE: sparse neural search (SIGIR21, SIGIR22)
tevatron
Tevatron - A flexible toolkit for dense retrieval research and development.
wiki-index
Simple full text indexing for Wikipedia