Vladimir Gurevich's repositories

yandex-practicum

tasks and projects from the data science course by Yandex.Practicum

Language:HTMLStargazers:24Issues:3Issues:0

jupyter-notebook-viewer

chrome extension for viewing Jupyter Notebooks in the browser without Jupyter Server

Language:JavaScriptLicense:MITStargazers:23Issues:3Issues:4

AnkiTools4j

anki decks creation in Java

Language:HTMLStargazers:6Issues:2Issues:0

deep_learning_school

tasks and projects from the deep learning school by MIPT

Language:Jupyter NotebookStargazers:3Issues:2Issues:0

hebrew_summarizer

finetuning experiments on summarization tasks for Hebrew

Language:PythonStargazers:2Issues:1Issues:0

wav2vec2-hebrew

Speech Recognition for Hebrew (using wav2vec2 models)

distiller

knowledge distillations for bert (classification, token classification models)

Language:PythonStargazers:1Issues:5Issues:0

news_scrapers

This repository contains scripts for scraping news from different sources

Language:PythonStargazers:1Issues:1Issues:0

abydos

Abydos NLP/IR library for Python [imvladikon] made some changes

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

annotations_deduplications

scripts to deduplicate annotations and to refine NER spans or to analyze the differences

Language:PythonStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:2Issues:0

bm25_vectorizer

sklearn compatible bm25 vectorizers

Language:PythonStargazers:0Issues:2Issues:0

campus-dl

A simple tool to download video lectures from campus.gov.il (based on edx-dl)

Language:HTMLLicense:LGPL-3.0Stargazers:0Issues:1Issues:0

cdatasketch

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble [imvladikon] added cython implementations

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

deduplicator

Simple entity deduplication package

Language:PythonStargazers:0Issues:1Issues:0

distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

evaluate

🤗 Evaluate: A library for easily evaluating machine learning models and datasets.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0
Stargazers:0Issues:2Issues:0

indonesian_nlp_experiments

some experiments in Indonesian NLP (information extraction from the courts reports)

Language:PythonStargazers:0Issues:1Issues:0

pysubs3

A Python library for editing subtitle files (fork of pysubs2 with changes)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:2Issues:8

sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

seqeval

A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

spacy-trankit

💥 Trankit models directly in spaCy💥

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

string-embed

😆 string embed for fast edit distance computation, codes for [Convolutional Embedding for Edit Distance (SIGIR 20)].

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

telegram-bot-hebrew

telegram (spring boot, java) with some language services for hebrew (translation, inflection)

Language:JavaLicense:MITStargazers:0Issues:2Issues:0

wikitalk_parser

Fetching and parsing Wikipedia Talks

Language:PythonStargazers:0Issues:1Issues:0

ydata

YDATA school assignments

Language:Jupyter NotebookStargazers:0Issues:2Issues:0