Grisha (itisgrisha)

itisgrisha

Geek Repo

Github PK Tool:Github PK Tool


Organizations
fewshotguys

Grisha's starred repositories

build-your-own-x

Master programming by recreating your favorite technologies from scratch.

nvm

Node Version Manager - POSIX-compliant bash script to manage multiple active node.js versions

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:29916Issues:426Issues:4173

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

goreplay

GoReplay is an open-source tool for capturing and replaying live HTTP traffic into a test environment in order to continuously test your system with real data. It can be used to increase confidence in code deployments, configuration changes and infrastructure changes.

Language:GoLicense:NOASSERTIONStargazers:18466Issues:469Issues:742

twemproxy

A fast, light-weight proxy for memcached and redis

Language:CLicense:Apache-2.0Stargazers:12100Issues:812Issues:435

mlops-zoomcamp

Free MLOps course from DataTalks.Club

Language:Jupyter NotebookStargazers:10775Issues:180Issues:92

sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

Language:C++License:Apache-2.0Stargazers:9894Issues:124Issues:734

stat_rethinking_2022

Statistical Rethinking course winter 2022

Data-science

Collection of useful data science topics along with articles, videos, and code

Language:Jupyter NotebookStargazers:4001Issues:141Issues:8

MT-Reading-List

A machine translation reading list maintained by Tsinghua Natural Language Processing Group

Language:TeXLicense:BSD-3-ClauseStargazers:2418Issues:166Issues:23

userver

Production-ready C++ Asynchronous Framework with rich functionality

Language:C++License:Apache-2.0Stargazers:2328Issues:50Issues:239

pgx_scripts

A collection of useful little scripts for database analysis and administration, created by our team at PostgreSQL Experts.

Language:ShellLicense:NOASSERTIONStargazers:1360Issues:112Issues:8

designing-distributed-systems-labs

Labs for the Designing Distributed Systems book.

badwolf

A Vim color scheme.

Language:Vim ScriptLicense:MITStargazers:1245Issues:40Issues:23

sacrebleu

Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons

Language:PythonLicense:Apache-2.0Stargazers:1021Issues:19Issues:155

ADBench

Official Implement of "ADBench: Anomaly Detection Benchmark", NeurIPS 2022.

Language:PythonLicense:BSD-2-ClauseStargazers:825Issues:16Issues:19

pragmatic_segmenter

Pragmatic Segmenter is a rule-based sentence boundary detection gem that works out-of-the-box across many languages.

Language:RubyLicense:MITStargazers:539Issues:16Issues:59

aiohttp-cors

CORS support for aiohttp

Language:PythonLicense:Apache-2.0Stargazers:203Issues:14Issues:54

ReadabiliPy

A simple HTML content extractor in Python. Can be run as a wrapper for Mozilla's Readability.js package or in pure-python mode.

Language:HTMLLicense:MITStargazers:203Issues:15Issues:48

web2text

Source code for the paper "Web2Text: Deep Structured Boilerplate Removal", full paper @ ECIR'18

Language:HTMLLicense:MITStargazers:166Issues:13Issues:16

WikiTableQuestions

A dataset of complex questions on semi-structured Wikipedia tables

Language:HTMLLicense:CC-BY-SA-4.0Stargazers:141Issues:10Issues:2

good-translation-wrong-in-context

This is a repository with the data and code for the ACL 2019 paper "When a Good Translation is Wrong in Context: ..." and the EMNLP 2019 paper "Context-Aware Monolingual Repair for Neural Machine Translation"

boilernet

Boilerplate Removal using Deep Learning

Language:PythonLicense:MITStargazers:77Issues:3Issues:14

contextual-mt

A repository with the code related to experiments around context-aware machine translation

awesome-AR

A collection of augmented reality resources for XR enthusiasts 😎

infer-pytorch-pyspark

Coupling PySpark with PyTorch Models

License:MITStargazers:13Issues:2Issues:0

BUG

A Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation, Levy et al., Findings of EMNLP 2021

Language:PythonLicense:MITStargazers:12Issues:1Issues:2
Language:JavaScriptStargazers:7Issues:5Issues:0

DELA-Project

DELA stands for Document-level machinE transLation evAlaution.

Stargazers:4Issues:0Issues:0