Aleksandr Chuklin (varepsilon)

varepsilon

Geek Repo

Company:@google

Location:Switzerland

Home Page:https://twitter.com/varphi

Github PK Tool:Github PK Tool

Aleksandr Chuklin's starred repositories

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:67588Issues:559Issues:710

langchain

⚡ Building applications with LLMs through composability ⚡

Language:PythonLicense:MITStargazers:55829Issues:537Issues:2889

Prompt-Engineering-Guide

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

powerlevel10k

A Zsh theme

Language:ShellLicense:MITStargazers:45612Issues:181Issues:2479

iodine

Official git repo for iodine dns tunnel

YaLM-100B

Pretrained language model with 100B parameters

Language:PythonLicense:Apache-2.0Stargazers:3733Issues:48Issues:28

SDV

Synthetic data generation for tabular data

Language:PythonLicense:NOASSERTIONStargazers:2303Issues:43Issues:1296

portfolio

Track and evaluate the performance of your investment portfolio across stocks, cryptocurrencies, and other assets.

Language:JavaLicense:EPL-1.0Stargazers:2287Issues:74Issues:1727

pyserini

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Language:PythonLicense:Apache-2.0Stargazers:1637Issues:18Issues:542

bleurt

BLEURT is a metric for Natural Language Generation based on transfer learning.

Language:PythonLicense:Apache-2.0Stargazers:685Issues:13Issues:51

aclpubcheck

Tools for checking ACL paper submissions

Language:PythonLicense:MITStargazers:554Issues:5Issues:45

ua-gec

UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language

Language:Macaulay2License:CC-BY-4.0Stargazers:255Issues:13Issues:6

PIE

Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models for Local Sequence Transduction": www.aclweb.org/anthology/D19-1435.pdf (EMNLP-IJCNLP 2019)

Language:Macaulay2License:MITStargazers:226Issues:9Issues:24

C4_200M-synthetic-dataset-for-grammatical-error-correction

This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences from C4 using a tagged corruption model. The approach and the dataset are described in more detail by Stahlberg and Kumar (2021) (https://www.aclweb.org/anthology/2021.bea-1.4/)

Language:PythonLicense:CC-BY-4.0Stargazers:152Issues:11Issues:7

m2scorer

MaxMatch (M^2) Scorer - Evaluation program for grammatical error correction systems.

Language:PythonLicense:GPL-2.0Stargazers:146Issues:4Issues:7

e2e-metrics

E2E NLG Challenge Evaluation metrics

Language:PythonLicense:NOASSERTIONStargazers:90Issues:5Issues:3

UNION

UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation

yandex-tank

Technical fork. All issues, requests etc. should be done in yandex/yandex-tank

Language:PythonLicense:LGPL-2.1Stargazers:48Issues:4Issues:0

X-MAML

Code base for " Zero-Shot Cross-Lingual Transfer with Meta Learning" papaer

FairRecSys

[Official Codes] Experiments on Generalizability of User-Oriented Fairness in Recommender Systems (SIGIR 2022)

Language:Jupyter NotebookStargazers:32Issues:4Issues:0

user-satisfaction-simulation

"Simulating User Satisfaction for the Evaluation of Task-oriented Dialogue Systems" in SIGIR'21

Language:Macaulay2Stargazers:16Issues:2Issues:0
Language:PythonLicense:MITStargazers:9Issues:3Issues:4

clse

The Corpus of Linguistically Significant Entities (CLSE) is a dataset of named entities annotated by linguist experts. It includes 34 languages and covers 74 different semantic types to support various applications from airline ticketing to video games. The aim of the corpus is to facilitate the creation of more linguistically diverse NLG datasets.

Language:PythonStargazers:7Issues:4Issues:0

RULEC-GEC

RULEC-GEC is a dataset of sentences written by learners of Russian and annotated for mistakes.

Language:RubyLicense:MITStargazers:1Issues:0Issues:0

telegram-bot-help-ua-ch

Telegrom bot that helps war in Ukraine refugees who seek information about a refuge in Switzerland.

Language:PythonLicense:MITStargazers:1Issues:0Issues:0