Beast code in Giters

Irina Proskurina's repositories

la-tda

Code for EACL Workshop paper Can BERT eat RuCoLA? Topological Data Analysis to Explain

Language:Jupyter Notebook5 1 1

grammar-checker

Essay Grammar Checker trained on REALEC Corpus using SpaCy

Language:Jupyter Notebook3 1 1

ul2-atelier-data-science

Practical Deep Learning course at the University of Lyon 2

Language:Jupyter Notebook3 10

corpora-manipulation

Tool for converting error corpora to parallel datasets

Language:Python100

covid-mgpr-based-model

Code for the research project on Predicting the impacts of intervention strategies on COVID-19 trajectory (Clemson University - Université Clermont Auvergne)

Language:Python1 10

quantized-lm-confidence

Code for NAACL paper When Quantization Affects Confidence of Large Language Models?

Language:Jupyter Notebook100

small-language-models

Code for CoNLL BabyLM workshop Mini Minds: Exploring Bebeshka and Zlata Baby Models

Language:Jupyter Notebook1 20

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonMIT000

BabyBERTa

Source code for CoNLL 2021 paper by Huebner et al. 2021

Language:Python010

BIG-bench

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Language:PythonApache-2.0000

CoLA

Demo for Grammaticality Judgement (Acceptability) task

Language:JavaScript000

fair-pruning

Code for the paper The Other Side of Compression: Measuring Bias in Pruned Transformers (IDA23)

Language:Python020

Feature_selection-based-on-IFS

Feature selection via intuitionistic fuzzy sets

Language:Jupyter Notebook000

grammar-optim

Code and Results for "Universals of word order reflect optimization of grammars for efficient communication"

Language:TeX000

Topology_for_BERT_CoLA

Code for Feature Space Analysis from the paper Acceptability Judgements via Examining the Topology of Attention Maps (EMNLP22)

Language:Jupyter Notebook040

transformers

🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.

Language:PythonApache-2.0000

evaluation-pipeline

Evaluation pipeline for the BabyLM Challenge 2023.

Language:PythonMIT000

label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Language:PythonApache-2.0000

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT000

moral_stories

Data and code for the "Moral Stories: Situated Reasoning about Norms, Intents, Actions, and their Consequences" (Emelin et al., 2021) paper.

Language:PythonMIT000

Provide unified APIs for SOTA model compression techniques, such as low precision (INT8/INT4/FP4/NF4) quantization, sparsity, pruning, and knowledge distillation on mainstream AI frameworks such as TensorFlow, PyTorch, and ONNX Runtime.

Language:PythonApache-2.0000

TruthfulQA

TruthfulQA: Measuring How Models Imitate Human Falsehoods

Language:Jupyter NotebookApache-2.0000

ul2-nlp-course

NLP for Social Sciences course at the University of Lyon 2

Language:Jupyter Notebook010

upunaprosk

Irina Proskurina's repositories

la-tda

grammar-checker

ul2-atelier-data-science

corpora-manipulation

covid-mgpr-based-model

quantized-lm-confidence

small-language-models

ADWISER

AutoGPTQ

BabyBERTa

BIG-bench

CNF-DNF-converter

CoLA

fair-pruning

Feature_selection-based-on-IFS

grammar-optim

Topology_for_BERT_CoLA

transformers

writing-assistant

evaluation-pipeline

label-studio

lm-evaluation-harness

moral_stories

neural-compressor

TruthfulQA

ul2-nlp-course