Manan Dey's repositories
bias_machine_translation
Code for: How sensitive are translation systems to extra contexts? Mitigating gender bias in Neural Machine Translation models through relevant contexts. (EMNLP Findings 2022)
awesome-align
A neural word aligner based on multilingual BERT
BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
bigscience
Codebase for the project
BLINK
Entity Linker solution
COVID-19-People-Resource-Provider-Mapping-App
Mapping People to Resource Providers (NGOs, Volunteers)
data_tooling
Tools for managing datasets for governance and training.
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Detecting-Depression-through-YouTube-history
It is aimed at calculation of affective scores of videos using the audio visual feature and further analyzing the affective pattern generated from the YouTube watching history of an individual to predict his depression severity score.
evals
Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.
Evaluating-gender-bias
Evaluating Gender Bias in NLI models
evaluation
Code and Data for Evaluation WG
flax-sentence-embeddings
Shared code for training sentence embeddings with Flax / JAX
lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
manandey.github.io
A beautiful, simple, clean, and responsive Jekyll theme for academics
mteb
MTEB: Massive Text Embedding Benchmark
natural-instructions
Expanding natural instructions
NL-Augmenter
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
promptsource
Toolkit for creating, sharing and using natural language prompts.
REL
REL: Radboud Entity Linker
simple-salesforce
A very simple Salesforce.com REST API client for Python
Snowfakery
A tool for generating fake data that has relations between tables.
t-zero
Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
weblm
Drive a browser with Cohere