Ashim Gupta's starred repositories

mosh

Mobile Shell

Language:C++License:GPL-3.0Stargazers:12460Issues:215Issues:916

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:8804Issues:77Issues:1004

pattern

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

Language:PythonLicense:BSD-3-ClauseStargazers:8710Issues:544Issues:206

machine-learning-notes

My continuously updated Machine Learning, Probabilistic Models and Deep Learning notes and demos (2000+ slides) 我不间断更新的机器学习,概率模型和深度学习的讲义(2000+页)和视频链接

Language:Jupyter NotebookStargazers:8313Issues:388Issues:31

nlpaug

Data augmentation for NLP

Language:Jupyter NotebookLicense:MITStargazers:4355Issues:41Issues:221

lit

The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.

Language:TypeScriptLicense:Apache-2.0Stargazers:3435Issues:68Issues:134

TextBrewer

A PyTorch-based knowledge distillation toolkit for natural language processing

Language:PythonLicense:Apache-2.0Stargazers:1567Issues:27Issues:104
Language:PythonLicense:Apache-2.0Stargazers:1098Issues:13Issues:92

awesome-instruction-dataset

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

DataAug4NLP

Collection of papers and resources for data augmentation for NLP.

textflint

Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing

Language:PythonLicense:GPL-3.0Stargazers:630Issues:18Issues:32

Awesome-LLM-Uncertainty-Reliability-Robustness

Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models

indicnlp_catalog

A collaborative catalog of NLP resources for Indic languages

Leetcode

Leetcode questions (Company-wise, Paradigm-wise and much more)

catwalk

This project studies the performance and robustness of language models and task-adaptation methods.

Language:PythonLicense:Apache-2.0Stargazers:138Issues:7Issues:24

alexa-with-dstc9-track1-dataset

DSTC9 Track 1 - Beyond Domain APIs: Task-oriented Conversational Modeling with Unstructured Knowledge Access

Language:PythonLicense:Apache-2.0Stargazers:105Issues:15Issues:4

GLUECoS

A benchmark for code-switched NLP, ACL 2020

Language:PythonLicense:MITStargazers:73Issues:9Issues:23

torch

TensorLy-Torch: Deep Tensor Learning with TensorLy and PyTorch

Language:PythonLicense:BSD-3-ClauseStargazers:70Issues:8Issues:20

bnn_priors

Code for the paper "Bayesian Neural Network Priors Revisited"

Language:PythonLicense:MITStargazers:55Issues:6Issues:0

fake-news-detection-resources

📖 A curated list of resources dedicated to Fake News Detection

License:CC0-1.0Stargazers:49Issues:5Issues:0

certified-word-sub

Official repository for Jia, Raghunathan, Göksel, and Liang, "Certified Robustness to Adversarial Word Substitutions" (EMNLP 2019)

Language:PythonLicense:MITStargazers:39Issues:3Issues:4

metadat

Meta-analytic datasets for R

Language:PythonStargazers:27Issues:1Issues:0
Language:PythonLicense:MITStargazers:24Issues:2Issues:2

datasets_multiling_dialogue

Multilingual Dialogue Datasets

RelEx

RelEx - A simple framework for Relation Extraction built on AllenNLP

Language:JsonnetLicense:Apache-2.0Stargazers:16Issues:5Issues:3

dual_decomposition

Python code for dual decompositon for different model pairs

Language:PythonStargazers:3Issues:1Issues:0