M Saiful Bari's starred repositories

jina

☁️ Build multimodal AI applications with cloud-native stack

Language:PythonLicense:Apache-2.0Stargazers:20536Issues:208Issues:1938

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonLicense:Apache-2.0Stargazers:7341Issues:99Issues:1466

PromptPapers

Must-read papers on prompt-based tuning for pre-trained language models.

GLM

GLM (General Language Model)

Language:PythonLicense:MITStargazers:3101Issues:46Issues:191

promptsource

Toolkit for creating, sharing and using natural language prompts.

Language:PythonLicense:Apache-2.0Stargazers:2579Issues:31Issues:162

cpu_features

A cross platform C99 library to get cpu features at runtime.

Language:C++License:Apache-2.0Stargazers:2395Issues:1219Issues:118

tsne-cuda

GPU Accelerated t-SNE for CUDA with Python bindings

Language:CudaLicense:BSD-3-ClauseStargazers:1748Issues:28Issues:124

TransCoder

Public release of the TransCoder research project https://arxiv.org/pdf/2006.03511.pdf

Language:PythonLicense:NOASSERTIONStargazers:1677Issues:57Issues:54

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonLicense:NOASSERTIONStargazers:1270Issues:24Issues:143

NL-Augmenter

NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations

Language:PythonLicense:MITStargazers:764Issues:23Issues:52

t-zero

Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)

Language:PythonLicense:Apache-2.0Stargazers:454Issues:24Issues:21

ACL-anthology-corpus

This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs

Language:Jupyter NotebookStargazers:165Issues:7Issues:3

pptod

Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System (ACL 2022)

Language:PythonLicense:Apache-2.0Stargazers:156Issues:8Issues:18

deep-learning-models

Natural language processing & computer vision models optimized for AWS

Language:PythonLicense:NOASSERTIONStargazers:139Issues:16Issues:15

LTP

[KDD'22] Learned Token Pruning for Transformers

Language:PythonLicense:Apache-2.0Stargazers:87Issues:3Issues:10

LaplacianShot

Laplacian Regularized Few Shot Learning

multilingual-modeling

BLOOM+1: Adapting BLOOM model to support a new unseen language

Language:PythonLicense:Apache-2.0Stargazers:66Issues:16Issues:24

tokenizer

Convert source code into numerical tokens

Language:C++License:NOASSERTIONStargazers:63Issues:3Issues:12

ml_nlp_paper_data

Dataset of ML and NLP papers

carbon-footprint

A repository for `codecarbon` logs.

Language:Jupyter NotebookStargazers:10Issues:14Issues:1

transformers

🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:5Issues:2Issues:0

multilingual-t0

Multilingual extension of T0

Language:PythonStargazers:5Issues:2Issues:0
Language:C++Stargazers:3Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0

UXLA

We propose UXLA, a novel data augmentation framework for self-supervised learning in zero-resource transfer learning scenarios.

Language:PythonStargazers:2Issues:1Issues:0
License:MITStargazers:1Issues:0Issues:0
Language:JavaStargazers:1Issues:0Issues:0

BIG-bench

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

eval_t0_deepspeed

Evaluate T0 with DeepSpeed

Language:PythonStargazers:1Issues:1Issues:0