Bijay Gurung's starred repositories

the-book-of-secret-knowledge

A collection of inspiring lists, manuals, cheatsheets, blogs, hacks, one-liners, cli/web tools and more.

License:MITStargazers:133421Issues:2389Issues:0

professional-programming

A collection of learning resources for curious software engineers

Language:PythonLicense:MITStargazers:45601Issues:978Issues:26

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:18719Issues:293Issues:1310

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:11368Issues:105Issues:818

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookLicense:MITStargazers:9569Issues:85Issues:244

pytorch-metric-learning

The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.

Language:PythonLicense:MITStargazers:5822Issues:63Issues:491

BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

Language:PythonLicense:MITStargazers:5670Issues:52Issues:1617

lit-gpt

Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonLicense:Apache-2.0Stargazers:5189Issues:63Issues:476

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonLicense:Apache-2.0Stargazers:4094Issues:41Issues:157

data-oriented-design

A curated list of data oriented design resources.

KeyBERT

Minimal keyword extraction with BERT

Language:PythonLicense:MITStargazers:3269Issues:32Issues:189

towhee

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

Language:PythonLicense:Apache-2.0Stargazers:3030Issues:29Issues:654

NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Language:PythonLicense:BSD-3-ClauseStargazers:2957Issues:60Issues:86

clip-retrieval

Easily compute clip embeddings and build a clip retrieval system with them

Language:Jupyter NotebookLicense:MITStargazers:2197Issues:24Issues:221

pythia

The hub for EleutherAI's work on interpretability and learning dynamics

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2087Issues:33Issues:97

setfit

Efficient few-shot learning with Sentence Transformers

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2029Issues:20Issues:289

decord

An efficient video loader for deep learning with smart shuffling that's super easy to digest

Language:C++License:Apache-2.0Stargazers:1677Issues:30Issues:236

lightning-bolts

Toolbox of models, callbacks, and datasets for AI/ML researchers.

Language:PythonLicense:Apache-2.0Stargazers:1651Issues:23Issues:359

advice

A repository of links with advice related to grad school applications, research, phd etc

ALBEF

Code for ALBEF: a new vision-language pre-training method

Language:PythonLicense:BSD-3-ClauseStargazers:1418Issues:11Issues:138

unlimiformer

Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"

Language:PythonLicense:MITStargazers:1035Issues:22Issues:57

evalplus

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023

Language:PythonLicense:Apache-2.0Stargazers:957Issues:8Issues:153

csvs-to-sqlite

Convert CSV files into a SQLite database

Language:PythonLicense:Apache-2.0Stargazers:862Issues:18Issues:70

curated-transformers

🤖 A PyTorch library of curated Transformer models and their composable components

Language:PythonLicense:MITStargazers:848Issues:14Issues:31

ml_collections

ML Collections is a library of Python Collections designed for ML use cases.

Language:PythonLicense:Apache-2.0Stargazers:844Issues:14Issues:17
Language:PythonLicense:Apache-2.0Stargazers:312Issues:7Issues:9

unmasked_teacher

[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models

Language:PythonLicense:MITStargazers:252Issues:14Issues:38

concept-erasure

Erasing concepts from neural representations with provable guarantees

Language:PythonLicense:MITStargazers:195Issues:9Issues:5

canals

A component orchestration engine

Language:PythonLicense:Apache-2.0Stargazers:27Issues:2Issues:43

mdp-playground

A python package to design and debug RL agents.

Language:PythonLicense:Apache-2.0Stargazers:25Issues:8Issues:1