manjrekarom

Omkar Manjrekar's starred repositories

New-Grad-Positions

A collection of full time roles in SWE, Quant, and PM for new grads.

10473 1347 171

hugo-PaperMod

A fast, clean, responsive Hugo theme.

Language:HTMLMIT9314 40 522

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonMIT8482 60 1440

nlp-recipes

Natural Language Processing Best Practices & Examples

Language:PythonMIT6353 187 211

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonNOASSERTION5012 35 178

lark

Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.

Language:PythonMIT4672 59 884

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonMIT4408 49 287

Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Language:PythonMIT3781 34 34

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonMIT3538 67 229

LoveIt

❤️A clean, elegant but advanced blog theme for Hugo 一个简洁、优雅且高效的 Hugo 主题

Language:JavaScriptMIT3352 30 501

minimalRL

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

Language:PythonMIT2813 49 40

biobert

Bioinformatics'2020: BioBERT: a pre-trained biomedical language representation model for biomedical text mining

Language:PythonNOASSERTION1898 63 174

rainbow-is-all-you-need

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

Language:Jupyter NotebookMIT1816 26 31

Awesome-Text2SQL

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

MIT1443 16 4

d3rlpy

An offline deep reinforcement learning library

Language:PythonMIT1267 27 327

Summarization-Papers

Summarization Papers

Language:TeX978 23 3

pg-is-all-you-need

Policy Gradient is all you need! A step-by-step tutorial for well-known PG methods.

Language:Jupyter NotebookMIT836 11 12

RRHF

[NIPS2023] RRHF & Wombat

Language:Python782 10 47

cuda_programming

Code from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch

Language:CudaGPL-3.0683 19 13

rl-tutorial-jnrr19

Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019

Language:Jupyter NotebookMIT582 11 13

bluebert

BlueBERT, pre-trained on PubMed abstracts and clinical notes (MIMIC-III).

Language:PythonNOASSERTION545 23 36

llama

User-friendly LLaMA: Train or Run the model using PyTorch. Nothing else.

Language:Python327 2 13

cherry

A PyTorch Library for Reinforcement Learning Research

Language:PythonApache-2.0198 17 9

saber

Saber is a deep-learning based tool for information extraction in the biomedical domain. Pull requests are welcome! Note: this is a work in progress. Many things are broken, and the codebase is not stable.

Language:PythonMIT102 18 105