Omkar Manjrekar (manjrekarom)

manjrekarom

Geek Repo

Company:@E-yantra

Location:India

Github PK Tool:Github PK Tool

Omkar Manjrekar's starred repositories

New-Grad-Positions

A collection of full time roles in SWE, Quant, and PM for new grads.

hugo-PaperMod

A fast, clean, responsive Hugo theme.

Language:HTMLLicense:MITStargazers:9314Issues:40Issues:522

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonLicense:MITStargazers:8482Issues:60Issues:1440

nlp-recipes

Natural Language Processing Best Practices & Examples

Language:PythonLicense:MITStargazers:6353Issues:187Issues:211

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonLicense:NOASSERTIONStargazers:5012Issues:35Issues:178

lark

Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.

Language:PythonLicense:MITStargazers:4672Issues:59Issues:884

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:4408Issues:49Issues:287

Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Language:PythonLicense:MITStargazers:3781Issues:34Issues:34

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonLicense:MITStargazers:3538Issues:67Issues:229

LoveIt

❤️A clean, elegant but advanced blog theme for Hugo 一个简洁、优雅且高效的 Hugo 主题

Language:JavaScriptLicense:MITStargazers:3352Issues:30Issues:501

minimalRL

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

Language:PythonLicense:MITStargazers:2813Issues:49Issues:40

biobert

Bioinformatics'2020: BioBERT: a pre-trained biomedical language representation model for biomedical text mining

Language:PythonLicense:NOASSERTIONStargazers:1898Issues:63Issues:174

rainbow-is-all-you-need

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

Language:Jupyter NotebookLicense:MITStargazers:1816Issues:26Issues:31

Awesome-Text2SQL

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

d3rlpy

An offline deep reinforcement learning library

Language:PythonLicense:MITStargazers:1267Issues:27Issues:327

pg-is-all-you-need

Policy Gradient is all you need! A step-by-step tutorial for well-known PG methods.

Language:Jupyter NotebookLicense:MITStargazers:836Issues:11Issues:12

RRHF

[NIPS2023] RRHF & Wombat

cuda_programming

Code from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch

Language:CudaLicense:GPL-3.0Stargazers:683Issues:19Issues:13

rl-tutorial-jnrr19

Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019

Language:Jupyter NotebookLicense:MITStargazers:582Issues:11Issues:13

bluebert

BlueBERT, pre-trained on PubMed abstracts and clinical notes (MIMIC-III).

Language:PythonLicense:NOASSERTIONStargazers:545Issues:23Issues:36

llama

User-friendly LLaMA: Train or Run the model using PyTorch. Nothing else.

cherry

A PyTorch Library for Reinforcement Learning Research

Language:PythonLicense:Apache-2.0Stargazers:198Issues:17Issues:9

saber

Saber is a deep-learning based tool for information extraction in the biomedical domain. Pull requests are welcome! Note: this is a work in progress. Many things are broken, and the codebase is not stable.

Language:PythonLicense:MITStargazers:102Issues:18Issues:105
Language:PythonLicense:Apache-2.0Stargazers:78Issues:2Issues:13

CA-MTL

Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data

habitat-imitation-baselines

Code for training embodied agents using imitation learning at scale in Habitat-Lab

Language:PythonLicense:MITStargazers:32Issues:2Issues:17

blue_benchmark_with_transformers

Implementation of the BLUE benchmark with Transformers.

Language:PythonLicense:Apache-2.0Stargazers:20Issues:3Issues:1

SuMe

[LREC2022] SuMe: A Dataset towards Summarizing biomedical Mechanisms

Language:PythonStargazers:4Issues:10Issues:0

rust-book-tryouts

rust book chapters in programmed format

Language:RustStargazers:1Issues:1Issues:0