ovcharenkoo

Oleg Ovcharenko's starred repositories

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonMIT50728 433 126

chatbot-ui

AI chat for every model.

Language:TypeScriptMIT27184 243 932

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookNOASSERTION20495 232 50

Scrapegraph-ai

Python scraper based on AI

Language:PythonMIT12207 82 155

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookNOASSERTION10248 83 289

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonMIT8378 67 193

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonApache-2.08280 99 1162

server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Language:PythonBSD-3-Clause7666 139 3570

jetson-inference

Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.

Language:C++MIT7523 273 1787

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.07288 85 1492

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT5657 36 900

openchat

OpenChat: Advancing Open-source Language Models with Imperfect Data

Language:PythonApache-2.05101 51 185

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonApache-2.01573 34 247

lectures

Material for cuda-mode lectures

Language:Jupyter NotebookApache-2.01303 23 6

FeatUp

Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024

Language:Jupyter NotebookMIT1272 18 54

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonNOASSERTION1264 24 143

deepops

Tools for building GPU clusters

Language:ShellBSD-3-Clause1217 51 426

open-gpu-kernel-modules

NVIDIA Linux open GPU with P2P support

Language:CNOASSERTION765 14 7

tensorrtllm_backend

The Triton TensorRT-LLM Backend

Language:PythonApache-2.0575 22 403

tutorials

This repository contains tutorials and examples for Triton Inference Server

Language:PythonBSD-3-Clause457 130

json_repair

A python module to repair invalid JSON, commonly used to parse the output of LLMs

Language:PythonMIT416 3 41

NeMo-Aligner

Scalable toolkit for efficient model alignment

Language:PythonApache-2.0407 9 53

NeMo-Curator

Scalable toolkit for data curation

Language:PythonApache-2.0322 12 52

modulus-makani

Massively parallel training of machine-learning based weather and climate models

Language:PythonNOASSERTION212 12 3

cartography

Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics

Language:Jupyter NotebookApache-2.0187 7 9

earth2mip

Earth-2 Model Intercomparison Project (MIP) is a python framework that enables climate researchers and scientists to inter-compare AI models for weather and climate.

Language:PythonApache-2.0165 6 100

modulus-sym

Framework providing pythonic APIs, algorithms and utilities to be used with Modulus core to physics inform model training as well as higher level abstraction for domain experts

Language:PythonApache-2.0130 13 66

mipt_course

Language:C++31 20

EAGE-Hackathon-2024-Instructions

Here you will find all the info you need to know to participate in the 2024 EAGE Annual Hackathon in Oslo!

GPL-3.0400

rapids-examples

Language:Jupyter Notebook100