ovcharenkoo

Oleg Ovcharenko's starred repositories

ndc_dapt_playbook

Scalable toolkit for data curation

Language:PythonApache-2.0200

locust

Write scalable load tests in plain Python 🚗💨

Language:PythonMIT2418900

Perplexica

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

Language:TypeScriptMIT1099500

Scrapegraph-ai

Python scraper based on AI

Language:PythonMIT1307200

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter Notebook1041000

open-gpu-kernel-modules

NVIDIA Linux open GPU with P2P support

Language:CNOASSERTION78200

NeMo-Curator

Scalable toolkit for data curation

Language:PythonApache-2.033700

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonMIT844700

lectures

Material for cuda-mode lectures

Language:Jupyter NotebookApache-2.0172200

FeatUp

Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024

Language:Jupyter NotebookMIT129300

EAGE-Hackathon-2024-Instructions

Here you will find all the info you need to know to participate in the 2024 EAGE Annual Hackathon in Oslo!

GPL-3.0400

NeMo-Aligner

Scalable toolkit for efficient model alignment

Language:PythonApache-2.042800

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT581600

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookNOASSERTION2171600

earth2mip

Earth-2 Model Intercomparison Project (MIP) is a python framework that enables climate researchers and scientists to inter-compare AI models for weather and climate.

Language:PythonApache-2.017100

tensorrtllm_backend

The Triton TensorRT-LLM Backend

Language:PythonApache-2.059200

json_repair

A python module to repair invalid JSON, commonly used to parse the output of LLMs

Language:PythonMIT45800

modulus-sym

Framework providing pythonic APIs, algorithms and utilities to be used with Modulus core to physics inform model training as well as higher level abstraction for domain experts

Language:PythonApache-2.013800

jetson-inference

Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.

Language:C++MIT755400

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.0743200

modulus-makani

Massively parallel training of machine-learning based weather and climate models

Language:PythonNOASSERTION21300

openchat

OpenChat: Advancing Open-source Language Models with Imperfect Data

Language:PythonApache-2.0512200

chatbot-ui

AI chat for every model.

Language:TypeScriptMIT2737700

deepops

Tools for building GPU clusters

Language:ShellBSD-3-Clause123000

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonApache-2.0162100

cartography

Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics

Language:Jupyter NotebookApache-2.018800

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonMIT5157900