Sumanth R Hegde's repositories
ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
tokenization
A comprehensive deep dive into the world of tokens
personal-website
My personal website, built on top of Wowchemy
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
ICL_Support_Example
The official implementation of the paper "Finding Support Examples for In-Context Learning".
peft
Fork of 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning. Our implementation for IA3, a new fine-tuning method is now a part of the official Huggingface library!
ecco
Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).
python-mastery
My solutions for Advanced Python Mastery (course by @dabeaz)
nanotron
Minimalistic large language model 3D-parallelism training
llmperf
LLMPerf is a library for validating and benchmarking LLMs
cuda-resource-stream
CUDA related news and material links
unsloth
5X faster 50% less memory LLM finetuning
text-to-meme
A Text to Meme model that can generate a full meme given user text.
accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
ia_3_test
Fork of Chao's test with peftt ia^3. Trying to get to the bottom of IA3 training errors.
wowchemy-hugo-themes
🔥 Hugo website builder, Hugo themes & Hugo CMS. No code, easily build with blocks! 创建在线课程,学术简历或初创网站。#OpenScience
starter-hugo-academic
🎓 Hugo Academic Theme 创建一个学术网站. Easily create a beautiful academic résumé or educational website using Hugo, GitHub, and Netlify.
FastChat
Fork of FastChat, an open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.
lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
Chain-of-ThoughtsPapers
A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".
chatbot-deployment
Deployment of PyTorch chatbot with Flask
ulogme
Automatically collect and visualize usage statistics in Ubuntu/OSX environments.
AdapNet-pp
Code for the EE6132 Course Project, Fall 2019. Code forked from https://github.com/DeepSceneSeg/AdapNet-pp with project-specific changes
EE5111_Estimation_Theory
A repository of mini projects and projects carried out as part of the EE5111 Estimation Theory couse, Spring 2020.
CS6790_GPCV
Assignments of the course "Geometry and Photometry for Computer Vision" , Spring 2020.
Learning-to-See-Moving-Objects-in-the-Dark
A fork of Learning to See Moving Objects in the Dark