Shaunak Ketkar's starred repositories
pyspark-style-guide
This is a guide to PySpark code style presenting common situations and the associated best practices based on the most frequent recurring topics across the PySpark repos we've encountered.
styleguide
Style guides for Google-originated open-source projects
awesome-workflow-engines
A curated list of awesome open source workflow engines
Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
llm-datasets
High-quality datasets, tools, and concepts for LLM fine-tuning.
beyond-jupyter
Software design principles for machine learning applications
NeuralFlow
Visualize the intermediate output of Mistral 7B
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
awesome-instruction-datasets
A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。
LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
awesome-scalability
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
PromethAI-Backend
Open-source framework that gives you AI Agents that help you navigate decision-making, get personalized goals and execute them
public-apis
A collective list of free APIs
distribution-is-all-you-need
The basic distribution probability Tutorial for Deep Learning Researchers