Venkat Raman's starred repositories
system-design-101
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
Made-With-ML
Learn how to design, develop, deploy and iterate on production-grade ML applications.
LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
system-design-resources
These are the best resources for System Design on the Internet
path-to-senior-engineer-handbook
All the resources you need to get to Senior Engineer and beyond
mlops-zoomcamp
Free MLOps course from DataTalks.Club
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
transformer-deploy
Efficient, scalable and enterprise-grade CPU/GPU inference server for π€ Hugging Face transformer models π
cs249r_book
Collaborative book Machine Learning Systems
LLM-Finetuning-Hub
Repository that contains LLM fine-tuning and deployment scripts along with our research findings.
jetbrains-reset-trial-evaluation-mac
Reset Intellij IDEA, WebStorm, DataGrip, PhpStorm, CLion, PyCharm, RubyMine, GoLand and Rider evaluation (2019 / 2020 / Mac OS)
baklava
Baklava is the build and packaging system for ML models. Baklava leverages the python standard "setuptools" packaging system, and extends it to build docker containers that run Machine learning models. These containers are compatible with SageMaker, and in future, they will be compatible with Kubeflow.