Anjan Nepal's starred repositories
long-range-arena
Long Range Arena for Benchmarking Efficient Transformers
tree-of-thoughts
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
gpt-engineer
Specify what you want it to build, the AI asks for clarification, and then builds it. Not actively maintained.
are-16-heads-really-better-than-1
Code for the paper "Are Sixteen Heads Really Better than One?"
coding-interview-university
A complete computer science study plan to become a software engineer.
resilience4j
Resilience4j is a fault tolerance library designed for Java8 and functional programming
Grokking-System-Design
Systems design is the process of defining the architecture, modules, interfaces, and data for a system to satisfy specified requirements. Systems design could be seen as the application of systems theory to product development.
system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
greedy-layer-pruning
Greedy layer pruning for transformer models.
transformer-deploy
Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀