anopska

0

followers

0

following

stars

Anoop's repositories

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Apache-2.0000

visualizing-finetunes

Apache-2.0000

gorilla

Gorilla: An API store for LLMs

Apache-2.0000

arcee-trainium-recipes

The repository contains all the set-up required to execute trainium training jobs.

000

grammar-based-agents

Modular open LLM agents via prompt chaining and schema-guided generation

Apache-2.0000

lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Apache-2.0000

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Apache-2.0000

peft_lora

Repo accompanying PEFT/LoRA article.

MIT-0000

ipyexperiments

Automatic GPU+CPU memory profiling, re-use and memory leaks detection using jupyter/ipython experiment containers

NOASSERTION000

TinyChatEngine

TinyChatEngine: On-Device LLM Inference Library

MIT000

tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

Language:PythonMIT000

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

MIT000

NexusRaven

NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRaven-13B and baselines.

Apache-2.0000

parallel-computing-tutorial

MIT000