singhranjodh

followers

following

stars

India

@_ranjodh_singh

Ranjodh Singh's starred repositories

Prompt-Engineering-Guide

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

Language:MDXMIT44821 501 130

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.017117 157 263

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonMIT8366 79 31

magika

Detect file content types with deep learning

Language:PythonApache-2.07440 36 319

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.06908 83 1368

axolotl

Go ahead and axolotl questions

Language:PythonApache-2.06301 50 573

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonBSD-3-Clause5224 59 86

cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++NOASSERTION4674 106 868

torchtune

A Native-PyTorch Library for LLM Fine-tuning

Language:PythonBSD-3-Clause3278 37 288

leafmap

A Python package for interactive mapping and geospatial analysis with minimal coding in a Jupyter environment

Language:PythonMIT2934 54 237

deepsparse

Sparsity-aware deep learning inference runtime for CPUs

Language:PythonNOASSERTION2897 55 128

jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

Language:PythonNOASSERTION2441 31 41

sparseml

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

Language:PythonApache-2.01991 47 198

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookApache-2.01922 34 75

lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Language:PythonApache-2.01655 29 208

localllm

Language:PythonApache-2.01465 34 9

SoM

Set-of-Mark Prompting for LMMs

Language:PythonMIT975 21 28

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonMIT528 18 14

adapter-bert

Language:PythonApache-2.0466 9 10

long-form-factuality

Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".

Language:PythonNOASSERTION465 10 1

EfficientLoFTR

Language:Jupyter Notebook425 40 17

LLM-Workshop

LLM Workshop by Sourab Mangrulkar

Language:Jupyter NotebookApache-2.0281 6 16

ao

Native PyTorch library for quantization and sparsity

Language:PythonBSD-3-Clause271 18 35

gaussian-head

Official repository for 'GaussianHead: High-fidelity Head Avatars with Learnable Gaussian Derivation'

Language:PythonMIT229 22 23

indicnlp_corpus

Description Describes the IndicNLP corpus and associated datasets

Language:Python147 10 14

fast-llm.rs

Language:Rust127 4 2

RAGTruth

Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"

MIT64 10 2

languagecodec_tmp

Temporary anonymous version

Language:PythonApache-2.02200

hyllama

llama.cpp gguf file parser for javascript

Language:JavaScriptMIT21 2 2

torch-bnb-fp4

Faster Pytorch bitsandbytes 4bit fp4 nn.Linear ops

Language:PythonMIT16 40