Ayush Shridhar (ayush-1506)

ayush-1506

Geek Repo

Location:West Lafayette, IN

Home Page:ayush-1506.github.io

Twitter:@ayuSHridhar

Github PK Tool:Github PK Tool


Organizations
FluxML
KnetML
openmainframeproject
purduecyan

Ayush Shridhar's starred repositories

godot

Godot Engine – Multi-platform 2D and 3D game engine

open-interpreter

A natural language interface for computers

Language:PythonLicense:AGPL-3.0Stargazers:49564Issues:373Issues:864

llama_index

LlamaIndex is a data framework for your LLM applications

Language:PythonLicense:MITStargazers:32555Issues:233Issues:4171

gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Language:PythonLicense:Apache-2.0Stargazers:30180Issues:162Issues:4311

netron

Visualizer for neural network, deep learning and machine learning models

Language:JavaScriptLicense:MITStargazers:26569Issues:295Issues:1089

mojo

The Mojo Programming Language

Language:MojoLicense:NOASSERTIONStargazers:21881Issues:264Issues:1825

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:20658Issues:212Issues:115

mlc-llm

Universal LLM Deployment Engine with ML Compilation

Language:PythonLicense:Apache-2.0Stargazers:17503Issues:166Issues:1140

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:17377Issues:156Issues:1347

mlx

MLX: An array framework for Apple silicon

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Language:PythonLicense:MITStargazers:11643Issues:86Issues:286

cv

Print-friendly, minimalist CV page

Language:TypeScriptLicense:MITStargazers:8513Issues:23Issues:28

search_with_lepton

Building a quick conversation-based search demo with Lepton AI.

Language:TypeScriptLicense:Apache-2.0Stargazers:7209Issues:49Issues:58

autograd

Efficiently computes derivatives of numpy code.

Language:PythonLicense:MITStargazers:6842Issues:219Issues:394

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonLicense:MITStargazers:6305Issues:61Issues:76

AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Language:PythonLicense:Apache-2.0Stargazers:4472Issues:82Issues:241

valhalla

Open Source Routing Engine for OpenStreetMap

Language:C++License:NOASSERTIONStargazers:4255Issues:104Issues:2243

HIP

HIP: C++ Heterogeneous-Compute Interface for Portability

dbrx

Code examples and resources for DBRX, a large language model developed by Databricks

Language:PythonLicense:NOASSERTIONStargazers:2452Issues:40Issues:22

AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Language:PythonLicense:MITStargazers:1339Issues:11Issues:309

matmulfreellm

Implementation for MatMul-free LM.

Language:PythonLicense:Apache-2.0Stargazers:1263Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1089Issues:17Issues:48

how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

dlpack

common in-memory tensor structure

Language:PythonLicense:Apache-2.0Stargazers:865Issues:47Issues:67
Language:PythonLicense:Apache-2.0Stargazers:857Issues:9Issues:0

CUDA-Learn-Notes

🎉CUDA 笔记 / 大模型手撕CUDA / C++笔记,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.

Language:CudaLicense:GPL-3.0Stargazers:665Issues:8Issues:5

cpufp

A CPU tool for benchmarking the peak of floating points

Language:AssemblyLicense:GPL-3.0Stargazers:433Issues:16Issues:12

MatmulTutorial

A Easy-to-understand TensorOp Matmul Tutorial

Language:C++License:Apache-2.0Stargazers:201Issues:7Issues:7