linxin2429

followers

following

stars

Wuhan,China

邓鑫林's starred repositories

generative-ai-for-beginners

18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Language:Jupyter NotebookMIT46909 417 86

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Language:PythonApache-2.028725 326 5236

tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

Language:PythonMIT24604 266 628

awesome-deep-learning

A curated list of awesome Deep Learning tutorials, projects and communities.

pingora

A library for building fast, reliable and evolvable network services.

Language:RustApache-2.019943 166 134

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookMIT13088 114 206

DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Language:Jupyter Notebook12798 296 820

triton

Development repository for the Triton language and compiler

Language:C++MIT11617 182 1237

unsloth

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonApache-2.011300 80 470

mlops-zoomcamp

Free MLOps course from DataTalks.Club

Language:Jupyter Notebook10496 177 89

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookMIT10112 69 12

juicefs

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

Language:GoApache-2.09946 112 1287

quiche

🥧 Savoury implementation of the QUIC transport protocol and HTTP/3

Language:RustBSD-2-Clause9035 160 580

micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Language:Jupyter NotebookMIT8639 143 25

server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Language:PythonBSD-3-Clause7587 138 3546

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.07087 82 1434

LWM

Language:PythonApache-2.06948 67 64

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++Apache-2.05577 65 623

OOTDiffusion

Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

Language:PythonNOASSERTION4941 70 183

MiniCPM

MiniCPM-2B: An end-side LLM outperforming Llama2-13B.

Language:Jupyter NotebookApache-2.04231 53 122

dlssg-to-fsr3

Adds AMD FSR 3 Frame Generation to games by replacing Nvidia DLSS-G Frame Generation (nvngx_dlssg).

Language:C++GPL-3.03790 39 417

ml-interviews-book

https://huyenchip.com/ml-interviews-book/

Language:HTML3195 43 14

fastllm

纯c++的全平台llm加速库，支持python调用，chatglm-6B级模型单卡可达10000+token / s，支持glm, llama, moss基座，手机端流畅运行

Language:C++Apache-2.03146 36 349

tvm_mlir_learn

compiler learning resources collect.

Language:Python1871 35 4

VideoPipe

跨平台的视频结构化（视频分析）框架，觉得有帮助的请给个星星 : ) 。**VideoPipe下一版本正在开发中，在保证跨平台、易上手的前提下，预计性能直逼deepstream等各硬件平台官方框架**。

Language:C++Apache-2.01074 17 20

micro-arch-training

How to make undergraduates or new graduates ready for advanced computer architecture research or modern CPU design

optd

CMU-DB's Cascades optimizer framework

Language:RustMIT322 32 31

CS149-parallel-computing

Learning materials for Stanford CS149 : Parallel Computing

Language:C129 2 2

CUDATutorial

A CUDA tutorial to make people learn CUDA program from 0

Language:Cuda121 2 3

core

The core library and APIs implementing the Triton Inference Server.

Language:C++BSD-3-Clause95 130