邓鑫林 (linxin2429)

linxin2429

Geek Repo

Location:Wuhan,China

Github PK Tool:Github PK Tool

邓鑫林's starred repositories

generative-ai-for-beginners

18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Language:Jupyter NotebookLicense:MITStargazers:46909Issues:417Issues:86

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Language:PythonLicense:Apache-2.0Stargazers:28725Issues:326Issues:5236

tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

Language:PythonLicense:MITStargazers:24604Issues:266Issues:628

awesome-deep-learning

A curated list of awesome Deep Learning tutorials, projects and communities.

pingora

A library for building fast, reliable and evolvable network services.

Language:RustLicense:Apache-2.0Stargazers:19943Issues:166Issues:134

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookLicense:MITStargazers:13088Issues:114Issues:206

DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Language:Jupyter NotebookStargazers:12798Issues:296Issues:820

triton

Development repository for the Triton language and compiler

unsloth

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:11300Issues:80Issues:470

mlops-zoomcamp

Free MLOps course from DataTalks.Club

Language:Jupyter NotebookStargazers:10496Issues:177Issues:89

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:10112Issues:69Issues:12

juicefs

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

Language:GoLicense:Apache-2.0Stargazers:9946Issues:112Issues:1287

quiche

🥧 Savoury implementation of the QUIC transport protocol and HTTP/3

Language:RustLicense:BSD-2-ClauseStargazers:9035Issues:160Issues:580

micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Language:Jupyter NotebookLicense:MITStargazers:8639Issues:143Issues:25

server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Language:PythonLicense:BSD-3-ClauseStargazers:7587Issues:138Issues:3546

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:7087Issues:82Issues:1434
Language:PythonLicense:Apache-2.0Stargazers:6948Issues:67Issues:64

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++License:Apache-2.0Stargazers:5577Issues:65Issues:623

OOTDiffusion

Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

Language:PythonLicense:NOASSERTIONStargazers:4941Issues:70Issues:183

MiniCPM

MiniCPM-2B: An end-side LLM outperforming Llama2-13B.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4231Issues:53Issues:122

dlssg-to-fsr3

Adds AMD FSR 3 Frame Generation to games by replacing Nvidia DLSS-G Frame Generation (nvngx_dlssg).

Language:C++License:GPL-3.0Stargazers:3790Issues:39Issues:417

ml-interviews-book

https://huyenchip.com/ml-interviews-book/

fastllm

纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行

Language:C++License:Apache-2.0Stargazers:3146Issues:36Issues:349

tvm_mlir_learn

compiler learning resources collect.

VideoPipe

跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星 : ) 。**VideoPipe下一版本正在开发中,在保证跨平台、易上手的前提下,预计性能直逼deepstream等各硬件平台官方框架**。

Language:C++License:Apache-2.0Stargazers:1074Issues:17Issues:20

micro-arch-training

How to make undergraduates or new graduates ready for advanced computer architecture research or modern CPU design

optd

CMU-DB's Cascades optimizer framework

Language:RustLicense:MITStargazers:322Issues:32Issues:31

CS149-parallel-computing

Learning materials for Stanford CS149 : Parallel Computing

CUDATutorial

A CUDA tutorial to make people learn CUDA program from 0

core

The core library and APIs implementing the Triton Inference Server.

Language:C++License:BSD-3-ClauseStargazers:95Issues:13Issues:0