Jie Xin (ftxj)

ftxj

Geek Repo

Company:NVIDIA

Location:Shanghai

Home Page:ftxj.github.io

Github PK Tool:Github PK Tool

Jie Xin's starred repositories

modulus

Open-source deep-learning framework for building, training, and fine-tuning deep learning models using state-of-the-art Physics-ML methods

Language:PythonLicense:Apache-2.0Stargazers:839Issues:0Issues:0

py-spy

Sampling profiler for Python programs

Language:RustLicense:MITStargazers:12249Issues:0Issues:0

ml-engineering

Machine Learning Engineering Open Book

Language:PythonLicense:CC-BY-SA-4.0Stargazers:10322Issues:0Issues:0

the-art-of-debugging

The Art of Debugging

Language:CLicense:CC-BY-SA-4.0Stargazers:766Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:846Issues:0Issues:0
Language:C++License:Apache-2.0Stargazers:131Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:23922Issues:0Issues:0

Fuser

A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

Language:C++License:NOASSERTIONStargazers:239Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38418Issues:0Issues:0

FlexGen

Running large language models on a single GPU for throughput-oriented scenarios.

Language:PythonLicense:Apache-2.0Stargazers:9095Issues:0Issues:0

Awesome-GPU

Awesome resources for GPUs

License:BSD-3-ClauseStargazers:443Issues:0Issues:0

Tensor-Puzzles

Solve puzzles. Improve your pytorch.

Language:Jupyter NotebookLicense:MITStargazers:2954Issues:0Issues:0

Pyjion

Pyjion - A JIT for Python based upon CoreCLR

Language:C++License:MITStargazers:1418Issues:0Issues:0

codon

A high-performance, zero-overhead, extensible Python compiler using LLVM

Language:C++License:NOASSERTIONStargazers:14009Issues:0Issues:0

ocolos-public

Ocolos is the first online code layout optimization system for unmodified applications written in unmanaged languages.

Language:C++License:BSD-2-ClauseStargazers:48Issues:0Issues:0

treebeard

An optimizing compiler for decision tree ensemble inference.

Language:C++License:MITStargazers:15Issues:0Issues:0

CompilerGym

Reinforcement learning environments for compiler and program optimization tasks

Language:PythonLicense:MITStargazers:888Issues:0Issues:0

Hermes

A speculative mechanism to accelerate long-latency off-chip load requests by removing on-chip cache access latency from their critical path, as described by MICRO 2022 paper by Bera et al. (https://arxiv.org/pdf/2209.00188.pdf)

Language:C++License:MITStargazers:63Issues:0Issues:0

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:C++License:NOASSERTIONStargazers:26Issues:0Issues:0

ceras

Ceras is yet another tiny deep learning engine, in pure c++ and header only.

Language:C++Stargazers:119Issues:0Issues:0

compile-time-regular-expressions

Compile Time Regular Expression in C++

Language:C++License:Apache-2.0Stargazers:3247Issues:0Issues:0

Rust-CUDA

Ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust.

Language:RustLicense:Apache-2.0Stargazers:2981Issues:0Issues:0

CppCoreGuidelines

The C++ Core Guidelines are a set of tried-and-true guidelines, rules, and best practices about coding in C++

Language:CSSLicense:NOASSERTIONStargazers:42195Issues:0Issues:0

ReadingList

Papers on Graph Analytics, Mining, and Learning

Stargazers:121Issues:0Issues:0

fluid-engine-dev

Fluid simulation engine for computer graphics applications

Language:C++License:MITStargazers:1847Issues:0Issues:0

DL_Compiler

Study Group of Deep Learning Compiler

Stargazers:148Issues:0Issues:0

AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Language:PythonLicense:Apache-2.0Stargazers:4503Issues:0Issues:0

Game-Programmer-Study-Notes

:anchor: 我的游戏程序员生涯的读书笔记合辑。你可以把它看作一个加强版的Blog。涉及图形学、实时渲染、编程实践、GPU编程、设计模式、软件工程等内容。Keep Reading , Keep Writing , Keep Coding.

Stargazers:8917Issues:0Issues:0

awesome-tensor-compilers

A list of awesome compiler projects and papers for tensor computation and deep learning.

Stargazers:2278Issues:0Issues:0

tvm_mlir_learn

compiler learning resources collect.

Language:PythonStargazers:1976Issues:0Issues:0