jianzi123

jianzi123's repositories

adaptdl

Resource-adaptive cluster scheduler for deep learning training.

Apache-2.0000

There can be more than Notion and Miro. AFFiNE(pronounced [ə‘fain]) is a next-gen knowledge base that brings planning, sorting and creating all together. Privacy first, open-source, customizable and ready to use.

Language:TypeScriptNOASSERTION000

algorithm_design

Use several algorithm design methods to solve several common problems with C++11.

Language:C++MIT000

checkgo

010

cricket

cricket is a virtualization solution for GPUs

MIT000

cuda-api-wrappers

Thin, unified, C++-flavored wrappers for the CUDA APIs

BSD-3-Clause000

cuda_hook

Hooked CUDA-related dynamic libraries by using automated code generation tools.

Language:CMIT000

cuda_scheduling_examiner_mirror

A tool for examining GPU scheduling behavior.

Language:CudaNOASSERTION000

deep_learning_study

Language:Python010

DeepLearningSystem

Deep Learning System core principles introduction.

Apache-2.0000

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.0000

FleetX

Paddle Distributed Training Extended. 飞桨分布式训练扩展包

Language:ShellApache-2.0000

godel-scheduler

an unified scheduler for online and offline tasks

Apache-2.0000

h2ogpt

Private Q&A and summarization of documents+images or chat with local GPT, 100% private, Apache 2.0. Supports LLaMa2, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://codellama.h2o.ai/

Language:PythonApache-2.0000

HiCCL

A hierarchical collective communications library with portable optimizations

Language:C++000

kluster-capacity

Cluster capacity analysis tool for capacity estimation、scheduler simulation、cluster compression、fragmentation etc.

Language:GoApache-2.0000

kubernetes-1

Apuntes sobre k8s

000

langchain

⚡ Building applications with LLMs through composability ⚡

Language:PythonMIT000

learn-vgpu-the-hard-way

qemu, cuda, virtio and kernel driver etc, none of which I understand, I just in awe.

Language:C000

ML-Papers-of-the-Week

🔥Highlighting the top ML papers every week.

000

note

学习笔记

Language:Go010

nvidia-patch

This patch removes restriction on maximum number of simultaneous NVENC video encoding sessions imposed by Nvidia to consumer-grade GPUs.

Language:Python000

pipedream_experiment

private repo of msr-fiddle/pipedream

Language:PythonMIT000

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.

Language:PythonApache-2.0000

rockmate

Language:PythonGPL-3.0000

some_script

010

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Apache-2.0000

vcuda-controller

NOASSERTION000

vgpu_unlock

Unlock vGPU functionality for consumer grade GPUs.

Language:CMIT000

xgo

Go CGO cross compiler

Language:ShellMIT010