ZZK (MARD1NO)

MARD1NO

Geek Repo

Company:SiliconFlow

Location:Neverland

Home Page:https://mard1no.github.io/

Github PK Tool:Github PK Tool

ZZK's repositories

open-resume

OpenResume is a powerful open-source resume builder and resume parser. https://open-resume.com/

Language:TypeScriptLicense:AGPL-3.0Stargazers:1Issues:0Issues:0

tutorial-multi-gpu

Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial

Language:CudaLicense:MITStargazers:1Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

ByteTransformer

optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

CUDALibrarySamples

CUDA Library Samples

Language:CudaLicense:NOASSERTIONStargazers:0Issues:0Issues:0

CV-CUDA

CV-CUDA™ is an open-source, graphics processing unit (GPU)-accelerated library for cloud-scale image processing and computer vision.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

docs

Documentations for PaddlePaddle

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

dynolog

Dynolog is a telemetry daemon for performance monitoring and tracing. It exports metrics from different components in the system like the linux kernel, CPU, disks, Intel PT, GPUs etc. Dynolog also integrates with pytorch and can trigger traces for distributed training applications.

Language:C++License:MITStargazers:0Issues:0Issues:0

EdgeGPT

Reverse engineered API of Microsoft's Bing Chat AI

Language:PythonLicense:UnlicenseStargazers:0Issues:0Issues:0

FlexGen

Running large language models like OPT-175B/GPT-3 on a single GPU. Up to 100x faster than other offloading systems.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Fuser

A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

GPTQ-triton

GPTQ inference Triton kernel

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

InferLLM

a lightweight LLM model inference framework

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

LLMSurvey

A collection of papers and resources related to Large Language Models.

Stargazers:0Issues:0Issues:0

matxscript

The model pre-processing and post-processing framework

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

nccl-tests

NCCL Tests

Language:CudaLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0
Language:CStargazers:0Issues:0Issues:0

PTX-ISA

CUDA PTX-ISA Document 中文翻译版

License:Apache-2.0Stargazers:0Issues:0Issues:0

QuickMathHPP

a single-header math library

Language:C++License:MITStargazers:0Issues:0Issues:0

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

taichi-nerfs

Implementations of NeRF variants based on Taichi + PyTorch

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

typst

A new markup-based typesetting system that is powerful and easy to learn.

Language:RustLicense:Apache-2.0Stargazers:0Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0