zfy3000's starred repositories

HPC-Notes

Personal Notes for Learning HPC & Parallel Computation [Active Adding New Content]

License:GPL-3.0Stargazers:51Issues:0Issues:0

CUDA-Optimization-Guide

Xiao's CUDA Optimization Guide [Active Adding New Contents]

License:GPL-3.0Stargazers:210Issues:0Issues:0

xdp-tutorial

XDP tutorial

Language:CStargazers:2332Issues:0Issues:0

pcm

Intel® Performance Counter Monitor (Intel® PCM)

Language:C++License:BSD-3-ClauseStargazers:2664Issues:0Issues:0

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:7853Issues:0Issues:0

ollama

Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.

Language:GoLicense:MITStargazers:77112Issues:0Issues:0

muduo

Event-driven network library for multi-threaded Linux server in C++11

Language:C++License:NOASSERTIONStargazers:14472Issues:0Issues:0

libgdsync

GPUDirect Async support for IB Verbs

Language:C++Stargazers:86Issues:0Issues:0

oneAPI-samples

Samples for Intel® oneAPI Toolkits

Language:C++License:MITStargazers:890Issues:0Issues:0

oneCCL

oneAPI Collective Communications Library (oneCCL)

Language:C++License:NOASSERTIONStargazers:183Issues:0Issues:0

docker-infiniband

Infiniband base image based on Centos

Language:ShellLicense:GPL-3.0Stargazers:5Issues:0Issues:0

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:21428Issues:0Issues:0

tutorials

Tutorials for the usage of the Uni.lu HPC platform

Language:HTMLLicense:NOASSERTIONStargazers:124Issues:0Issues:0

osu-micro-benchmarks

MPI Microbenchmarks

Language:CLicense:NOASSERTIONStargazers:28Issues:0Issues:0

cbpfc

cBPF to C or eBPF compiler

Language:GoLicense:BSD-3-ClauseStargazers:179Issues:0Issues:0

Learn-CUDA-Programming

Learn CUDA Programming, published by Packt

Language:CudaLicense:MITStargazers:946Issues:0Issues:0

clone_anonymous_github

clone/download repositories from https://anonymous.4open.science/

Language:PythonStargazers:85Issues:0Issues:0

bk-bcs-saas

蓝鲸智云容器管理平台SaaS(Blueking Container Service)

Language:PythonLicense:NOASSERTIONStargazers:269Issues:0Issues:0

ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

Language:C++License:NOASSERTIONStargazers:19686Issues:0Issues:0

wepy

小程序组件化开发框架

Language:JavaScriptLicense:NOASSERTIONStargazers:22485Issues:0Issues:0

tccl

Thunder Research Group's Collective Communication Library

Language:C++License:NOASSERTIONStargazers:16Issues:0Issues:0

ml-systems-papers

Curated collection of papers in machine learning systems

Stargazers:75Issues:0Issues:0

BLINKplus_cache

[Under Development] BLINK+, UC Berkeley CS 267 Final Project

Language:CudaLicense:Apache-2.0Stargazers:2Issues:0Issues:0

glake

GLake: optimizing GPU memory management and IO transmission.

Language:C++License:Apache-2.0Stargazers:316Issues:0Issues:0

ucc

Unified Collective Communication Library

Language:CLicense:BSD-3-ClauseStargazers:177Issues:0Issues:0

runlike

Given an existing docker container, prints the command line necessary to run a copy of it.

Language:PythonLicense:NOASSERTIONStargazers:1871Issues:0Issues:0

byteps

A high performance and generic framework for distributed DNN training

Language:PythonLicense:NOASSERTIONStargazers:3585Issues:0Issues:0

Alink

Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.

Language:JavaLicense:Apache-2.0Stargazers:3540Issues:0Issues:0

blinkplus-nccl-base

[Under Development] BLINK+ based on NCCL

Language:C++License:NOASSERTIONStargazers:2Issues:0Issues:0

upcxx_collectives

collectives library for upc++

Language:C++License:BSL-1.0Stargazers:3Issues:0Issues:0