zhangkaihuo's repositories

Paddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

Language:C++License:Apache-2.0Stargazers:2Issues:0Issues:0
Language:CudaStargazers:1Issues:0Issues:0

Auto-GPT

An experimental open-source attempt to make GPT-4 fully autonomous.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

BMTrain

Efficient Training (including pre-training and fine-tuning) for Big Models

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

CGBN

CGBN: CUDA Accelerated Multiple Precision Arithmetic (Big Num) using Cooperative Groups

Language:CudaLicense:NOASSERTIONStargazers:0Issues:0Issues:0

CINN

Compiler Infrastructure for Neural Networks

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

License:NOASSERTIONStargazers:0Issues:0Issues:0

CUDALibrarySamples

CUDA Library Samples

Language:CudaLicense:NOASSERTIONStargazers:0Issues:0Issues:0

docs

Documentations for PaddlePaddle

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

glake

GLake: optimizing GPU memory management and IO transmission.

License:Apache-2.0Stargazers:0Issues:0Issues:0

hipSPARSE

ROCm SPARSE marshalling library

License:MITStargazers:0Issues:0Issues:0

libff

C++ library for Finite Fields and Elliptic Curves

License:NOASSERTIONStargazers:0Issues:0Issues:0

llama.cpp

Port of Facebook's LLaMA model in C/C++

License:MITStargazers:0Issues:0Issues:0

llmfarm_core.swift

Swift library to work with llama and other large language models.

Language:CLicense:MITStargazers:0Issues:0Issues:0

mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:CudaStargazers:0Issues:0Issues:0

MiniCPM

MiniCPM-2B: An end-side LLM outperforms Llama2-13B.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

mlc-MiniCPM

MiniCPM on Android platform.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

open-gpu-kernel-modules

NVIDIA Linux open GPU kernel module source

License:NOASSERTIONStargazers:0Issues:0Issues:0

Paddle3D

A 3D computer vision development toolkit based on PaddlePaddle. It supports point-cloud object detection, segmentation, and monocular 3D object detection models.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

PaddleFleetX

Paddle Distributed Training Examples. 飞桨分布式训练示例 Resnet Bert GPT MOE DataParallel ModelParallel PipelineParallel HybridParallel AutoParallel Zero Sharding Recompute GradientMerge Offload AMP DGC LocalSGD Wide&Deep

License:Apache-2.0Stargazers:0Issues:0Issues:0

PaddleNLP

An NLP library with Awesome pre-trained Transformer models and easy-to-use interface, supporting wide-range of NLP tasks from research to industrial applications.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

rapidsnark

fast zksnark prover

License:GPL-3.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

second.pytorch

SECOND for KITTI/NuScenes object detection

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

spconv

Spatial Sparse Convolution Library

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

sppark

Zero-knowledge template library

Language:CudaLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0