sneaxiy's repositories

Language:CudaStargazers:1Issues:1Issues:0

OpAccStableFramework

The accuracy and stability test framework.

Language:PythonStargazers:1Issues:1Issues:0

Paddle

PArallel Distributed Deep LEarning

Language:C++License:Apache-2.0Stargazers:1Issues:1Issues:0

apex

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

CINN

Compiler Infrastructure for Neural Networks

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

couler

Unified Interface for Constructing and Managing Workflows on different workflow engines, such as Argo Workflows, Tekton Pipelines, and Apache Airflow.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

cub

Cooperative primitives for CUDA C++.

Language:CudaLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

DALI

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0

cutlass

CUDA Templates for Linear Algebra Subroutines

License:NOASSERTIONStargazers:0Issues:0Issues:0

DeepLearningExamples

Deep Learning Examples

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:C++License:BSD-3-ClauseStargazers:0Issues:0Issues:0

logging

MLPerf™ logging library

License:NOASSERTIONStargazers:0Issues:0Issues:0

models

Model configurations

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

models-2

Premade models for SQLFlow

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

nccl

Optimized primitives for collective multi-GPU communication

License:NOASSERTIONStargazers:0Issues:0Issues:0

NVBug

NVIDIA Bug

Language:CudaStargazers:0Issues:0Issues:0

NVIDIA-MxNet

NVIDIA optimized MxNet framework for MLPerf

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

PaddleFleetX

Paddle Distributed Training Examples. 飞桨分布式训练示例 Resnet Bert GPT MOE DataParallel ModelParallel PipelineParallel HybridParallel AutoParallel Zero Sharding Recompute GradientMerge Offload AMP DGC LocalSGD Wide&Deep

License:Apache-2.0Stargazers:0Issues:0Issues:0

PaddleMIX

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

PaddleNLP

Easy-to-use and powerful NLP library with Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including Neural Search, Question Answering, Information Extraction and Sentiment Analysis end-to-end system.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

PaddleScience

PaddleScience is SDK and library for developing AI-driven scientific computing applications based on PaddlePaddle.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

PaddleSpeech

Easy-to-use Speech Toolkit including SOTA/Streaming ASR with punctuation, influential TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

License:Apache-2.0Stargazers:0Issues:0Issues:0

sqlflow

Brings SQL and AI together.

Language:GoLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

training_results_v2.0

MLPerf™ Training v2.0 results

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

triton

Development repository for the Triton language and compiler

Language:C++License:MITStargazers:0Issues:0Issues:0