yiakwy-xpu-ml-framework-team's repositories
NV-DOCA-code-examples
DOCA Application code sharing Contest
NV-nccl-tests
NCCL Tests
NV_grouped_gemm
PyTorch bindings for CUTLASS grouped GEMM for MoE.
AMD-CK-fork
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
AMD-ROCm-docker-fork
Dockerfiles for the various software layers defined in the ROCm software platform
AMD-ROCm-fork
AMD ROCm™ Software - GitHub Home
AMD_matrix_instruction_calculator-fork
A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators
coweaves-k8s-pytorchjobs-nccl-tests
NVIDIA NCCL Tests for Distributed Training
CUSTOMER-RTP-LLM-fork
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
GC-Partner-IPUDOOM
DOOM (1993) on IPU đź‘ż
groqflow
GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing those programs on GroqChip™ processors.
lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
Longctx_ChunkLlama
Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
META-llama3
The official Meta Llama 3 GitHub site
ml_dtypes
A stand-alone implementation of several NumPy dtype extensions used in machine learning.
MS_Pix2Text
An Open-Source Python3 tool for recognizing layouts, tables, math formulas, and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.
NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
NV-DALI
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
NV-gdrcopy
A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology
NV-libcudacxx-fork
[ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl
NV-nccl-rdma-sharp-plugins
RDMA and SHARP plugins for nccl library
OpenAI-triton
Development repository for the Triton language and compiler
PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
prompt-injection-defenses
Every practical and proposed defense against prompt injection.
skyworkai-Vitron
A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing