Le's repositories
AI-For-Beginners
12 Weeks, 24 Lessons, AI for All!
astra-sim
ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale
chakra_new
Repository for MLCommons Chakra schema and tools
ChatDev
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
dlrm
An implementation of a deep learning recommendation model (DLRM)
FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models. |语音识别工具包,包含丰富的性能优越的开源预训练模型,支持语音识别、语音端点检测、文本后处理等,具备服务部署能力。
FunClip
一款基于FunASR高准确率开源语音识别模型的智能视频剪辑工具 / A video clipping tool based on FunASR open source model and Gradio.
katalyst-core
Katalyst aims to provide a universal solution to help improve resource utilization and optimize the overall costs in the cloud. This is the core components in Katalyst system, including multiple agents and centralized components
knavigator
knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.
koordinator
A QoS-based scheduling system brings optimal layout and status to workloads such as microservices, web services, big data jobs, AI jobs, etc.
kwok
Kubernetes WithOut Kubelet - Simulates thousands of Nodes and Clusters.
llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
Megatron-LM
Ongoing research training transformer models at scale
nccl
Optimized primitives for collective multi-GPU communication
nccl-tests
NCCL Tests
OpenSCA-cli
OpenSCA is an open source software supply chain security solution that supports the detection of open source dependencies, vulnerabilities and license compliance with a widely noticed accuracy by the community.
otg-examples
Open Traffic Generator examples available to everyone. It's a great way to get started.
param
PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for evaluation of training and inference platforms.
self-llm
《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合**宝宝的部署教程
sonic-mgmt
Configuration management examples for SONiC
testing
Public repository for test-cases contributed by different organizations
text-generation-webui
A Gradio web UI for Large Language Models. Supports transformers, GPTQ, llama.cpp (ggml/gguf), Llama models.
vulhub
Pre-Built Vulnerable Environments Based on Docker-Compose
WasmEdge
WasmEdge is a lightweight, high-performance, and extensible WebAssembly runtime for cloud native, edge, and decentralized applications. It powers serverless apps, embedded functions, microservices, smart contracts, and IoT devices.
whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Yuan-2.0
Yuan 2.0 Large Language Model