Yanming W. (ymwangg)

ymwangg

Geek Repo

Company:@aws

Github PK Tool:Github PK Tool


Organizations
neo-ai

Yanming W.'s repositories

Language:AssemblyLicense:MITStargazers:3Issues:2Issues:1

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:3Issues:0Issues:0
Language:Jupyter NotebookStargazers:2Issues:2Issues:0

transformers

🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

accelerate

🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

xla

Enabling PyTorch on Google TPU

Language:C++License:NOASSERTIONStargazers:0Issues:2Issues:0

AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

alpa

Training and serving large-scale neural networks

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ColossalAI-Documentation

Documentation for Colossal-AI

Language:JavaScriptLicense:Apache-2.0Stargazers:0Issues:0Issues:0

compiler-explorer

Run compilers interactively from your web browser and interact with the assembly

License:BSD-2-ClauseStargazers:0Issues:0Issues:0

detr

End-to-End Object Detection with Transformers

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

djl-serving

A universal scalable machine learning model deployment solution

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:C++Stargazers:0Issues:2Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

llama.cpp

Port of Facebook's LLaMA model in C/C++

Language:CLicense:MITStargazers:0Issues:0Issues:0

llm-awq

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

lm-evaluation-harness

A framework for few-shot evaluation of language models.

License:MITStargazers:0Issues:0Issues:0

maskrcnn-benchmark

Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:AssemblyStargazers:0Issues:2Issues:0

PipeEdge

PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0
Language:C++License:Apache-2.0Stargazers:0Issues:2Issues:0

tensorflow-fork

An Open Source Machine Learning Framework for Everyone

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

text-generation-webui

A Gradio web UI for Large Language Models. Supports transformers, GPTQ, llama.cpp (ggml/gguf), Llama models.

Language:PythonLicense:AGPL-3.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:2Issues:0

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

vllm-test

Misc test and benchmark code for vllm

Language:PythonStargazers:0Issues:0Issues:0