Sihan Chen (Spycsh)

Spycsh

Geek Repo

Company:Intel

Location:Shanghai

Github PK Tool:Github PK Tool

Sihan Chen's starred repositories

manim

Animation engine for explanatory math videos

Language:PythonLicense:MITStargazers:59996Issues:884Issues:1123

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:27647Issues:184Issues:877

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:22243Issues:183Issues:174

jaeger

CNCF Jaeger, a Distributed Tracing Platform

Language:GoLicense:Apache-2.0Stargazers:19757Issues:329Issues:1846

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Language:PythonLicense:Apache-2.0Stargazers:10553Issues:195Issues:2121

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonLicense:MITStargazers:9577Issues:65Issues:102

server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Language:PythonLicense:BSD-3-ClauseStargazers:7634Issues:139Issues:3563

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:7226Issues:84Issues:1477

text-to-text-transfer-transformer

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Language:PythonLicense:Apache-2.0Stargazers:5987Issues:104Issues:405

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++License:Apache-2.0Stargazers:5603Issues:65Issues:623

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5297Issues:63Issues:89

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5093Issues:39Issues:34

notebooks

Jupyter notebooks for the Natural Language Processing with Transformers book

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3661Issues:61Issues:93

oneDNN

oneAPI Deep Neural Network Library (oneDNN)

Language:C++License:Apache-2.0Stargazers:3505Issues:185Issues:1256

kserve

Standardized Serverless ML Inference Platform on Kubernetes

Language:PythonLicense:Apache-2.0Stargazers:3229Issues:63Issues:1729

docarray

Represent, send, store and search multimodal data

Language:PythonLicense:Apache-2.0Stargazers:2819Issues:43Issues:637

Awesome-Pruning

A curated list of neural network pruning resources.

tensorflow-onnx

Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2251Issues:58Issues:1031

neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Language:PythonLicense:Apache-2.0Stargazers:2053Issues:34Issues:187

intel-extension-for-transformers

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Language:PythonLicense:Apache-2.0Stargazers:2025Issues:27Issues:144

noisereduce

Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)

Language:Jupyter NotebookLicense:MITStargazers:1323Issues:22Issues:71

optimization-manual

Contains the source code examples described in the "Intel® 64 and IA-32 Architectures Optimization Reference Manual"

Language:AssemblyLicense:0BSDStargazers:751Issues:41Issues:1

awesome-ml-model-compression

Awesome machine learning model compression research papers, tools, and learning material.

License:MITStargazers:449Issues:23Issues:0

streamingbook

Code snippets from the Streaming Systems book (streamingbook.net).

Language:JavaLicense:Apache-2.0Stargazers:238Issues:10Issues:2
Language:C++License:BSL-1.0Stargazers:229Issues:9Issues:8

oneAPI_course

oneAPI - Data Parallel C++ course for students

Language:C++License:Apache-2.0Stargazers:35Issues:1Issues:2

GenAIComps

GenAI components at micro-service level; GenAI service composer to create mega-service

Language:PythonLicense:Apache-2.0Stargazers:19Issues:6Issues:0

statefun-aws-demo

A stateful serverless demo app running on AWS Lambda, using Apache Flink Stateful Functions

Language:PythonLicense:Apache-2.0Stargazers:14Issues:3Issues:1

GenAIEval

Evaluation, benchmark, and scorecard, targeting for performance on throughput and latency, accuracy on popular evaluation harness, safety, and hallucination

Language:PythonLicense:Apache-2.0Stargazers:12Issues:8Issues:0
Language:JavaStargazers:7Issues:0Issues:0