marssss's repositories

Selective_Context

Compress your input to ChatGPT or other LLMs, to let them process 2x more content and save 40% memory and GPU time.

Language:PythonStargazers:1Issues:0Issues:0

artificial_intelligence

My C++ deep learning framework & other machine learning algorithms

Stargazers:0Issues:0Issues:0

Awesome-LLM-Compression

Awesome LLM compression research papers and tools.

License:MITStargazers:0Issues:0Issues:0

CUDA-From-Correctness-To-Performance-Code

Codes & examples for "CUDA - From Correctness to Performance"

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

cuda-mode-lectures

Material for cuda-mode lectures

License:Apache-2.0Stargazers:0Issues:0Issues:0

d2l-zh

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DeepSpeedExamples

Example models using DeepSpeed

License:Apache-2.0Stargazers:0Issues:0Issues:0

gpu-optimization-workshop

Slides, notes, and materials for the workshop (cuda-mode)

Stargazers:0Issues:0Issues:0

how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Stargazers:0Issues:0Issues:0

intel-extension-for-transformers

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

License:Apache-2.0Stargazers:0Issues:0Issues:0

Kcx_Learning

个人知识库,记录我的计算机科学与人工智能学习之路,终生学习,终生更新

Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

KuiperInfer

带你从零实现一个高性能的深度学习推理库,支持Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

llm-resource

LLM全栈优质资源汇总

License:Apache-2.0Stargazers:0Issues:0Issues:0

LLM101n

LLM101n: Let's build a Storyteller

Stargazers:0Issues:0Issues:0

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

MyTinySTL

Achieve a tiny STL in C++11

License:NOASSERTIONStargazers:0Issues:0Issues:0

onnx-tensorrt

ONNX-TensorRT: TensorRT backend for ONNX

License:Apache-2.0Stargazers:0Issues:0Issues:0

onnxruntime-inference-examples

Examples for using ONNX Runtime for machine learning inferencing.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

puck

Puck is a high-performance ANN search engine

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

License:Apache-2.0Stargazers:0Issues:0Issues:0

tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

License:MITStargazers:0Issues:0Issues:0

tinyml-papers-and-projects

This is a list of interesting papers and projects about TinyML.

License:MITStargazers:0Issues:0Issues:0

tlx

TLX - A Collection of Sophisticated C++ Data Structures, Algorithms, and Miscellaneous Helpers

License:BSL-1.0Stargazers:0Issues:0Issues:0

transfomers-silicon-research

Research and Materials on Hardware implementation of Transformer Model

License:MITStargazers:0Issues:0Issues:0

udlbook

Understanding Deep Learning - Simon J.D. Prince

License:NOASSERTIONStargazers:0Issues:0Issues:0

UnderstandingDeepLearning-ZH-CN

UnderstandingDeepLearing中文翻译

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0