bobliao (foreverlms)

foreverlms

Geek Repo

Company:Bytedance

Location:Shanghai, China

Home Page:https://blog.bobliao.xyz

Github PK Tool:Github PK Tool

bobliao's starred repositories

llama.cpp

LLM inference in C/C++

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:38491Issues:383Issues:1639

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:34354Issues:340Issues:2852

interview

📚 C/C++ 技术面试基础知识总结,包括语言、程序库、数据结构、算法、系统、网络、链接装载库等知识及面试经验、招聘、内推等信息。This repository is a summary of the basic knowledge of recruiting job seekers and beginners in the direction of C/C++ technology, including language, program library, data structure, algorithm, system, network, link loading library, interview experience, recruitment, recommendation, etc.

Language:C++License:NOASSERTIONStargazers:34153Issues:870Issues:62

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29269Issues:339Issues:267

llvm-project

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

Language:LLVMLicense:NOASSERTIONStargazers:27513Issues:587Issues:74742

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:22675Issues:222Issues:129

llama2.c

Inference Llama 2 in one file of pure C

awesome-cheatsheets

超级速查表 - 编程语言、框架和开发工具的速查表,单个文件包含一切你需要知道的东西 :zap:

Language:ShellLicense:MITStargazers:11011Issues:269Issues:23

AISystem

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:10086Issues:140Issues:32

einops

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Language:PythonLicense:MITStargazers:8287Issues:68Issues:172

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonLicense:NOASSERTIONStargazers:8214Issues:79Issues:506
Language:PythonLicense:Apache-2.0Stargazers:7045Issues:67Issues:70

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++License:Apache-2.0Stargazers:5718Issues:63Issues:624

mindspore

MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.

Language:C++License:Apache-2.0Stargazers:4197Issues:149Issues:259

web-stable-diffusion

Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3526Issues:36Issues:44

gprof2dot

Converts profiling output to a dot graph.

Language:PythonLicense:LGPL-3.0Stargazers:3178Issues:78Issues:58

CV-CUDA

CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.

Language:C++License:NOASSERTIONStargazers:2309Issues:45Issues:156

note-of-interview

《互联网面试笔记》收集和分析互联网常见面试题,并将这些面试知识整理成文方便大家查阅。主要是面向java程序员,但基础知识部分不同语言程序员(c,c++,python等)都可以参考。---持续更新中, 欢迎编辑~

autodiff

automatic differentiation made easier for C++

Language:C++License:MITStargazers:1607Issues:43Issues:193

torchtitan

A native PyTorch Library for large model training

Language:PythonLicense:BSD-3-ClauseStargazers:1477Issues:35Issues:118

onnx-modifier

A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.

Language:JavaScriptLicense:MITStargazers:1245Issues:12Issues:101

CPPNotes

【C++ 面试 + C++ 学习指南】 一份涵盖大部分 C++ 程序员所需要掌握的核心知识。

matxscript

A high-performance, extensible Python AOT compiler.

Language:C++License:Apache-2.0Stargazers:402Issues:15Issues:49

HPC-Learning-Notes

高性能计算相关知识学习笔记,包含学习笔记和相关知识的代码demo,在持续完善中。 如果有帮助的话请Star一下,对作者帮助很大,谢谢!

Language:Jupyter NotebookStargazers:343Issues:6Issues:1

ArxivDigest

ArXiv Digest and Personalized Recommendations using Large Language Models

Language:PythonLicense:MITStargazers:298Issues:11Issues:3

FeatureDetector

What features does your CPU and OS support?

Language:C++License:CC0-1.0Stargazers:270Issues:11Issues:9

tflite2tensorflow

Generate saved_model, tfjs, tf-trt, EdgeTPU, CoreML, quantized tflite, ONNX, OpenVINO, Myriad Inference Engine blob and .pb from .tflite. Support for building environments with Docker. It is possible to directly access the host PC GUI and the camera to verify the operation. NVIDIA GPU (dGPU) support. Intel iHD GPU (iGPU) support. Supports inverse quantization of INT8 quantization model.

Language:PythonLicense:MITStargazers:260Issues:15Issues:39

ECE408

Code base and slides for ECE408:Applied Parallel Programming On GPU.