Col_In_Coding's starred repositories

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:53856Issues:511Issues:924

text-generation-webui

A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

Language:PythonLicense:AGPL-3.0Stargazers:37645Issues:325Issues:3463

chinese-independent-developer

👩🏿‍💻👨🏾‍💻👩🏼‍💻👨🏽‍💻👩🏻‍💻**独立开发者项目列表 -- 分享大家都在做什么

HowToLiveLonger

程序员延寿指南 | A programmer's guide to live longer

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:17389Issues:159Issues:277

Kalman-and-Bayesian-Filters-in-Python

Kalman Filter book using Jupyter Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filters, and more. All exercises include solutions.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:16026Issues:473Issues:314

triton

Development repository for the Triton language and compiler

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:11523Issues:106Issues:827

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:10095Issues:82Issues:283

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:7096Issues:82Issues:1441

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonLicense:MITStargazers:6305Issues:61Issues:76

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonLicense:MITStargazers:5641Issues:47Issues:954

cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++License:NOASSERTIONStargazers:4739Issues:107Issues:898

statistical-learning-method-solutions-manual

统计学习方法习题解答,在线阅读地址:https://datawhalechina.github.io/statistical-learning-method-solutions-manual

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1631Issues:24Issues:21

tomesd

Speed up Stable Diffusion with this one simple trick!

Language:PythonLicense:MITStargazers:1227Issues:19Issues:47

onnx-modifier

A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.

Language:JavaScriptLicense:MITStargazers:1165Issues:10Issues:94

cccl

CUDA C++ Core Libraries

Language:C++License:NOASSERTIONStargazers:884Issues:30Issues:1039

deq

[NeurIPS'19] Deep Equilibrium Models

Language:PythonLicense:MITStargazers:707Issues:21Issues:29

CUDA-Learn-Notes

🎉CUDA 笔记 / 大模型手撕CUDA / C++笔记,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.

Language:CudaLicense:GPL-3.0Stargazers:665Issues:8Issues:5

photometric_optimization

Photometric optimization code for creating the FLAME texture space and other applications

Language:PythonLicense:MITStargazers:496Issues:9Issues:21

llama.onnx

LLaMa/RWKV onnx models, quantization and testcase

Language:PythonLicense:GPL-3.0Stargazers:331Issues:13Issues:18

aisys-building-blocks

Building blocks for foundation models.

TensorRT-Model-Optimizer

TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization and sparsity. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.

Language:PythonLicense:NOASSERTIONStargazers:250Issues:8Issues:24

Facial-Landmarks-Annotation-Tool

A visual editor for manually annotating facial landmarks in images of human faces.

Language:C++License:GPL-3.0Stargazers:212Issues:12Issues:12

MixSTE

Official implementation of CVPR 2022 paper(MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in Video)

facial-landmark-dataset

A collection of facial landmark datasets and Python code to make use of them.

Language:PythonLicense:MITStargazers:96Issues:3Issues:11

STCFormer

(CVPR2023)3D Human Pose Estimation with Spatio-Temporal Criss-cross Attention