preFiredman's starred repositories

deep-learning-for-image-processing

deep learning for image processing including classification and object-detection etc.

Language:PythonLicense:GPL-3.0Stargazers:21878Issues:0Issues:0

ZLUDA

CUDA on AMD GPUs

Language:RustLicense:Apache-2.0Stargazers:8464Issues:0Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:62119Issues:0Issues:0

FasterTransformer

Transformer related optimization, including BERT, GPT

Language:C++License:Apache-2.0Stargazers:5674Issues:0Issues:0

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonLicense:Apache-2.0Stargazers:3479Issues:0Issues:0

TurboTransformers

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

Language:C++License:NOASSERTIONStargazers:1455Issues:0Issues:0

MoneyPrinterPlus

使用AI大模型技术,一键批量生成各类短视频,自动批量混剪短视频,自动把视频发布到抖音,快手,小红书,视频号上,赚钱从来没有这么容易过! 支持本地语音模型chatTTS,fasterwhisper。Generate short videos with one click using AI LLM,print money together!

Language:PythonLicense:GPL-3.0Stargazers:449Issues:0Issues:0

aimoneyhunter

ai副业赚钱大集合,教你如何利用ai做一些副业项目,赚取更多额外收益。The Ultimate Guide to Making Money with AI Side Hustles: Learn how to leverage AI for some cool side gigs and rake in some extra cash. Check out the English version for more insights.

Stargazers:12275Issues:0Issues:0

self-llm

《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合**宝宝的部署教程

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6652Issues:0Issues:0

ShortGPT

🚀🎬 ShortGPT - Experimental AI framework for youtube shorts / tiktok channel automation

Language:PythonLicense:NOASSERTIONStargazers:5342Issues:0Issues:0

MoneyPrinterTurbo

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Language:PythonLicense:MITStargazers:15199Issues:0Issues:0

MoneyPrinter

Automate Creation of YouTube Shorts using MoviePy.

Language:PythonLicense:MITStargazers:9885Issues:0Issues:0

EasySpider

A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。

Language:JavaScriptLicense:NOASSERTIONStargazers:30478Issues:0Issues:0

taskflow

A General-purpose Task-parallel Programming System using Modern C++

Language:C++License:NOASSERTIONStargazers:9840Issues:0Issues:0
Language:PythonLicense:MITStargazers:371Issues:0Issues:0

CGraph

【A common used C++ DAG framework】 一个通用的、无三方依赖的、跨平台的、收录于awesome-cpp的、基于流图的并行计算框架。欢迎star & fork

Language:C++License:MITStargazers:1599Issues:0Issues:0

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:7634Issues:0Issues:0

FlagGems

FlagGems is an operator library for large language models implemented in Triton Language.

Language:PythonLicense:Apache-2.0Stargazers:177Issues:0Issues:0

master-cudnn

解读cudnn文档,掌握其用法

License:MITStargazers:8Issues:0Issues:0

sd-webui-controlnet

WebUI extension for ControlNet

Language:PythonLicense:GPL-3.0Stargazers:16545Issues:0Issues:0

cudnn-frontend

cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it

Language:C++License:MITStargazers:377Issues:0Issues:0

so-large-lm

大模型基础: 一文了解大模型基础知识

Stargazers:2070Issues:0Issues:0

cuda_hgemm

Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.

Language:CudaLicense:MITStargazers:239Issues:0Issues:0

YHs_Sample

Yinghan's Code Sample

Language:CudaLicense:GPL-3.0Stargazers:259Issues:0Issues:0

CUDA_gemm

A simple high performance CUDA GEMM implementation.

Language:CudaStargazers:303Issues:0Issues:0

How_to_optimize_in_GPU

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

Language:CudaLicense:Apache-2.0Stargazers:766Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:54364Issues:0Issues:0

Awesome-AITools

Collection of AI-related utilities. Welcome to submit issues and pull requests /收藏AI相关的实用工具,欢迎提交issues 或者pull requests

Stargazers:3946Issues:0Issues:0

ChatLaw

ChatLaw:A Powerful LLM Tailored for Chinese Legal. 中文法律大模型

License:AGPL-3.0Stargazers:6699Issues:0Issues:0

radeon_gpu_profiler

Radeon GPU Profiler (RGP) is a tool from AMD that allows for deep inspection of GPU workloads.

Stargazers:379Issues:0Issues:0