Ma-Dan's starred repositories

calm

CUDA/Metal accelerated language model inference

Language:CLicense:MITStargazers:325Issues:0Issues:0

infini-mini-transformer

This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and training code.

Language:PythonStargazers:48Issues:0Issues:0

gemma-sft

Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)

Language:PythonLicense:Apache-2.0Stargazers:24Issues:0Issues:0
Language:C++Stargazers:6Issues:0Issues:0

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8179Issues:0Issues:0

exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs

Language:PythonLicense:MITStargazers:3096Issues:0Issues:0

json

JSON for Modern C++

Language:C++License:MITStargazers:40828Issues:0Issues:0

llamafile

Distribute and run LLMs with a single file.

Language:C++License:NOASSERTIONStargazers:16240Issues:0Issues:0

chinese-independent-developer

👩🏿‍💻👨🏾‍💻👩🏼‍💻👨🏽‍💻👩🏻‍💻**独立开发者项目列表 -- 分享大家都在做什么

Stargazers:35607Issues:0Issues:0

ruapu

Detect CPU features with single-file

Language:CLicense:MITStargazers:248Issues:0Issues:0

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:20421Issues:0Issues:0

Qwen1.5-0.5b-chat-android

:fire: 基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat

Language:C++Stargazers:18Issues:0Issues:0
Language:C++License:Apache-2.0Stargazers:15Issues:0Issues:0

GitHub-Chinese-Top-Charts

:cn: GitHub中文排行榜,各语言分设「软件 | 资料」榜单,精准定位中文好项目。各取所需,高效学习。

Language:JavaLicense:NOASSERTIONStargazers:92019Issues:0Issues:0

llvm-project

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

Language:C++License:NOASSERTIONStargazers:26323Issues:0Issues:0

CUDA_gemm

A simple high performance CUDA GEMM implementation.

Language:CudaStargazers:292Issues:0Issues:0

RAGOnMedicalKG

RAGOnMedicalKG,将大模型RAG与KG结合,完成demo级问答,旨在给出基础的思路。

Language:PythonStargazers:114Issues:0Issues:0

mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:2362Issues:0Issues:0

faster-whisper

Faster Whisper transcription with CTranslate2

Language:PythonLicense:MITStargazers:9607Issues:0Issues:0
Language:VerilogStargazers:3Issues:0Issues:0

RasaGPT

💬 RasaGPT is the first headless LLM chatbot platform built on top of Rasa and Langchain. Built w/ Rasa, FastAPI, Langchain, LlamaIndex, SQLModel, pgvector, ngrok, telegram

Language:PythonLicense:MITStargazers:2219Issues:0Issues:0

streaming-tts-webui

Streaming Text to Speech Web UI

Language:HTMLLicense:Apache-2.0Stargazers:10Issues:0Issues:0

sqlflow

Brings SQL and AI together.

Language:GoLicense:Apache-2.0Stargazers:5036Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6746Issues:0Issues:0

flash_attn_jax

JAX bindings for Flash Attention v2

Language:C++License:BSD-3-ClauseStargazers:58Issues:0Issues:0

CUDA-Learn-Notes

🎉CUDA 笔记 / 大模型手撕CUDA / C++笔记,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.

Language:CudaLicense:GPL-3.0Stargazers:653Issues:0Issues:0

diffusion_schrodinger_bridge

PyTorch Implementation of DSB for Score Based Generative Modeling. Experiments managed using Hydra.

Language:PythonLicense:MITStargazers:116Issues:0Issues:0

euanka

本地完整部署ASR(K2)-NLP(Rasa,Spacy)-LLM(Chatglm2)-TTS(Vits)

Language:GoStargazers:101Issues:0Issues:0

rwkv-qualcomm

Inference rwkv5 with Qualcomm AI Engine Direct SDK

Language:C++Stargazers:25Issues:0Issues:0

AllNewsSpider

澎湃新闻,新浪新闻,腾讯新闻,搜狐新闻,新闻联播,泰晤士报,纽约时报,BBCNews,旨在爬取所有新闻门户网站的新闻,禁止将所得数据商用!

Language:PythonLicense:Apache-2.0Stargazers:312Issues:0Issues:0