Ziwei Fan (zfan20)

zfan20

Geek Repo

Company:University of Illinois at Chicago

Home Page:https://ziwei-fan.github.io/

Github PK Tool:Github PK Tool

Ziwei Fan's starred repositories

LLM101n

LLM101n: Let's build a Storyteller

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:22986Issues:225Issues:131

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:14842Issues:97Issues:746

Perplexica

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

Language:TypeScriptLicense:MITStargazers:12682Issues:94Issues:217

ggml

Tensor library for machine learning

tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

Language:SystemVerilogStargazers:6864Issues:68Issues:22

ToolBench

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Language:PythonLicense:Apache-2.0Stargazers:4712Issues:49Issues:280

matmulfreellm

Implementation for MatMul-free LM.

Language:PythonLicense:Apache-2.0Stargazers:2837Issues:44Issues:29

reflexion

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Language:PythonLicense:MITStargazers:2252Issues:30Issues:33

c-style

My favorite C programming practices.

ThunderKittens

Tile primitives for speedy kernels

Language:CudaLicense:MITStargazers:1458Issues:26Issues:22

GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Language:PythonLicense:Apache-2.0Stargazers:1324Issues:17Issues:49

distributed-llama

Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed.

Language:C++License:MITStargazers:1279Issues:25Issues:49

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonLicense:MITStargazers:1103Issues:22Issues:36

granite-code-models

Granite Code Models: A Family of Open Foundation Models for Code Intelligence

PiPPy

Pipeline Parallelism for PyTorch

Language:PythonLicense:BSD-3-ClauseStargazers:697Issues:37Issues:258

unet.cu

UNet diffusion model in pure CUDA

Language:CudaStargazers:556Issues:2Issues:0

RepoAgent

An LLM-powered repository agent designed to assist developers and teams in generating documentation and understanding repositories quickly.

Language:PythonLicense:Apache-2.0Stargazers:282Issues:9Issues:26

awesome-emulators-simulators

A curated list of software emulators and simulators of PCs, home computers, mainframes, consoles, robots and much more...

CEPE

[ACL 2024] Long-Context Language Modeling with Parallel Encodings

Language:PythonLicense:MITStargazers:126Issues:5Issues:5

bpe.c

Simple Byte pair Encoding mechanism used for tokenization process . written purely in C

Language:CLicense:MITStargazers:111Issues:0Issues:0

Awesome-Mainframes

Awesome list of mainframe related resources & projects

bark.cpp

Port of Suno AI's Bark in C/C++ for fast inference

Language:C++License:MITStargazers:49Issues:0Issues:0

farel-bench

Testing LLM reasoning abilities with family relationship quizzes.

Language:PythonLicense:MITStargazers:40Issues:0Issues:0

PLTranslationEmpirical

Artifact repository for the paper "Lost in Translation: A Study of Bugs Introduced by Large Language Models while Translating Code", In Proceedings of The 46th IEEE/ACM International Conference on Software Engineering (ICSE 2024), Lisbon, Portugal, April 2024

Language:PythonLicense:MITStargazers:37Issues:2Issues:1

llama_duo

asynchronous/distributed speculative evaluation for llama3

Language:C++License:MITStargazers:35Issues:2Issues:1
Language:PythonLicense:Apache-2.0Stargazers:12Issues:1Issues:0