s1van

followers

following

stars

The Ohio State University

Siyuan Ma's starred repositories

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0133414 1119 15912

imgui

Dear ImGui: Bloat-free Graphical User interface for C++ with minimal dependencies

Language:C++MIT60501 1037 5955

llama

Inference code for Llama models

Language:PythonNOASSERTION56010 526 969

ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Language:PythonGPL-3.053211 392 3386

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.035100 343 2759

google-research

Google Research

Language:Jupyter NotebookApache-2.034061 750 1252

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT30327 428 4190

Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Language:PythonBSD-3-Clause28046 235 671

babyagi

Language:PythonMIT20166 301 151

gaussian-splatting

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Language:PythonNOASSERTION14095 118 948

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonApache-2.012533 134 211

agents

An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents

Language:PythonApache-2.05213 62 74

awesome-chatgpt

🤖 Awesome list for ChatGPT — an artificial intelligence chatbot developed by OpenAI

CC0-1.05072 600

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonMIT4476 50 290

headshots-starter

Language:TypeScriptMIT3819 21 47

human-eval

Code for the paper "Evaluating Large Language Models Trained on Code"

Language:PythonMIT2366 130 36

AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Language:PythonApache-2.02161 29 139

Webpilot

Language:VueGPL-3.01771 22 55

gsgen

[CVPR 2024] Text-to-3D using Gaussian Splatting

Language:PythonMIT771 11 47

knnlm

Language:PythonMIT312 8 8

BlackMamba

Code repository for Black Mamba

Language:Python228 4 7

falkon

Large-scale, multi-GPU capable, kernel solver

Language:PythonMIT180 6 55

grokking

Language:Jupyter Notebook54 30

Neural-Collapse

[NeurIPS 2021] A Geometric Analysis of Neural Collapse with Unconstrained Features

Language:Python51 3 2

recursive_feature_machines

Language:PythonMIT38 2 2

grokking

Implementation of OpenAI's 'Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets' paper.

Language:PythonMIT32 1 2

EigenPro2

EigenPro2 iteration in Tensorflow (Keras)

Language:PythonMIT23 20

Epoch_wise_Double_Descent

Official implementation of "Multi-scale Feature Learning Dynamics: Insights for Double Descent".

Language:PythonMIT16 40

convrfm

Code for convolutional neural feature ansatz and (deep convrfm)

Language:Python6 20

EigenPro

Latest and fastest EigenPro that scales to billions of examples

Language:Python400