Siyuan Ma (s1van)

s1van

Geek Repo

Company:The Ohio State University

Github PK Tool:Github PK Tool

Siyuan Ma's starred repositories

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:133414Issues:1119Issues:15912

imgui

Dear ImGui: Bloat-free Graphical User interface for C++ with minimal dependencies

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:56010Issues:526Issues:969

ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Language:PythonLicense:GPL-3.0Stargazers:53211Issues:392Issues:3386

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:35100Issues:343Issues:2759

google-research

Google Research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:34061Issues:750Issues:1252

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:30327Issues:428Issues:4190

Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Language:PythonLicense:BSD-3-ClauseStargazers:28046Issues:235Issues:671

gaussian-splatting

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Language:PythonLicense:NOASSERTIONStargazers:14095Issues:118Issues:948

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonLicense:Apache-2.0Stargazers:12533Issues:134Issues:211

agents

An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents

Language:PythonLicense:Apache-2.0Stargazers:5213Issues:62Issues:74

awesome-chatgpt

🤖 Awesome list for ChatGPT — an artificial intelligence chatbot developed by OpenAI

License:CC0-1.0Stargazers:5072Issues:60Issues:0

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:4476Issues:50Issues:290

human-eval

Code for the paper "Evaluating Large Language Models Trained on Code"

Language:PythonLicense:MITStargazers:2366Issues:130Issues:36

AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Language:PythonLicense:Apache-2.0Stargazers:2161Issues:29Issues:139

gsgen

[CVPR 2024] Text-to-3D using Gaussian Splatting

Language:PythonLicense:MITStargazers:771Issues:11Issues:47
Language:PythonLicense:MITStargazers:312Issues:8Issues:8

BlackMamba

Code repository for Black Mamba

falkon

Large-scale, multi-GPU capable, kernel solver

Language:PythonLicense:MITStargazers:180Issues:6Issues:55
Language:Jupyter NotebookStargazers:54Issues:3Issues:0

Neural-Collapse

[NeurIPS 2021] A Geometric Analysis of Neural Collapse with Unconstrained Features

grokking

Implementation of OpenAI's 'Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets' paper.

Language:PythonLicense:MITStargazers:32Issues:1Issues:2

EigenPro2

EigenPro2 iteration in Tensorflow (Keras)

Language:PythonLicense:MITStargazers:23Issues:2Issues:0

Epoch_wise_Double_Descent

Official implementation of "Multi-scale Feature Learning Dynamics: Insights for Double Descent".

Language:PythonLicense:MITStargazers:16Issues:4Issues:0

convrfm

Code for convolutional neural feature ansatz and (deep convrfm)

Language:PythonStargazers:6Issues:2Issues:0

EigenPro

Latest and fastest EigenPro that scales to billions of examples

Language:PythonStargazers:4Issues:0Issues:0