Sotamaker's starred repositories

Pix2Text

An Open-Source Python3 tool for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.

Language:Jupyter NotebookLicense:MITStargazers:1807Issues:0Issues:0

AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7638Issues:0Issues:0
Language:PythonStargazers:7Issues:0Issues:0

deep-tempest

Restoration for TEMPEST images using deep-learning

Language:PythonLicense:NOASSERTIONStargazers:447Issues:0Issues:0
Language:PythonStargazers:8Issues:0Issues:0

minRF

Minimal implementation of scalable rectified flow transformers, based on SD3's approach

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:421Issues:0Issues:0

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language:PythonLicense:MITStargazers:1214Issues:0Issues:0

BFN-Solver

Official PyTorch implementation for "Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential Equations"

Language:PythonStargazers:28Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:7092Issues:0Issues:0

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:4031Issues:0Issues:0

awesome-text-to-image-studies

A collection of awesome text-to-image generation studies.

Language:TeXLicense:MITStargazers:348Issues:0Issues:0

Awesome-Diffusion-Model-Based-Image-Editing-Methods

Diffusion Model-Based Image Editing: A Survey (arXiv)

License:MITStargazers:419Issues:0Issues:0

Awesome-Controllable-T2I-Diffusion-Models

A collection of resources on controllable generation with text-to-image diffusion models.

License:MITStargazers:861Issues:0Issues:0

Awesome-Text-to-Image

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

License:MITStargazers:2096Issues:0Issues:0

Awesome-Controllable-Diffusion

Papers and resources on Controllable Generation using Diffusion Models, including ControlNet, DreamBooth, T2I-Adapter, IP-Adapter.

License:MITStargazers:363Issues:0Issues:0

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonLicense:AGPL-3.0Stargazers:2712Issues:0Issues:0
Language:HTMLStargazers:3Issues:0Issues:0

Generative-AI

[TPAMI 2023] Multimodal Image Synthesis and Editing: The Generative AI Era

Language:TeXStargazers:780Issues:0Issues:0

GGOT

GGOT: A Gaussian graphical optimal transport method to detecting disease tipping points

Language:PythonLicense:MITStargazers:3Issues:0Issues:0

pytorch-template

To be the world's best PyTorch project template.

Language:PythonStargazers:416Issues:0Issues:0

Prompt-Engineering-Guide

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

Language:MDXLicense:MITStargazers:48022Issues:0Issues:0

StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Language:PythonLicense:Apache-2.0Stargazers:9505Issues:0Issues:0

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:140346Issues:0Issues:0

LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Language:PythonStargazers:10093Issues:0Issues:0

sd-webui-controlnet

WebUI extension for ControlNet

Language:PythonLicense:GPL-3.0Stargazers:16880Issues:0Issues:0

Diffusion-SpaceTime-Attn

Official implementation of the paper "Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis"

Language:Jupyter NotebookLicense:MITStargazers:88Issues:0Issues:0

In-Context-Learning_PaperList

Paper List for In-context Learning 🌷

Stargazers:165Issues:0Issues:0

iclr2024_stats

ICLR2024 statistics

Language:HTMLStargazers:45Issues:0Issues:0

ArXivQA

WIP - Automated Question Answering for ArXiv Papers with Large Language Models (https://arxiv.taesiri.xyz/)

Language:PythonStargazers:285Issues:0Issues:0

stable_signature

Official implementation of the paper "The Stable Signature Rooting Watermarks in Latent Diffusion Models"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:367Issues:0Issues:0