Shahmatov Arseniy's starred repositories

ComfyUI-to-Python-Extension

A powerful tool that translates ComfyUI workflows into executable Python code.

Language:PythonLicense:MITStargazers:880Issues:0Issues:0

stable-fast

Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

Language:PythonLicense:MITStargazers:1062Issues:0Issues:0

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonLicense:AGPL-3.0Stargazers:2575Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18333Issues:0Issues:0

deforum-kandinsky

Kandinsky x Deforum — generating short animations

Language:PythonLicense:NOASSERTIONStargazers:102Issues:0Issues:0

align_sd

Better Aligning Text-to-Image Models with Human Preference. ICCV 2023

Language:PythonLicense:Apache-2.0Stargazers:258Issues:0Issues:0

consistency_models

Official repo for consistency models.

Language:PythonLicense:MITStargazers:6035Issues:0Issues:0

Kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2730Issues:0Issues:0

openai-cookbook

Examples and guides for using the OpenAI API

Language:MDXLicense:MITStargazers:57812Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:54451Issues:0Issues:0

nccl

Optimized primitives for collective multi-GPU communication

Language:C++License:NOASSERTIONStargazers:3014Issues:0Issues:0

Adabelief-Optimizer

Repository for NeurIPS 2020 Spotlight "AdaBelief Optimizer: Adapting stepsizes by the belief in observed gradients"

Language:Jupyter NotebookLicense:BSD-2-ClauseStargazers:1040Issues:0Issues:0
Language:PythonLicense:MITStargazers:17Issues:0Issues:0

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:8865Issues:0Issues:0

ghost

A new one shot face swap approach for image and video domains

Language:PythonLicense:Apache-2.0Stargazers:1151Issues:0Issues:0

ptp

[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》

Language:PythonLicense:Apache-2.0Stargazers:147Issues:0Issues:0

novelai-aspect-ratio-bucketing

Implementation of aspect ratio bucketing for training generative image models as described in: https://blog.novelai.net/novelai-improvements-on-stable-diffusion-e10d38db82ac

Language:PythonLicense:MITStargazers:343Issues:0Issues:0

rutransform

RuTransform: python framework for adversarial attacks and text data augmentation for Russian

Language:PythonLicense:Apache-2.0Stargazers:17Issues:0Issues:0

Versatile-Diffusion

Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023

Language:PythonLicense:MITStargazers:1301Issues:0Issues:0

LinearAlgebra2021-2022

Linear Algebra Course being taught in HSE in 2021/2022 (in russian)

Language:TeXStargazers:24Issues:0Issues:0

micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Language:Jupyter NotebookLicense:MITStargazers:9602Issues:0Issues:0
License:Apache-2.0Stargazers:2Issues:0Issues:0
Language:PythonStargazers:278Issues:0Issues:0

multimodal

TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.

Language:PythonLicense:BSD-3-ClauseStargazers:1379Issues:0Issues:0

open_clip

An open source implementation of CLIP.

Language:PythonLicense:NOASSERTIONStargazers:9309Issues:0Issues:0

TenderHack

Development of a prototype engine for searching for goods on the tender procurement portal

Language:Jupyter NotebookStargazers:27Issues:0Issues:0

google-research

Google Research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:33492Issues:0Issues:0

k-diffusion

Karras et al. (2022) diffusion models for PyTorch

Language:PythonLicense:MITStargazers:2205Issues:0Issues:0

L-SAMPLER

Neural Sampler for mixing two sounds together.

Language:PythonStargazers:6Issues:0Issues:0

latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:Jupyter NotebookLicense:MITStargazers:11179Issues:0Issues:0