Alexey Borsky (volotat)

volotat

Geek Repo

Location:Belgrade, Serbia

Twitter:@volotat

Github PK Tool:Github PK Tool

Alexey Borsky's starred repositories

llama.cpp

LLM inference in C/C++

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:25676Issues:211Issues:229

InvokeAI

InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multiple commercial products.

Language:TypeScriptLicense:Apache-2.0Stargazers:22676Issues:200Issues:2920

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:21365Issues:179Issues:451

ZeroNet

ZeroNet - Decentralized websites using Bitcoin crypto and BitTorrent network

Language:JavaScriptLicense:NOASSERTIONStargazers:18302Issues:841Issues:2168

MoneyPrinterTurbo

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Language:PythonLicense:MITStargazers:15724Issues:132Issues:361

FreeAskInternet

FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU needed. The user can ask a question and the system will make a multi engine search and combine the search result to LLM and generate the answer based on search results. It's all FREE to use.

Language:PythonLicense:Apache-2.0Stargazers:8407Issues:55Issues:78

magika

Detect file content types with deep learning

Language:RustLicense:Apache-2.0Stargazers:7666Issues:35Issues:396

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:7359Issues:88Issues:121

EMO

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

Language:PythonLicense:Apache-2.0Stargazers:7053Issues:67Issues:70

reader

Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/

Language:TypeScriptLicense:Apache-2.0Stargazers:6107Issues:36Issues:87

moondream

tiny vision language model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4785Issues:51Issues:111

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonLicense:Apache-2.0Stargazers:4441Issues:62Issues:177

SUPIR

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

Language:PythonLicense:NOASSERTIONStargazers:4128Issues:68Issues:130

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonLicense:Apache-2.0Stargazers:3883Issues:53Issues:79

sd-forge-layerdiffuse

[WIP] Layer Diffusion for WebUI (via Forge)

Language:PythonLicense:Apache-2.0Stargazers:3685Issues:37Issues:90

yggdrasil-go

An experiment in scalable routing as an encrypted IPv6 overlay network

Language:GoLicense:NOASSERTIONStargazers:3448Issues:81Issues:494

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3145Issues:26Issues:129

DynamiCrafter

[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Language:PythonLicense:Apache-2.0Stargazers:2286Issues:31Issues:118

Anything-3D

Segment-Anything + 3D. Let's lift anything to 3D.

Language:PythonLicense:MITStargazers:1534Issues:35Issues:15

AQLM

Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf

Language:PythonLicense:Apache-2.0Stargazers:1091Issues:19Issues:65

edm2

Analyzing and Improving the Training Dynamics of Diffusion Models (EDM2)

Language:PythonLicense:NOASSERTIONStargazers:459Issues:12Issues:5

Matte-Anything

[Image and Vision Computing (Vol.147 Jul. '24)] Interactive Natural Image Matting with Segment Anything Models

Language:PythonLicense:MITStargazers:455Issues:8Issues:22

ViewDiff

ViewDiff generates high-quality, multi-view consistent images of a real-world 3D object in authentic surroundings. (CVPR2024).

Language:PythonLicense:NOASSERTIONStargazers:291Issues:5Issues:15

bonsai

A voxel engine in a pot

Language:CLicense:WTFPLStargazers:180Issues:2Issues:37

MambaByte

Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta

Language:PythonLicense:MITStargazers:98Issues:6Issues:1

llama2d

2D Positional Embeddings for Webpage Structural Understanding 🦙👀

Language:PythonLicense:GPL-3.0Stargazers:91Issues:1Issues:1

PQDiff

[ICLR 2024] Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach Link: https://arxiv.org/abs/2401.15652