Alex Rozgo's repositories
LooseControl
Lifting ControlNet for Generalized Depth Conditioning
aici
AICI: Prompts as (Wasm) Programs
CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
ControlNet_AnimalPose
Adding a quadruped pose control model to ControlNet!
CRM
Single Image to 3D Textured Mesh in 10 seconds.
dreamgaussian
Generative Gaussian Splatting for Efficient 3D Content Creation
dysts
More than a hundred strange attractors
flowty-realtime-lcm-canvas
A realtime sketch to image demo using LCM and the gradio library.
gpt-crawler
Crawl a site to generate knowledge files to create your own custom GPT from a URL
jepa
PyTorch code and models for V-JEPA self-supervised learning from video.
llama
Simple llama usage example
LLaMA-Adapter
Fine-tuning LLaMA to follow instructions within 1 Hour and 1.2M Parameters
moondream
tiny vision language model
NeuS2
Official code for NeuS2
NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
pdfGPT
PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The only open source solution to turn your pdf files in a chatbot!
PoNQ
Official implementation of PoNQ
preach
Platform independent data channels for WebRTC/Rust.
prismer
The implementation of "Prismer: A Vision-Language Model with An Ensemble of Experts".
shap-e
Generate 3D objects conditioned on text or images
stable-diffusion-xl-burn
Stable Diffusion XL ported to Rust's burn framework
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
tensorken
A fun, hackable, GPU-accelerated, neural network library in Rust, written by an idiot
Wonder3D
Single Image to 3D using Cross-Domain Diffusion
xelis-blockchain
A private blockDAG using Homomorphic Encryption with Smart Contract support
zero123plus
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.
ZoeDepth
Metric depth estimation from a single image