haikuoxin's repositories
clarity-upscaler
Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative
diffusers
š¤ Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
GLIGEN
Open-Set Grounded Text-to-Image Generation
GroundingDINO
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
HanLP
äøęåčÆ čÆę§ę ę³Ø å½åå®ä½čÆå« ä¾åå„ę³åę ęåå„ę³åę čÆä¹ä¾ååę čÆä¹č§č²ę ę³Ø ę代ę¶č§£ é£ę ¼č½¬ę¢ čÆä¹ēøä¼¼åŗ¦ ę°čÆåē° å ³é®čÆēčÆęå čŖåØęč¦ ęę¬åē±»čē±» ę¼é³ē®ē¹č½¬ę¢ čŖē¶čÆčØå¤ē
HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
imagen-pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
InternLM-XComposer
InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
InternVL
[CVPR 2024] InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks āā An Open-Source Alternative to ViT-22B
latexify_py
A library to generate LaTeX expression from Python code.
LaVague
Automate automation with Large Action Model framework
learnopencv
Learn OpenCV : C++ and Python Examples
lightning-thunder
Source to source compiler for PyTorch. It makes PyTorch programs faster on single accelerators and distributed.
llama3
The official Meta Llama 3 GitHub site
llama3-from-scratch
llama3 implementation one matrix multiplication at a time
lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Megatron-LM
Ongoing research training transformer models at scale
MiniGemini
Official implementation for Mini-Gemini
multimodal
TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
pulp
A python Linear Programming API
pywin32
Python for Windows (pywin32) Extensions
quickwit
Cloud-native search engine for observability. An open-source alternative to Datadog, Elasticsearch, Loki, and Tempo.
stable-diffusion-webui
Stable Diffusion web UI
SyncDreamer
[ICLR 2024 Spotlight] SyncDreamer: Generating Multiview-consistent Images from a Single-view Image
taming-transformers
Taming Transformers for High-Resolution Image Synthesis
tantivy-py
Python bindings for Tantivy
trl
Train transformer language models with reinforcement learning.
unsloth
Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory
VAR
[GPT beats diffusionš„] [scaling laws in visual generationš] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!