yoosan

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonMIT4054 115 81

transformer-debugger

Language:PythonMIT4022 25 14

Queryable

Run OpenAI's CLIP and Apple's MobileCLIP model on iOS to search photos.

Language:SwiftMIT2626 16 36

gemma

Open weights LLM from Google DeepMind.

Language:PythonApache-2.02422 32 32

mpire

A Python package for easy multiprocessing, but faster than multiprocessing

Language:PythonMIT2004 15 83

SearchEngine

搜索引擎原理

1412 20 6

minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Language:PythonApache-2.01182 18 63

WonderJourney

Language:PythonMIT661 48 9

multimodal-prompt-learning

[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".

Language:PythonMIT638 7 81

text-dedup

All-in-one text de-duplication

Language:PythonApache-2.0594 4 67

distrifuser

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Language:PythonMIT562 8 23

llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Language:PythonApache-2.0548 12 62

KVQuant

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Language:Python285 13 15

vattention

Dynamic Memory Management for Serving LLMs without PagedAttention

Language:CMIT195 2 7

Full-Segment-Anything

This is Pytorch Implementation Code for adding new features in code of Segment-Anything. Here, the features support batch-input on the full-grid prompt (automatic mask generation) with post-processing: removing duplicated or small regions and holes, under flexible input image size

Language:PythonMIT134 2 9

the-stack-v2

Code for the curation of The Stack v2 and StarCoder2 training data

Language:Jupyter NotebookApache-2.089 5 5

GeoReasoner

GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Mode

22 7 2