yxchng

followers

following

stars

yxchng's starred repositories

LLaVA-pp

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Language:Python76300

DoRA

[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation

Language:PythonNOASSERTION47800

ollama

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.

Language:GoMIT8050700

loraplus

Language:PythonMIT17400

LongMamba

Some preliminary explorations of Mamba's context scaling.

Language:Python17700

LinearAttentionArena

Here we will test various linear attention designs.

Language:PythonApache-2.05300

objaverse-xl

🪐 Objaverse-XL is a Universe of 10M+ 3D Objects. Contains API Scripts for Downloading and Processing!

Language:PythonApache-2.065700

DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Language:PythonMIT190000

Awesome_Mamba

Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis

MIT16300

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonMIT77900

DAQ-VS

Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]

900

corenet

CoreNet: A library for training deep neural networks

Language:PythonNOASSERTION683400

evit

Python code for ICLR 2022 spotlight paper EViT: Expediting Vision Transformers via Token Reorganizations

Language:PythonApache-2.016200

CAT-Seg

Official Implementation of "CAT-Seg🐱: Cost Aggregation for Open-Vocabulary Semantic Segmentation"

Language:PythonMIT22600

inceptionnext

InceptionNeXt: When Inception Meets ConvNeXt (CVPR 2024)

Language:PythonApache-2.021100

EfficientVMamba

Code Implementation of EfficientVMamba

Language:Python15700

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型

Language:PythonMIT439400

imagenet_d

[CVPR2024 Highlight] Official Code for "ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object"

Language:PythonMIT3600

LLM-Inheritune

This is the official repository for Inheritune.

Language:Python8200

croco

Language:PythonNOASSERTION23400

coconut_cvpr2024

Language:Jupyter NotebookApache-2.013600

VideoMamba

[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding

Language:PythonApache-2.071400

S-Seg

1000

DeLVM

Language:Python10600

V3Det

Language:Python9200

open-eqa

OpenEQA Embodied Question Answering in the Era of Foundation Models

Language:Jupyter NotebookMIT19200

Diffusion-RWKV

Scaling RWKV-Like Architectures for Diffusion Models

Language:PythonNOASSERTION9500

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonApache-2.01204200

PEM

PEM: Prototype-based Efficient MaskFormer for Image Segmentation

Language:Python6000

frequency_determines_performance

Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance"

Language:Jupyter NotebookMIT6800