uenian33

followers

following

stars

uenian33's starred repositories

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.025391 219 4105

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonApache-2.05800 65 415

moondream

tiny vision language model

Language:Jupyter NotebookApache-2.04795 51 111

koboldcpp

Run GGUF models easily with a KoboldAI UI. One File. Zero Install.

Language:C++AGPL-3.04686 67 713

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonNOASSERTION4608 50 421

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonApache-2.03147 26 129

FoundationPose

[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects

Language:PythonNOASSERTION1275 31 201

InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Language:PythonApache-2.01260 29 148

TinyGPT-V

TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

Language:PythonBSD-3-Clause1229 19 33

3D-LLM

Code for 3D-LLM: Injecting the 3D World into Large Language Models

Language:PythonMIT880 16 62

octo

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Language:PythonMIT735 19 92

dift

[NeurIPS'23] Emergent Correspondence from Image Diffusion

Language:PythonMIT573 7 22

dobb-e

Dobb·E: An open-source, general framework for learning household robotic manipulation

Language:G-codeMIT558 15 7

omniglue

Code release for CVPR'24 submission 'OmniGlue'

Language:PythonApache-2.0500 10 23

tabbyAPI

An OAI compatible exllamav2 API that's both lightweight and fast

Language:PythonAGPL-3.0421 9 89

alfworld

ALFWorld: Aligning Text and Embodied Environments for Interactive Learning

Language:PythonMIT318 8 72

embodied-generalist

[ICML 2024] Official code repository for 3D embodied generalist agent LEO

Language:PythonMIT308 15 42

Object-Goal-Navigation

Pytorch code for NeurIPS-20 Paper "Object Goal Navigation using Goal-Oriented Semantic Exploration"

Language:PythonMIT295 6 32

awesome-temporal-action-segmentation

A curated list of awesome temporal action segmentation resources.

simple-diffusion

A minimal implementation of a denoising diffusion model in PyTorch.

Language:PythonMIT80 2 1

GeoAware-SC

Official Implementation of paper "Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence"

Language:Python73 3 10

LDM_correspondences

Unsupervised Semantic Correspondence Using Stable Diffusion

Language:PythonApache-2.048 3 3

deformable_gym

A collection of RL gymnasium environments for learning to grasp 3D deformable objects.

Language:PythonNOASSERTION19 9 22

lunar_planner

Language:PythonMIT19 4 1

grounding-predicates

Language:Python19 1 1

robovqa

Language:Jupyter NotebookApache-2.013 8 1

softgym_tfn

Code for CoRL 2022 paper: https://arxiv.org/abs/2211.09006 (simulation environments)

Language:C++MIT10 6 1

ACTR

Language:Python8 1 4

BiVLC

Language:PythonMIT4 10

SSSCWEB

This repository contains the official implementation of Self-supervised Learning of Semantic Correspondence Using Web Videos that has been accepted to 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024).

Language:Python1 20