Zihan Wang (ImNotPrepared)

ImNotPrepared

Geek Repo

Company:Carnegie Mellon University

Location:Berkeley

Github PK Tool:Github PK Tool


Organizations
Project-MONAI

Zihan Wang's repositories

Language:PythonStargazers:2Issues:2Issues:0

tapnet

Tracking Any Point (TAP)

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

AlphaCLIP

Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Papers and Datasets on Multimodal Large Language Models, and Their Evaluation.

Stargazers:0Issues:0Issues:0

awesome-segment-anything

Tracking and collecting papers/projects/others related to Segment Anything.

License:MITStargazers:0Issues:0Issues:0

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

dynibar

Implementation of DynIBaR Neural Dynamic Image-Based Rendering (CVPR 2023)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

License:Apache-2.0Stargazers:0Issues:0Issues:0

llama

Inference code for LLaMA models

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Metaworld

Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning

License:MITStargazers:0Issues:0Issues:0

MONAI

AI Toolkit for Healthcare Imaging

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Neural-Scene-Flow-Fields

PyTorch implementation of paper "Neural Scene Flow Fields for Space-Time View Synthesis of Dynamic Scenes"

License:MITStargazers:0Issues:0Issues:0

open_clip

An open source implementation of CLIP.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

projectaria_tools

projectaria_tools is an C++/Python open-source toolkit to interact with Project Aria data

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

prompt-dt

Official code repository for Prompt-DT.

Language:PythonStargazers:0Issues:0Issues:0

pypose

To connect classic robotics with modern learning methods seamlessly.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

VIMA

Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"

License:MITStargazers:0Issues:0Issues:0

vision-language-models-are-bows

Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023

License:MITStargazers:0Issues:0Issues:0

visprog

Official code for VisProg (CVPR 2023 Best Paper!)

License:Apache-2.0Stargazers:0Issues:0Issues:0

visual_gpt_score

VisualGPTScore for visio-linguistic reasoning

License:MITStargazers:0Issues:0Issues:0