yytzsy's starred repositories

TimeMarker

A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability

License:Apache-2.0Stargazers:27Issues:0Issues:0

HuggingFace-Download-Accelerator

利用HuggingFace的官方下载工具从镜像网站进行高速下载。

Language:PythonStargazers:827Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:484Issues:0Issues:0

MIC

MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU

Language:PythonStargazers:334Issues:0Issues:0

Downstream-Dinov2

Downstream-Dino-V2: A GitHub repository featuring an easy-to-use implementation of the DINOv2 model by Facebook for downstream tasks such as Classification, Semantic Segmentation and Monocular depth estimation.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:196Issues:0Issues:0

POPE

The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''

Language:PythonLicense:MITStargazers:179Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:20256Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:35475Issues:0Issues:0

PolygonObjectDetection

This repository is based on Ultralytics/yolov5, with adjustments to enable polygon prediction boxes.

Language:PythonStargazers:355Issues:0Issues:0

3D-Box-Segment-Anything

We extend Segment Anything to 3D perception by combining it with VoxelNeXt.

Language:Jupyter NotebookStargazers:539Issues:0Issues:0

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:142958Issues:0Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:47645Issues:0Issues:0

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Language:PythonLicense:Apache-2.0Stargazers:8272Issues:0Issues:0

DALLE2-pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

Language:PythonLicense:MITStargazers:11147Issues:0Issues:0

skeleton-tracing

A new algorithm for retrieving topological skeleton as a set of polylines from binary images

Language:CLicense:MITStargazers:510Issues:0Issues:0

contrastive_association

Contrastive Instance Association for 4D Panoptic Segmentation using Sequences of 3D LiDAR Scans

Language:PythonLicense:MITStargazers:46Issues:0Issues:0

lmconv

Code for UAI 2020 paper "Locally Masked Convolution for Autoregressive Models"

Language:PythonLicense:NOASSERTIONStargazers:77Issues:0Issues:0

2DPASS

2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point Clouds (ECCV 2022) :fire:

Language:PythonLicense:MITStargazers:405Issues:0Issues:0

Panoptic-SegFormer

This is the official repo of Panoptic SegFormer [CVPR'22]

Language:PythonLicense:Apache-2.0Stargazers:218Issues:0Issues:0

MaskFormer

Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)

Language:PythonLicense:NOASSERTIONStargazers:1354Issues:0Issues:0

Mask2Former

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Language:PythonLicense:MITStargazers:2565Issues:0Issues:0

3D-PointCloud

Papers and Datasets about Point Cloud.

Language:PythonStargazers:2479Issues:0Issues:0

DS-Net

[CVPR 2021/TPAMI 2023] Rank 1st in the public leaderboard of SemanticKITTI Panoptic Segmentation (2020-11-16)

Language:PythonLicense:MITStargazers:243Issues:0Issues:0

Panoptic-PolarNet

Implementation for Panoptic-PolarNet (CVPR 2021)

Language:PythonLicense:BSD-3-ClauseStargazers:168Issues:0Issues:0

Rotated_IoU

Differentiable IoU of rotated bounding boxes using Pytorch

Language:PythonLicense:MITStargazers:415Issues:0Issues:0

lift-splat-shoot

Lift, Splat, Shoot: Encoding Images from Arbitrary Camera Rigs by Implicitly Unprojecting to 3D (ECCV 2020)

Language:PythonLicense:NOASSERTIONStargazers:1054Issues:0Issues:0
Language:PythonStargazers:12Issues:0Issues:0

mmdetection3d

OpenMMLab's next-generation platform for general 3D object detection.

Language:PythonLicense:Apache-2.0Stargazers:5316Issues:0Issues:0

OpenPCDet

OpenPCDet Toolbox for LiDAR-based 3D Object Detection.

Language:PythonLicense:Apache-2.0Stargazers:4697Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:485Issues:0Issues:0