yytzsy

followers

following

stars

yytzsy's starred repositories

TimeMarker

A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability

Apache-2.02700

HuggingFace-Download-Accelerator

利用HuggingFace的官方下载工具从镜像网站进行高速下载。

Language:Python82700

adapter-bert

Language:PythonApache-2.048400

MIC

MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU

Language:Python33400

Downstream-Dinov2

Downstream-Dino-V2: A GitHub repository featuring an easy-to-use implementation of the DINOv2 model by Facebook for downstream tasks such as Classification, Semantic Segmentation and Monocular depth estimation.

Language:Jupyter NotebookNOASSERTION19600

POPE

The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''

Language:PythonMIT17900

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.02025600

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.03547500

PolygonObjectDetection

This repository is based on Ultralytics/yolov5, with adjustments to enable polygon prediction boxes.

Language:Python35500

3D-Box-Segment-Anything

We extend Segment Anything to 3D perception by combining it with VoxelNeXt.

Language:Jupyter Notebook53900

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonAGPL-3.014295800

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.04764500

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Language:PythonApache-2.0827200

DALLE2-pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

Language:PythonMIT1114700

skeleton-tracing

A new algorithm for retrieving topological skeleton as a set of polylines from binary images

Language:CMIT51000

contrastive_association

Contrastive Instance Association for 4D Panoptic Segmentation using Sequences of 3D LiDAR Scans

Language:PythonMIT4600

lmconv

Code for UAI 2020 paper "Locally Masked Convolution for Autoregressive Models"

Language:PythonNOASSERTION7700

2DPASS

2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point Clouds (ECCV 2022) :fire:

Language:PythonMIT40500

Panoptic-SegFormer

This is the official repo of Panoptic SegFormer [CVPR'22]

Language:PythonApache-2.021800

MaskFormer

Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)

Language:PythonNOASSERTION135400

Mask2Former

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Language:PythonMIT256500

3D-PointCloud

Papers and Datasets about Point Cloud.

Language:Python247900

DS-Net

[CVPR 2021/TPAMI 2023] Rank 1st in the public leaderboard of SemanticKITTI Panoptic Segmentation (2020-11-16)

Language:PythonMIT24300

Panoptic-PolarNet

Implementation for Panoptic-PolarNet (CVPR 2021)

Language:PythonBSD-3-Clause16800

Rotated_IoU

Differentiable IoU of rotated bounding boxes using Pytorch

Language:PythonMIT41500

lift-splat-shoot

Lift, Splat, Shoot: Encoding Images from Arbitrary Camera Rigs by Implicitly Unprojecting to 3D (ECCV 2020)

Language:PythonNOASSERTION105400

afdet

Language:Python1200

mmdetection3d

OpenMMLab's next-generation platform for general 3D object detection.

Language:PythonApache-2.0531600

OpenPCDet

OpenPCDet Toolbox for LiDAR-based 3D Object Detection.

Language:PythonApache-2.0469700

ttfnet

Language:PythonApache-2.048500