pean1128

followers

following

stars

Tencent Game

Chengdu

pean1128's repositories

CoACD

[SIGGRAPH2022] Approximate Convex Decomposition for 3D Meshes with Collision-Aware Concavity and Tree Search

Language:C++MIT100

awesome-3d-reconstruction-papers

A collection of 3D reconstruction papers in the deep learning era.

000

Awesome-MVS

Awesome list of multi-view stereo papers

000

canvas-vae

Implementation of CanvasVAE: Learning to Generate Vector Graphic Documents, ICCV 2021

Apache-2.0000

ChatPaper

Use ChatGPT to summarize the arXiv papers.

NOASSERTION000

Chinese-BERT-wwm

Pre-Training with Whole Word Masking for Chinese BERT（中文BERT-wwm系列模型）

Apache-2.0000

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Apache-2.0000

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

MIT000

Cream

This is a collection of our NAS and Vision Transformer work.

MIT000

cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

NOASSERTION000

FreeReg

[Arxiv 2023] FreeReg: Image-to-Point Cloud Registration Leveraging Pretrained Diffusion Models and Monocular Depth Estimators

000

generalized_contrastive_loss

MIT000

GLIGEN

Open-Set Grounded Text-to-Image Generation

MIT000

gptrpg

A demo of an GPT-based agent existing in an RPG-like environment

000

GroundingDINO

The official implementation of "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Apache-2.0000

kapture

kapture is a file format as well as a set of tools for manipulating datasets, and in particular Visual Localization and Structure from Motion data.

BSD-3-Clause000

nerf-learn

记录对nerf各种算法、应用、软件等等的学习过程

000

psd.js

A Photoshop PSD file parser for NodeJS and browsers

MIT000

PSD2UGUI_X

Convert psd file to ugui prefab, text, image, raw image, button, slider, scroll view, dropdown, toggle, textmeshpro...

000

RGC

[ACM MM 2023] An official source code for paper Reinforcement Graph Clustering with Unknown Cluster Number.

MIT000

rico_semantics

Consists of ~500k human annotations on the RICO dataset identifying various icons based on their shapes and semantics, and associations between selected general UI elements and their text labels. Annotations also include human annotated bounding boxes which are more accurate and have a greater coverage of UI elements.

CC-BY-SA-4.0000

screen_qa

ScreenQA dataset was introduced in the "ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots" paper. It contains ~86K question-answer pairs collected by human annotators for ~35K screenshots from Rico. It should be used to train and evaluate models capable of screen content understanding via question answering.

CC-BY-4.0000

sentence-transformers

Multilingual Sentence & Image Embeddings with BERT

Apache-2.0000

SimCLR

PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations

MIT000

StreamRF

Official implementation of our NeurIPS paper "Streaming Radiance Fields for 3D Video Synthesis"

BSD-2-Clause000

SuperGlobal

ICCV 2023 Paper Global Features are All You Need for Image Retrieval and Reranking Official Repository

MIT000

UEyes-CHI2023

000

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

MIT000

visual-chatgpt

VisualChatGPT

000

webui

NOASSERTION000