pean1128's repositories
sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
Unique3D
Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
LayoutGPT
Official repo for LayoutGPT
SimCLR
PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations
screen_qa
ScreenQA dataset was introduced in the "ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots" paper. It contains ~86K question-answer pairs collected by human annotators for ~35K screenshots from Rico. It should be used to train and evaluate models capable of screen content understanding via question answering.
RGC
[ACM MM 2023] An official source code for paper Reinforcement Graph Clustering with Unknown Cluster Number.
PSD2UGUI_X
Convert psd file to ugui prefab, text, image, raw image, button, slider, scroll view, dropdown, toggle, textmeshpro...
layout-dm
LayoutDM: Discrete Diffusion Model for Controllable Layout Generation [Inoue+, CVPR2023]
FreeReg
[Arxiv 2023] FreeReg: Image-to-Point Cloud Registration Leveraging Pretrained Diffusion Models and Monocular Depth Estimators
kapture
kapture is a file format as well as a set of tools for manipulating datasets, and in particular Visual Localization and Structure from Motion data.
rico_semantics
Consists of ~500k human annotations on the RICO dataset identifying various icons based on their shapes and semantics, and associations between selected general UI elements and their text labels. Annotations also include human annotated bounding boxes which are more accurate and have a greater coverage of UI elements.
SuperGlobal
ICCV 2023 Paper Global Features are All You Need for Image Retrieval and Reranking Official Repository
cuda-samples
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
StreamRF
Official implementation of our NeurIPS paper "Streaming Radiance Fields for 3D Video Synthesis"
Cream
This is a collection of our NAS and Vision Transformer work.
GLIGEN
Open-Set Grounded Text-to-Image Generation
Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
GroundingDINO
The official implementation of "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
nerf-learn
记录对nerf各种算法、应用、软件等等的学习过程
gptrpg
A demo of an GPT-based agent existing in an RPG-like environment
Awesome-MVS
Awesome list of multi-view stereo papers
awesome-3d-reconstruction-papers
A collection of 3D reconstruction papers in the deep learning era.
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Chinese-BERT-wwm
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)