haoranD / GPT4Point

[CVPR 2024] GPT4Point: A Unified Framework for Point-Language Understanding and Generation.

Home Page:https://gpt4point.github.io/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

GPT4Point : A Unified Framework for Point-Language Understanding and Generation

version Status-building PRs-Welcome

πŸ”₯ News

πŸ”₯ 2024/02/27: Our paper GPT4Point is accepted by CVPR'24!

πŸ”₯ 2024/01/19: We release the Objaverse-XL (Point Cloud Format) Extraction way.

πŸ”₯ 2024/01/10: We release the Objaverse-XL (Point Cloud Format) Download way.

πŸ”₯ 2023/12/05: The paper GPT4Point (arxiv) has been released, we unified the Point-language Understanding and Generation.

πŸ”₯ 2023/08/13: Two-stage Pre-training code of PointBLIP has been released.

πŸ”₯ 2023/08/13: Part of datasets used and result files has been uploaded.

🏠 Overview

This project presents GPT4Point , a 3D multi-modality model that aligns 3D point clouds with language. More details are shown in project page.

  • Unified Framework for Point-language Understanding and Generation. We present the unified framework for point-language understanding and generation GPT4Point, including the 3D MLLM for point-text tasks and controlled 3D generation.

  • Automated Point-language Dataset Annotation Engine Pyramid-XL. We introduce the automated point-language dataset annotation engine Pyramid-XL based on Objaverse-XL, currently encompassing 1M pairs of varying levels of coarseness and can be extended cost-effectively.

  • Object-level Point Cloud Benchmark. Establishing a novel object-level point cloud benchmark with comprehensive evaluation metrics for 3D point cloud language tasks. This benchmark thoroughly assesses models' understanding capabilities and facilitates the evaluation of generated 3D objects.

πŸ“¦ Point Dataset and Data Annotation Engine

Objaverse-XL Point Dataset Download Way

Note that you should cd in the Objaverse-xl_Download directory.

cd ./Objaverse-xl_Download

Then please see the folder Objaverse-xl_Download for details.

Objaverse-XL Point Cloud Data Generation

Please see the Extract_Pointcloud for details.

πŸ“ TODO List

Dataset and Data Engine

  • [βœ”] Release the arxiv and the project page.
  • [βœ”] Release the dataset (Objaverse-Xl) Download way.
  • [βœ”] Release the dataset (Objaverse-Xl) rendering (points) way.
  • Release dataset and data annotation engine (Pyramid-XL).
  • Add inferencing codes with checkpoints.
  • Add Huggingface DemoπŸ€—.
  • Add training codes.
  • Add evaluation codes.
  • Add gradio demo codes.

πŸ”— Citation

If you find our work helpful, please cite:

@misc{qi2023gpt4point,
  title={GPT4Point: A Unified Framework for Point-Language Understanding and Generation}, 
  author={Zhangyang Qi and Ye Fang and Zeyi Sun and Xiaoyang Wu and Tong Wu and Jiaqi Wang and Dahua Lin and Hengshuang Zhao},
  year={2023},
  eprint={2312.02980},
  archivePrefix={arXiv},
  primaryClass={cs.CV}
}

πŸ“„ License

Creative Commons License
This work is under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

πŸ“š Related Work

Together, Let's make LLM for 3D great!

  • Point-Bind & Point-LLM: It aligns point clouds with Image-Bind to reason multi-modality input without 3D-instruction data training.
  • 3D-LLM: employs 2D foundation models to encode multi-view images of 3D point clouds.
  • PointLLM: employs 3D point clouds with LLaVA.

About

[CVPR 2024] GPT4Point: A Unified Framework for Point-Language Understanding and Generation.

https://gpt4point.github.io/

License:MIT License


Languages

Language:Python 98.9%Language:Jupyter Notebook 1.0%Language:Shell 0.1%