sosppxo's starred repositories
Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
JerryYin777.github.io
My Academic Website:https://jerrysys.top
Open-MAGVIT2
Open-MAGVIT2: Democratizing Autoregressive Visual Generation
fish-speech
Brand new TTS solution
YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
semantic-gaussians
Official implemetation of the paper "Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting".
Remote-Sensing-in-CVPR2024
Papers related to remote sensing in CVPR 2024
expert_readed_books
2021年最新总结,推荐工程师合适读本,计算机科学,软件技术,创业,**类,数学类,人物传记书籍
Replica-Dataset
The Replica Dataset v1 as published in https://arxiv.org/abs/1906.05797 .
concept-graphs
Official code release for ConceptGraphs
PointTransformerV3
[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)
sd-webui-EasyPhoto
📷 EasyPhoto | Your Smart AI Photo Generator.
VideoAgent
This is the official code of VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding (ECCV 2024)