Chenxin Li (XGGNet)

XGGNet

Geek Repo

Company:The Chinese University of Hong Kong

Home Page:https://xggnet.github.io/

Twitter:@XGGNet

Github PK Tool:Github PK Tool

Chenxin Li's repositories

Endora

Endora: Video Generation Models as Endoscopy Simulators

StegaNeRF

Official Pytorch implementation of "StegaNeRF: Embedding Invisible Information within Neueral Radiance Fields"

Language:PythonLicense:BSD-2-ClauseStargazers:35Issues:4Issues:3

Grounded-Segment-Anything

Marrying Grounding DINO with Segment Anything & Stable Diffusion & Tag2Text & BLIP & Whisper & ChatBot - Automatically Detect , Segment and Generate Anything with Image, Text, and Audio Inputs

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1Issues:0Issues:0

llama-dl

High-speed download of LLaMA, Facebook's 65B parameter GPT model

Language:ShellLicense:GPL-3.0Stargazers:1Issues:0Issues:0

multimodal-prompt-learning

[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".

Language:PythonLicense:MITStargazers:1Issues:0Issues:0
Language:JavaScriptStargazers:0Issues:1Issues:0

Academic-project-page-template

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

Language:JavaScriptStargazers:0Issues:0Issues:0

Auto-GPT

An experimental open-source attempt to make GPT-4 fully autonomous.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Awesome-Dataset-Distillation

Awesome Dataset Distillation Papers

License:MITStargazers:0Issues:0Issues:0

Awesome-Text-to-Image

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

License:MITStargazers:0Issues:0Issues:0

CF-ViT

Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"

License:Apache-2.0Stargazers:0Issues:0Issues:0

CoOp

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Endo-FM

[MICCAI'23] Foundation Model for Endoscopy Video Analysis via Large-scale Self-supervised Pre-train

License:Apache-2.0Stargazers:0Issues:0Issues:0

Endo-FM-1

[MICCAI'23] Foundation Model for Endoscopy Video Analysis via Large-scale Self-supervised Pre-train

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

generative-ai-roadmap

生成式AI的应用路线图 The roadmap of generative AI: use cases and applications

License:CC-BY-4.0Stargazers:0Issues:0Issues:0

generative-models

Generative Models by Stability AI

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Latte

Latte: Latent Diffusion Transformer for Video Generation.

License:Apache-2.0Stargazers:0Issues:0Issues:0

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

LightGaussian

"LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS", Zhiwen Fan, Kevin Wang, Kairun Wen, Zehao Zhu, Dejia Xu, Zhangyang Wang

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

MiniGPT-4

MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

Multi-Modality-Arena

Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, BLIP-2, and many more!

Language:PythonStargazers:0Issues:0Issues:0

PhysGaussian

[CVPR 2024] PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics

Stargazers:0Issues:0Issues:0

SEED

Official implementation of SEED-LLaMA (ICLR 2024).

License:NOASSERTIONStargazers:0Issues:0Issues:0

SOMA

[ICCV' 23 ORAL] Novel Scenes & Classes: Towards Adaptive Open-set Object Detection

License:MITStargazers:0Issues:0Issues:0

Source-Free-Domain-Generalization

An open-world scenario domain generalization code base

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:JavaScriptStargazers:0Issues:1Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0