Xiang An (anxiangsir)

anxiangsir

Geek Repo

Github PK Tool:Github PK Tool


Organizations
deepinsight

Xiang An's starred repositories

Language:C#License:MITStargazers:14Issues:0Issues:0

id2reflectance

[CVPR 2024] ID2Reflectance: Monocular Identity-Conditioned Facial Reflectance Reconstruction

Language:PythonStargazers:12Issues:0Issues:0

self-cognition-instuctions

A dataset template for guiding chat-models to self-cognition, including information about the model’s identity, capabilities, usage, limitations, etc.

Language:PythonStargazers:20Issues:0Issues:0

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookLicense:MITStargazers:5643Issues:0Issues:0

LaPA_model

[CVPRW 2024] LaPA: Latent Prompt Assist Model For Medical Visual Question Answering

Language:PythonStargazers:9Issues:0Issues:0

COMG_model

[WACV 2024] Complex Organ Mask Guided Radiology Report Generation

Language:PythonLicense:MITStargazers:29Issues:0Issues:0

Arc2Face

[ECCV 2024🔥] Arc2Face: A Foundation Model of Human Faces

Language:PythonLicense:MITStargazers:517Issues:0Issues:0

CelebAMask-HQ

A large-scale face dataset for face parsing, recognition, generation and editing.

Language:PythonStargazers:2060Issues:0Issues:0

cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Language:PythonLicense:Apache-2.0Stargazers:1623Issues:0Issues:0

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:8165Issues:0Issues:0

DeepSeek-Coder-V2

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

License:MITStargazers:1591Issues:0Issues:0

FRHandbook

Handbook of Face Recognition (Third Edition)

Language:PythonLicense:MITStargazers:4Issues:0Issues:0

Github-personal-homepage

本项目旨在为 GitHub 用户提供一系列精心设计和整理的个人主页 README 模板,让你的个人主页更加独特和专业

License:MITStargazers:4Issues:0Issues:0

RWKV-CLIP

The official code of "RWKV-CLIP: A Robust Vision-Language Representation Learner"

Language:PythonLicense:MITStargazers:71Issues:0Issues:0

Mantis

Official code for Paper "Mantis: Multi-Image Instruction Tuning"

Language:PythonLicense:Apache-2.0Stargazers:129Issues:0Issues:0

CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Language:PythonLicense:Apache-2.0Stargazers:1700Issues:0Issues:0

Bunny

A family of lightweight multimodal models.

Language:PythonLicense:Apache-2.0Stargazers:834Issues:0Issues:0

VAR-CLIP

Implements VAR+CLIP for image generation

Stargazers:24Issues:0Issues:0

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3902Issues:0Issues:0

mamba

The Fast Cross-Platform Package Manager

Language:C++License:BSD-3-ClauseStargazers:6615Issues:0Issues:0

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49228Issues:0Issues:0

ffhq-dataset

Flickr-Faces-HQ Dataset (FFHQ)

Language:PythonLicense:NOASSERTIONStargazers:3634Issues:0Issues:0

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型

Language:PythonLicense:MITStargazers:4591Issues:0Issues:0

FaceStudio

Put Your Face Everywhere in Seconds.

License:Apache-2.0Stargazers:308Issues:0Issues:0

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4740Issues:0Issues:0

GPTs

leaked prompts of GPTs

Stargazers:27941Issues:0Issues:0

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:5733Issues:0Issues:0

urban_seg

Remotes Sensing Semantic Segmentation

Language:PythonLicense:Apache-2.0Stargazers:401Issues:0Issues:0

FaRL

FaRL for Facial Representation Learning [Official, CVPR 2022]

Language:PythonLicense:MITStargazers:357Issues:0Issues:0

react

REACT (CVPR 2023, Highlight 2.5%)

Language:PythonLicense:MITStargazers:126Issues:0Issues:0