Beast code in Giters

Xiang An's starred repositories

id2reflectance

[CVPR 2024] ID2Reflectance: Monocular Identity-Conditioned Facial Reflectance Reconstruction

Language:Python1200

self-cognition-instuctions

A dataset template for guiding chat-models to self-cognition, including information about the model’s identity, capabilities, usage, limitations, etc.

Language:Python2000

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookMIT564300

LaPA_model

[CVPRW 2024] LaPA: Latent Prompt Assist Model For Medical Visual Question Answering

Language:Python900

COMG_model

[WACV 2024] Complex Organ Mask Guided Radiology Report Generation

Language:PythonMIT2900

Arc2Face

[ECCV 2024🔥] Arc2Face: A Foundation Model of Human Faces

Language:PythonMIT51700

CelebAMask-HQ

A large-scale face dataset for face parsing, recognition, generation and editing.

Language:Python206000

cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Language:PythonApache-2.0162300

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonApache-2.0816500

DeepSeek-Coder-V2

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

MIT159100

FRHandbook

Handbook of Face Recognition (Third Edition)

Language:PythonMIT400

Github-personal-homepage

本项目旨在为 GitHub 用户提供一系列精心设计和整理的个人主页 README 模板，让你的个人主页更加独特和专业

MIT400

RWKV-CLIP

The official code of "RWKV-CLIP: A Robust Vision-Language Representation Learner"

Language:PythonMIT7100

Mantis

Official code for Paper "Mantis: Multi-Image Instruction Tuning"

Language:PythonApache-2.012900

CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Language:PythonApache-2.0170000

Bunny

A family of lightweight multimodal models.

Language:PythonApache-2.083400

VAR-CLIP

Implements VAR+CLIP for image generation

2400

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonMIT390200

anxiangsir

Xiang An's starred repositories

dgxr-integration-sdk

id2reflectance

self-cognition-instuctions

pyannote-audio

LaPA_model

COMG_model

Arc2Face

CelebAMask-HQ

cambrian

MiniCPM-V

DeepSeek-Coder-V2

FRHandbook

Github-personal-homepage

RWKV-CLIP

Mantis

CogVLM2

Bunny

VAR-CLIP

VAR

mamba

grok-1

ffhq-dataset

InternVL

FaceStudio

IP-Adapter

GPTs

CogVLM

urban_seg

FaRL

react