star (starhiking)

starhiking

Geek Repo

Company:University of Chinese Academy of Sciences

Location:Beijing

Github PK Tool:Github PK Tool

star's starred repositories

Language:PythonLicense:Apache-2.0Stargazers:2544Issues:0Issues:0

LLaVA-pp

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Language:PythonStargazers:798Issues:0Issues:0

FMAE-IAT

official implementation for the paper 'Representation Learning and Identity Adversarial Training for Facial Behavior Understanding'

Language:PythonLicense:NOASSERTIONStargazers:16Issues:0Issues:0
Language:PythonLicense:MITStargazers:73Issues:0Issues:0

scrfd_demo

A demo for inference of scrfd model

Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0

Pytorch_Retinaface

Retinaface get 80.99% in widerface hard val using mobilenet0.25.

Language:PythonLicense:MITStargazers:22Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:341Issues:0Issues:0

CV_DL_Gather

Gather research papers, corresponding codes (if having), reading notes and any other related materials about Hot🔥🔥🔥 fields in Computer Vision based on Deep Learning.

Stargazers:59Issues:0Issues:0

fairface

FairFace: Face Attribute Dataset for Balanced Race, Gender, and Age

Stargazers:410Issues:0Issues:0

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:12750Issues:0Issues:0

self-llm

《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合**宝宝的部署教程

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8248Issues:0Issues:0

cgft-llm

Practice to LLM.

Language:Jupyter NotebookLicense:MITStargazers:361Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:132996Issues:0Issues:0

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Language:PythonLicense:NOASSERTIONStargazers:3328Issues:0Issues:0

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Language:PythonLicense:MITStargazers:5656Issues:0Issues:0

idea2img

Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation, ECCV 2024

Language:PythonLicense:MITStargazers:13Issues:0Issues:0

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:31921Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:3841Issues:0Issues:0

FoodSAM

FoodSAM: Any Food Segmentation

Language:PythonLicense:Apache-2.0Stargazers:141Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:13647Issues:0Issues:0

Video-LLaVA

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonLicense:Apache-2.0Stargazers:2887Issues:0Issues:0

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:26491Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:19606Issues:0Issues:0

DragDiffusion

[CVPR2024, Highlight] Official code for DragDiffusion

Language:PythonLicense:Apache-2.0Stargazers:1141Issues:0Issues:0

MDT

Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)

Language:PythonLicense:Apache-2.0Stargazers:514Issues:0Issues:0

Awesome-Video-Diffusion-Models

[CSUR] A Survey on Video Diffusion Models

Stargazers:1733Issues:0Issues:0

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25331Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:25425Issues:0Issues:0

Collaborative-Diffusion

[CVPR 2023] Collaborative Diffusion

Language:PythonLicense:NOASSERTIONStargazers:395Issues:0Issues:0

NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Language:PythonLicense:BSD-3-ClauseStargazers:3228Issues:0Issues:0