Huanxin Yang's starred repositories

fonttools

A library to manipulate font files from Python.

Language:PythonLicense:MITStargazers:4345Issues:0Issues:0

fewshot-font-generation

The unified repository for few-shot font generation methods. This repository includes FUNIT (ICCV'19), DM-Font (ECCV'20), LF-Font (AAAI'21) and MX-Font (ICCV'21).

Language:PythonLicense:NOASSERTIONStargazers:211Issues:0Issues:0

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:14075Issues:0Issues:0

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:68355Issues:0Issues:0

Awesome-LLM-Prune

Awesome list for LLM pruning.

Stargazers:160Issues:0Issues:0

HUST-OBC

Oracle Bone Script data collected by VLRLab of HUST

Language:PythonStargazers:30Issues:0Issues:0

Open-Oracle

AI-assisted Deciphering Oracle Bone Script

Stargazers:36Issues:0Issues:0

ToC3D

[ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression

Language:PythonLicense:NOASSERTIONStargazers:37Issues:0Issues:0

CrowdCLIP

[CVPR 2023] CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model

Language:Jupyter NotebookStargazers:73Issues:0Issues:0

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:24603Issues:0Issues:0

Panda-70M

[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

Language:PythonStargazers:525Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:26190Issues:0Issues:0

animate-anything

Fine-Grained Open Domain Image Animation with Motion Guidance

Language:PythonLicense:MITStargazers:783Issues:0Issues:0

ControlNeXt

Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA

Language:PythonLicense:Apache-2.0Stargazers:1398Issues:0Issues:0

CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Language:PythonLicense:Apache-2.0Stargazers:9050Issues:0Issues:0

attention-map

🚀 Cross attention map tools for huggingface/diffusers

Language:PythonLicense:MITStargazers:149Issues:0Issues:0

MFH

This project is for MFH:Marrying Frequency Domain with Handwritten Mathematical Expression Recognition. We implement our method based on CoMER.

Language:PythonStargazers:3Issues:0Issues:0

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language:PythonLicense:MITStargazers:1313Issues:0Issues:0

FreeMask

[NeurIPS 2023] FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models

Language:PythonLicense:MITStargazers:129Issues:0Issues:0

Visualizer

assistant tools for attention visualization in deep learning

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1008Issues:0Issues:0

Uni-ControlNet

[NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models

Language:PythonLicense:MITStargazers:609Issues:0Issues:0

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Language:PythonLicense:MITStargazers:6004Issues:0Issues:0

freecontrol

Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition"

Language:PythonStargazers:443Issues:0Issues:0

CSRNet-pytorch

CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes

Language:Jupyter NotebookStargazers:654Issues:0Issues:0

Atlantis

Atlantis: Enabling Underwater Depth Estimation with Stable Diffusion (CVPR2024, Highlight)

Language:PythonLicense:MITStargazers:74Issues:0Issues:0

Awesome-Crowd-Counting

Awesome Crowd Counting

Stargazers:2406Issues:0Issues:0

DAPT

[CVPR 2024] Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis

Language:PythonLicense:Apache-2.0Stargazers:175Issues:0Issues:0

PointMamba

[NeurIPS 2024] PointMamba: A Simple State Space Model for Point Cloud Analysis

Language:PythonLicense:Apache-2.0Stargazers:355Issues:0Issues:0

pytorch-CycleGAN-and-pix2pix

Image-to-Image Translation in PyTorch

Language:PythonLicense:NOASSERTIONStargazers:23082Issues:0Issues:0

Case-Sensitive-Scene-Text-Recognition-Datasets

This dataset contains re-annotations of 4 popular Latin/English scene text recognition datasets.

Stargazers:49Issues:0Issues:0