Beast code in Giters

Huanxin Yang's starred repositories

fonttools

A library to manipulate font files from Python.

Language:PythonMIT434500

fewshot-font-generation

The unified repository for few-shot font generation methods. This repository includes FUNIT (ICCV'19), DM-Font (ECCV'20), LF-Font (AAAI'21) and MX-Font (ICCV'21).

Language:PythonNOASSERTION21100

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonApache-2.01407500

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookNOASSERTION6835500

Awesome-LLM-Prune

Awesome list for LLM pruning.

16000

HUST-OBC

Oracle Bone Script data collected by VLRLab of HUST

Language:Python3000

Open-Oracle

AI-assisted Deciphering Oracle Bone Script

3600

ToC3D

[ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression

Language:PythonNOASSERTION3700

CrowdCLIP

[CVPR 2023] CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model

Language:Jupyter Notebook7300

generative-models

Generative Models by Stability AI

Language:PythonMIT2460300

Panda-70M

[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

Language:Python52500

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonApache-2.02619000

animate-anything

Fine-Grained Open Domain Image Animation with Motion Guidance

Language:PythonMIT78300

ControlNeXt

Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA

Language:PythonApache-2.0139800

CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Language:PythonApache-2.0905000

attention-map

🚀 Cross attention map tools for huggingface/diffusers

Language:PythonMIT14900

MFH

This project is for MFH:Marrying Frequency Domain with Handwritten Mathematical Expression Recognition. We implement our method based on CoMER.

Language:Python300

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language:PythonMIT131300

FreeMask

[NeurIPS 2023] FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models

Language:PythonMIT12900

Visualizer

assistant tools for attention visualization in deep learning

Language:Jupyter NotebookApache-2.0100800

Uni-ControlNet

[NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models

Language:PythonMIT60900

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Language:PythonMIT600400

freecontrol

Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition"

Language:Python44300

CSRNet-pytorch

CSRNet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes

Language:Jupyter Notebook65400

Atlantis

Atlantis: Enabling Underwater Depth Estimation with Stable Diffusion (CVPR2024, Highlight)

Language:PythonMIT7400

Awesome-Crowd-Counting

Awesome Crowd Counting

240600

DAPT

[CVPR 2024] Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis

Language:PythonApache-2.017500

PointMamba

[NeurIPS 2024] PointMamba: A Simple State Space Model for Point Cloud Analysis

Language:PythonApache-2.035500

pytorch-CycleGAN-and-pix2pix

Image-to-Image Translation in PyTorch

Language:PythonNOASSERTION2308200

Case-Sensitive-Scene-Text-Recognition-Datasets

This dataset contains re-annotations of 4 popular Latin/English scene text recognition datasets.

4900