Mingkun Yang (ayumiymk)

ayumiymk

Geek Repo

Company:Huazhong University of Science and Technology

Location:Luoyu Road 1037, Wuhan, China

Home Page:https://scholar.google.com/citations?user=3EfF1qgAAAAJ&hl=en

Github PK Tool:Github PK Tool

Mingkun Yang's starred repositories

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:28761Issues:312Issues:47

maybe

The OS for your personal finances

Language:RubyLicense:AGPL-3.0Stargazers:26566Issues:137Issues:196

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:13786Issues:105Issues:869

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:10085Issues:148Issues:142
Language:PythonLicense:Apache-2.0Stargazers:9380Issues:97Issues:273

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonLicense:GPL-3.0Stargazers:5654Issues:62Issues:54

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5078Issues:44Issues:68

aim

Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.

Language:PythonLicense:Apache-2.0Stargazers:4782Issues:43Issues:977

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:3764Issues:110Issues:109

AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Language:PythonLicense:Apache-2.0Stargazers:3738Issues:52Issues:79

awesome-productivity-cn

绝妙的个人生产力(Awesome Productivity - Chinese version)

Vim

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

EfficientSAM

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1747Issues:22Issues:61

Vary

Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

VMamba

VMamba: Visual State Space Models,code is based on mamba

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!

Language:PythonLicense:Apache-2.0Stargazers:1319Issues:12Issues:118

GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Language:PythonLicense:MITStargazers:882Issues:41Issues:19

Uformer

[CVPR 2022] Official implementation of the paper "Uformer: A General U-Shaped Transformer for Image Restoration".

Language:PythonLicense:MITStargazers:733Issues:12Issues:76

llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

Language:PythonLicense:Apache-2.0Stargazers:702Issues:8Issues:18

Vary-toy

Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)

DCNv4

[CVPR 2024] Deformable Convolution v4

Language:PythonLicense:MITStargazers:324Issues:3Issues:41

FiT

FiT: Flexible Vision Transformer for Diffusion Model

MambaTransformer

Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling

Language:PythonLicense:MITStargazers:120Issues:3Issues:3

RevisitingCIL

The code repository for "Revisiting Class-Incremental Learning with Pre-Trained Models: Generalizability and Adaptivity are All You Need" in PyTorch.

outfit-anyone

About Project Page for Outfit Anyone

ITER

PyTorch codes for "Iterative Token Evaluation and Refinement for Real-World Super-Resolution", AAAI 2024

DeepEraser

The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”.

catvision

A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the performance of the open-source model Qwen-VL-7B-Chat.