ayumiymk

Mingkun Yang's starred repositories

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookApache-2.028761 312 47

maybe

The OS for your personal finances

Language:RubyAGPL-3.026566 137 196

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonApache-2.013786 105 869

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonMIT10085 148 142

mamba

Language:PythonApache-2.09380 97 273

surya

OCR, layout analysis, reading order, line detection in 90+ languages

Language:PythonGPL-3.05654 62 54

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonNOASSERTION5078 44 68

aim

Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.

Language:PythonApache-2.04782 43 977

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonApache-2.03764 110 109

AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Language:PythonApache-2.03738 52 79

awesome-productivity-cn

绝妙的个人生产力（Awesome Productivity - Chinese version）

CC0-1.02573 47 5

Vim

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Language:Python2067 26 70

EfficientSAM

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Language:Jupyter NotebookApache-2.01747 22 61

Vary

Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Language:Python1550 52 104

VMamba

VMamba: Visual State Space Models，code is based on mamba

Language:Python1444 15 172

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据！

Language:PythonApache-2.01319 12 118

GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Language:PythonMIT882 41 19

Uformer

[CVPR 2022] Official implementation of the paper "Uformer: A General U-Shaped Transformer for Image Restoration".

Language:PythonMIT733 12 76

llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

Language:PythonApache-2.0702 8 18

Vary-toy

Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)

Language:Python530 12 29

DCNv4

[CVPR 2024] Deformable Convolution v4

Language:PythonMIT324 3 41

FiT

FiT: Flexible Vision Transformer for Diffusion Model

Apache-2.0321 26 4

MambaTransformer

Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling

Language:PythonMIT120 3 3

RevisitingCIL

The code repository for "Revisiting Class-Incremental Learning with Pre-Trained Models: Generalizability and Adaptivity are All You Need" in PyTorch.

Language:Python90 2 8

outfit-anyone

About Project Page for Outfit Anyone

Language:JavaScript86 15 1

meta-prompts

Language:PythonMIT57 4 10

ITER

PyTorch codes for "Iterative Token Evaluation and Refinement for Real-World Super-Resolution", AAAI 2024

NOASSERTION39 4 1

DeepEraser

The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”.

Language:Python16 3 2

CAM

Language:Python15 2 1

catvision

A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the performance of the open-source model Qwen-VL-7B-Chat.

Language:Python14 2 1