crossLi's starred repositories

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:129007Issues:1022Issues:7295

ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Language:PythonLicense:NOASSERTIONStargazers:15466Issues:138Issues:610

gpt4-pdf-chatbot-langchain

GPT4 & LangChain Chatbot for large PDF docs

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonLicense:BSD-3-ClauseStargazers:9784Issues:103Issues:135

Bob

Bob 是一款 macOS 平台的翻译和 OCR 软件。

Awesome-Chinese-LLM

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonLicense:MITStargazers:7981Issues:67Issues:184

OutfitAnyone

Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person

captum

Model interpretability and understanding for PyTorch

Language:PythonLicense:BSD-3-ClauseStargazers:4556Issues:239Issues:513

Firefly

Firefly: 大模型训练工具,支持训练Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

mind-map

一个还算强大的Web思维导图。A relatively powerful web mind map.

Language:VueLicense:MITStargazers:3709Issues:24Issues:560

CPM-Bee

百亿参数的中英文双语基座大模型

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Language:PythonLicense:Apache-2.0Stargazers:2511Issues:28Issues:339

MedSAM

Segment Anything in Medical Images

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2163Issues:18Issues:201

IQA-PyTorch

👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, NIMA, DBCNN, WaDIQaM, BRISQUE, PI and more...

Language:PythonLicense:NOASSERTIONStargazers:1427Issues:16Issues:117

ppq

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

Language:PythonLicense:Apache-2.0Stargazers:1351Issues:17Issues:202

Med-ChatGLM

Repo for Chinese Medical ChatGLM 基于中文医学知识的ChatGLM指令微调

Language:PythonLicense:Apache-2.0Stargazers:909Issues:11Issues:56

Medical-SAM-Adapter

Adapting Segment Anything Model for Medical Image Segmentation

Language:PythonLicense:GPL-3.0Stargazers:819Issues:10Issues:81

Awesome-diffusion-model-for-image-processing

one summary of diffusion-based image processing, including restoration, enhancement, coding, quality assessment

RapidASR

商用级开源语音自动识别程序库,开箱即用,全平台支持,中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide a set of easier APIs to call ASR models.

Language:C++License:MITStargazers:402Issues:16Issues:25

LiLT

Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)

Language:PythonLicense:MITStargazers:317Issues:6Issues:46

FudanOCR

A toolbox of scene text super-resolution and recognition

DatasetDM

[NeurIPS2023] DatasetDM:Synthesizing Data with Perception Annotations Using Diffusion Models

DocDiff

ACM Multimedia 2023: DocDiff: Document Enhancement via Residual Diffusion Models. Also contains 1597 red seals in Chinese scenes, along with their corresponding binary masks.

Language:PythonLicense:MITStargazers:167Issues:4Issues:23

ncnn_paddleocr

Android paddleocr demo infer by ncnn

colornamer

Given a color, return a hierarchy of names.

Language:PythonLicense:Apache-2.0Stargazers:83Issues:10Issues:1

docdiff

Compares two text files by word, by character, or by line

Language:RubyStargazers:54Issues:0Issues:0

DPMN

Improving Scene Text Image Super-Resolution via Dual Prior Modulation Network (AAAI 2023)

Language:PythonLicense:MITStargazers:39Issues:2Issues:8

OphGLM

The first ophthalmology Large Language-and-Vision Assistant based on Instructions and Dialogue