Etpoem's starred repositories

aria2

aria2 is a lightweight multi-protocol & multi-source, cross platform download utility operated in command-line. It supports HTTP/HTTPS, FTP, SFTP, BitTorrent and Metalink.

Language:C++License:GPL-2.0Stargazers:35356Issues:738Issues:1848

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:AGPL-3.0Stargazers:31282Issues:184Issues:524

Umi-OCR

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。

Language:PythonLicense:MITStargazers:26170Issues:140Issues:563

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:19633Issues:158Issues:1497

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7542Issues:65Issues:189

SUPIR

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

Language:PythonLicense:NOASSERTIONStargazers:4276Issues:67Issues:139

Tune-A-Video

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Language:PythonLicense:Apache-2.0Stargazers:4225Issues:50Issues:97

recognize-anything

Open-source and strong foundation image recognition models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2778Issues:26Issues:157

mingw-builds-binaries

MinGW-W64 compiler binaries

Vary

[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

SCrawler

🏳️‍🌈 Media downloader from any sites, including Twitter, Reddit, Instagram, Threads, Facebook, OnlyFans, YouTube, Pinterest, PornHub, XHamster, XVIDEOS, ThisVid etc.

Language:Visual Basic .NETLicense:GPL-3.0Stargazers:1257Issues:22Issues:138

chalk.ist

📷 Create beautiful images of your source code

Language:VueLicense:MITStargazers:948Issues:5Issues:25

PASD

[ECCV2024] Pixel-Aware Stable Diffusion for Realistic Image Super-Resolution and Personalized Stylization

Language:PythonLicense:Apache-2.0Stargazers:876Issues:10Issues:68

ResShift

ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (NeurIPS@2023 Spotlight, TPAMI@2024)

Language:PythonLicense:NOASSERTIONStargazers:876Issues:15Issues:92

WeChatQRCode

⛄ 基于OpenCV开源的微信二维码引擎移植的二维码扫码识别库

Language:C++License:Apache-2.0Stargazers:618Issues:9Issues:52

VSET

基于Vapoursynth的图形化视频批量压制处理工具,超分辨率,补帧,vs滤镜一应俱全。

Language:PythonLicense:GPL-3.0Stargazers:578Issues:4Issues:4

SeeSR

[CVPR2024] SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution

Language:PythonLicense:Apache-2.0Stargazers:393Issues:7Issues:66

streetview

Python package for retrieving current and historical photos from Google Street View

CLoT

CVPR'24, Official Codebase of our Paper: "Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation".

pybind11-Chinese-docs

pybind11中文文档(个人翻译)

MARCONet

Learning Generative Structure Prior for Blind Text Image Super-resolution [CVPR 2023]

Language:PythonLicense:NOASSERTIONStargazers:192Issues:5Issues:22

CapsFusion

[CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale

CLIPPyX

AI Powered Image search tool offers content-based, text, and visual similarity system-wide search.

Language:PythonLicense:MITStargazers:129Issues:6Issues:8

Phantom

repository for dreamoving-phantom https://www.modelscope.cn/studios/vigen/DreaMoving_Phantom/summary. DreaMoving-Phantom is a general and automatic image enhancement and super resolution framework.

Language:PythonLicense:Apache-2.0Stargazers:125Issues:3Issues:4

android-sdk-image

Docker image for Android SDK builds

Language:DockerfileLicense:MITStargazers:45Issues:3Issues:6

QA-CLIP

Chinese CLIP models with SOTA performance.

Language:PythonLicense:Apache-2.0Stargazers:44Issues:3Issues:2

ID-Card-Passport-Recognition-SDK-Linux

Robust, ID Card, Passport, Driver License OCR SDK for Linux

Language:PythonStargazers:41Issues:12Issues:0

Face-Recognition-SDK-Linux

Fast, Accurate, Mask-Aware Face Recognition SDK with Liveness Detection

Language:CStargazers:2Issues:0Issues:0