Li Zhang's starred repositories

Efficient-Multimodal-LLMs-Survey

Efficient Multimodal Large Language Models: A Survey

License:Apache-2.0Stargazers:224Issues:0Issues:0

remove-bg

Remove background directly in your browser, powered by WebGPU

Language:TypeScriptStargazers:368Issues:0Issues:0

13ft

My own custom 12ft.io replacement

Language:PythonLicense:MITStargazers:2557Issues:0Issues:0

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-2-ClauseStargazers:10703Issues:0Issues:0

brutal

A neobrutalism Astro starter theme

Language:AstroLicense:MITStargazers:297Issues:0Issues:0

squoosh

Make images smaller using best-in-class codecs, right in the browser.

Language:TypeScriptLicense:Apache-2.0Stargazers:21635Issues:0Issues:0

L1B3RT45

JAILBREAK PROMPTS FOR ALL MAJOR AI MODELS

License:AGPL-3.0Stargazers:2370Issues:0Issues:0

Qwen-Agent

Agent framework and applications built upon Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Language:PythonLicense:NOASSERTIONStargazers:2993Issues:0Issues:0

orbit

First framework to create radial interfaces using only CSS.

Language:SCSSLicense:MITStargazers:273Issues:0Issues:0

RMS-Runtime-Mobile-Security

Runtime Mobile Security (RMS) 📱🔥 - is a powerful web interface that helps you to manipulate Android and iOS Apps at Runtime

Language:JavaScriptLicense:GPL-3.0Stargazers:2560Issues:0Issues:0

buster

Captcha solver extension for humans, available for Chrome, Edge and Firefox

Language:JavaScriptLicense:GPL-3.0Stargazers:7760Issues:0Issues:0

nopecha-extension

Automated CAPTCHA solver for your browser. Works with Selenium, Puppeteer, Playwright, and more.

Language:JavaScriptLicense:MITStargazers:6293Issues:0Issues:0

OpenVoice

Instant voice cloning by MIT and MyShell.

Language:PythonLicense:MITStargazers:28196Issues:0Issues:0

PasteBarApp

PasteBar - Limitless, Free Clipboard Manager for Mac and Windows

Language:TypeScriptLicense:NOASSERTIONStargazers:551Issues:0Issues:0

metube

Self-hosted YouTube downloader (web UI for youtube-dl / yt-dlp)

Language:PythonLicense:AGPL-3.0Stargazers:5995Issues:0Issues:0

astro-theme-typography

Rediscover the beauty of typography.

Language:AstroLicense:MITStargazers:240Issues:0Issues:0

Astro-Theme-Creek

A theme for Astro

Language:AstroLicense:MITStargazers:220Issues:0Issues:0

Transcribro

Private and on-device speech recognition keyboard and service for Android.

Language:KotlinLicense:ISCStargazers:360Issues:0Issues:0

astro-theme-mia

A minimalist, powerful astro theme with integrated rough-notation for engaging, informative content.

Language:AstroLicense:MITStargazers:72Issues:0Issues:0

ripgrep-all

rga: ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc.

Language:RustLicense:NOASSERTIONStargazers:6517Issues:0Issues:0

rudroid

Rudroid - Writing the World's worst Android Emulator in Rust 🦀

Language:RustLicense:MITStargazers:146Issues:0Issues:0

OpenAdapt

AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models

Language:PythonLicense:MITStargazers:839Issues:0Issues:0

screen_qa

ScreenQA dataset was introduced in the "ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots" paper. It contains ~86K question-answer pairs collected by human annotators for ~35K screenshots from Rico. It should be used to train and evaluate models capable of screen content understanding via question answering.

License:CC-BY-4.0Stargazers:84Issues:0Issues:0

ActivityLauncher

Activity launcher creates shortcuts for any installed app and hidden activities to launch them with ease

Language:KotlinLicense:ISCStargazers:843Issues:0Issues:0

lossless-cut

The swiss army knife of lossless video/audio editing

Language:TypeScriptLicense:GPL-2.0Stargazers:25811Issues:0Issues:0

Android-Emulator-image

The use of this Docker image simplifies the process of running an Android emulator within a Docker container

Language:ShellLicense:MITStargazers:92Issues:0Issues:0

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:11362Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:8285Issues:0Issues:0

ShortcutsBench

ShortcutsBench: A Large-Scale Real-World Benchmark for API-Based Agents

Language:PythonLicense:Apache-2.0Stargazers:70Issues:0Issues:0

Agent-Eval-Refine

Code for Paper: Autonomous Evaluation and Refinement of Digital Agents

Language:PythonLicense:BSD-3-ClauseStargazers:78Issues:0Issues:0