qi (ListenQ)

ListenQ

User data from Github https://github.com/ListenQ

Company:深圳三代人有限公司

Location:深圳南山区华中科技大厦

GitHub:@ListenQ

qi's starred repositories

retrofit-spring-boot-starter

A spring-boot starter for retrofit, supports rapid integration and feature enhancements.(适用于retrofit的spring-boot-starter,支持快速集成和功能增强)

Language:JavaLicense:Apache-2.0Stargazers:1900Issues:0Issues:0

FastEdit

🩹Editing large language models within 10 seconds⚡

Language:PythonLicense:Apache-2.0Stargazers:1352Issues:0Issues:0

Awesome-Text2SQL

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

License:MITStargazers:3275Issues:0Issues:0

mindsdb

Federated query engine for AI - The only MCP Server you'll ever need

Language:PythonLicense:NOASSERTIONStargazers:37309Issues:0Issues:0

kspider

Kspider 是一个爬虫平台,以图形化方式定义爬虫流程,无需代码即可实现一个爬虫流程,Kspider不仅限爬虫,也可用于WEB自动化测试,更多功能等你探索。

Language:JavaLicense:MITStargazers:1285Issues:0Issues:0

LabelLLM

The Open-Source Data Annotation Platform

Language:TypeScriptLicense:Apache-2.0Stargazers:960Issues:0Issues:0

labelU

Data annotation toolbox supports image, audio and video data.

Language:PythonLicense:Apache-2.0Stargazers:1416Issues:0Issues:0

opencv

Open Source Computer Vision Library

Language:C++License:Apache-2.0Stargazers:84889Issues:0Issues:0

SenseVoice

Multilingual Voice Understanding Model

Language:PythonLicense:NOASSERTIONStargazers:6945Issues:0Issues:0

label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

Language:JavaScriptLicense:Apache-2.0Stargazers:25439Issues:0Issues:0

tika

The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).

Language:JavaLicense:Apache-2.0Stargazers:3415Issues:0Issues:0

data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Language:PythonLicense:Apache-2.0Stargazers:5519Issues:0Issues:0

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonLicense:Apache-2.0Stargazers:6294Issues:0Issues:0

dingo

Dingo: A Comprehensive AI Data Quality Evaluation Tool

Language:JavaScriptLicense:Apache-2.0Stargazers:552Issues:0Issues:0

doccano

Open source annotation tool for machine learning practitioners.

Language:PythonLicense:MITStargazers:10390Issues:0Issues:0

cvat

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Language:PythonLicense:MITStargazers:14755Issues:0Issues:0

labelme

Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).

Language:PythonLicense:GPL-3.0Stargazers:15254Issues:0Issues:0

video-analyzer

Analyze videos using LLMs, Computer Vision and Automatic Speech Recognition

Language:PythonLicense:Apache-2.0Stargazers:1131Issues:0Issues:0

EasySpider

A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。

Language:JavaScriptLicense:AGPL-3.0Stargazers:43402Issues:0Issues:0

crawlee

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

Language:TypeScriptLicense:Apache-2.0Stargazers:20527Issues:0Issues:0

crawl4ai

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

Language:PythonLicense:Apache-2.0Stargazers:55808Issues:0Issues:0

feapder

🚀🚀🚀feapder is an easy to use, powerful crawler framework | feapder是一款上手简单,功能强大的Python爬虫框架。内置AirSpider、Spider、TaskSpider、BatchSpider四种爬虫解决不同场景的需求。且支持断点续爬、监控报警、浏览器渲染、海量数据去重等功能。更有功能强大的爬虫管理系统feaplat为其提供方便的部署及调度

Language:PythonLicense:NOASSERTIONStargazers:3471Issues:0Issues:0

scrapy

Scrapy, a fast high-level web crawling & scraping framework for Python.

Language:PythonLicense:BSD-3-ClauseStargazers:58946Issues:0Issues:0

pyspider

A Powerful Spider(Web Crawler) System in Python.

Language:PythonLicense:Apache-2.0Stargazers:16968Issues:0Issues:0

webmagic

A scalable web crawler framework for Java.

Language:JavaLicense:Apache-2.0Stargazers:11662Issues:0Issues:0

cgft-llm

Practice to LLM.

Language:Jupyter NotebookLicense:MITStargazers:2029Issues:0Issues:0

ApeRAG

ApeRAG: Production-ready GraphRAG with multi-modal indexing, AI agents, MCP support, and scalable K8s deployment

Language:PythonLicense:Apache-2.0Stargazers:925Issues:0Issues:0

awesome-public-datasets

A topic-centric list of HQ open datasets.

License:MITStargazers:70466Issues:0Issues:0

easy-dataset

A powerful tool for creating fine-tuning datasets for LLM

Language:JavaScriptLicense:NOASSERTIONStargazers:11812Issues:0Issues:0

LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:62495Issues:0Issues:0