There are 15 repositories under xpath topic.
Agent for collecting, processing, aggregating, and writing metrics, logs, and other arbitrary data.
新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。
🕷️ An undetectable, powerful, flexible, high-performance Python library to make Web Scraping Easy and Effortless as it should be!
Html Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.
Command-line XML and HTML beautifier and content extractor
Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
豆瓣电影top250、斗鱼爬取json数据以及爬取美女图片、淘宝、有缘、CrawlSpider爬取红娘网相亲人的部分基本信息以及红娘网分布式爬取和存储redis、爬虫小demo、Selenium、爬取多点、django开发接口、爬取有缘网信息、模拟知乎登录、模拟github登录、模拟图虫网登录、爬取多点商城整站数据、爬取微信公众号历史文章、爬取微信群或者微信好友分享的文章、itchat监听指定微信公众号分享的文章
Finally, a JSONPath implementation for Python that aims to be standard compliant. That's all. Enjoy!
camaro is a Node.js library that transform XML to JSON, using Node.js binding to native XML parser pugixml, one of the fastest XML parser around.
纯Java实现的支持W3C Xpath 1.0标准语法的HTML解析器。A html parser with xpath base on Jsoup and Antlr4. Maybe it is the best in java.Just try it.
dude uncomplicated data extraction: A simple framework for writing web scrapers using Python decorators
Site-specific article extraction rules to aid content extractors, feed readers, and 'read later' applications.
High-performance HTML5 parser for Ruby based on Lexbor, with support for both CSS selectors and XPath.
A fluent api for working with XML in PHP
An Elixir library for parsing and extracting data from HTML and XML with CSS or XPath selectors.
Undetected web-scraping & seamless HTML parsing in Python!