360fish

360fish

Geek Repo

Github PK Tool:Github PK Tool

360fish's starred repositories

GoogleScraper

A Python module to scrape several search engines (like Google, Yandex, Bing, Duckduckgo, ...). Including asynchronous networking support.

Language:HTMLLicense:Apache-2.0Stargazers:2603Issues:0Issues:0

Monster-Crawler

A Tutorial Showing Scrapy Web Scraping and Data Visulization

Language:PythonStargazers:16Issues:0Issues:0

lemon-agent

Plan-Validate-Solve (PVS) Agent for accurate, reliable and reproducable workflow automation

Language:TypeScriptLicense:MITStargazers:307Issues:0Issues:0

Plan-and-Solve-Prompting

Code for our ACL 2023 Paper "Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models".

Language:PythonStargazers:560Issues:0Issues:0

factory-pattern-vectorstore-interface

A pattern to let you try several vector databases and change a little code as possible

Language:PythonStargazers:34Issues:0Issues:0

hr-gpt

An AI HR Agent who lives in Slack (GPT-powered)

Language:PythonLicense:NOASSERTIONStargazers:58Issues:0Issues:0

Uscrapper

Uscrapper Vanta: Dive deeper into the web with this powerful open-source tool. Extract valuable insights with ease and efficiency, from both surface and deep web sources. Empower your data mining and analysis with Vanta's advanced capabilities. Fast, reliable, and user-friendly, Uscrapper Vanta is the ultimate choice for researchers and analysts.

Language:PythonLicense:MITStargazers:453Issues:0Issues:0

dataflowkit

Extract structured data from web sites. Web sites scraping.

Language:GoLicense:BSD-3-ClauseStargazers:654Issues:0Issues:0

amazon-scraper

A simple web scraper to extract Product Data and Pricing from Amazon

Language:PythonStargazers:316Issues:0Issues:0

crawlee

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

Language:TypeScriptLicense:Apache-2.0Stargazers:13816Issues:0Issues:0