SumaiyaShaikh / Web_search_indexing

A simple processing pipeline that turns a Website into structured knowledge. The system takes HTML pages as input, process them one at a time and output an index of terms identified in the documents.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Web_search_indexing

A simple processing pipeline that turns a Website into structured knowledge. The system takes HTML pages as input, process them one at a time and output an index of terms identified in the documents.

About

A simple processing pipeline that turns a Website into structured knowledge. The system takes HTML pages as input, process them one at a time and output an index of terms identified in the documents.


Languages

Language:Python 100.0%