Ajinkgupta / khoj-Ai-Search-Engine

Khoj is a web-based search engine built using PHP, MySQL, and Ajax. It includes a web crawler that can index websites .

Home Page:https://github.com/Ajinkgupta/khoj-Ai-Search-Engine

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

How the crawler works?

asac1995 opened this issue · comments

Hello, how are websites indexed? Is there a predefined site list? Isn't there an option for visitors to add their sites to be indexed?

For visitors, there is an option to submit their website via the spider: http://khoj.wexr.site/spider/

However, in the current demo version, I have added a simple protection to prevent others from entering their website.

you can find it from sourcecode in spider folder !!
As of now this project is not meant for production !

image