raphaelquintao / QSE

Search Engine written for academic purpose.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

QSE

Search Engine written for academic purpose.

  • Implemented.
    • Indexer
    • Search
    • Web Interface
  • Not fully implemented.
    • Nothing!

Requirements

Usage - Indexer

QSE-Indexer.exe -d foo/bar/ -n corpus [-s] [-a] [-r] [-t] [-v] [-h]

Parameter Description
-d Collection base directory
-n Database name
-s Max raw data read before flush, default 10MB.
-a Indicates that directory contains raw html files, do not set this option means that directory contains data in binary file and contains index.txt as reference.
-r Read collection recursively, use only if using -a
-t Use Tidy as HTML parser, drastically increase time and memory usage. Default HTMLCXX, faster but worse.
-v Version and About
-h Help

About

Search Engine written for academic purpose.


Languages

Language:C++ 60.2%Language:C 15.1%Language:JavaScript 14.1%Language:HTML 7.8%Language:CMake 2.8%