There are 3 repositories under hocr-documents topic.
A Gtk/Qt front-end to tesseract-ocr.
Document Layout Analysis resources repos for development with PdfPig.
Python package for combining .hocr files and images into searchable PDFs
Python parser for hOCR files using lxml
Quick and dirty visualization of HOCR bboxes on a page
A sample code using tesseract-ocr .NET Core for optical character recognition. The result is formatted as HTML.