shrivastava95 / docparser

A multilingual document parser that processes PDFs. Built using Google's open source Tesseract OCR, and OpenAI's CLIP (Contrastive Language Image Pretraining).

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

shrivastava95/docparser Stargazers