CatchTheTornado / text-extract-api

Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown

Home Page:https://demo.doctractor.com

Repository from Github https://github.comCatchTheTornado/text-extract-apiRepository from Github https://github.comCatchTheTornado/text-extract-api

CatchTheTornado/text-extract-api Stargazers