Burmese-Table-OCR

💻 Extract text from tables of images. Use OpenCV to detect margin lines and PyTesseract to detect Burmese text. ⌨️

🏷️ To-Do List

Unicode CSV encoding problem - when I try to export csv into google sheets , the font wasn't correct when using with Gspread 'import_csv' function.
- Solution -> open("angel.csv", "r").read().encode("utf8")
Nov 27,2020
- A lot of errors also today. I didn't note down everything but the solved tasks that I remember is
  - Adjusting threshold & minLinLength values to detect the table correctly (it's the most important thing)
  - Append the dictionary according to filenames
  - Generate CSV - row by row
- Overall result is satisfied.
- My code is full of comments & editions. Noone won't be able to understand at the first look.:satisfied:
  - I need to write a blog about this project and also record an explanation video.

Extract text from tables of images. Use OpenCV to detect margin lines and PyTesseract to detect Burmese text.

Language:Python 100.0%