thiswillbeyourgithub / LogseqPDFImporter

Import pdf into logseq but also import annotations made from other softwares

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

LogseqPDFImporter

Import PDF into logseq but also import annotations made from other softwares.

Status

  • work in progress
  • The text highlights are correctly parsed.
  • Other type of annotation (lines, shapes, rectangles, etc) are parsed as "area highlight" (open an issue if something goes wrong). The area is currently only one rectangle that surrounds the whole area, I have yet to code the exact rectangle geometry extractions (help welcome!)
  • Colors are correctly matched to logseq's available colors.
  • Creates both the .md and .edn files, as well as images of area highlights.

TODO (please help)

  • fix the text annotation by using small rectangles that cover exactly the text instead of one large overlapping area over the whole text

Usage

  • python -m pip install -r requirements.txt
  • python LogseqPDFImporter.py path_to_pdf --md_path path_to_md --edn_path path_to_edn

Example

1

2

credits

About

Import pdf into logseq but also import annotations made from other softwares

License:GNU General Public License v3.0


Languages

Language:Python 100.0%