You can use the python script to transfer pdf, docx and other document to paragraph for transfer them to embedding vector.
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool