images in pulled book are not showing

Question

images in pulled book are not showing

digitalw00t opened this issue 3 years ago · comments

All images in the books I pull are not in the epub. I do see in the same directory an OEBPS folder, with an images folder in there. And I do see images in there. All the images are 30k in size, and the ubutnu image viewer says they are all an "unknown" image type.

V.It · Answer 1 · Mon Dec 13 2021 21:44:13 GMT+0800 (China Standard Time)

The parameter received asset_base_url is outdated, and that's the reason the images are not being found. It is a simple change in the script to fix it: just need to get the new URL (open a book in the website, right-click on it then copy image URL) and replace in code the usage of asset_base_url by the new format.

RenanSPLopes · Answer 2 · Wed Dec 15 2021 20:34:26 GMT+0800 (China Standard Time)

As said by @victormeloufrgs changing the script in this way solve the problem, is not ideal but helps:

            new_base_url = "https://learning.oreilly.com/api/v2/epubs/urn:orm:book:9781492086888/files/assets"
            if "images" in next_chapter and len(next_chapter["images"]):
                self.images.extend(urljoin(new_base_url, img_url)
                                   for img_url in next_chapter['images'])

You only need to change the id of the book in the URL to the id that you wants

Andrew Falgout · Answer 3 · Thu Dec 16 2021 23:21:10 GMT+0800 (China Standard Time)

I'll do some testing and see if this fixes it.

…

On Wed, Dec 15, 2021 at 6:34 AM RenanSPLopes ***@***.***> wrote: As said by @victormeloufrgs <https://github.com/victormeloufrgs> changing the script in this way solve the problem, is not ideal but helps: new_base_url = "https://learning.oreilly.com/api/v2/epubs/urn:orm:book:9781492086888/files/assets" if "images" in next_chapter and len(next_chapter["images"]): self.images.extend(urljoin(new_base_url, img_url) for img_url in next_chapter['images']) You only need to change the id of the book in the URL to the id that you wants — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#302 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAEQZFODB7EOIMOFBGYOLKTURCDN5ANCNFSM5JVRUM5A> . Triage notifications on the go with GitHub Mobile for iOS <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.

Bipin Nag · Answer 4 · Sun Jan 09 2022 01:23:55 GMT+0800 (China Standard Time)

My working solution based on snippet from @RenanSPLopes
asset_base_url was slightly different and needed to sub book_id as param

# Images
asset_base_url = "https://learning.oreilly.com/api/v2/epubs/urn:orm:book:%s/files/" % self.book_id
if "images" in next_chapter and len(next_chapter["images"]):
    self.images.extend(urljoin(asset_base_url, img_url)
                        for img_url in next_chapter['images'])

Andrew Falgout · Answer 5 · Tue Feb 15 2022 13:30:14 GMT+0800 (China Standard Time)

Looks like images are in there, having another issue. I'll put in a seperate issue report fo it. Closing this one. Thanks for the assist.