GoogleChrome / rendertron

A Headless Chrome rendering solution

Home Page:https://render-tron.appspot.com/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Question: What is the reason of cutting pathname in rendertron which is used in <base> as a page default URL

vcernomschi opened this issue · comments

Could you please help to understand why rendertron cuts page URL which is used in tag as a page default URL ?

Reference: https://github.com/GoogleChrome/rendertron/blob/main/src/renderer.ts#L226

I ams asking as it leads to invalid behaviour when crawler tries to check empty link defined as <a href="#">Empty link URL</a>.

For example, potentially for the following URL: "https://developer.mozilla.org/en-US/docs/Web/API/URL" it will add the following base tag (which in my specific case leads to "Not Found" resource):

<base href="https://developer.mozilla.org/en-US/docs/Web/API">

Just wondering if it is only applied for use case when the pathname ends with slash "/".

I'm having the same issue.

In general, as we're deprecating the project, you should look into alternative approaches to rendering on the web.