Detecting puppeteer-extra-stealth in Headed Mode

Demo

Link: https://serene-benz-a681c4.netlify.app/

Password: superstrong999@#$

Files

Note that index.html is the raw code, while public/index.html is the encrypted code which is exposed in netlify.

Preview

Regular Browser	Puppeteer with Stealth

Signals

Signal 1:

Signal 1 is a low hanging fruit in puppeteer-extra-stealth which has not been worked on yet. See it here.

Signal 2:

Signal 2 is a chromium bug which allows us to skip evaluateOnNewDocument. It's used in Amazon Bot Detection Script (See #injectIframe function)

Signal 3:

Signal 3 is a standard technique used by CreepJS to detect inconsistencies via worker properties.

Signal 4:

Signal 4 is a POC on sniffing Function & Object references in native functions.

Testing

Device / Browser	S1	S2	S3	S4
M1 iMac / Puppeteer 12.0.1 with Stealth 2.9.0	failed	failed	failed	failed
Samsung Galaxy Note 9 / Chrome 96.0.4664.92	passed	passed	passed	passed
Samsung Galaxy Note 9 / Samsung Browser 16.0.2.19	passed	passed	passed	passed
M1 iMac / Chrome 96.0.4664.94	passed	passed	passed	passed
M1 iMac / Safari 15.1	passed	passed	passed	passed
M1 iMac / Firefox 94.02	passed	passed	passed	passed
iPad Pro 2018 / Safari 14	passed	passed	failed	passed
iPad Pro 2018 / Chrome 96.0.4664.94	passed	passed	passed	passed
MBP Pro 15" 2018 / Chrome 96.0.4664.93	passed	passed	passed	passed
MBP Pro 15" 2018 / Brave 1.32.113	passed	passed	passed	passed
iPhone 12 Pro Max / Chrome 96.4664.94	passed	passed	passed	passed
iPhone 12 Pro Max / Safari 14.1.2	passed	passed	passed	passed
iPhone 6S Plus / Safari 11 on iOS 11.4	passed	passed	passed	passed

Note: IPad Pro 2018 / Safari 14 seems to incorrectly fail on S3 because navigator.platform is inconsistent. On that note, CreepJS also has a pretty low trust score for the same device / browser. I'll leave this issue to further investigation in the future.

Methodology

I followed three main branches of approach to find techniques:

Testing each evasion script on commercial bot detectors to find a weak evasion.
Reverse engineering some commercial bot detectors that still has some readability.
Code-reading puppeteer-extra-stealth and back reading github discussions for some unresolved flaws.

Personal Notes

I feel it's a little hard to detect stealth today than last year because they now use Proxy & a clone of Reflect class.
Shape Security bot detection (e.g. nordstrom.com) seems to be the strongest.

mightymercado / bot-detection