anilabhadatta / educative.io_scraper

Educative.io Course Downloader developed using Python and Selenium. Refer Readme.md for setup instructions.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

utf8 Encoding/Decoding error

taranjeetsingh257 opened this issue · comments

I am getting this error in my terminal(after successfully logging in and having a config file):
Exception, Driver exited 'utf-8' codec can't decode byte 0xff in position 0: invalid start byte
Main Exception local variable 'driver' referenced before assignment
Press Enter to continue

Do you know why I am getting this?
I cloned the latest updated repo

show to full terminal output
show me the process of creating config

@anilabhadatta

                    Educative Scraper (version 8.5), developed by Anilabha Datta
                    Project Link: https://github.com/anilabhadatta/educative.io_scraper
                    Please go through the ReadMe for more information about this project.

                    Press 1 and Enter to generate config
                    Press 2 and Enter to select a config [Currently selected config 0]
                    Press 3 and Enter to login Educative
                    Press 4 and Enter to start scraping
                    Press Enter to exit

Enter your choice: 1

    Leave Blank and Press Enter if you don't want to overwrite Previous Values

Enter the URL text file path: /Users/t/Downloads/educative.io_scraper-master/urls.txt
Enter Save Path: /Users/t/Downloads/educative.io_scraper-master/n
Headless T/F? T

                    Educative Scraper (version 8.5), developed by Anilabha Datta
                    Project Link: https://github.com/anilabhadatta/educative.io_scraper
                    Please go through the ReadMe for more information about this project.

                    Press 1 and Enter to generate config
                    Press 2 and Enter to select a config [Currently selected config 0]
                    Press 3 and Enter to login Educative
                    Press 4 and Enter to start scraping
                    Press Enter to exit

Enter your choice: 3

Driver Loaded
Press Enter to return to Main Menu after Login is successfull
Login Success!

                    Educative Scraper (version 8.5), developed by Anilabha Datta
                    Project Link: https://github.com/anilabhadatta/educative.io_scraper
                    Please go through the ReadMe for more information about this project.

                    Press 1 and Enter to generate config
                    Press 2 and Enter to select a config [Currently selected config 0]
                    Press 3 and Enter to login Educative
                    Press 4 and Enter to start scraping
                    Press Enter to exit

Enter your choice: 4

            Scraper Started, Log file can be found in Save directory

Exception, Driver exited 'utf-8' codec can't decode byte 0xff in position 0: invalid start byte
Main Exception local variable 'driver' referenced before assignment
Press Enter to continue

@taranjeetsingh257 what are you pasting in your url.txt file?

@taranjeetsingh257 you are able to open the browser and login right?

@taranjeetsingh257 Show me screenshot after running the scraper . Full terminal output is required

@taranjeetsingh257 This is my output, No problems in my end.

            Scraper Started, Log file can be found in Save directory

Driver Loaded

                        [Selected config: 0] Starting Scraping: 0, https://www.educative.io/courses/operating-systems-virtualization-concurrency-persistence/xl7XB03xZ4J

Load Webpage Function
Checking Login Function
Checking for captcha Function...
Create Course Folder Function
Getting File Name

This is a module page
Get File name module
Inside Course Folder operating systems virtualization concurrency persistence
Checking Login Function
Checking for captcha Function...
Scrolling Page
Getting File Name
This is a module page
Get File name module
Checking page
Removing Unnecessary Tags from page
Node deleted div[class*='ed-grid'] > nav
Node deleted div[aria-label*='Your Privacy']
Node deleted div[id*='view-collection-article-content-root']> :not(#handleArticleScroll) > *
Node deleted div[aria-labelledby*='simple-modal-title']
Adding Style Tag with Filter
Inside find_mark_down_quiz_containers function
No mark down quiz_container found
Inside take_quiz_screenshot function
Quiz not found
Remove Mark completed
Show Solution Function
No Solution found
Finding Slides Function
Slides Found
Show Hints Function
No hints found
Adding Name Tag in Next Back Button
Fixing SVG Tags inside Object Tags
Get HTML Page Content Using Single File Function
make_code_selectable function
make_code_selectable function executed
Creating HTML File
HTML File Created
HTML Page content taken.
Inside Widget Container Function
No widget container found
Code Container Download Type Function
No Code Container Downloadable Type found
Code Container Clipboard Type Function
No code containers found
Next Page Function
Going Next Page
--------------- 0 Complete-------------------

@taranjeetsingh257 I think you are not starting the chromedriver probably. open a separate terminal and run python chromedriver.py
I have mentioned them in Readme. Follow the steps once

@taranjeetsingh257 v3 released, reclone to a new folder and refer readme for setup instructions