pulak-gautam / spo-performa-scraper

A scraping tool to extract information from performa zip files provided by SPO

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

SPO Performas' Scraper/Organizer


Description

This is a tool for filtering out companies through the pdf performas provided by the Students' Placement Cell (SPO).

Motivation/Need

SPO Portal organizes all the announcements and performas well, and is easy to navigate. It makes easy for students to look out for suitable companies, and roles open for their profile.

After every internship or placement season, SPO makes the performas (containing the job description & other details of a role offered by a company, all in a pdf file) of the companies which participated in the internship/placement season. The organization of the performas is lost and what the students are given is a zipped file containing pdf files which is difficult to navigate.

This was mostly created out of a personal need, since I was very frustrated through the task of scrounging through hundreds of pdf files and noting which were open for my branch/degree or profile.

Functions

The most common use-case of sorting out / listing companies according to branch/degree has been automated through this tool.

About

A scraping tool to extract information from performa zip files provided by SPO


Languages

Language:Python 100.0%