abakamousa / Data_Extraction

Dataset generation

Home Page:https://abakamousa.github.io/blog/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

About

This repository presents an example of extraction of data from stackoverflow .

Objective

The objective of this project is to generate a dataset composed of questions, Summaries and answers extracted from stackoverflow. This dataset will help to build a chatbot.

🔧 Techniques used

In this project we used Web scraping, a widely used technique for public data extraction from web pages.

Overview of the generated dataset

image

About

Dataset generation

https://abakamousa.github.io/blog/


Languages

Language:Jupyter Notebook 100.0%