SrikantShubam / WebSite-Classifier

WebSite-Classifier is a machine learning project that classifies websites based on their content. The repository includes code for web scraping, data cleaning, feature extraction, and model building using scikit-learn and pandas. In addition, the project includes a Flask module for deploying the model.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

πŸ“š WebSite-Classifier 🌐

Welcome to the WebSite-Classifier repository! This is a machine learning project that can classify websites into different categories based on their content. This repository contains the code and necessary files to build and train a website classifier model.

πŸ“‹ Table of Contents

  • Installation
  • Usage
  • Dataset
  • Model Training
  • Contributing
  • License

πŸ’» Installation

To use this project, you will need to install the necessary dependencies. You can do this using pip:

pip install -r requirements.txt

πŸš€ Usage

The main script for this project is classify.py. To use it, simply run the following command:

python classify.py <url>

πŸ“Š Dataset

The dataset used for training this model is not included in this repository. However, you can find the dataset here

πŸ€– Model Training

To train the website classifier model, you can run the train.py script. This will use the dataset to train a machine learning model and save it to disk.

πŸš€ Flask Deployment

This project also includes a Flask web application for deploying the website classifier model. To run the Flask app, execute the following commands:

export FLASK_APP=app.py
flask run

The app will be available at http://127.0.0.1:5000/. You can enter a website URL and click the "Classify" button to classify the website into one of several categories.

🀝 Contributing

Contributions to this project are welcome! If you would like to contribute, please create a pull request.

πŸ“„ License

This project is licensed under the MIT License. See the LICENSE file for more information.

Thank you for visiting the WebSite-Classifier repository! If you have any questions or suggestions, please feel free to open an issue.

About

WebSite-Classifier is a machine learning project that classifies websites based on their content. The repository includes code for web scraping, data cleaning, feature extraction, and model building using scikit-learn and pandas. In addition, the project includes a Flask module for deploying the model.

License:MIT License


Languages

Language:CSS 60.6%Language:Python 24.8%Language:HTML 13.9%Language:Dockerfile 0.6%Language:Procfile 0.0%