emreyesilyurt / regex-dataset-cleaning

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Introduction

This Python script provides functions to clean columns in a pandas DataFrame using regular expressions. It includes functions to remove - Numeric Characters - Unknown Information Indicator-Related Strings - Includes Slash - Suffix Variation - Extra Special Characters - HTML Tags - Website Related Suffixes - Extra Quotation Marks

Dependencies

  • pandas
  • cleanco

You can install the required dependencies using pip:

pip install pandas cleanco
pip install pandas pandas

About


Languages

Language:Python 100.0%