There are 0 repository under csv-processing topic.
Data-Splitter is a Python script designed to split a large CSV file containing data into three different formats: JSON, a database table, and another CSV file. The script ensures a random distribution of data across the three output formats based on custom-defined ratios.
🇺🇸 Solution for importing and analyzing public Brazilian business data (CNPJ). 🇧🇷 Processamento de Dados CNPJ: Uma solução robusta e conteinerizada para importação e análise de dados empresariais brasileiros (CNPJ).
The Keyword Analysis Dashboard is a powerful tool designed to help marketers and SEO professionals analyze keyword data efficiently. This dashboard allows users to upload keyword data, configure analysis parameters, and visualize the results through interactive charts and tables.
Pipeline-Genie is an intelligent data pipeline that processes CSV datasets, identifies their schema, and leverages LLaMA 2.0 to extract business insights. Users can select relevant business needs, triggering automated ETL transformations using Apache Spark. The final transformed dataset is stored in AWS S3 and made available for download.
Reference project using AWS Lambda and Scala for CSV processing and loading into DynamoDB
A simple ETL and visualization project that loads employee data from CSV into a SQLite database and generates charts for salary distribution and hiring trends.
Real-time fraud detection platform – 4-model ML ensemble, sub-second scoring, interactive Streamlit dashboard. Open-source, Docker-ready.
This bot is very useful to search books and download them in Telegram
Proyecto de análisis de ventas de tiendas para Alura Latam con Python y visualización de datos.
A Python script to classify companies based on financial metrics like Piotroski F-Score and Stock Valuation, using CSV financial data for analysis and output.
In this project, I analyze commercial sales data using NumPy and pandas. I visualize total revenue per product using color-coded bar charts in Matplotlib. It’s a foundational step in business data analysis and project documentation.
This Python application analytically processes CSV data containing haunted site information, employing heatmap visualization techniques to discern the most paranormally active regions within the USA.
This repository showcases a complete Python-based ETL (Extract, Transform, Load) data pipeline designed to process, validate, and analyze weather data for multiple cities. The project demonstrates a structured approach to handling weather data, focusing on data accuracy, transformation, and insights generation.
A Python tool to clean, filter, and organize product annotation data.
Generate GPX for running routes based on destination and points of interest
A Python script to classify companies based on financial metrics like Piotroski F-Score and Stock Valuation, using CSV financial data for analysis and output.
Data-driven GTD task analysis for TickTick exports. Normalizes messy CSV data, identifies high-impact activities using five-dimension assessment, provides strategic productivity insights. Transforms reactive task management into systematic optimization with automated filtering and export-ready datasets.
A Robust Python script to migrate issues from Jira to Azure DevOps, ensuring data integrity and formatting consistency.
A high-performance polynomial regression implementation in pure C with gradient descent optimization and visualization support.
The goal of this project is to eliminate the need for paper by digitizing the process of handling client passport information.
A Python desktop application for image caption validation and annotation with an easy-to-use Tkinter GUI, featuring progress tracking and duplicate caption detection. Built with MVC architecture for clean separation of concerns.
An interactive Power BI project analyzing multi-year Spotify streaming history to uncover user listening patterns, peak activity times, and music preferences. The dashboard includes YOY growth analysis, heatmaps, top artist/album/track rankings, and quadrant segmentation of songs based on frequency and duration.
Python-приложение для сопоставления номеров из выгрузки Active Directory с данными из .csv/.txt, с выводом в CSV и логированием.
PowerShell script to process AMBR tank CSV data and upload to S3
📧 Validate large email lists efficiently with an async tool that checks deliverability while respecting rate limits and providing detailed provider feedback.
A comprehensive Python-based e-commerce sales data analyzer that processes and visualizes sales metrics, customer behavior, and business insights from CSV datasets. Features data cleaning, statistical analysis, and automated reporting capabilities.
A Python script that processes student grades from a CSV file, filters valid scores (0–100), assigns letter grades, calculates the average score of passing students, and writes results to a new CSV file. Uses list/dictionary comprehensions and reduce for efficient data processing.
📫 Asynchronous email deliverability verifier that performs MX lookups, SMTP RCPT probes, and catch-all detection with retries, resume support, and CSV/JSON reporting.
Automation with Shell Scripting from API data extraction to parallel processing
Professional MS Access Data Processing Script - Python automation for bulk file processing with Excel reporting
Fun university project @ IU (2020) for implementation of a smart service business model through dynamic tire rental pricing based on road quality data.
Python CLI tool for generating Sumsub share tokens from CSV data for bulk KYC verification with the Reap API
FastAPI backend service for lead qualification using rule-based scoring + AI reasoning. Accepts product offers and CSV leads, scores buying intent (High/Medium/Low) with dual-layer logic, exports results as JSON/CSV. Features Gemini AI integration, CORS support, Docker deployment, and robust error handling.
A Python-based web scraping tool to extract email addresses from Australian agricultural business websites for lead generation and business outreach purposes.