mechevarria / mbox-hana

Nodejs utility to parses an mbox format email export and uploads to a SAP HANA DB for text analysis of the content. Includes email attachments as well

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

mbox to hana

Node.js utility to parse email mbox files and persist to SAP HANA. For quick testing and deployment, take advantage of the SAP HDI Container for creating a silo-ed database

Requirements

  • A an email export named export.mbox to be read in the main directory of this project. This is the type of export that is typical of a Google Takeout

  • The project database can de deployed on the SAP Business Technology Platform if the Cloud Foundry CLI is installed

  • Login to your SAP Business Technology Platform space with the cf-login.sh script. You will need to add the CP_USER and CP_PASSWORD environment variables and edit the script for organization and project space

  • A SAP HANA platform service must be deployed for the project DB to be deployed as a container

  • You can deploy the SAP HANA HDI Container with the cf-db.sh script

  • Once the DB container is deployed you will need credentials to access it. Create a file named service-key.json with the SAP HANA database connection properties

You can also copy and paste a service key from a SAP HANA instance deployed in SAP Business Technology Platform

key.png

Example:

{
  "host": "zeus.hana.prod.some.region.com",
  "port": 555666,
  "user": "MBOX_SCHEMA_RT",
  "password": "some-long-password-with-numbers-888-AND-CAPS",
  "encrypt": true
}
  • Next install the dependencies to run the script with npm install

  • Finally run the script to load the email export with npm run start

About

Nodejs utility to parses an mbox format email export and uploads to a SAP HANA DB for text analysis of the content. Includes email attachments as well


Languages

Language:JavaScript 82.4%Language:Shell 17.6%