Time is money

If you ensure that the resources like compute/memory/storage/time are used efficiently at your end, and do not go into processing tasks that have already been stopped (and then to roll back the work done post the stop-action), you save time to devote to other tasks/clients and make money.

The Challenge

You have a variety of long-running tasks that require time and resources on the servers. As it stands now, once you have triggered off a long-running task, there is no way to tap into it and pause/stop/terminate the task, upon realizing that an erroneous request went through from one of the clients (mostly web or pipeline).

This repository holds my solution to the given challenge statement.

Technology Stack

Programming Language: JavaScript

Framework: ExpressJS

Runtime Engine: NodeJS

Database: MySQL

Containerization: Docker and Docker Compose

Caching: Redis

Documentation: apiDoc documentation

Node.js perspective

At a high level, Node.js falls into the category of concurrent computation. This is a direct result of the single-threaded event loop being the backbone of a Node.js application. The event-loop repeatedly takes an event and then sequentially executes all listeners interested in that event.

It also facilitates creation of child processes to leverage parallel processing on multi-core CPU based systems. When we spawn a new child process in Node.js, this take advantage of multi-core CPUs. A new process will be created and managed by the OS. That new process can be executed in parallel to the main process as long as your computer has at least 2 virtual CPU CORES.

What happens if we have 1 core and you spawn child process?

The same thing that happens when you have multiple cores. The operating system schedules the execution of the processes amongst N cores, which in this case, would mean that all processes are being executed by 1 CORE.

So do we still get some benefit? If you have one core, and you create child process, then processor will make them run in parallel (even one task is long running?).

If the processes can be executed in parallel by the processor, then yes, we will get the benefit (intel processors do virtual core which let even single core act like multi core).

Architecture

Fig.1: High Level - Client Interaction with the system

Fig.2: Middle Level - Logic for API handling client requests

Fig.3: Lower Level - Data flow diagram for the child process

Operating Instructions

Prerequisites

Make sure you have
- Docker installed

Clone

Download or clone this repository
- You can clone the repository executing below command in a location of your choice of your system. $ git clone https://github.com/adisakshya/time-is-money.git

Run Using Docker

Make sure you have docker installed before proceeding.
- In project-directory ./time-is-money, run the following command
  - docker-compose up --build
- Running docker-compose in non-detached mode (to be able to see the logs running)
Now you have successfully setup the project,
- Please find the API documentation here.
- It describe all routes that define the operation of the API.

API Routes

Get details of all the tasks
- http://<domain:port>/api/v1/task
Get details of a task by ID
- http://<domain:port>/api/v1/task/view?id=taskID
- Query Parameter: id [TaskID]
Start a long running task
- http://<domain:port>/api/v1/task/start
Terminate a long running task by id
- http://<domain:port>/api/v1/task/terminate?id=taskID
- Query Parameter: id [TaskID]
Pause long running task by id
- http://<domain:port>/api/v1/task/pause?id=taskID
- Query Parameter: id [TaskID]
Resume a paused long running task by id
- http://<domain:port>/api/v1/task/resume?id=taskID
- Query Parameter: id [TaskID]

Issues faced

Pause operation

In the challenge statement, it is written that the user can stop the long-running task and choose to resume or terminate it, but it doesn't specify how long the user can keep a task paused and what happens after is user exceeds the defined time limit for holding the pause operation?

Issue with MySQL docker container

I was validating the behaviour of the system for pause/resume and terminate actions on long-running tasks, which is parsing a CSV and inserting the fields in the database while doing so I encountered an issue with MySQL database container.

While validating the system with CSV containing 1,00,000-1,20,000 rows and 20 columns, I started some tasks (all task using the same CSV), after a tasks successfully performed committed insert operation in the database, then MySQL database flashed an error saying "Ran out of memory, need more space" and then flashes "mysql: server unexpectedly closed the connection" and processes exit, when I check the database tables for CSV data it shows 100000-120000 rows present in the database, the correct number for first committed task.

adisakshya / time-is-money