1. Workers
2. Coordinator
3. Two-way communication channel
4. Continuous query
5. Value Monitoring
6. Cash-Register Model
There are k distributed sites, each receiving a stream of elements over time, possibly at varying rates. Each site holds a local counter
Also, there is a central site or Coordinator whose job is to coordinate the whole process. In particular, Coordinator can communicate directly with each site, there is direct two-way communication with each of the sites. The sites do not communicate with each other directly but this is not a limitation since they can always send messages via the Coordinator. The goal is continuously at all times the Coordinator provides an estimate of the monitoring function such that the difference is always bounded(value monitoring) using the minimal communication cost. In this case, we focus on the Count-Tracking Problem, the Coordinator's job is to maintain an (ε,δ)-approximation of the total count continuously at all times with minimal communication cost.
(1-ε)∙n(t) ≤
$n_a$ ≤ (1+ε)∙n(t) with prob.=1-δ,∀t
-
The Coordinator maintains the exact total count continuously at all times
-
Each site send a message to the Coordinator whenever the counter increments by 1
-
Communication cost: O(N), Ν: Final value of total count
-
The Coordinator maintains an ε-approximation of the total count continuously at all times
-
Each site send a message to the Coordinator whenever the counter increments by 1+ε
-
One-way communication cost
-
Communication cost: O(k/ε ∙ logN), Ν: Final value of total count
-
The Coordinator maintains an (ε,δ)-approximation of the total count continuously at all times
-
The basic idea is that each site sends a message to the Coordinator with probability p. To achieve the goal, it is sufficient to define the probability p=Θ(√k/εn)
-
Two-way communication cost
-
Working in rounds – logN rounds
-
Communication cost: O(√k/ε ∙ logN), Ν: Final value of total count
-
The Coordinator maintains an ε-approximation of the total count continuously at all times
-
The basic idea is that the maximum error in the Coordinator to maintain an ε-approximation is equal εn. Accordingly, the maximum error that can be supported from each site is equal to εn/k, therefore each site sends a message to the Coordinator when the local counter is incremented by Θ(εn/k).
-
Two-way communication cost
-
Working in rounds – logN rounds
-
Communication cost: O(k/ε ∙ logN), Ν: Final value of total count
Count-Tracking Problem | Space(per site) | Communication cost |
---|---|---|
EXACT | O(1) | O(N) |
RANDOMIZED | O(1) | O(√k/ε ∙ logN) |
DETERMINISTIC | O(1) | O(k/ε ∙ logN) |
CONTINUOUS | O(1) | O(κ/ε ∙ logN) |