m3aggregator client blocks entirely if one writer is blocked

Question

m3aggregator client blocks entirely if one writer is blocked

shaan420 opened this issue 2 years ago · comments

Filing M3 Issues

General Issues

General issues are any non-performance related issues (data integrity, ease of use, error messages, configuration, documentation, etc).

Whenever a flush is invoked each writer is spawned off in its own go routine from the PooledWorkerPool. This pool initially creates multiple shards and each shard listens to incoming work (flush() in this case). It uses currTime%numShards to decide which shard to redirect the work. In case that shard is executing a flush() operation that is blocked then the entire pool is blocked. So essentially, even if the pool has other free goroutines it will not schedule the work on them but wait behind the blocked worker.

Please provide the following information along with a description of the issue that you're experiencing:

What service is experiencing the issue? (M3Coordinator, M3DB, M3Aggregator, etc)
M3aggregator-client
What is the configuration of the service? Please include any YAML files, as well as namespace / placement configuration (with any sensitive information anonymized if necessary).
Configure FlushWorkerCount on the reporter as 10 (something small)
How are you using the service? For example, are you performing read/writes to the service via Prometheus, or are you using a custom script?
See TestWriteManagerMultipleWriters unit test
Is there a reliable way to reproduce the behavior? If so, please provide detailed instructions.
Yes, run TestWriteManagerMultipleWriters with small flush worker count

Performance issues

If the issue is performance related, please provide the following information along with a description of the issue that you're experiencing:

What service is experiencing the performance issue? (M3Coordinator, M3DB, M3Aggregator, etc)
Approximately how many datapoints per second is the service handling?
What is the approximate series cardinality that the series is handling in a given time window? I.E How many unique time series are being measured?
What is the hardware configuration (number CPU cores, amount of RAM, disk size and types, etc) that the service is running on? Is the service the only process running on the host or is it colocated with other software?
What is the configuration of the service? Please include any YAML files, as well as namespace / placement configuration (with any sensitive information anonymized if necessary).
How are you using the service? For example, are you performing read/writes to the service via Prometheus, or are you using a custom script?

In addition to the above information, CPU and heap profiles are always greatly appreciated.

CPU / Heap Profiles

CPU and heap profiles are critical to helping us debug performance issues. All our services run with the net/http/pprof server enabled by default.

Instructions for obtaining CPU / heap profiles for various services are below, please attach these profiles to the issue whenever possible.

M3Coordinator

CPU
curl <HOST_NAME>:<PORT(default 7201)>/debug/pprof/profile?seconds=5 > m3coord_cpu.out

Heap
curl <HOST_NAME>:<PORT(default 7201)>/debug/pprof/heap > m3coord_heap.out

M3DB

CPU
curl <HOST_NAME>:<PORT(default 9004)>/debug/pprof/profile?seconds=5 > m3db_cpu.out

Heap
curl <HOST_NAME>:<PORT(default 9004)>/debug/pprof/heap -> m3db_heap.out

M3DB Grafana Dashboard Screenshots

If the service experiencing performance issues is M3DB and you're monitoring it using Prometheus, any screenshots you could provide using this dashboard would be helpful.