jpetazzo / container.training

Slides and code samples for training, tutorials, and workshops about Docker, containers, and Kubernetes.

Home Page:http://container.training/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

errors from Prometheus?

bridgetkromhout opened this issue · comments

Looking at http://70.37.55.196:31277/#!/pod/default/winsome-wasp-prometheus-alertmanager-784d9bddf6-4pcqw?namespace=default I see this:

screen shot 2018-07-16 at 3 11 59 pm

screen shot 2018-07-16 at 3 12 07 pm

I ran through that section starting at https://oscon2018.container.training/#325 pretty quickly so maybe I missed something, but I'm left with two questions:

  1. Is this expected - the persistent volumes claim error and related percentage unavailable?

screen shot 2018-07-16 at 3 14 09 pm

  1. What am I missing in the Prometheus section? Shouldn't I have a URL I send people to, to look at it?

If my memory serves me well, there are two components (I think they are deployments but I'm not 100% sure so I'm using vague terminology on purpose here) that require persistent volumes in the Prometheus Helm Chart: the prometheus server itself (for data retention), and ... maybe the alerter or something?

So, trying to deploy it on a cluster without persistence basically fails. Which is why I added the flags to disable persistence for the Prometheus server itself: it lets it start and collect data (even in a fragile way). The other component doesn't start, but we don't care about it.

Let me know if you'd like me to dig more to solidify this explanation!

(I'll definitely update that section later, for sure.)

So, trying to deploy it on a cluster without persistence basically fails

So, the failure is expected. That's cool! I just wanted to make sure I wasn't uncovering a Surprise Failure.