validate changed items using datadog api

Question

validate changed items using datadog api

grosser opened this issue a year ago · comments

see https://docs.datadoghq.com/workflows/actions_catalog/monitor_validatemonitor/
only works for monitors ... another option could be to "apply + revert" for others and then get errors from that
... or validating with a deliberate error so we get "expected failure + unknown failure" and then sort out the unknown

Michael Grosser · Answer 1 · Mon Apr 17 2023 04:17:35 GMT+0800 (China Standard Time)

POC

desc "Validate resources against datadog api using generated/ (atm monitor only) [PROJECT|TRACKING_ID|FILE=]"
task validate: "kennel:environment" do
  files =
    if (project = ENV["PROJECT"])
      Dir["generated/#{project}/*.json"]
    elsif (id = ENV["TRACKING_ID"])
      Dir["generated/#{id.split(":").join("/")}.json"]
    elsif (file = ENV["FILE"])
      file
    else
      raise "Need PROJECT or TRACKING_ID or FILE"
    end

  monitors = files
    .map { |f| [f, JSON.parse(File.read(f))] }
    .select { |_, r| r["api_resource"] == "monitor" }
  raise "No monitors in selected files" if monitors.empty?

  monitors.each do |file, monitor|
    begin
      Kennel::Api.new.create("monitor/validate", monitor)
      puts "#{file}: Valid"
    rescue StandardError
      puts "#{file}:"
      body = $!.message.split("\n").last # parsing api output from what lib/kennel/api.rb adds
      puts JSON.parse(body).fetch("errors")
    end
  end
end

Michael Grosser · Answer 2 · Mon May 01 2023 05:44:50 GMT+0800 (China Standard Time)

to make this work for dashboard we could add a fake broken widget and then see if the error coming back is that widget, if it is then we know it's valid
(this way we don't do any real updates ... but it's still risky)
for slo we'd need a similar scheme but might be even harder / less reliable