HCL-TECH-SOFTWARE / HCL-Workload-Automation-Observability-for-Datadog

HCL Workload Automation Observability for Datadog

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

HCL Workload Automation Observability for Datadog

You can use HCL Workload Automation (HWA) Observability for Datadog to monitor HCL Workload Automation metrics, events, audit and infrastructure logs.

Content

Prerequisites

Following prerequisites must be satisfied prior to deploy the Datadog solution:

  • Kubernetes cluster (OCP V4.7 or above, GKE) with administrative access
  • HWA v10 installed on the Kubernetes cluster
  • Tools & Packages
    • Datadog Agent v7 and Datadog release 3.3.1

Note: To use the HWA Observability Dashboard for Datadog, HWA metrics must be available. For information about HWA exposed metrics, see Exposing metrics to monitor your workload.

Language support

For the dashboards provided with HWA Observability for Datadog, the only supported language is English.

Installation

1. Installating and configuring Datadog

  1. From HCL License Portal download the appropriate HWA Observability installation package: HWA Observability for HWA Observability Add-on
  2. Installation package for Datadog: HWA_OBSERVABILITY_APP_FOR_DATADOG_10.1.0.1.tar.gz
  3. Follow the below command to untar the gzip file

  tar -xvzf HWA_OBSERVABILITY_APP_FOR_DATADOG_10.1.0.1.tar.gz
  1. Follow the steps mentioned on the ReadMe file to continue solution setup.

2. Datadog installation

  1. To install the Datadog agent on your cluster, follow this procedure: Datadog Agent on Kubernetes.

  2. At step 2, select the Manifest template on Linux that supports (METRICS,LOGS,APM,PROCESS,NPM,SECURITY).

  3. Continue with the installation until you reach step 4.

  4. At step 5, if your Datadog site to datadoghq.com skip this step , otherwise set DD_SITE based on your Datadog website URL.

    Follow this documentation site selector and select the site parameter based on your Datadog website URL.

    Look at this command below as an example.

     - name: DD_SITE
       value: "{ADD_SITE_PARAMETER}"
    

    Inside its manifest daemonset.yaml, add the following environment variable for all containers.

  5. Before executing step 6, configure your YAML file for Forwarding metrics and logs steps as mentioned below.

3. Forwarding Logs to Datadog

Inside its manifest daemonset.yaml, add the following environment details:

- name: DD_CONTAINER_INCLUDE_LOGS
  value: "name:waserver-plan-auditing name:waserver-db-auditing name:waserver-bm-events"
- name: DD_CONTAINER_EXCLUDE_LOGS
  value: "name:.* image:.*"

For Openshift installtion, inside its manifest daemonset.yaml, add the following environment variable for all containers.

 - name: DD_KUBELET_TLS_VERIFY
   value: "false"

4. Forwarding Metrics to Datadog

Inside its manifest cluster-agent-deployment.yaml, add the following to annotations for the Cluster Agent:

      annotations:
        ad.datadoghq.com/cluster-agent.checks: |
          { "openmetrics": { "init_config": {}, 
          "instances": [ {
          "openmetrics_endpoint": "http://%%host%%:%%port%%/metrics",
          "tls_verify":"false",
          "namespace": "hwa_metrics",
          "metrics": [".*"] 
          } ] } }        

Replace openmetrics_endpoint host and port with your respective metrics endpoint host and port.

Note: Add property "tls_verify":"false" for not certified or unsecure metrics endpoint.

Inside its manifest cluster-agent-deployment.yaml, add the following environment variables for the Cluster Agent container:

- name: DD_PROMETHEUS_SCRAPE_ENABLED
  value: "true"
- name: DD_PROMETHEUS_SCRAPE_SERVICE_ENDPOINTS
  value: "true" 

After you have configured your YAML file, continute with steps 6 and 7 mentioned in the procedure: Datadog Agent on Kubernetes.

Note : For Openshift installation follow the same steps and replace kubectl with oc commands.

WA Monitoring Dashboards and Alerts setup in Datadog

1. Create Log Facets

markdown

In Logs page, create a new facet directly from the facet panel using the add facet button and create Log factes for the below attributes.

Attributes
@actionType
@auditType
@userName
@workstationName
@objectType
@event
@returnCode
@jobName

2. Create Dashboards and Monitors

Run the starter.sh with your DD_HOST_PATH,DD_API_KEY, DD_APP_KEY as parameters.

 $ sh ./starter.sh DD_HOST_PATH DD_API_KEY DD_APP_KEY 

Look at this command below as an example.

 $ sh ./starter.sh  https://XXX.datadoghq.com 934ac4513XXXXXXX 0efa7dc64XXXXXXX 

3. Configuring dashboard markdown link

The HWA Observability Overview Dashboard​ provides the link to all dashboards. Once the dashboard is created, a new dashboard id is generated for each and every dashboard. So every dashboard id needs to be updated on the HWA Observability Dashboard​ markdown component.

markdown

  1. In Dashboard List, open each and evey dahboard in a new window and copy the dashboard URLs.
  2. Select HWA Observability Dashboard.
  3. Edit all markdown components as mentioned above.
  4. Replace your respective dashboard URLs with markdown field.

Getting Started with Workload Automation Monitoring

The WA Monitoring Overview Dashboard provides a single, consolidated view for monitoring the workload status. By selecting a dashboard on the list, you can see the information related to that dashboard.

WA Monitoring overview Dashboard

In the WA Monitoring Overview Dashboard, you can view the whole status of your workload at a glance for one or more of the engines you have configured.

Overview Dashboard: Install App

By clicking a button or menu option, you open a new tab displaying the selected dashboard information. The following dashboards are supported:

  • Jobs and Job Streams. This dashboard shows the status of Jobs, Critical Jobs and Job Streams.

    Source of Information: WA Deployment events sidecar container and WA server Logs.

  • KPIs and Workstations. This dashboard shows the Workload Automation KPIs information for each host and allows drilldown to see the timeseries data in visual representation for defined KPIs.

    Source of Information : API exposed by WA Server on port 31116.

  • Auditing Dashboard. This dashboard shows details of workstations and audit information such as user actions. Users can view audit information for selected time range, user and object.

    Source of Information: WA Deployment sidecar audit containers, for example: waserver-db-auditing, waserver-plan-auditing etc.

  • Infrastructure Dashboard. This dashboard provides an overview of the infrastructure details of Workload Automation deployed on Kubernetes cluster.

    Source of Information: Infrastructure Logs exposed by Openshift cluster.

  • Monitors: Monitors are created out of HWA Monitoring Application. Follow this Monitors documentation to create your monitors and alerts. The following table shows some of the out-of-the-box monitors that come with the HWA Observability Dashboard:

Monitors Threshold Condition Priority
WA_Broker_Status ==0 (more than 5 minutes ) Critical
WA_DB_connected ==0 (more than 1 minutes ) Critical
WA_FINAL_Error ==1 (more than 1 minutes ) Critical
WA_FINAL_Late ==1 (more than 1 minutes ) Critical
WA_LicenseUncounted > 0 (more than 1day) Critical
WA_Server_link_status ==0 (more than 5 minutes ) Critical

FAQs

  1. How many dashboards are available with HCL Workload Automation Observability for Datadog?

    You can import and use the following 5 dashboards:

    • HWA Observability Dashboard​ (Overview Dashboard)
    • Jobs and Job Streams Dashboard
    • KPIs and Workstations Dashboard
    • Auditing Dashboard
    • Infrastructure Dashboard
  2. How to apply Dark theme to Datadog dashboards?

    View Datadog dashboards in dark mode by hovering over your avatar in the sidebar, or by pressing Ctrl+Opt+D / Ctrl+Alt+D.

  3. Which version of Datadog is supported?

    The solution supports and was verified on Datadog Agent v7 and Datadog release 3.3.1.

  4. How to save the filter history?

    Add your filter keys and select "save select view as". Then, add the view name.

  5. How to collect metrics from not certified or unsecure metrics endpoint?

    Add property "tls_verify":"false" to the metric instances.

  6. How to create Application Key?

    • To add a Datadog application key, navigate to Organization Settings > Application Keys. If you have the permission to create application keys, click New Key.

    Notes:

    • Application key names cannot be blank.
  7. How to create API key ?

    To add a Datadog API key or client token:

    • Navigate to Organization settings, then click the API keys or Client Tokens tab.
    • Click the New Key or New Client Token button, depending on which you’re creating.
    • Enter a name for your key or token.
    • Click Create API key or Create Client Token.

    Notes:

    • Your org must have at least one API key and up to 50 API keys.
    • Key names must be unique across your organization.
  8. How to create new alerts/monitors ?

    Follow this Monitors documentation to create your alerts/monitors.

  9. How to install this solution for openshift?

    Follow the Installation steps and replace kubectl with oc command.

  10. Error while getting hostname, exiting: unable to reliably determine the host name. You can define one in the agent config file or in your hosts file.

    Inside its manifest daemonset.yaml, add the following environment variable for all containers.

         - name: DD_KUBELET_TLS_VERIFY
           value: "false"
    
  11. How to view logs on Auditing Dashboard piechart component ?

    Select the piechart and select the View related logs.

About

HCL Workload Automation Observability for Datadog

License:Apache License 2.0