Each Dell EMC Integrated System SU comes configured with robust monitoring and alerting capabilities. The HLH has enabled the Hyper-V role to host several guest VMs, as shown in the following figure, including:
The Health Resource Provider exposes the health and alerts for the Microsoft Azure Stack Hub software infrastructure components. The provider is viewable through the Azure Stack Administration Portal or programmatically using REST APIs and PowerShell. The following figure summarizes the monitoring and alerting system:
There is a limited number of hardware alerts that are exposed through the Health Resource Provider. However, it complements the greater level of monitoring detail that OpenManage Enterprise provides for server hardware components.
The discovery of PowerEdge servers in OpenManage Enterprise occurs automatically during deployment. WS-MAN polls each of the server iDRACs for alerts and failures and forwards the SNMP traps to OpenManage Enterprise for tasks. The tasks are sent to SupportAssist Enterprise for automatic SR-ticket generation, as shown in the following figure:
OpenManage Network Manager fulfills several critical functions in the Azure Stack Hub scale-node architecture, and uses SNMP and SYSLOG for switch monitoring. When hardware or configuration problems occur, OpenManage Network Manager generates alerts that are forwarded to OpenManage Enterprise for action. If required, an SR ticket is generated.
Other capabilities of OpenManage Network Manager are customer reports, performance monitoring, traffic-flow analysis, and firmware and compliance checking. These optional capabilities are not set up by default, but are available if needed. For instance, if you need a detailed report of the network switches, go to the Reports tab and create a detailed view of the switches, as shown in the following figure:
The Health Resource Provider manages the Azure Stack Hub software infrastructure component health and alerts. The Region Management monitoring tile in the Azure Stack Hub Administration Portal displays this information, as shown in the following figure:
Each alert displays details such as severity, state, created time, updated time, and description of the event, as shown in the following figure:
Prescriptive guidance on remediating each issue is provided through REST APIs and PowerShell.
The System Center Management Pack (SCOM) for Microsoft Azure Stack Hub enables you to monitor the availability of the Azure Stack Hub infrastructure. The management pack runs on a specified resource pool, and then uses various Azure Stack Hub APIs to remotely discover and collect instrumentation information, such as deployments, regions, and other resources.
Note: There are numerous external monitoring options, including Splunk, Nagios, and other monitoring platforms. SCOM and Nagios can both be integrated into Azure Stack Hub.