Unplanned Downtime: QMetry Test Management Cloud Not Accessible intermittently for ~7 minutes, 15 Feb 2022

Unplanned Downtime: QMetry Test Management - Cloud was Not Accessible with an error message “503 Service Unavailable“ on 15 Feb 2022, 10:15 pm to 10:23 pm PST. This downtime affected all QMetry Test Management Cloud customers.

Users received the below error while accessing the QMetry URL.

All events regarding this downtime will be updated on this page.

Day

Time (PST)

Event Details

Day

Time (PST)

Event Details

21 February

-

  • Root cause - The application became inaccessible due to insufficient memory available to the JRE (Java Runtime Environment), which was caused by high memory utilisation triggered by the large number of Jira web-hooks received by the QMetry server in a short period.

15 February

10:23 PM

  • QMetry services are restored. Our teams are working further to find the root cause which will be made available soon.

15 February

10:15 PM

  • QMetry Test Management gives error - 503 Service Unavailable on accessing the URL.

  • QMetry Teams are working on high priority to restore the services back to normal. Based on the initial analysis, QMetry services should be restored soon.

RCA and Permanent Fix

  • The application became inaccessible due to insufficient memory available to the JRE (Java Runtime Environment), which was caused by high memory utilization triggered by the large number of Jira webhooks received by the QMetry server in a short period.

  • The issue has been permanently fixed with release v8.10.0 on QMetry cloud on 12th March 2022.