GraphQL API outage for GitOps Platform

Incident Report for Codefresh

Postmortem

Impact
Our GitOps Platform’s API was unresponsive for an hour.

Detection:
We detected the issue via our automated monitoring.

Root Cause:
Some customer requests to our API triggered an error, which then caused a crashloop in some API pods.

Resolution:
We have updated our error handling to avoid this error in future.

Posted Jun 20, 2023 - 00:24 UTC

Resolved

This incident has been resolved.
Posted Jun 13, 2023 - 03:19 UTC

Monitoring

We have identified and resolved the issue with the API for the GitOps Platform. We are continuing to monitor this issue.
Posted Jun 12, 2023 - 22:00 UTC

Update

We are continuing to investigate this issue.
Posted Jun 12, 2023 - 20:51 UTC

Investigating

Our testing has detected an issue with one of the API's used for our GitOps platform. We are currently investigating and will update this incident as work develops.
Our Classic platform is unaffected.
Posted Jun 12, 2023 - 20:51 UTC
This incident affected: Codefresh Systems (Codefresh GitOps SLA, Codefresh GitOps UI, Codefresh API, Codefresh Hosted GitOps Services).