Delayed telemetry data [resolved]

2023-04-03 10:15 UTC - We are investigating delayed telemetry data. More information within the hour.

2023-04-03 11:30 UTC - Delayed telemetry data on all Keboola Connection stacks have been recorded since approximately 20:00 UTC on March 29. We were able to determine the root cause and perform a backfill. Now all telemetry data tables are up-to-date.

We are very sorry for the inconvenience. If you encounter any discrepancies, please contact us immediately.

Workspace table load fails (all stacks)

2023-03-31 09:50 We are investigating failing workspace (Python, R, SQL) loads on all stacks.

2023-03-31 09:58 Affected are all newly created user workspaces (Python, R, SQL,.. ).  A fix will be available soon. The next update will be provided in 30 minutes or as soon as new information becomes available.

2023-03-31 10:24 Problem occurred on when new table was added into workspace. This issue was resolved, we are now working to fix already corrupted workspaces. Workaround now is to remove tables from input mapping and add them again. The next update will be provided in 60 minutes or as soon as new information becomes available.

2023-03-31 11:00 We fixed remaining workspaces and preparing fix to prevent this problem in the future. If you encounter this issue please contact our support and mention this status post. 

We sincerely apologize for any inconvenience caused and appreciate your understanding.

Job start-up delays in Azure North Europe

2023-03-30 16:22 UTC - We are investigating the delays in job start-up within the https://connection.north-europe.azure.keboola.com stack. The next update will be provided in 30 minutes or as soon as new information becomes available.

2023-03-30 16:54 UTC - The investigation into the cause of the issue is still ongoing. The next update will be provided in 30 minutes or as soon as new information becomes available.

2023-03-30 17:56 UTC - The investigation into the cause of the issue is still ongoing. The next update will be provided in 30 minutes or as soon as new information becomes available.

2023-03-30 18:55 UTC - We have identified and fixed the root cause of the issue. The job backlog has now been cleared. We will continue to monitor the situation to ensure that everything remains stable.

2023-03-30 19:18 UTC - The service disruption has been resolved and the stack is now fully operational. 

Thank you for your patience.

Planned service maintenance on April 15th in AWS US and AWS EU stacks

Regrettably, we were unable to upgrade all necessary databases during the previous planned service disruption. Due to a strict deadline imposed by our service provider (AWS), we must carry out another service disruption for maintenance purposes.

This maintenance will impact both the AWS US and AWS EU stacks.

It is scheduled for Saturday, April 15, 2023,

  • between 07:00 and 08:00 UTC (09:00 and 10:00 CEST) for the AWS EU stack, and
  • between 09:00 and 10:00 UTC (02:00 and 03:00 PDT) for the AWS US stack.

During this time, Storage jobs will be paused or delayed, and the platform will be unavailable for a brief period (approximately 5 minutes). The platform will then generate a 500 HTTP response for the majority of API requests. Throughout the remainder of the maintenance window, the platform will be fully accessible but will not process any new or existing Storage jobs.

We sincerely apologize for any inconvenience caused and appreciate your understanding.

Limited service disruption for AWS US

A limited service disruption on AWS US stack will start at 10:00 a.m. UTC today, as announced earlier. Storage jobs, Queue v1, and Orchestration (in projects with Queue v1) processing will stop and new jobs will be delayed until the upgrade is completed. All running jobs will be cancelled, but will resume after the upgrade.

All APIs and other unaffected services, such as Workspaces and Queue v2 jobs, will remain operational, though their operations may be delayed due to the Storage job delays. We will provide an update when the service disruption starts and ends. 

We apologize for any inconvenience caused and thank you for your understanding.

Update 10:00 a.m. UTC: The limited service disruption has begun.

Update 10:10 a.m. UTC: The service disruption has been resolved and the stack is now fully operational. 

Thank you for your patience.


Brief metadata database outage in AWS US

We have encountered a brief metadata DB outage in AWS US at 15:07 UTC. Affected services are

  • legacy Transformations
  • legacy Orchestrations
  • projects and jobs running on old Queue V1

This outage may cause some jobs being executed during the outage fail or run twice in parallel.

We're sorry for this inconvenience. 

UPDATE 15:39 UTC: All affected jobs were restarted and any duplicate executions were terminated.

Limited service disruption for AWS EU

A limited service disruption on AWS EU stack will start at 12:00 p.m. UTC today, as announced earlier. Storage jobs, Queue v1, and Orchestration (in projects with Queue v1) processing will stop and new jobs will be delayed until the upgrade is completed. All running jobs will be cancelled, but will resume after the upgrade.

All APIs and other unaffected services, such as Workspaces and Queue v2 jobs, will remain operational, though their operations may be delayed due to the Storage job delays. We will provide an update when the service disruption starts and ends. 

We apologize for any inconvenience caused and thank you for your understanding.

Update 12:00 p.m. UTC: The limited service disruption has begun.

Update 12:33 p.m. UTC: The service disruption has been resolved and the stack is now fully operational. 

Thank you for your patience.

Failing Facebook/instagram components on Azure North Europe stack

2023-03-14 14:15 UTC - Since Sunday, we have been experiencing component failures when communicating with the Facebook Graph API on our Azure North Europe stack https://connection.north-europe.azure.keboola.com

Following components failing with application error:

keboola.ex-facebook-ads
keboola.ex-facebook
keboola.ex-instagram

We believe that issue is related to following reported bug in Facebook API https://developers.facebook.com/support/bugs/737701844772490

We are monitoring situation.

2023-03-20 08:00 UTC -  We have not received any update from Meta, last error occurred more than 72 hours ago, we will monitor situation. For now the situation looks stable and we are considering issue as resolved.

Jobs outage on connection.keboola.com (us-east-1)

2023-03-07 14:37 CET - We are investigating problem with jobs on connection.keboola.com (us-east-1 stack). 

2023-03-07 14:42 CET - We have identified a problem with one of our internal databases, containing metadata about jobs. As a result, no jobs can be run since 14:30 CET, and the rest of the platform may be behaving abnormally.

2023-03-07 15:05 CET - The database that was affected has been fixed, and operations should be running normally since 15:00 CET.


We apologize for any inconvenience caused and thank you for your understanding.