Job failures

There were jobs failures between 4:30 AM - 5:30 AM CET. Failures were caused by low disk space of one of worker servers.

We are sorry for this inconvenience and we're taking steps to mitigate this problem in the future.

Deprecated Facebook Ads API v2.8 and below

Facebook is deprecating all Marketing API versions prior to v2.9 on Wednesday, July 26, 2017 and recommends upgrading to version v2.9 immediately. Facebook Ads Extractor uses v2.8 by default for all newly created configurations but after July 26 all newly created configurations will have default api-version set to v2.9.

The migration should only require changing the api-version parameter in all existing configurations of our Facebook Ads Extractor. However, since there may be some breaking changes, we strongly recommend to change the api-version in your configurations manually, review any possible changes and take the appropriate actions. For more details on the new version, please read Marketing API Changes in v2.9 in the Facebook API changlelog. Of those changes we'd like to highlight the deprecation of date_preset values which are replaced with new ones, e.g. last_3_days, replaced by last_3d .

In other Facebook news, they have announced version v2.10 of Facebook API and Facebook Marketing API.

Storage Jobs Stuck in Processing State

We have noticed an increased number of Storage jobs stuck in the processing state, in rare cases causing a complete halt and queuing of all new Storage jobs in a project.

We have identified the root cause. Due to the Snowflake incident last night it seems some of the database transactions were still open and they blocked queries from consecutive jobs. We have terminated all orphaned transactions and jobs started processing and restarting. 

Please note, that queued jobs are processed in random order. 

We are sorry for this inconvenience.

Sudden Jobs Failures on July 6th, 2017

On July 6th, 2017, between 9:16-9:18am CET one of our internal databases was forced to update and restart by AWS. Result of this action was Application error failure of all running components jobs. The error may show up significantly later than the restart.  We recommend to review your orchestrations jobs and take action if needed. We are sorry for this inconvenience and we're taking steps to mitigate this problem in the future.

Jobs queues overload (resolved)

since 3:50am CET we are experiencing server queues overload. We are still investigating the issue and will inform about the progress.

UPDATE 5:45am CET- We found the possible root of the cause, Snowflake queries are being unusually queued, furthermore we are unable to raise power or number of cluster of our Snowflake warehouses and we waiting for Snowflake Support findings.

UPDATE  6:10am CET from Snowflake support: Engineering has started mitigation steps to address this issue, we will provide another update shortly.

UPDATE 7:00am CET: Snowflake queries queues has got empty and everything looks to be back to normal.

UPDATE RESOLVED 9:00am CET:  We confirm the issue has been resolved, the cause was the unexpected queueing of Snowflake queries. Snowflake confirmed and rolled back their latest release. However, as a consequence our waiting jobs reached backoff threshold and timeout, which resulted in orchestration failures. We recommend to review your project orchestrations jobs and take action if needed.

Apifier extractor (Public Beta)

We are pleased to announce the public beta release of the Apifier extractor. The Apifier extractor runs a web crawler and stores the results in Keboola storage. The crawler is created and configured in Apifier and can also be adjusted in extractor configuration. This way you can extract structured data from any website and import it directly do Keboola.

Apifier already contains many community web crawlers that can be reused, so you can seamlessly download restaurant reviews from Yelp, hotels from TripAdvisor, or request the creation of a custom crawler for an additional fee.

Week in Review -- June 28, 2017

RStudio and Jupyter Sandboxes

RStudio and Jupyter sandboxes are now in public beta, you can try them right now in your projects.
You can now also see and extend your sandbox expiration.

We are working on:

  • HTTPS Support.
  • Configurable (pay as you go) memory and disk limits.
  • Creating better UI and integration with transformations.
  • Allowing tables and files to be added to a running sandbox.

Keboola Connection in EU

We have launched Keboola Connection in a new region outside of the USA. You can now create your projects in the EU-Frankfurt region, please contact support@keboola.com if you are interested. This new region is now in beta.

Minor improvements

  • Improved performance of PostgreSQL extractor.  

Loading Events Application Errors (Updated)

Friday June 23rd, 20:30 PST

We're experiencing a large number of application errors when loading events from Storage in the UI. 

We're trying to debug and fix this issue, no other operations should be affected.

Friday June 23rd, 20:48 PST

We have rolled out new servers which seems to mitigate the issue. All operations back to normal.

We're sorry for this inconvenience.

Redshift Writer Job Failures

Today (Jun 21, 2017) between 16:00 and 18:30 CEST all Redshift Writer jobs created in this period failed with an application error. We have identified and fixed the issue.

We're sorry for this inconvenience.

GA Extractor Quota Limits

Recently (this week) we've begun hitting request quota limits with our Google Analytics Extractor.  The process of increasing the quota has begun, but until that happens we will unfortunately be required to put a hard cap on the extractor runtime of 30 minutes.  This means that only extractors that finish their jobs in under 30 minutes will complete successfully.

If you are affected by this limit, please try adjusting your configuration to reduce the amount of data being extracted.

Sorry for this inconvenience, we'll update here when/if the quota gets increased and we can remove this limitation.