Week In Review -- July 9, 2018

New Components

Updated Components

  • The GoodData Reports extractor now accepts an optional host parameter so it will work in the EU or any custom domain.
  • The Email Attachments extractor has been updated with the following improvements
    • It now strictly enforces the one text file attachment per email rule
    • It ignores any non-textual attachments (signature images etc...)
    • It can process multiple emails simultaneously
  • The Gmail Extractor no longer has a limit of 65k for fields (backend storage limits still apply)
  • The HTTP Extractor now accepts a "maxRedirects" parameter to tell it how many times to retry when it receives an http redirect status code. 

Minor Improvements

  • R has been updated to version 3.5.0 (for transformations and RStudio sandboxes)



Processing delays for GoodData Writer jobs in the EU region

[02.07.2018 8:06 CET]

We're experiencing some delays in the processing of GoodData Writer jobs in the EU region, and are currently investigating the cause of the issue.

UPDATE [02.07.2018 10:06 CET] 

The tasks are queued in GoodData EU platform. We are investigation this issue with GoodData support.

UPDATE [02.07.2018 15:27 CET] 

The issue has been resolved, and all GoodData Writer tasks in the EU region are working normally again.


Week in Review -- June 6, 2018

New Features

Orchestration Description
  • You can describe your Orchestrations now. This can be useful when you collaborate with your colleagues or want to quickly find what you need among dozens of Orchestrations.


Orchestration Versioning
  • Also, Orchestrations just got their internal versioning system (joining Extractors, Writers and Applications). At this stage, the versioning is read-only, so you can compare two versions. Rollbacks and revision copies will be added soon.


Minor Improvements

MySQL Writer
  • Improved performance of incremental load. Also, the writer now checks if the primary keys in its configuration match with those in the target database table.


Fixes

  • Google Calendar Extractor is now available also in the EU region.

GoodData Writer Issues

Today between 2:30 and 7:00 CEST we experienced issues with GoodData Writer. Ironically, it failed to connect to a third-party service for utilization monitoring. The problem was fixed so there should be no other job failure. We are going to inspect the extent of the damage.

Week in Review -- May 16, 2018

Core

Components

Bugfixes

  • Component configuration state is not updated in case of attached processor failure. e.g in case of AWS S3 extractor and New Files Only option in leaves files as unprocessed in case of processor failure so the files can be processed again until the whole pipeline of processors is executed successfully.
  • MSSQL Writer - fixed support of unicode characters
  • Google Sheets Writer - fixed writing of large tables without performance issues
  • When specifying transformation output mapping, the bucket name is automatically webalized as being typed

Developers

We are happy to introduce the first version of Keboola Storage API Javascript client.


SQL Server Extractor Connection Issues

Between 17:00 May 15, 2018 and 8:00 on May 16, 2018 CET we experienced issues with the SQL Server database extractor. We recommend to review your orchestrations and take appropriate actions if needed.

If you were affected by this, please accept our sincere apologies.

SQL Server writer failures

Between 9 May 2018, 10:53 CEST and 11 May 2018, 09:30 CEST there were job failures for SQL Server Writer configurations that had nullable data types. The issue was created by a new version of the writer so we have rolled it back to the  previous version while we investigate the root cause.

We're sorry for any inconvenience. 

Week in Review -- April 30, 2018

Core

  • Improved generated configuration changes descriptions
  • Added configuration version to jobs results of Docker-based components (it is not yet available for legacy components like transformation and gooddata-writer)
  • Refreshed Manage API docs with working examples
  • Fixed loading of large tables for R-studio and Jupyter sandboxes
  • Fixed random CSV Import upload errors in EU region

Components

  • Improved "show details" experience for input and output mappings
  • Added visibility of columns non-existing in Storage to writers
  • Increased query timeout for all Keboola Provisioned Snowflake writers from 15 seconds to 15 minutes
  • Added support of unconventional column names to MySQL extractor
  • Removed static state from MongoDB extractor

Processors

  • Added support of snappy format to processor-decompress
  • Added processor filter-files
  • Added support for sanitization of invalid utf-8 in processor-iconv

Developers

New Debug API call is available (replaces very rarely used sandbox, dry-run and input-data calls). It creates a snapshot of the data directory used for running the component and stores it in your KBC project. To learn more, feel free to go through the API Docs or through the tutorial. In short the API call:

  • uses the same calling convention as the Run API,
  • filters encrypted values from the data directory,
  • works with all components (previously only those without encryption were supported),
  • works with Processors,
  • works with Configuration Rows,
  • works also with broken components and configurations (even if the run fails, you'll still get a snapshot of the data directory).

Python transformations

Pip version 10 was released recently which removes the pip.main method (more reading). The recommended way to install packages from within python is:

import subprocess
import sys
subprocess.call([sys.executable, '-m', 'pip', 'install', '--disable-pip-version-check', 'PACKAGE_NAME'])

Currently there are 70 transformations using the removed pip methods. If your projects are using them we'll be contacting you with a list of affected transformations. This breaking change introduced in pip is currently blocking us from upgrading python to 3.6.5 where pip 10 is used by default.

Orchestration Notification Updates Resulted in Deleted Tasks

There was an update to the orchestrator this week that had an unfortunate side-effect.  If you updated your orchestrations' notifications it would delete the orchestration's tasks.

Thankfully, the orchestrations are versioned, so if this happened to you, we will restore the tasks from the last version.
If you have any concerns about this please contact us at support@keboola.com.  

For what it's worth, updating notifications will no longer delete orchestration tasks, please accept our humble apologies if you were affected.