michelangelus - Fotolia

Dell links with Syncsort to tune Cloudera Hadoop for offloaded ETL

Dell and others have a new ETL reference architecture. Its purpose is to ease migrations to Cloudera Hadoop. Also: Dell buys EMC; Syncsort is acquired.

Dell has allied with Hadoop vendor Cloudera and data integration house Syncsort to simplify extract, transform and load (ETL) functions on the Hadoop data platform. In short, Dell and the others have agreed on reference architecture for offloading ETL jobs, with the goal to make Cloudera Hadoop an easier alternative to established data warehouses.

The mix combines Syncsort's DMX-h tools with Dell and Cloudera products. Syncsort's DMX-h contribution includes visual tools, which handle collection, preparation and transformation of data within Hadoop. Syncsort is based in Woodcliff Lake, N.J., while Cloudera is in Palo Alto, Calif.

A natural target for the architecture is application migration. Data transformation work in existing relational databases and enterprise data warehouses (EDWs) is the specific objective. Such work can bog down data warehouses or incur high costs if software is added to address the problem.

"We've discovered those enterprise data warehouses people are trying to find new ways to control costs. As they evaluate workloads, they find some data transformation jobs are consuming a lot of performance and a lot of capacity," said Armando Acosta, Hadoop product and planning manager at Dell.

Acosta said Hadoop running on Dell hardware can provide an alternative to existing enterprise data warehouses, but that moving jobs to Hadoop can require advanced programming skills. The Syncsort tools help address such skill issues, he said.

Transforms of the corporate kind

In other developments, both Dell and Syncsort have been involved in corporate buyouts. The former, notably, agreed to purchase storage giant EMC for $67 billion. The deal, at this time, is without definitive details on future plans for EMC's Pivotal operations, including former Greenplum data management properties, which are a relatively small part of EMC's overall portfolio.

Meanwhile, Syncsort will be acquired by Clearlake Capital Group in Santa Monica, Calif. Terms were not disclosed. In recent years, Syncsort has expanded its mainframe ETL software suite to include Hadoop tools for a variety of platforms, and it has formed numerous alliances -- such as the deal to work with Cloudera Hadoop products.

The company said Clearlake's investment would allow it to further those efforts. In an email message, Lonne Jaffe, CEO of Syncsort, said the acquisition and additional investment will allow it to fund new products. He also said the purchase will include funding to pursue a selected strategy of acquisition to accelerate growth.

Next Steps

GoldenGate 12c creates fast links to Hadoop data

Find out about Hortonworks' Hadoop management initiative

Discover MapR's plan for replication across data centers

Dig Deeper on Hadoop framework