SANTA CLARA, Calif. -- Talend and Hortonworks are teaming up to bring Talend’s open source data integration tools to the Apache Hadoop community -- and they weren’t the only software vendors making “big data” announcements at this week’s O’Reilly Strata Conference.
Under terms of the deal, Hortonworks will offer the Talend Open Studio for Big Data as part of its Hortonworks Data Platform. Hortonworks is a Yahoo! spin-off that distributes a version of the Hadoop file system.
The Talend Open Studio for Big Data is an open source data integration package that allows users to move data between Hadoop and enterprise systems such as relational databases or data warehouses. The platform also lets users create connectors to related open source data management technologies like HBase, Pig, Sqoop and Hive.
Talend is just one of many commercial software vendors -- including Oracle Corp., IBM and Microsoft -- that now offer Hadoop connector technology.
“Our vision is that it shouldn’t require a Ph.D in MapReduce to implement your transformations,” said Yves de Montcheuil, Talend's vice president of marketing. “We want to democratize big data. We want to help people deploy big data a lot more quickly and more efficiently so they don’t have to write complex programs in MapReduce.”
Talend’s de Montcheuil also had a tip for organizations considering an investment in big data management technologies.
“The beauty of big data is that it’s based on an open source stack [and you can] adopt it on your own terms and see if you’re getting value out of it,” he said. “That is how most people start with big data anyways. They experiment and play around and, if they are getting value, then it will be a lot easier to make the case for an investment in the technology.”
Some of the other software vendors making major announcements at the Strata Conference included Calpont Corp., Pentaho Corp., DataStax, VMware Inc. and Alpine Data Labs.
Calpont used the occasion as a launching pad for the latest version of its InfiniDB high performance analytic database. The company says InfiniDB 3 is a columnar database platform that integrates with Amazon’s EC2 deployment and offers new parallel data load options designed to speed the process of loading large data stores.
Calpont, which was on hand at the Strata Conference demonstrating its latest capabilities, said InfiniDB will be available beginning next month.
Pentaho and DataStax announced a joint deal designed to make it easier for organizations to move information into and out of the popular Cassandra NoSQL database. Under the partnership, Pentaho will add native support for Cassandra in its commercial distribution of Apache Kettle, an open source data integration software package.
DataStax says that Pentaho Kettle’s highly visual interface and interactive reporting and analysis capabilities provide Cassandra users with an easier way to explore and extract valuable information from big data stores.
Server virtualization pioneer VMware announced that it has combined its Spring Java application framework with Hadoop to create a new offering for developers called Spring Hadoop.
VMware says the latest addition to its Spring Data family of open source projects speeds up the process of building distributed processing deployments with Hadoop. Spring Hadoop is a free download available now under the open source Apache 2.0 license.
Predictive analytics software maker Apline Data Labs unveiled its new Alpine Illuminator, a web-based collaboration framework for building predictive models which can be applied to subsets of big data stores.
In addition to offering users “a single place to share and iterate on predictive models,” the company says Alpine Illuminator offers users version control, modeling history reports, auditing capabilities, email alerts and a scheduling engine.
“A collaborative, web-based approach is taken for granted in most modern business applications,” said Steven Hillion, Alpine’s chief product officer, said in a press release. “But until now, there has been no way to extend this functionality to something as complex as the predictive analytics workflow.”
Strata Conference speaker explains how to become a data-driven CEO
Forrester analyst James Kobielus talks about “big data” at last year’s Strata Conference