- January 14, 2020
Cloudera finds a new leader, pulling the former CEO of Hortonworks back into the fold to help set the direction for the big data Hadoop vendor as it moves forward in 2020.
- September 25, 2019
Cloudera released a big data platform combining its technologies and ones from Hortonworks, initially in the AWS cloud but with multi-cloud support to come.
- September 05, 2019
The interim CEO of Cloudera is cautiously optimistic about growth prospects as the big data vendor acquired Arcadia Data and planned to bolster its own cloud platform.
- August 06, 2019
Longtime independent big data vendor MapR goes out of business, selling technology and intellectual property to HPE. The move marks the continuing decline of the Hadoop market.
- July 26, 2019
Inspired by the IBM-Red Hat model, Cloudera goes the open source route to broaden its market as demand for Hadoop weakens and the vendor takes on big competitors like AWS.
- May 31, 2019
It's right there in a MapR letter to California's labor department: A leader in the Hadoop market is desperately seeking funding after poor sales of its promising data platform.
- April 29, 2019
Teams at Wayfair mix new open source tools to power customer-facing apps. In such shops, tech leaders like Ben Clark must deftly maneuver an obstacle course of data components.
- April 15, 2019
Events are as important as data in emerging applications underlying many e-commerce efforts. Streams of events tell a company what motivates customers to use online products.
- April 04, 2019
Tools such as Unravel and Pepperdata offer a way to measure performance of big data cloud applications, which may aid companies with on-premises configuration issues.
- February 14, 2019
The Presto engine arose as an alternative to Hive for big data queries. Now, the Presto Software Foundation has formed to promote the SQL query software's virtues.
- January 22, 2019
Cloud architecture, analytics and AI data processing are top innovation priorities for new Teradata CEO Oliver Ratzesberger. He talks about his goals in this Q&A.
- January 15, 2019
Two wunderkinds of Hadoop have formalized their merger. Cloudera and Hortonworks say they will place special focus on AI as they chart the stand-alone vendor's future.
- December 28, 2018
Better data governance, increased cloud use and wider DataOps adoption head the list of trends for data management teams to plan for in 2019, IT analysts say.
- November 06, 2018
The IBM acquisition of Red Hat marks a watershed in computer architecture. The duo says it can rebuild data applications in new ways. This news analysis explores what's coming.
- October 04, 2018
Hadoop users will have fewer choices as big data rivals Cloudera and Hortonworks unite. But the new company may be more competitive with AWS and Google.
- September 26, 2018
BI on Hadoop is still new, but moving BI to data is trending. A data scientist working with IoT data at Komatsu sees the importance of getting big data to the right people.
- September 13, 2018
Hortonworks is joining with Red Hat and IBM to work together on a hybrid big data architecture format that will run using containers both in the cloud and on premises.
- September 07, 2018
Hadoop data tooling is expanding. A view holds that Hadoop is moving from alternate data warehousing to a full-fledged big data analytics offering.
- September 04, 2018
A graph database startup's parallel loading, processing and querying combine to deliver real-time data for fintech firms that offer fast credit evaluations online.
- August 08, 2018
Confluent Platform updates seek to bring data streaming with Apache Kafka to a wider audience. A new GUI and user-defined functions are part of the 5.0 release.
- July 02, 2018
GDPR influence is touching a Hadoop big data world that was immune to many privacy considerations until now. This podcast features the rise of Hadoop data governance for data lakes.
- June 25, 2018
Hortonworks users talk about building Hadoop data lakes to support new applications -- and the challenges their teams face on ingesting and refining data for end users.
- June 18, 2018
Hortonworks now supports Google Cloud Storage and has also broadened cloud deals with Microsoft and IBM, aiming to increase cloud uses of its big data platform.
- March 21, 2018
Big data vendors and users are looking to Kubernetes-managed containers to help accelerate deployments and enable more flexible use of computing resources.
- March 15, 2018
StreamSets software for inspecting big data brings governance to data in motion. Such capabilities may find more use as the European Union's GDPR deadline looms.
- February 22, 2018
In big data news, we find Google TPUs, or Tensor Processing Units, offered as a cloud service, while LinkedIn is open sourcing a Hadoop test simulator called Dynamometer.
- January 03, 2018
Is this the post-Hadoop era? Not in the eyes of Hadoop 3.0 backers, who see the latest update to the big data framework succeeding in machine learning applications and cloud systems.
- December 01, 2017
Born at Cloudera, the MPP query engine known as Apache Impala has become a top-level open source project. It's one of various tools bringing SQL-style interactivity to big data analytics.
- September 28, 2017
The Strata conference in New York saw big data platform vendor MapR Technologies update MapR-DB, its NoSQL database engine, to better perform in real-time analytics applications.
- September 26, 2017
ETL jobs -- once the sole province of IT -- take on a new form as data wrangling and self-service gain greater traction with business users of analytics.
- August 31, 2017
SQL on Hadoop arrived -- so did SQL on Spark. Now, SQL on Kafka is emerging to provide a different way to look at Kafka data as it streams through the enterprise.
- August 30, 2017
In this Talking Data podcast, TechTarget editors discuss Hadoop's future, IBM's decision to resell the Hortonworks distribution of the open source technology and other big data issues.
- July 31, 2017
Data management startup Dremio has aimed its Apache Arrow expertise at the problem of self-service data delivery. In-column caches and optimization speed queries across varied data stores.
- June 20, 2017
IBM pulled the plug on its distribution of Hadoop in favor of reselling Hortonworks' bundle of big data technologies, a decision that reduces the number of Hadoop vendors to four.
- May 12, 2017
Kafka is a linchpin in many on-premises big data pipelines. Now, software vendor Confluent is offering a Kafka cloud service to ease use of the messaging and data streaming system in the cloud.
- April 28, 2017
Data lakes offer a more expansive alternative to data warehouses for analytics uses. TDWI analyst Philip Russom offers advice on how to get things right in a data lake architecture.
- April 28, 2017
Systems of engagement represent a hotbed of activity in data management these days. Flexibility and scalability are watchwords.
- April 20, 2017
Corporate users are becoming more open to deploying big data systems with Apache Spark in the cloud, Databricks CEO Ali Ghodsi says in a Q&A on the open source processing platform.
- April 14, 2017
Software containers encapsulate complexity and ease deployment, two traits that are helping to elicit growing interest in using them as part of big data systems.
- March 31, 2017
Fitness company Beachbody set up a data lake system in the AWS cloud to support big data analytics applications after deciding that an on-premises deployment would be too complicated.
- March 08, 2017
Application profiling software from Pepperdata is built on LinkedIn's Dr. Elephant open source entry. A primary goal is to get more Hadoop and Spark applications into production.
- February 21, 2017
Moving custom Spark and Hadoop pilot projects into production use has proved daunting. But container technology eased the transition at the Advisory Board analytics service.
- February 16, 2017
Spark Streaming architecture to date has focused much on programming perks. Now, as a bit of a hedge against other streaming choices, Drizzle comes to bat to cut streaming latency.
- December 02, 2016
Amazon's Athena data engine brings interactive SQL queries to S3 data sets and lets users pay as they go. It's based on an open source framework called Presto that Teradata and others also employ.
- November 30, 2016
The Louisiana Department of Health responded to flooding with the help of GIS software that located trouble spots with at-risk hospitals. Ease of use was welcome, according to a preparedness manager.
- October 28, 2016
Big data is moving from its bare-metal roots, and data streaming is a driver. Containers and microservices may have a role to play in what's next. An e-commerce application shows the way.
- October 06, 2016
What's in your toolbox? October's issue of Business Information turns the tables and puts that burning question to Capital One and several other business intelligence and data analytics software users. As the burgeoning worlds of ...
- September 30, 2016
Users increasingly are eyeing the cloud for big data management and analytics applications, and IT vendors are moving to ease the process -- and the price -- of running Hadoop in the cloud.
- September 15, 2016
HPE is paring down its software holdings, including analytical database software in the Vertica line and other big data tools. A sale to Micro Focus is due to close next year, leaving users in some limbo for now.
- August 31, 2016
Vertica 8.0 expands the analytical database's support for Kafka, Spark and Hadoop. That's an important step, as the Hewlett Packard Enterprise technology tries to compete in a field of diverse data tools.
- July 06, 2016
Hadoop management is becoming a bigger priority for big data users and vendors alike as the distributed processing framework plays a more central role in the business operations of organizations.
- June 09, 2016
Spark 2.0, with structured streaming and SQL 2003 support, is aborning as indicated at Databricks' Spark Summit, where R-to-Spark interfaces also popped up.
- June 03, 2016
Startup vendor Confluent is looking to place a stake in the big data ecosystem with Kafka streaming and management tools meant to reduce complexity in applications that place data in motion.
- April 29, 2016
Open source data engineering has become a way of life at e-commerce leader eBay, says the company's Debashis Saha. Kylin is one of the tools that has resulted.
- April 22, 2016
A new view on hybrid data architectures, in which data lakes and warehouses coexist, emerged at EDW 2016. The hybrid approach has implications for data design, skills and planning.
- April 19, 2016
Running a Hadoop cluster in the data center isn't for the weak. But several new tools aim to give IT operations teams a closer look into what's going on inside Hadoop-based big data systems.
- April 13, 2016
Pivotal Software dropped out of the Hadoop distribution business in favor of reselling the Hortonworks version of the big data framework -- and the market consolidation moves may not be over.
- April 01, 2016
Moving streams of data is a must in many modern applications. As a result, streaming analytics systems with Spark Streaming, Kafka and other components are coming to the big data forefront.
- March 31, 2016
At Strata + Hadoop World 2016, Hadoop co-creator Doug Cutting said the core of the distributed processing framework is likely to see its position at the center of big data systems diminish.
- March 24, 2016
The Strata + Hadoop World conference focuses on big data management and analytics technologies, in particular the Hadoop distributed processing framework and Spark processing engine.
- March 02, 2016
Looking to better balance system stability and innovation, Hadoop distribution provider Hortonworks will follow two release 'cadences' for different component sets in its HDP package.
- February 29, 2016
Its collection of big-data processing features is priming the Apache Spark architecture for wider deployment. One key trait: Spark performance outpaces MapReduce in many Hadoop use cases.
- February 24, 2016
Numerous SQL-on-Hadoop engines are available for accessing data stored in HDFS using the familiar SQL language. They all look promising, they all support a rich SQL dialect, but which ones is the ...
- February 24, 2016
Amid the buzz at Spark Summit East 2016 in New York was word that the Spark data processing engine's stream processing architecture will be overhauled in the upcoming version 2.0 of the open source software.
- February 04, 2016
Hadoop has been slowly plodding through the big data jungle, but SQL's integration may put a spring in the elephant's step.
- February 03, 2016
Attention's been placed on Spark running on Hadoop, but there are Spark connectors for NoSQL that usher in a new class of operational analytics.
- January 28, 2016
In a Q&A as Hadoop reaches one 10-year milestone in its development, co-creator Doug Cutting talks about the adoption of the big data framework, and the history and future of Hadoop.
- January 27, 2016
Software architect Mansour Raad is at the center of activity as geospatial data melds with Hadoop -- and soon, Spark.
- January 14, 2016
MapR's Hadoop distribution will add a message system to feed a streaming data pipeline. It takes a cue from open-source Kafka technology.
- December 21, 2015
This episode of the 'Talking Data' podcast looks at the word of the year in data analytics and management. In 2015, Spark joined Hadoop and MapReduce at the top of the list of trending big data technologies.
- December 09, 2015
If Hadoop and Spark are to sneak into the enterprise, they will need to be manageable. With Driven, Concurrent Inc. takes a stab at the problem.
- November 16, 2015
IBM's planned purchase of The Weather Co.'s data operations may be a bellwether event from which data professionals can learn.
- October 30, 2015
At its Insight 2015 conference, IBM featured Apache Spark, releasing a cloud-based Spark service to support analytics applications and detailing Spark use in some of its own tools.
- October 27, 2015
Dell and others have a new ETL reference architecture. Its purpose is to ease migrations to Cloudera Hadoop. Also: Dell buys EMC; Syncsort is acquired.
- October 19, 2015
We may have outlived the era of killer apps in some part defined by Walmart, but Hadoop big data applications may help the giant's quest for more growth.
- October 13, 2015
MapR takes JSON format data into Hadoop, while Teradata places its flagship database on AWS.
- October 07, 2015
Tracking 'What is Hadoop?' is getting more complex as the potential components of Hadoop systems increase -- and core elements such as HDFS are augmented by possible alternatives.
- September 17, 2015
The latest episode of BizApps Today examines barriers to Hadoop technology adoption, SQL-on-Hadoop options and the new concept of data storytelling.
- September 09, 2015
In a Q&A, Clarity Solution Group CTO Tripp Smith says to base SQL-on-Hadoop software decisions on actual workloads. Some Hadoop tools target batch jobs, while others are intended for interactive ones.
- August 14, 2015
A new data ingestion and extraction tool supporting the Hadoop Distributed File System is at the heart of startup vendor DataTorrent's efforts to broaden its big data analytics engine's appeal.
- August 13, 2015
In a Q&A, data warehousing expert Joe Caserta explains why a new generation of developers building Hadoop clusters and other big data systems may need an introduction to some fundamental rules of ETL.
- August 07, 2015
RelayHealth's Raheem Daya described the path he took to deploy and expand a Hadoop cluster for distributed data processing during a presentation at the 2015 TDWI conference in Boston.
- June 16, 2015
In many organizations, Hadoop is still pushing to go beyond proof-of-concept projects. Some vendors hope new tools that enable familiar SQL querying will lead it to broader adoption.
- May 29, 2015
Big data technologies have become vital to online ad platform developer Altitude Digital. And as the tools have evolved, CTO Manny Puentes has learned some lessons about using them.
- May 28, 2015
While Power BI took center stage at Microsoft Convergence, many users are struggling with CRM basics. Also: insight on the Spark processing engine.
- April 29, 2015
Startup JethroData's namesake Version 1.0 software is an index-based, SQL-on-Hadoop engine that forgoes full scans of Hadoop data sources.
- March 25, 2015
New database updates address large-scale applications. Included are Couchbase Server 4.0 with multidimensional scaling support and VoltDB 5.0 with links to Hadoop data streaming tools.
- March 09, 2015
As companies get a better handle on how to use big data wisely, a new effort on interoperability for Hadoop projects gets a mixed reaction.
- March 02, 2015
Hadoop vendor MapR's latest release puts the focus on database replication across data centers. Also, Microsoft has built a Python-friendly Azure service for machine learning in the cloud.
- February 24, 2015
The Open Data Platform has arrived, but not all Hadoop vendors are on board. The initiative, aimed at boosting interoperability, formed a backdrop for discussion at the Strata + Hadoop World 2015 conference.
- February 20, 2015
Building and running enterprise Hadoop applications takes more than data crunching. First, Hadoop data must be absorbed into company processes, a Western Union IT manager says.
- February 17, 2015
In a Q&A, Forrester analyst Mike Gualtieri said Hadoop-based data lakes can become an alternative to enterprise data warehouses. But first, faster I/O and better data governance are needed.
- February 06, 2015
A new Gartner report says the storage repository isn’t the trouble-free panacea many observers hail it to be. New data governance practices -- and new skills -- are critical.
- January 22, 2015
An update to Oracle's GoldenGate replicator is on tap. And, Xplenty and Segment connect on Hadoop processing in the cloud.
- January 06, 2015
Hadoop clusters, NoSQL databases and other modern technologies have roles to play in business intelligence and analytics environments. But traditional data warehouses still do, too.
- November 26, 2014
Startup vendor Splice Machine has put SQL capabilities on top of Hadoop to create a hybrid database that it said can run both transaction processing and analytics applications.
- November 24, 2014
Data management product announcements by HP, SAP and NuoDB show greater support for SQL on Hadoop analytics, enhanced cloud deployment and mixed processing workloads.
- November 05, 2014
The Spark processing engine, which targets machine learning and a range of other big data analytics applications, was a big topic at the 2014 Strata + Hadoop World conference in New York.
- October 24, 2014
Hadoop may get most of the big data glory. But Strata + Hadoop World attendees said combining it with other technologies is what's really needed to power big data applications.
- September 25, 2014
News briefs describe MapR's latest Hadoop distribution, which includes Apache Drill for SQL analytics. Also, InfiniDB said it will close down business.