- July 02, 2018
GDPR influence is touching a Hadoop big data world that was immune to many privacy considerations until now. This podcast features the rise of Hadoop data governance for data lakes.
- June 25, 2018
Hortonworks users talk about building Hadoop data lakes to support new applications -- and the challenges their teams face on ingesting and refining data for end users.
- June 18, 2018
Hortonworks now supports Google Cloud Storage and has also broadened cloud deals with Microsoft and IBM, aiming to increase cloud uses of its big data platform.
- March 21, 2018
Big data vendors and users are looking to Kubernetes-managed containers to help accelerate system and application deployments and enable more flexible use of computing resources.
- March 15, 2018
StreamSets software for inspecting big data brings governance to data in motion. Such capabilities may find more use as the European Union's GDPR deadline looms.
- February 22, 2018
In big data news, we find Google TPUs, or Tensor Processing Units, offered as a cloud service, while LinkedIn is open sourcing a Hadoop test simulator called Dynamometer.
- January 03, 2018
Is this the post-Hadoop era? Not in the eyes of Hadoop 3.0 backers, who see the latest update to the big data framework succeeding in machine learning applications and cloud systems.
- December 01, 2017
Born at Cloudera, the MPP query engine known as Apache Impala has become a top-level open source project. It's one of various tools bringing SQL-style interactivity to big data analytics.
- September 28, 2017
The Strata conference in New York saw big data platform vendor MapR Technologies update MapR-DB, its NoSQL database engine, to better perform in real-time analytics applications.
- September 26, 2017
ETL jobs -- once the sole province of IT -- take on a new form as data wrangling and self-service gain greater traction with business users of analytics.
- August 31, 2017
SQL on Hadoop arrived -- so did SQL on Spark. Now, SQL on Kafka is emerging to provide a different way to look at Kafka data as it streams through the enterprise.
- August 30, 2017
In this Talking Data podcast, TechTarget editors discuss Hadoop's future, IBM's decision to resell the Hortonworks distribution of the open source technology and other big data issues.
- July 31, 2017
Data management startup Dremio has aimed its Apache Arrow expertise at the problem of self-service data delivery. In-column caches and optimization speed queries across varied data stores.
- June 20, 2017
IBM pulled the plug on its distribution of Hadoop in favor of reselling Hortonworks' bundle of big data technologies, a decision that reduces the number of Hadoop vendors to four.
- May 12, 2017
Kafka is a linchpin in many on-premises big data pipelines. Now, software vendor Confluent is offering a Kafka cloud service to ease use of the messaging and data streaming system in the cloud.