New & Notable
Hadoop framework News
September 18, 2020
WANdisco LiveData Migrator moves Hadoop data while avoiding downtime. Although niche, it solves the hurdle of migrating Hadoop data while it's going through active changes.
July 23, 2020
Enterprise Presto SQL vendor Starburst updated its data query platform with expanded support for legacy Hadoop workloads as well as modern cloud data lake deployments.
January 14, 2020
Cloudera finds a new leader, pulling the former CEO of Hortonworks back into the fold to help set the direction for the big data Hadoop vendor as it moves forward in 2020.
September 25, 2019
Cloudera released a big data platform combining its technologies and ones from Hortonworks, initially in the AWS cloud but with multi-cloud support to come.
Hadoop framework Get Started
Bring yourself up to speed with our introductory content
Apache Hadoop YARN is the resource management and job scheduling technology in the open source Hadoop distributed processing framework. Continue Reading
Big data analytics is the often complex process of examining large and varied data sets, or big data, to uncover information -- such as hidden patterns, unknown correlations, market trends and customer preferences -- that can help organizations make... Continue Reading
Your data lake is full of sensitive information and securing that data is a top priority. These are the best practices to keep that information safe from hackers. Continue Reading
Evaluate Hadoop framework Vendors & Products
Weigh the pros and cons of technologies, products and projects you are considering.
There are many important parts to a data lake implementation, from technology to governance. Read on for the top factors to evaluate in your implementation strategy. Continue Reading
The latest release of the Apache Kafka open source event streaming platform adds improved replication and availability capabilities to help boost overall performance. Continue Reading
There are many ways to store big data, but the choice of data warehouse vs. data lake vs. data mart comes down to who uses the data and how. Use this cheat sheet to compare. Continue Reading
Manage Hadoop framework
Learn to apply best practices and optimize your operations.
On premises or in the cloud: What's the better place for your data lake? Here are some things to consider before deciding where to deploy a big data environment. Continue Reading
Many organizations are switching to cloud databases and big data platforms. But understanding what option best meets your data needs is an important first step. Continue Reading
Better data governance, increased cloud use and wider DataOps adoption head the list of trends for data management teams to plan for in 2019, IT analysts say. Continue Reading
Problem Solve Hadoop framework Issues
We’ve gathered up expert advice and tips from professionals like you so that the answers you need are always available.
Flooding a Hadoop cluster with data that isn't well organized and managed can stymie analytics efforts. Take these steps to help make your data lake accessible and usable. Continue Reading
The new thing in big data is Kubernetes container orchestration. While it's still early, there are signs of activity, which are cited in this edition of the Talking Data podcast. Continue Reading
In the era of more and more digital orders, Panera Bread encountered big data challenges that led the restaurant chain to deploy a new cluster architecture with Hadoop, Spark and other technologies. Continue Reading