New & Notable
Hadoop framework News
June 18, 2018
Hortonworks now supports Google Cloud Storage and has also broadened cloud deals with Microsoft and IBM, aiming to increase cloud uses of its big data platform.
March 21, 2018
Big data vendors and users are looking to Kubernetes-managed containers to help accelerate system and application deployments and enable more flexible use of computing resources.
March 15, 2018
StreamSets software for inspecting big data brings governance to data in motion. Such capabilities may find more use as the European Union's GDPR deadline looms.
February 22, 2018
In big data news, we find Google TPUs, or Tensor Processing Units, offered as a cloud service, while LinkedIn is open sourcing a Hadoop test simulator called Dynamometer.
Hadoop framework Get Started
Bring yourself up to speed with our introductory content
Hadoop is an open source distributed processing framework that manages data processing and storage for big data applications running in clustered systems. Continue Reading
Apache Hadoop YARN is the resource management and job scheduling technology in the open source Hadoop distributed processing framework. Continue Reading
The Hadoop Distributed File System (HDFS) is the primary data storage system used by Hadoop applications. Continue Reading
Evaluate Hadoop framework Vendors & Products
Weigh the pros and cons of technologies, products and projects you are considering.
Learn how the Spark DataFrame execution plan works and why its lazy evaluation model helps the processing engine to avoid the performance issues inherent in Hadoop MapReduce. Continue Reading
Relational databases may have hit a wall of late, but the SQL query engine seems poised for wider growth. Starburst, a retro startup of sorts, is among those looking to take it wider still. Continue Reading
Understanding the data integration process is central to self-service BI and data architecture design, consultant Rick Sherman says in an end-of-year look at data management trends. Continue Reading
Manage Hadoop framework
Learn to apply best practices and optimize your operations.
Data security needs to be addressed upfront in deployments of big data systems -- and users are likely to find they have to build some security capabilities themselves. Continue Reading
Data lakes pose technology deployment and data management challenges that can leave analytics users high and dry if the implementation process isn't handled properly. Continue Reading
Today, analytics work is about speed. That means rapidly building clusters and transforming and querying data. Learn how users are streamlining digital business. Continue Reading
Problem Solve Hadoop framework Issues
We’ve gathered up expert advice and tips from professionals like you so that the answers you need are always available.
The new thing in big data is Kubernetes container orchestration. While it's still early, there are signs of activity, which are cited in this edition of the Talking Data podcast. Continue Reading
Flooding a Hadoop cluster with data that isn't organized and managed properly can stymie analytics efforts. Take these steps to help make your data lake accessible and usable. Continue Reading
In the era of more and more digital orders, Panera Bread encountered big data challenges that led the restaurant chain to deploy a new cluster architecture with Hadoop, Spark and other technologies. Continue Reading