New & Notable
Big data management News
October 18, 2019
Databricks has found a new home at the Linux Foundation for its open source Delta Lake data lake project, in a bid to help grow a broader community and accelerate adoption.
September 25, 2019
Cloudera released a big data platform combining its technologies and ones from Hortonworks, initially in the AWS cloud but with multi-cloud support to come.
August 20, 2019
Self-service data preparation can duplicate work and slow down analytics. One possible fix: an internal marketplace where users can 'shop' for data assets.
August 06, 2019
Longtime independent big data vendor MapR goes out of business, selling technology and intellectual property to HPE. The move marks the continuing decline of the Hadoop market.
Big data management Get Started
Bring yourself up to speed with our introductory content
Hadoop is an open source distributed processing framework that manages data processing and storage for big data applications running in clustered systems. Continue Reading
An entity relationship diagram (ERD), also known as an entity relationship model, is a graphical representation that depicts relationships among people, objects, places, concepts or events within an information technology (IT) system. Continue Reading
Big data analytics is the often complex process of examining large and varied data sets, or big data, to uncover information -- such as hidden patterns, unknown correlations, market trends and customer preferences -- that can help organizations make... Continue Reading
Evaluate Big data management Vendors & Products
Weigh the pros and cons of technologies, products and projects you are considering.
There are many ways to store big data, but the choice of data warehouse vs. data lake vs. data mart comes down to who uses the data and how. Use this cheat sheet to compare. Continue Reading
Enterprises in need of a big data platform must run some analytics of their own to choose a vendor. AWS' integration between services can't be beat, but is Cloudera a better choice? Continue Reading
Veteran data professional Michael Bowers differentiates between key data management positions, including their salaries and which ones can add the most business value. Continue Reading
Manage Big data management
Learn to apply best practices and optimize your operations.
Most data being created today is unstructured, and storage pros often find themselves struggling to keep up. Luckily, efficient unstructured data storage is still possible. Continue Reading
As business intelligence analysis and reporting platforms become increasingly important in the enterprise, so does the data that feeds them. Are your BI data sources up to par? Continue Reading
As automation grows, data scientists will focus more on business needs, strategic oversight and deep learning and less on model creation and other routine tasks. Continue Reading
Problem Solve Big data management Issues
We’ve gathered up expert advice and tips from professionals like you so that the answers you need are always available.
Flooding a Hadoop cluster with data that isn't well organized and managed can stymie analytics efforts. Take these steps to help make your data lake accessible and usable. Continue Reading
The new thing in big data is Kubernetes container orchestration. While it's still early, there are signs of activity, which are cited in this edition of the Talking Data podcast. Continue Reading
In the era of more and more digital orders, Panera Bread encountered big data challenges that led the restaurant chain to deploy a new cluster architecture with Hadoop, Spark and other technologies. Continue Reading