Hadoop framework
New & Notable
Hadoop framework News
-
February 24, 2021
24
Feb'21
Next Pathway update targets Hadoop migrations
Next Pathway's data scanning and migration tools can now analyze Hadoop ecosystems and translate its codes to cloud-based data warehouses such as Snowflake and Amazon RedShift.
-
January 15, 2021
15
Jan'21
Informatica takes Customer 360 master data management to cloud
Updated MDM service benefits from integrations with the broader cloud-native Informatica platform that is built on top of a microservices Kubernetes-based architecture.
-
September 18, 2020
18
Sep'20
WANdisco launches automated Hadoop data migrator for AWS
WANdisco LiveData Migrator moves Hadoop data while avoiding downtime. Although niche, it solves the hurdle of migrating Hadoop data while it's going through active changes.
-
July 23, 2020
23
Jul'20
Starburst advances Presto to handle Hadoop data better
Enterprise Presto SQL vendor Starburst updated its data query platform with expanded support for legacy Hadoop workloads as well as modern cloud data lake deployments.
Hadoop framework Get Started
Bring yourself up to speed with our introductory content
-
Hadoop Distributed File System (HDFS)
The Hadoop Distributed File System (HDFS) is the primary data storage system used by Hadoop applications. Continue Reading
-
big data analytics
Big data analytics is the often complex process of examining big data to uncover information -- such as hidden patterns, correlations, market trends and customer preferences -- that can help organizations make informed business decisions. Continue Reading
-
Apache Hadoop YARN
Apache Hadoop YARN is the resource management and job scheduling technology in the open source Hadoop distributed processing framework. Continue Reading
Evaluate Hadoop framework Vendors & Products
Weigh the pros and cons of technologies, products and projects you are considering.
-
Data warehouse vs. data lake: Key differences
Data warehouses and data lakes are both data repositories common in the enterprise, but what are the main differences between the two and which is best for your data? Continue Reading
-
Key factors for successful data lake implementation
There are many important parts to a data lake implementation, from technology to governance. Read on for the top factors to evaluate in your implementation strategy. Continue Reading
-
Apache Kafka version 2.4 improves streaming data performance
The latest release of the Apache Kafka open source event streaming platform adds improved replication and availability capabilities to help boost overall performance. Continue Reading
Manage Hadoop framework
Learn to apply best practices and optimize your operations.
-
Should you host your data lake in the cloud?
On premises or in the cloud: What's the better place for your data lake? Here are some things to consider before deciding where to deploy a big data environment. Continue Reading
-
Top database cloud migration considerations for enterprises
Many organizations are switching to cloud databases and big data platforms. But understanding what option best meets your data needs is an important first step. Continue Reading
-
Data management trends for 2019: Governance, DataOps, cloud
Better data governance, increased cloud use and wider DataOps adoption head the list of trends for data management teams to plan for in 2019, IT analysts say. Continue Reading
Problem Solve Hadoop framework Issues
We’ve gathered up expert advice and tips from professionals like you so that the answers you need are always available.
-
7 steps to a successful data lake implementation
Flooding a Hadoop cluster with data that isn't well organized and managed can stymie analytics efforts. Take these steps to help make your data lake accessible and usable. Continue Reading
-
Kubernetes container orchestration gets big data star turn
The new thing in big data is Kubernetes container orchestration. While it's still early, there are signs of activity, which are cited in this edition of the Talking Data podcast. Continue Reading
-
Panera meets big data challenge of lunchtime operations
In the era of more and more digital orders, Panera Bread encountered big data challenges that led the restaurant chain to deploy a new cluster architecture with Hadoop, Spark and other technologies. Continue Reading