Hadoop framework
New & Notable



Machine Learning Meets Data Quality
Read an exclusive interview with Andrew Burt, chief privacy offer and legal engineer at Immuta Inc., on data governance and machine learning integration. Plus, uncover steps IT managers are taking to improve data quality in their big data environments in order to ensure analytics accuracy.
Hadoop framework News
-
February 14, 2019
14
Feb'19
Originators form group to boost Presto SQL query engine
The Presto engine arose as an alternative to Hive for big data queries. Now, the Presto Software Foundation has formed to promote the SQL query software's virtues.
-
January 22, 2019
22
Jan'19
New Teradata CEO pursues cloud-based architecture
Cloud architecture, analytics and AI data processing are top innovation priorities for new Teradata CEO Oliver Ratzesberger. He talks about his goals in this Q&A.
-
January 15, 2019
15
Jan'19
Cloudera and Hortonworks combo to push CDP, machine learning
Two wunderkinds of Hadoop have formalized their merger. Cloudera and Hortonworks say they will place special focus on AI as they chart the stand-alone vendor's future.
-
December 28, 2018
28
Dec'18
Data management trends for 2019: Governance, DataOps, cloud
Better data governance, increased cloud use and wider DataOps adoption head the list of trends for data management teams to plan for in 2019, IT analysts say.
Hadoop framework Get Started
Bring yourself up to speed with our introductory content
-
big data analytics
Big data analytics is the often complex process of examining large and varied data sets -- or big data -- to uncover information including hidden patterns, unknown correlations, market trends and customer preferences that can help organizations make... Continue Reading
-
Apache Hive
Apache Hive is an open source data warehouse system for querying and analyzing large data sets that are principally stored in Hadoop files. Continue Reading
-
Hadoop
Hadoop is an open source distributed processing framework that manages data processing and storage for big data applications running in clustered systems. Continue Reading
Evaluate Hadoop framework Vendors & Products
Weigh the pros and cons of technologies, products and projects you are considering.
-
Open source support was central to 2018 data deals
Mergers and acquisitions unsettled the big data status quo in 2018. Open source support made these couplings a bit different than those of the past, Talking Data podcasters said. Continue Reading
-
Neo4j graph database targets AI use cases, performance
The Neo4j graph database is poised for use in AI applications, in which understanding data can stymie efforts. Recent updates target AI needs and performance issues. Continue Reading
-
Cloud buoys data microservices -- for on-premises systems, too
Data in a microservices architecture is percolating anew. This news analysis looks at IBM Cloud Private for Data and other means to harmonize data in public and private locations. Continue Reading
Manage Hadoop framework
Learn to apply best practices and optimize your operations.
-
Big data platform broadens place in analytics architecture
Big data platforms stumbled a bit getting out of the prototyping stage. But a view from the Strata Data Conference in New York sees broader use in the offing. Continue Reading
-
Mining equipment maker uses BI on Hadoop to dig for data
BI on Hadoop is still new, but moving BI to data is trending. A data scientist working with IoT data at Komatsu sees the importance of getting big data to the right people. Continue Reading
-
IT teams take big data security issues into their own hands
Data security needs to be addressed upfront in deployments of big data systems -- and users are likely to find they have to build some security capabilities themselves. Continue Reading
Problem Solve Hadoop framework Issues
We’ve gathered up expert advice and tips from professionals like you so that the answers you need are always available.
-
Kubernetes container orchestration gets big data star turn
The new thing in big data is Kubernetes container orchestration. While it's still early, there are signs of activity, which are cited in this edition of the Talking Data podcast. Continue Reading
-
Seven steps to a successful data lake implementation
Flooding a Hadoop cluster with data that isn't organized and managed properly can stymie analytics efforts. Take these steps to help make your data lake accessible and usable. Continue Reading
-
Panera meets big data challenge of lunchtime operations
In the era of more and more digital orders, Panera Bread encountered big data challenges that led the restaurant chain to deploy a new cluster architecture with Hadoop, Spark and other technologies. Continue Reading