Enterprise data architecture best practices
- July 08, 2020
A new Linux Foundation-sponsored group will support open source data management and promote interoperability between applications and data sources, on premises or cloud-based.
- February 24, 2020
Credit Karma's vice president of engineering explains why and how the personal finance service is using the GraphQL data query technology to support its growing business.
- September 30, 2019
The list of vendors offering managed cloud database services is growing rapidly, which gives IT teams more opportunities to offload management of database systems.
- August 29, 2019
Data management consultant Donna Burbank outlines how effective data governance hinges on the deployment of a comprehensive enterprise data architecture.
- August 20, 2019
Self-service data preparation can duplicate work and slow down analytics. One possible fix: an internal marketplace where users can 'shop' for data assets.
- May 31, 2019
It's right there in a MapR letter to California's labor department: A leader in the Hadoop market is desperately seeking funding after poor sales of its promising data platform.
- May 01, 2019
Frank Slootman, who led ServiceNow and Data Domain through successful IPOs, is the new chairman and CEO of cloud data warehouse vendor Snowflake, replacing former CEO Bob Muglia.
- April 30, 2019
In this Q&A, now-former Snowflake CEO Bob Muglia discusses the vendor's decision to embrace cloud data warehousing and how the industry is changing as more data moves to the cloud.
- April 29, 2019
Teams at Wayfair mix new open source tools to power customer-facing apps. In such shops, tech leaders like Ben Clark must deftly maneuver an obstacle course of data components.
- April 04, 2019
Tools such as Unravel and Pepperdata offer a way to measure performance of big data cloud applications, which may aid companies with on-premises configuration issues.
- March 21, 2019
Machine learning will bring change to analytics and data management, said data luminary Michael Stonebraker. Others agree managing such change will take special effort.
- March 15, 2019
Many data professionals have yet to solidify traditional data management practices, but they have a new set of challenges to overcome to ensure data privacy and avoid misuse.
- March 12, 2019
Apache Kafka and Apache Spark connectors ease use of the Aerospike NoSQL data store in high-speed applications such as analytics that are becoming more broadly supported.
- January 22, 2019
Cloud architecture, analytics and AI data processing are top innovation priorities for new Teradata CEO Oliver Ratzesberger. He talks about his goals in this Q&A.
- December 28, 2018
Better data governance, increased cloud use and wider DataOps adoption head the list of trends for data management teams to plan for in 2019, IT analysts say.
- December 18, 2018
Datawatch plans to add analytics and data visualization tools to its Swarm data preparation platform, starting with integration capabilities in a Swarm 2.2 release.
- November 30, 2018
Built-for-purpose databases target general ones, especially in the cloud. At re:Invent 2018, AWS added time-series and transaction ledger databases to expand its line.
- October 04, 2018
Hadoop users will have fewer choices as big data rivals Cloudera and Hortonworks unite. But the new company may be more competitive with AWS and Google.
- September 26, 2018
BI on Hadoop is still new, but moving BI to data is trending. A data scientist working with IoT data at Komatsu sees the importance of getting big data to the right people.
- September 13, 2018
Hortonworks is joining with Red Hat and IBM to work together on a hybrid big data architecture format that will run using containers both in the cloud and on premises.
- August 08, 2018
Confluent Platform updates seek to bring data streaming with Apache Kafka to a wider audience. A new GUI and user-defined functions are part of the 5.0 release.
- July 19, 2018
Chief data officers and experts see the CDO role as changing to a more strategic orientation -- especially finding key opportunities in vast troves of data.
- July 16, 2018
The chief data officer role is about many things -- regulations, innovation, AI and more. Consultant Randy Bean discussed the matter ahead of an MIT symposium on the topic.
- June 27, 2018
NoSQL vendor MongoDB upgraded its database software with ACID support, while also releasing a serverless platform intended to simplify application development.
- June 25, 2018
Hortonworks users talk about building Hadoop data lakes to support new applications -- and the challenges their teams face on ingesting and refining data for end users.
- June 18, 2018
Hortonworks now supports Google Cloud Storage and has also broadened cloud deals with Microsoft and IBM, aiming to increase cloud uses of its big data platform.
- March 21, 2018
Big data vendors and users are looking to Kubernetes-managed containers to help accelerate deployments and enable more flexible use of computing resources.
- January 03, 2018
Is this the post-Hadoop era? Not in the eyes of Hadoop 3.0 backers, who see the latest update to the big data framework succeeding in machine learning applications and cloud systems.
- August 30, 2017
In this Talking Data podcast, TechTarget editors discuss Hadoop's future, IBM's decision to resell the Hortonworks distribution of the open source technology and other big data issues.
- July 24, 2017
In many organizations, chief data officer jobs centered on defense against risk are giving way to ones emphasizing innovation. To do so, CDOs must nurture a data culture, MIT panelists said.
- June 30, 2017
The quest for the agile database is putting developers in the forefront and has some DBA tasks moving to the development groups, according to panelists at a conference in Boston.
- June 29, 2017
With the EU's new General Data Protection Regulation looming on the horizon, companies -- including many in the U.S. -- need to get going on required data governance upgrades.
- June 23, 2017
MongoDB has expanded cloud coverage for its Atlas hosted database service, with Azure and Google versions joining an initial AWS-based offering to give users a choice on cloud platforms.
- June 20, 2017
IBM pulled the plug on its distribution of Hadoop in favor of reselling Hortonworks' bundle of big data technologies, a decision that reduces the number of Hadoop vendors to four.
- May 31, 2017
Deep learning applications often require a mix of data, and assorted preprocessing techniques. That makes data preparation a priority, and conventional machine learning may have a role to play.
- May 12, 2017
Kafka is a linchpin in many on-premises big data pipelines. Now, software vendor Confluent is offering a Kafka cloud service to ease use of the messaging and data streaming system in the cloud.
- April 28, 2017
Data lakes offer a more expansive alternative to data warehouses for analytics uses. TDWI analyst Philip Russom offers advice on how to get things right in a data lake architecture.
- April 14, 2017
Software containers encapsulate complexity and ease deployment, two traits that are helping to elicit growing interest in using them as part of big data systems.
- March 31, 2017
Fitness company Beachbody set up a data lake system in the AWS cloud to support big data analytics applications after deciding that an on-premises deployment would be too complicated.
- February 21, 2017
Moving custom Spark and Hadoop pilot projects into production use has proved daunting. But container technology eased the transition at the Advisory Board analytics service.
- October 06, 2016
What's in your toolbox? October's issue of Business Information turns the tables and puts that burning question to Capital One and several other business intelligence and data analytics software users. As the burgeoning worlds of ...
- September 30, 2016
Users increasingly are eyeing the cloud for big data management and analytics applications, and IT vendors are moving to ease the process -- and the price -- of running Hadoop in the cloud.
- July 06, 2016
Hadoop management is becoming a bigger priority for big data users and vendors alike as the distributed processing framework plays a more central role in the business operations of organizations.
- June 01, 2016
Data wrapping -- in this case, bundling data and analytics services with products -- may entice more companies to become data businesses. A panel at an MIT symposium considered some best practices for doing so.
- April 01, 2016
Moving streams of data is a must in many modern applications. As a result, streaming analytics systems with Spark Streaming, Kafka and other components are coming to the big data forefront.
- March 31, 2016
At Strata + Hadoop World 2016, Hadoop co-creator Doug Cutting said the core of the distributed processing framework is likely to see its position at the center of big data systems diminish.
- January 28, 2016
In a Q&A as Hadoop reaches one 10-year milestone in its development, co-creator Doug Cutting talks about the adoption of the big data framework, and the history and future of Hadoop.
- January 14, 2016
MapR's Hadoop distribution will add a message system to feed a streaming data pipeline. It takes a cue from open-source Kafka technology.
- December 28, 2015
In 2015, APIs for IBM's Watson system were front and center as a means to bring cognitive computing applications to a broader corporate audience.
- December 21, 2015
This episode of the 'Talking Data' podcast looks at the word of the year in data analytics and management. In 2015, Spark joined Hadoop and MapReduce at the top of the list of trending big data technologies.
- December 21, 2015
In a Q&A, big data and data science expert Kirk Borne discusses new data processing and analytics technologies and the growing importance of data literacy in organizations.
- November 16, 2015
IBM's planned purchase of The Weather Co.'s data operations may be a bellwether event from which data professionals can learn.
- October 30, 2015
At its Insight 2015 conference, IBM featured Apache Spark, releasing a cloud-based Spark service to support analytics applications and detailing Spark use in some of its own tools.
- October 07, 2015
Tracking 'What is Hadoop?' is getting more complex as the potential components of Hadoop systems increase -- and core elements such as HDFS are augmented by possible alternatives.
- September 22, 2015
At a TDWI Boston Chapter meeting, Mark Madsen says some notions of information become outdated in the face of big data analytics. This is part one of two.
- September 22, 2015
Operations and big data analytics applications are beginning to blend, causing changes in data strategies, Mark Madsen tells a TDWI Boston Chapter meeting. This is part two of two.
- September 09, 2015
In a Q&A, Clarity Solution Group CTO Tripp Smith says to base SQL-on-Hadoop software decisions on actual workloads. Some Hadoop tools target batch jobs, while others are intended for interactive ones.
- August 13, 2015
In a Q&A, data warehousing expert Joe Caserta explains why a new generation of developers building Hadoop clusters and other big data systems may need an introduction to some fundamental rules of ETL.
- August 07, 2015
RelayHealth's Raheem Daya described the path he took to deploy and expand a Hadoop cluster for distributed data processing during a presentation at the 2015 TDWI conference in Boston.
- June 15, 2015
Sales intelligence is making brainiacs out of sales reps. As companies look for ways to capture the attention of customers new and old, many are turning to analytics tools and data to help seal the deal.
Some use data to shape sales ...
- May 28, 2015
In this episode of the Talking Data podcast, TechTarget editors report on their experiences learning to do data journalism, including all the work needed to get data ready for analysis.
- May 28, 2015
While Power BI took center stage at Microsoft Convergence, many users are struggling with CRM basics. Also: insight on the Spark processing engine.
- February 06, 2015
A new Gartner report says the storage repository isn’t the trouble-free panacea many observers hail it to be. New data governance practices -- and new skills -- are critical.
- January 06, 2015
Hadoop clusters, NoSQL databases and other modern technologies have roles to play in business intelligence and analytics environments. But traditional data warehouses still do, too.
- July 29, 2014
At the TDWI Executive Summit in Boston, users talked about the benefits and challenges of incorporating Agile development methodologies into data warehouse and business intelligence projects.
- March 31, 2014
In a Q&A, author William McKnight discusses the importance of building a robust information architecture -- and having the right person in the lead.
- October 31, 2013
Requirements gathering shouldn't get short shrift in the database rush, says Michael J. Hernandez, author of 'Database Design for Mere Mortals.'
- October 15, 2013
Google's Jeromy Carriere spoke about the search engine giant's big data infrastructure at a recent TDWI meeting. Should it influence other efforts?
- May 07, 2013
In a podcast Q&A, consultant William McKnight offers advice on steering clear of common hazards that can puncture the performance of data warehouses.
- January 10, 2013
Data architects creating operational business intelligence applications may need to put streams of Hadoop data into a fast messaging infrastructure.
- August 08, 2012
Effective governance can help companies get the most out of their "big data" environments. But at this point, there's no formula for how to do that.
- February 29, 2012
Gartner says the onslaught of “big data” is turning data warehouses into distributed data processing platforms. But Hadoop and other big data tools aren’t as widespread as you might think -- yet.
- February 27, 2012
In the past, analytics applications typically were powered by relational databases. Now the options are more varied and less straightforward, says TechTarget's Wayne Eckerson.
- September 15, 2011
A longtime data management professional illustrates the synergies that exist between the Inmon and Kimball data warehousing methodologies.
- September 13, 2011
In an interview, data warehousing pioneer Bill Inmon states his case on the merits of his namesake methodology vs. Ralph Kimball's and discusses the "DW 2.0" version of the Inmon architecture.
- September 01, 2011
Data warehouse architect Bill Harrison explains how he got the go-ahead to purchase a new set of graphical data modeling tools.
- August 11, 2011
Forrester analyst James Kobielus discusses traditional data warehouse concepts, Hadoop and the three best ways to start making sense of "big data."
- July 28, 2011
IT pros and analysts offer guidance on how to cope with the complexities of "big data" installations in data warehouses and alternative data stores.
- April 21, 2011
The director of data warehousing and analytics at Walt Disney Parks and Resorts shares some important BI and data warehouse best practices.
- March 25, 2011
The truth is redundancy of data is absolutely a normal part of life.
- March 23, 2011
Capacity planning is essential for all data warehouse environments. Data warehouses grow at high rates and must be managed to keep budgets under control.
- December 06, 2010
Learn about key issues to consider as part of the database planning process, and get advice from consultant Mark Whitehorn on developing a successful database strategy.
- November 22, 2010
Database as a Service (DaaS) technology is becoming a more viable alternative to on-premises databases. Get guidance on the pros and cons of using cloud database software.
- June 09, 2009
Hear a hub-and-spoke definition and learn about challenges surrounding hub-and-spoke architecture. Know all the issues and details before implementing a hub-and-spoke design and hear how data governance plays a role in the hub-and-spoke model.
- December 11, 2007
Putting together a comprehensive data migration plan now can help firms avoid serious downtime later, according to experts.
- February 08, 2007
This article covers the stages of data warehousing from the data mart through the enterprise data warehouse.
- December 19, 2006
This article is a continuation in the series on enterprise architecture.
- August 18, 2005
There is a case to be made for management to process and react to data slowly, not quickly.