Features
Features
-
Users balance Spark support from vendors, access to new features
Apache Spark users are often faced with a quandary: continue with vendor support or break out on their own to a newer version of the fast-moving open source software with updated features? Continue Reading
-
Some early adopters take it easy on Spark cluster rollouts
Software companies Intuit and Novantas took deliberate approaches to deploying their first Spark clusters, limiting initial user access and looking for solid business uses in a 'surgical' way. Continue Reading
-
The Diaku Axon data governance platform addresses regulated industries
The Diaku Axon data governance platform is designed to assist financial institutions as well as other industries with complex regulatory compliance environments. Continue Reading
-
Examining the top data governance tools on the market
Expert John Ladley examines the leading data governance software products, comparing and contrasting their features to help you determine which tool will best meet your needs. Continue Reading
-
What to know about the Data3Sixty cloud-based data governance platform
With Data3Sixty's cloud-based grow-as-you-go subscription model, organizations don't have to invest in a major data governance implementation or worry about complex upgrades. Continue Reading
-
Alation Data Catalog helps provide collaborative data governance
Alation Data Catalog provides business analysts and data stewards with search and discovery, governance and collaboration functionality, and automates many data governance tasks. Continue Reading
-
Finding the way to different types of databases, big data tools
In a Q&A, EMA analyst John Myers advises IT teams to look at big data workloads when sorting through new and different types of databases and open source tools. His word on Spark? It's still young. Continue Reading
-
What you should know about Collibra Data Governance Center
Collibra Data Governance Center is a repository- and workflow-oriented data governance platform that also offers features for data management and data stewardship. Continue Reading
-
How to narrow down your choices for buying a data governance tool
Evaluating and selecting a data governance tool depends on not only features and functionality, but also how you will use the tool to add value to your organization. Continue Reading
-
Up-and-coming data engineers complement entrenched data scientists
Even before the hot title of data scientist was fully defined, a complementary role began to bubble up: data engineer. Here's how they differ and why companies may need one, the other or both. Continue Reading
-
Business intelligence app integration, self-service deemed must-haves
Businesses must either disrupt or be disrupted. One way to do the former is to integrate sales intelligence apps and make them self-service. Sound difficult? It doesn't have to be. Continue Reading
-
What to know about Information Builders' Omni-Gen data governance tool
Omni-Gen from Information Builders features a variety of tools for enterprise data management, data governance and data best practices, all in one centralized package. Continue Reading
-
An overview of Quest's database performance management tools
DBAs can use the Quest Toad product suite for managing database structural performance, and Quest Foglight to proactively monitor SQL, storage and virtualization performance. Continue Reading
-
A look inside the SolarWinds Database Performance Analyzer
SolarWinds provides DBAs and application teams with a database performance monitoring and analysis tool via a Java application that runs on a dedicated Windows, Unix or Linux server. Continue Reading
-
Inside IBM's Data Server Manager and other database performance tools
IBM offers several products and platforms to help DBAs proactively monitor, manage and optimize the performance of DB2 for z/OS and DB2 for Linux, UNIX and Windows databases. Continue Reading
-
Online music startup picks a recommendation engine
Amazon and others wrote the book on the recommendation engine. Rather than build its own, music startup PonoMusic turned to data prep and BI software maker Datameer to ramp up fast. Continue Reading
-
A look at the features of the SAS Data Management unified platform
SAS Data Management offers a suite of tools for handling access to data on legacy systems and Hadoop, as well as for integrating and cleansing data on-premises and in the cloud. Continue Reading
-
Adaptive Metadata Manager helps manage enterprise data governance
The Adaptive Metadata Manager data governance tool helps organizations institute enterprise-wide data policies and lets users access, manage and analyze important data assets. Continue Reading
-
Improve data management and quality with SAP Master Data Governance
SAP Master Data Governance offers a variety of features for managing master data assets and associated data terms and policies from a single, central location. Continue Reading
-
Exploring Bradmark Technologies' database performance monitoring tools
Bradmark Technologies provides a database performance monitor, an operating system monitor, and tools for implementing structural changes and performing database reorganizations. Continue Reading
-
Exploring the features of Oracle Enterprise Manager 13c
Cloud Control, a feature of Oracle Enterprise Manager 13c, can be used to monitor and manage databases, middleware, infrastructure and hardware running on-premises or in the cloud. Continue Reading
-
Inside IDERA's database performance management and optimization tools
IDERA offers tools for analyzing, monitoring and diagnosing database performance issues and increasing the performance of SQL code across the major relational databases. Continue Reading
-
Inside BMC's database performance tools for DB2 for z/OS
BMC offers several tools to help DBAs manage and prevent critical performance issues in DB2 for z/OS, as well as monitoring options for other vital mainframe subsystems. Continue Reading
-
Metadata injection marks Pentaho big data pipeline
The crush of big data leads some data pros to seek more automation of data integration processes. The Pentaho software platform now offers metadata injection capabilities to help meet such needs. Continue Reading
-
A look at IBM InfoSphere Information Server for Data Integration
IBM InfoSphere Information Server for Data Integration provides a single, unified platform for application integration, cloud integration, data quality and master data management. Continue Reading
-
Inside the SAP Data Services data integration tool
SAP Data Services can be used alone or with other SAP products to provide data integration, transformation, data quality, data profiling and text data processing. Continue Reading
-
An overview of the Pentaho Data Integration platform
The Pentaho Data Integration platform enables organizations to integrate, blend, convert and transform data from any data source across their entire enterprise. Continue Reading
-
Tips on selecting the right database performance management tools
Not every tool can resolve every problem, so understanding the cause of the bottleneck can help you determine which of these database performance management tools to consider. Continue Reading
-
How to know if your company is ready for data governance tools
Even if you can identify with the use cases presented here, it's important your organization is well-prepared to deploy the software it buys. Continue Reading
-
Examining the functions and features of database performance tools
To help you determine which tools your organization needs, it's important to review the primary features and functionality of the three database performance tools categories. Continue Reading
-
Examining the iWay Integration Suite for real-time data integration
The iWay Integration Suite from Information Builders enables organizations to integrate diverse data sources, including legacy hardware or software platforms. Continue Reading
-
Microsoft SSIS addresses data integration and data migration functions
Microsoft SQL Server Integration Services, which is built into SQL Server database, is an enterprise data integration, data transformation and data migration tool. Continue Reading
-
A look inside the Talend Enterprise Data Integration tool
Talend Enterprise Data Integration is available as open source for the budget conscious or as an enterprise version, which provides more extensive integration capabilities. Continue Reading
-
NoSQL performance management still an incomplete picture
The ability to monitor and manage the performance of NoSQL databases is all over the map, making it crucial for users to find the right technology for the applications they're looking to run. Continue Reading
-
Three indicators that could signal database performance issues
Database performance monitoring and management tools can be used to mitigate issues and help organizations be more proactive, so they can avoid performance problems and outages. Continue Reading
-
How data governance software helps ensure the integrity of your data
While a data governance initiative comprises people, processes and technology, data governance software supplements and automates the processes the organization implements. Continue Reading
-
Inside the Informatica PowerCenter data integration platform
The Informatica PowerCenter data integration platform helps organizations access, discover, cleanse and integrate data from disparate data sources. Continue Reading
-
What you need to know about Oracle Data Integrator 12c
Oracle Data Integrator 12c works with several other Oracle products to leverage the capabilities of RDBMSs for processing and transforming data. Continue Reading
-
Inside CA Technologies' database performance management products
CA Technologies offers an array of database performance management products for DB2 for z/OS and a scalable system and network monitoring product for distributed databases. Continue Reading
-
SAS Data Governance helps support enterprise data tasks
SAS Data Governance provides organizations with data governance tools for organizing, managing and accessing their data assets, and establishing enterprise-wide data policies. Continue Reading
-
New data landscape augurs discovery-based architectures
In an interview, consultant Lakshmi Randall foresees changes in how data management is organized and executed as the overall data landscape shifts due to the adoption of big data systems. Continue Reading
-
What you need to know about database performance software
Database performance software identifies bottlenecks and points of contention, monitors workload and throughput, and manages system and DBMS resource usage. Continue Reading
-
GPU database serves up analysis of tweets, other data feeds
As a student, Todd Mostak took on large-scale tweet analysis of historic events in the Middle East. Today, he leads startup MapD, which offers a database built on graphics processing units. Continue Reading
-
EBay helps drive new style of data engineering
Open source data engineering has become a way of life at e-commerce leader eBay, says the company's Debashis Saha. Kylin is one of the tools that has resulted. Continue Reading
-
Learn more about the Cloudera Hadoop distribution
Cloudera distribution including Apache Hadoop provides an analytics platform and the latest open source technologies to store, process, discover, model and serve large amounts of data. Continue Reading
-
Inside the IBM BigInsights platform for big data management
The latest version of IBM BigInsights offers several value-add services that can be used with its core distribution of open source Hadoop for managing big data. Continue Reading
-
Inside the MapR Hadoop distribution for managing big data
The MapR Hadoop distribution replaces HDFS with its proprietary file system, MapR-FS, which is designed to provide more efficient management of data, reliability and ease of use. Continue Reading
-
What to know about the IBM Information Governance Catalog
IBM's Information Governance Catalog provides organizations with a workflow-oriented data governance tool for organizing, managing and accessing their data assets. Continue Reading
-
Inside the Hortonworks open enterprise Hadoop distribution
The Hortonworks Data Platform consists entirely of projects built through the Apache Software Foundation and provides an open source environment for data collection, processing and analysis. Continue Reading
-
Analytics and BI in the cloud get mixed reaction
Managing and analyzing data in the cloud can reduce IT costs and simplify technology deployments and upgrades. But adoption levels remain relatively low, despite the potential benefits. Continue Reading
-
A look at Amazon Elastic MapReduce cloud-based Hadoop
The Amazon Elastic MapReduce Web service offers a managed Hadoop framework that enables users to distribute and process big data across dynamically scalable Amazon EC2 instances. Continue Reading
-
Inside the Microsoft Azure HDInsight cloud infrastructure
Azure HDInsight is a cloud implementation of Apache Hadoop that provides a software framework designed for processing, analyzing and reporting on big data. Continue Reading
-
Data virtualization ushers in unified view of data
A member-owned supply chain management company melded siloed data sets to gain a unified view of diverse data feeds. Continue Reading
-
IBM's dashDB forges data warehouse in the cloud
Amazon's Redshift led the way in cloud data warehouses. Now IBM hopes to catch up on the wings of dashDB. Continue Reading
-
Big data platforms pose structural issues for new users
Big data systems require the same kind of data partitioning and setup steps as conventional ones do. But first, users have to learn how to make that process work in Hadoop and Spark. Continue Reading
-
Five factors to help select the right data warehouse product
How big is your company, and what resources does it have? What are your performance needs? Answering these questions and others can help you select the right data warehouse platform. Continue Reading
-
Q&A: Dinsmore sees open source Apache Spark moving to new stage
Analytics vet Thomas Dinsmore says Apache Spark is entering a new phase of adoption, one in which hype gives way to clearer assessment. He also discusses the ascent of the R programming language for analytics. Continue Reading
-
Pivotal Greenplum streamlines big data query optimization
The Pivotal Greenplum open source shared-nothing data warehouse delivers high query performance and throughput, and provides rapid analytics on big data. Continue Reading
-
A look at the upcoming Microsoft Azure SQL Data Warehouse
The Microsoft Azure SQL Data Warehouse lets you scale compute and storage independently based on your performance needs, so you pay for query performance only when you need it. Continue Reading
-
A look inside the SAP IQ column-oriented database
The SAP IQ column-oriented database is designed for large data warehouses that require high scalability, rapid data loading and optimal query performance. Continue Reading
-
Examining the Teradata Data Warehouse
With both relational and columnar options, the Teradata Active Enterprise Data Warehouse gives companies an efficient, scalable appliance they can deploy in-house or in the cloud. Continue Reading
-
What to consider when evaluating Hadoop vendors
Before you evaluate specific Hadoop software or subscriptions, examine what features the vendor distributions provide and how they match your big data management needs. Continue Reading
-
Evaluating the key features of data warehouse platforms
Choosing between the different types of data warehouse platforms can be simplified once you know which deployment option best meets your project requirements. Continue Reading
-
How a Hadoop distribution can help you manage big data
To help you determine if a commercial Hadoop distribution could benefit your organization, consultant David Loshin examines big data use cases and applications that Hadoop can support. Continue Reading
-
What to expect from Oracle Exadata Database Machine
Oracle Exadata Database Machine combines hardware and software to enable analytics, batch, reporting and other tasks to run simultaneously within and across databases. Continue Reading
-
IBM dashDB delivers with cloud data warehouse
Available through IBM's Bluemix platform, IBM dashDB is a data warehouse as a service that includes IBM BLU Acceleration technology and embedded Netezza in-database analytics. Continue Reading
-
Examining the IBM PureData System for Analytics appliance
Powered by Netezza technology, the IBM PureData System for Analytics data warehouse appliance enables users to execute complex queries and get results quickly. Continue Reading
-
Future database stew to include NoSQL
Future database plans will include NoSQL, but there are elements to consider before making the leap, according to Mike Bowers, set to discuss this at Enterprise Data World 2016. Continue Reading
-
Exploring the Actian Analytics Platform appliance
With the Actian Analytics Platform data warehouse appliance, both Hadoop and non-Hadoop clients can use their existing SQL applications and investments on new data sources. Continue Reading
-
Exploring the HPE Vertica Analytics Platform
The Vertica Analytics Platform from Hewlett Packard Enterprise is designed to be used for data warehouses and other complex, query-intensive applications. Continue Reading
-
Exploring Amazon Redshift cloud data warehouse as a service
The Amazon Redshift petabyte-scale cloud data warehouse as a service enables organizations to analyze data in a cost-effective way, using their existing business intelligence tools. Continue Reading
-
Geospatial data is on the map for Hadoop, Spark
Software architect Mansour Raad is at the center of activity as geospatial data melds with Hadoop -- and soon, Spark. Continue Reading
-
JSON format coexists with XML in association's data strategy
The JSON data-interchange format has increasingly found a home in Web applications. But XML is keeping its place in a publishing system at the American Psychological Association. Continue Reading
-
Mobile gaming company plays new Hadoop cluster management card
Chartboost, which operates a platform for mobile games, turned to new cluster management software in an effort to overcome problems in controlling the use of its Hadoop processing resources. Continue Reading
-
NoSQL, Hadoop data engines shifted into new gears in 2015
To say that the core engines in big data platforms were in flux in 2015 may be an understatement. Users considering NoSQL and Hadoop deployments faced an array of new technologies. Continue Reading
-
Kirk Borne on data science and big data analytics, data literacy
In a Q&A, big data and data science expert Kirk Borne discusses new data processing and analytics technologies and the growing importance of data literacy in organizations. Continue Reading
-
Spark vs. Hadoop: Is big data engine a replacement part?
How the relationship between Spark and Hadoop will play out is an open question. We asked IT pros whether they see Spark more as a Hadoop companion or competitor. Continue Reading
-
DataStax Enterprise operational database tames Cassandra
The need for geographic data distribution has driven the DataStax Enterprise operational database into prominence. New version 4.8 offers more immediate indexing of data. Continue Reading
-
Medical technologist drives semantic data lake development
A pivotal magazine article helped point medical doctor Parsa Mirhaji along a path to a semantic data lake for healthcare analytics applications, using Hadoop, RDF, graph databases and more. Continue Reading
-
The benefits of deploying a data warehouse platform
Big data may be all the rage, but data warehouse platforms are still being utilized by companies of all sizes. Expert Craig S. Mullins takes a look at the technology. Continue Reading
-
IT pros talk top enterprise NoSQL architecture challenges
We asked attendees at a conference on NoSQL databases about the challenges faced by users of the software. Their responses cited issues such as scalability, data modeling and analytics. Continue Reading
-
SQL-Hadoop duo looks to ease programming in big data apps
An emerging crop of SQL-on-Hadoop query engines are enabling users to pair up the database programming language and big data framework, letting SQL developers query Hadoop data. Continue Reading
-
Three ways to build a big data system
In a book excerpt, author Dale Neef outlines and compares different approaches organizations can take when trying to bring a big data system into their IT environments. Continue Reading
-
Madsen looks at shift in big data analytics applications
At a TDWI Boston Chapter meeting, Mark Madsen says some notions of information become outdated in the face of big data analytics. This is part one of two. Continue Reading
-
New models of processing stalk big data analytics applications
Operations and big data analytics applications are beginning to blend, causing changes in data strategies, Mark Madsen tells a TDWI Boston Chapter meeting. This is part two of two. Continue Reading
-
Evaluating SQL-on-Hadoop tools? Start with the use case
In a Q&A, Clarity Solution Group CTO Tripp Smith says to base SQL-on-Hadoop software decisions on actual workloads. Some Hadoop tools target batch jobs, while others are intended for interactive ones. Continue Reading
-
Swim fast with a Hadoop data lake architecture -- or sink
The Hadoop data lake concept presents plenty of challenges for organizations. But the experiences of early adopters point the way toward successful data lake architecture deployments. Continue Reading
-
Types of NoSQL databases and key criteria for choosing them
In a book excerpt, consultant Dan Sullivan offers insights into how to select the right type of NoSQL database for the right application in your organization. Continue Reading
-
How to identify master data in a multi-domain MDM program
In an excerpt from their book on managing multi-domain master data management programs, Mark Allen and Dalton Cervo explain how to identify MDM domains and your master data. Continue Reading
-
Don't throw out design principles when jumping in Hadoop data lake
In a Q&A, data warehousing expert Joe Caserta explains why a new generation of developers building Hadoop clusters and other big data systems may need an introduction to some fundamental rules of ETL. Continue Reading
-
SAP HANA in-memory DBMS overview
SAP HANA is an in-memory DBMS and application platform designed to handle high transaction rates and complex queries using one data copy. Continue Reading
-
Solid data integration techniques pave clear path for info
Implementing an effective data integration strategy is becoming increasingly important as organizations look to collect and analyze information from a diverse set of data sources. Continue Reading
-
Redis open source DBMS overview
The Redis open source DBMS provides a highly scalable data store that can be shared by multiple processes, applications or servers. Continue Reading
-
SQL-on-Hadoop tools help users navigate enterprise Hadoop course
Hadoop may be a technology in waiting, unless SQL-on-Hadoop tools turn it into an enterprise mainstay. Continue Reading
-
Riak KV NoSQL DBMS overview
Riak NoSQL DBMS is designed to enable storage of and access to various types of unstructured data that require continuous availability. Continue Reading
-
MySQL open source RDBMS overview
Developers, database administrators and DevOps teams use MySQL open source RDBMS to more easily operate next-generation applications in the cloud. Continue Reading
-
Neo4j graph DBMS overview
The Neo4j graph DBMS delivers high performance and availability, with its native graph capabilities for data storage and access. Continue Reading
-
MongoDB NoSQL DBMS overview
MongoDB database management system is designed for running modern applications that rely on structured and unstructured data and support rapidly changing data. Continue Reading
-
MarkLogic Server NoSQL DBMS overview
The MarkLogic Server NoSQL DBMS is designed to make heterogeneous data integration easier and faster using an array of enterprise features. Continue Reading
-
Data lake concept needs more big data use cases to flourish
Hadoop data lakes offer an enticing location for large data sets. But consultant Andy Hayler says more examples of successful big data projects are needed to help boost their adoption. Continue Reading