Definition

data analytics (DA)

Contributor(s): Craig Stedman

Data analytics (DA) is the process of examining data sets in order to find trends and draw conclusions about the information they contain. Increasingly data analytics is used with the aid of specialized systems and software. Data analytics technologies and techniques are widely used in commercial industries to enable organizations to make more-informed business decisions. It is also used scientists and researchers to verify or disprove scientific models, theories and hypotheses.

As a term, data analytics predominantly refers to an assortment of applications, from basic business intelligence (BI), reporting and online analytical processing (OLAP) to various forms of advanced analytics. In that sense, it's similar in nature to business analytics, another umbrella term for approaches to analyzing data. The difference is that the latter is oriented to business uses, while data analytics has a broader focus. The expansive view of the term isn't universal, though: In some cases, people use data analytics specifically to mean advanced analytics, treating BI as a separate category.

Data analytics initiatives can help businesses increase revenues, improve operational efficiency, optimize marketing campaigns and customer service efforts. It can also be used to respond quickly to emerging market trends and gain a competitive edge over rivals. The ultimate goal of data analytics, however, is boosting business performance. Depending on the particular application, the data that's analyzed can consist of either historical records or new information that have been processed for real-time analytics. In addition, it can come from a mix of internal systems and external data sources.

Types of data analytics applications

At a high level, data analytics methodologies include exploratory data analysis (EDA), and confirmatory data analysis (CDA). EDA aims to find patterns and relationships in data, while CDA applies statistical techniques to determine whether hypotheses about a data set are true or false. EDA is often compared to detective work, while CDA is akin to the work of a judge or jury during a court trial -- a distinction first drawn by statistician John W. Tukey in his 1977 book Exploratory Data Analysis.

Data analytics can also be separated into quantitative data analysis and qualitative data analysis. The former involves the analysis of numerical data with quantifiable variables. These variables can be compared or measured statistically. The qualitative approach is more interpretive -- it focuses on understanding the content of non-numerical data like text, images, audio and video, common phrases, themes and points of view.

At the application level, BI and reporting provide business executives and corporate workers with actionable information about key performance indicators, business operations, customers and more. In the past, data queries and reports typically were created for end users by BI developers who worked in IT. Now, more organizations will use self-service BI tools that let executives, business analysts and operational workers run their own ad hoc queries and build reports themselves.

Consultant Claudia Imhoff on analytics-driven organizations

An advanced type of data analytics include data mining, which involves sorting through large data sets to identify trends, patterns and relationships. Another type is called predictive analytics, which seeks to predict customer behavior, equipment failures and other future events. Machine learning can also be used for data analytics, using automated algorithms to churn through data sets more quickly than data scientists can do via conventional analytical modeling. Big data analytics applies data mining, predictive analytics and machine learning tools. Text mining provides a means of analyzing documents, emails and other text-based content.  

Data analytics initiatives support a wide variety of business uses. For example, banks and credit card companies analyze withdrawal and spending patterns to prevent fraud and identity theft. E-commerce companies and marketing services providers will use clickstream analysis to identify website visitors who are likely to buy a particular product or service -- based on navigation and page-viewing patterns. Healthcare organizations mine patient data to evaluate the effectiveness of treatments for cancer and other diseases. Mobile network operators also examine customer data to forecast churn. This allows mobile companies to take steps to prevent defections to business rivals. To boost customer relationship management efforts, other companies can also engage in CRM analytics to segment customers for marketing campaigns and equip call center workers with up-to-date information about callers.

Inside the data analytics process

Data analytics applications involve more than just analyzing data. Particularly on advanced analytics projects. Much of the required work takes place upfront, in collecting, integrating and preparing data and then developing, testing and revising analytical models to ensure that they produce accurate results. In addition to data scientists and other data analysts, analytics teams often include data engineers, whose job is to help get data sets ready for analysis.

The analytics process starts with data collection. Data scientists identify the information they need for a particular analytics application, and then work on their own or with data engineers and IT staff to assemble it for use. Data from different source systems may need to be combined via data integration routines, transformed into a common format and loaded into an analytics system, such as a Hadoop clusterNoSQL database or data warehouse.

Who's who on the data analytics team

In other cases, the collection process may consist of pulling a relevant subset out of a stream of data that flows into, for example, Hadoop. This data is then moved to a separate partition in the system so it can be analyzed without affecting the overall data set.

Once the data that's needed is in place, the next step is to find and fix data quality problems that could affect the accuracy of analytics applications. That includes running data profiling and data cleansing tasks to ensure the information in a data set is consistent and that errors and duplicate entries are eliminated. Additional data preparation work is then done to manipulate and organize the data for the planned analytics use. Data governance policies are then applied to ensure that the data follows corporate standards and is being used properly.

From here, a data scientist builds an analytical model, using predictive modeling tools or other analytics software -- using languages such as Python, Scala, R and SQL. The model is initially run against a partial data set to test its accuracy. Typically, it's then revised and tested again. This process is known as "training" the model until it functions as intended. Finally, the model is run in production mode against the full data set, something that can be done once to address a specific information need or on an ongoing basis as the data is updated.

In some cases, analytics applications can be set to automatically trigger business actions. For example, stock trades by a financial services firm. Otherwise, the last step in the data analytics process is communicating the results generated by analytical models to business executives and other end users. Charts and other infographics can be designed to make findings easier to understand. Data visualizations often are incorporated into BI dashboard applications that display data on a single screen and can be updated in real-time as new information becomes available.

Data analytics vs. data science

As automation grows, data scientists will focus more on business needs, strategic oversight and deep learning. Data analysts who work in business intelligence will focus more on model creation and other routine tasks. In general, data scientists concentrate efforts on producing broad insights, while data analysts focus on answering specific questions. In terms of technical skills, future data scientists will need to focus more on the machine learning operations process, also called MLOps.

This was last updated in September 2020

Next Steps

Consultant David Loshin explains what big data analytics tools can do for companies

Corporate lawyers increasingly play a role in customer data analytics programs

CIO Celso Mello on why human curiosity is a key to effective data analytics

Continue Reading About data analytics (DA)

Dig Deeper on Enterprise data architecture best practices

Join the conversation

13 comments

Send me notifications when other members comment.

Please create a username to comment.

What's your top tip for making the data analytics process work effectively?
Cancel
Our expertise in the Data and Analytics domain has catered to businesses’ need for actionable insights from their online and offline data sources. We are a vendor-neutral player, we audit, consult and implement an optimized data strategy to gain competitive edge.
Cancel
Very good. If you want to learn data analytics then you have to start with "Data Analysis" then gain knowledge in statistical concepts. I have spent almost 10+ years in the area of Statistics, Quality, Statistical process control and that helped to become expert in this field. Remember on thing, Just by reading books one can not become an expert. you have to make your hands dirty in terms of analyzing data, all kinds of data, learn how to present it to business stakeholders. Just by reading books no one can learn swimming or cycling. So Data analytics is also the same.
Regards
Jiten
Data Analytics Expert
Cancel
Sir, can you explain what are the areas we have to work to be a DATA analyst. The only statistical tools are fine enough to be a data analyst.
Cancel
Jitendranath

Thank you for your comment above.

How can one get their hands dirty? 

Can you suggest several ways I can get my hands on experience even with the basic simple data i.e. websites that need assistance, contacts who need a hand even if for free... etc

Thanks 
OA37
Cancel
by touching dirty things
Cancel
Hi
I have a question - is there a role for Data Architect? What responsibilities would that entail?
Cancel
I find this very informative , and further to it, i have taken up data analytics  and a change of my career .

Cancel
It's a comprehensive article about data analytics. I think the best is the exploratory data analysis. It's a good way to explore your datasets and find patterns meanings and relations. 
Cancel
data science vs machine learning it't to old vision. Try to see that Data is the new oil. Make it work. The Data Science umbrella sustains digitally driven strategies like datarob.com to transform your business.
Cancel
My self  thanks for sharing this information I am so very happy to read this content 
Cancel
I’m a Professional and experienced Data Analyst. I will do EDA(Exploratory Data Analysis) for businesses to take important decision. contact me here :- https://www.fiverr.com/biren_karena/do-data-analysis-visualization-and-generate-report 
Cancel
A good article on Data Analytics. I read another article on data analytics and how it helps in combating Covid-19 recently. You can read the article here.
Cancel

SearchBusinessAnalytics

SearchAWS

SearchContentManagement

SearchOracle

SearchSAP

SearchSQLServer

Close