Boris Paskhaver

Data is the most valuable resource of the 21st century. Across a breadth of industries, the world’s top companies are adapting their business models to collect and analyze more data. Governments and civil institutions are also following suit. United Nations Secretary-General, António Guterres, called accurate data “the lifeblood of good policy and decision-making.”

Today, data influences everything from our movie recommendations to our medical treatments, from online advertisements to poverty reduction initiatives. More data has been generated in the last two years than in all of the previous years of human history combined. But here’s a shocking statistic. Only 0.5% of the world’s data is estimated to have been analyzed.

Data Analysis with Pandas and Python

Last Updated September 2020

Bestseller
  • 168 lectures
  • All Levels
4.6 (11,181)

Analyze data quickly and easily with Python’s powerful pandas library! All datasets included — beginners welcome! | By Boris Paskhaver

Explore Course

The ability to work with data is becoming a crucial skill in the modern world. But what exactly is data analysis, and how can one get started with it? In this article, we’ll explore all the details.

What is data analysis?

The meaning of “data analysis” can vary from company to company, from team to team, and even from project to project. It’s better to think about the purpose of data analysis.

The goal of data analysis is to extract key insights from datasets. A data analyst aims to discover useful information that helps stakeholders make informed decisions. It’s not just about crunching numbers. Data analysis wants to find the story in the data, the underlying trend, the hidden message. The best data analysts may even appear to be fortune-tellers by using existing data to predict what the future holds.

It might sound challenging, but data analysis is a lot less complex than it appears! Most of us think like data analysts on a daily basis. For example, we may:

In these situations, we’re using our past and current observations to influence our future behavior. Data analysis seeks the same result.

What types of data are there?

There are many different types of data. Some common examples include:

Data types fall into two broader categories: quantitative and qualitative. Quantitative data involves a numeric measurement. Qualitative data describes a characteristic or trait.

Imagine we’re analyzing a dataset of houses sold in the United States. Examples of quantitative data might include the selling price, the number of bedrooms, or the square footage. Examples of qualitative data might include the house’s address, listing agent, or type (i.e., ranch or colonial).

The process of data analysis

The data analysis process begins with an inquiry. What would you like to figure out? What information would be helpful for you or the business unit to make a decision?  What data needs to be acquired? Is there an existing data collection that you have that holds the answers? 

Once data has been collected, the next step is extracting it from its source. A data analyst may query a database to pull out the relevant tables of information. A variety of different file types, including CSV (comma separated values), JSON (JavaScript Object Notation), and HDF (Hierarchical Data Format), are used to store data.

Raw data describes data in its original format, the way that it is stored before it is manipulated. Raw data often arrives with imperfections that the analyst needs to fix before analysis can occur. For example:

The process of cleaning data is called data wrangling or data munging.

Once the data is cleaned, we apply a sequence of manipulations to the dataset. The goal with every step is to move closer to an answer for the original business inquiry. Data analysis can feel a bit like detective work. Over time, a skilled analyst builds a dynamic toolkit of data analysis techniques and learns the right ones to apply to solve the problem at hand. Some examples of everyday manipulations include:


Once an analyst has reached a conclusion, the final step is presenting it to stakeholders. The deliverable can vary from assignment to assignment. One stakeholder may ask for a single conclusive number, such as a count or a sum. Another may request a collection of all relevant data points. A third may like to see a written summary.

It is often easier to describe an idea with a picture than with a paragraph of text. Visual representations of data are popular in data analysis reports. Some common visualizations include bar charts, pie charts, scatter plots, and histograms.

Data analysis tools

Both free and commercial data analysis tools are available for analysts of all skill levels. Some are optimized for speed and efficiency. Others focus on user experience and convenience. Rarely is there only one technology for a job. Rather, a data analyst learns to choose the best tool for their current assignment.

Microsoft Excel is the most popular data analysis tool available today. It is estimated to be installed on 750 million computers worldwide. Similar spreadsheet software options include Google Sheets and Apple Numbers. Excel is a great option for those getting started with data analysis. Its visual interface requires a smaller learning curve compared to its competitors.

Once datasets start to grow in size and complexity, it becomes optimal to transition to a more heavy-duty solution. Other data analysis tools include:

These tools open the doors to more advanced analysis and other benefits including automation. 

The demand for data analysts is forecasted to keep growing as companies and institutions collect more and more data. Luckily, the barrier to entry for learning data analysis has never been lower. Many in-depth online courses are available for a wide spectrum of technologies. Data analysis is a worthwhile investment for those looking to pick up for a versatile skill that can prove helpful at both home and work.

Page Last Updated: July 2020

Top courses in Data Analysis

Learning Python for Data Analysis and Visualization
Jose Portilla
4.4 (15,097)
Bestseller
Microsoft Excel - Advanced Excel Formulas & Functions
Maven Analytics, Chris Dutton
4.7 (37,700)
Bestseller
Quick Steps to Learn Microsoft Power BI
Project Management House
4.4 (16)
An Introduction to Stata
Najib Mozahem
4.7 (308)
Dynamic Dashboards and Data Analysis with Google Data Studio
Lachezar Arabadzhiev
4.6 (187)
Highest Rated
R Level 1 - Data Analytics with R
R-Tutorials Training
4.5 (2,170)
Data Analysis Bootcamp™ 21 Real World Case Studies
Rajeev D. Ratan, Nidia Sahjara
4.3 (307)
Introduction to Data Analytics for Accountants
Rizwan Ahmed
4.9 (24)
Highest Rated
The Data Science Course 2020: Complete Data Science Bootcamp
365 Careers, 365 Careers Team
4.5 (78,330)
Bestseller
SQL for Data Analysis: Weekender Crash Course for Beginners
A Course You'll Actually Finish, David Kim, Peter Sefton
4.3 (11,975)
Bestseller

More Data Analysis Courses

Data Analysis students also learn

Empower your team. Lead the industry.

Get a subscription to a library of online courses and digital learning tools for your organization with Udemy for Business.

Request a demo

Courses by Boris Paskhaver

Data Analysis with Pandas and Python
Boris Paskhaver
4.6 (11,181)
Bestseller
Learn to Code with Ruby
Boris Paskhaver
4.7 (3,620)
Bestseller
Learn to Code with Python
Boris Paskhaver
4.7 (2,587)
Testing Ruby with RSpec: The Complete Guide
Boris Paskhaver
4.5 (596)
Bestseller
Getting Started with React Hooks
Boris Paskhaver
4.6 (315)

Courses by Boris Paskhaver