狠狠撸

狠狠撸Share a Scribd company logo
Manish Gupta
DATA ANALYSIS STEP BY STEP
DATA, DATA, DATA.. there is a lot of data being gathered by all sort of channels and any organization’s
success, to an great extent, depends on how well they analyse such data and at what time.
So what is the Data Analysis –
“Data analysis is a process of
? inspecting,
? cleansing,
? transforming, and
? modeling data
with the goal of discovering useful information, informing conclusions, hidden patterns, unknown
correlations and supporting decision-making. ”
So in simple terms Data analysis is a process for obtaining raw data and converting it into information
useful for decision-making by users. Data is collected and analyzed to answer questions, test hypotheses
or disprove theories.
Lets understanding it from an example assume an IOT device located on a Toll Gate is collecting number
of cars passing by that specific Toll Gate . It collects the registration number of the car and time of passing
the traffic light. Now further assume, that there were 100,000 cars passed in a day.
Now !! what information this data can give, it depends on the user or management objective but there
are following possibilities.
? No of cars passed during certain time slots of the day to understand traffic behavior
? Linkup the plate number of owner attributes and analyse further
o What was the ratio of taxi to owned cars (if high taxi may be public transport can be
planned)
o How many were private cars and how many were company cars for business and
personal.
o If similar device and data is available at toll exit, can analyse the data to understand
average speed by type of cars.
Lets Look at another example of an e-commerce business which sells 100s of products each day on its
website. There will be atleast 3,000 transactions in a month and whopping 36,000 transactions in a year.
Following can be achieved through data analytics.
1. What are most sold products on the platform.
2. What are most profitable products on platform and if they are amongst the most sold products.
3. Are there any products which are most sold but not so profitable?
4. Are there any products which are selling at loss?
5. Is there any product for whose sales is limited due to any controllable constraints?
6. What type of customers are buying product? What are opportunities !
Manish Gupta
DATA ANALYSIS STEP BY STEP
7. What is the average price per order !
8. What products mostly brought together or by same customers?
And much more depending on business development needs
Ok!! Now that we understand how data analysis can be powerful for business lets look at various steps
and techniques performed for data analytics.
Before we understand what is data analytics, Lets try to Understand the Decision making Process in
brief related to data. For this part of book we will look into only decision making needs related to data.
Strategic decision making process will be discussed another time.
Decision Making points from a data set can be grouped into following
Answer How
? How Much or How Many of sales, products, customers, etc
? How long (delivery time, period sales, shelf life, sale time from purchase time)
Answer WH Family
? Who (customer profiles and segments),
? What (product segments),
? Where (Geo Segments) and
? When (date, timestamp month)
Now that we understand what various possible decision-making points are, lets look at the process of
making such decisions starting from collection till interpretation of data into information.
Key Steps of Data Analysis
a. Assess Data requirements
The data is necessary as inputs to the analysis, which is specified based upon the requirements of
those directing the analysis or customers (who will use the finished product of the analysis).
Following must be kept in mind while assessing data requirements.
1. Specific data variables (decision points) which can be
a. numerical quantity of products, revenue, costing or
b. categorical i.e category of products, regions, type of customer
2. Data can be collected to the basic transaction level
3. Data must be segmented and referenced to common sets for example customer profile can
be collected separately and should be referenced in transactions so that repetitive data can
be reduced.
Manish Gupta
DATA ANALYSIS STEP BY STEP
b. Data collection
Data is collected from a variety of sources. The requirements may be communicated by analysts
to custodians of the data, such as information technology personnel within an organization.
The data may also be collected from
1. sensors in the environment, such as traffic cameras, satellites, recording devices, etc.
2. obtained through interviews,
3. downloads from online sources or reading documentation.
c. Data processing
Data initially obtained must be processed or organised for analysis. For instance, these may
involve placing data into rows and columns in a table format (i.e., structured data) for further
analysis, such as within a spreadsheet or statistical software.
d. Data cleaning
Once processed and organised, the data may be
? incomplete,
? contain duplicates, or
? contain errors.
Data cleaning is the process of preventing and correcting these errors.
Following processes and checks are usually applied while cleaning data.
a. record matching,
b. deduplication, and
c. column segmentation.
d. Such data problems can also be identified through a variety of analytical techniques.
e. For example, with financial information, the totals for revenue can be matched against
total revenue reported in financial statements.
f. Textual data spell checkers can be used to lessen the amount of mistyped words, but it is
harder to tell if the words themselves are correct.
e. Perform Data Analysis methodologies
There are two main type of Data Analysis methods.
Quantitative Analysis
1. Descriptive Analysis - Describe the main features of a large collection of data.
2. Confirmatory Analysis - Confirm or negate a hypothesis.
3. Exploratory Analysis - Find previously unknown relationships in the data.
Manish Gupta
DATA ANALYSIS STEP BY STEP
4. Inferential Analysis - Use a smaller sample of data to learn something about a bigger
population.
5. Causal Analysis - Find out what happens to one variable when you change another.
Event Series Analysis
facilitate searching for patterns across multiple event records and datasets
f. Communication & Evaluation
Data visualization to understand the results of a data analysis.
Once the data is analyzed, it may be reported in many formats to the users of the analysis to
support their requirements.
The users may have feedback, which results in additional analysis. As such, much of the
analytical cycle is iterative.
Now that we get the brief understanding of data analysis concepts and steps, we will continue to learn
detailed methodologies and techniques using Excel.
To Learn data analysis in more details with lots of examples and apply techniques subscribe to my
detailed e-book
Pre-order @ USD 2 only (80% introductory discount), after publishing it will be priced at USD 10.
I will sweeten the deal by adding a free access to webinar If you subscribe now !!
We will be launching a online course on data analysis, which I will be announcing soon.

More Related Content

Similar to Data analysis step by step guide (20)

wholeness of data analytics in cyber security.ppt
wholeness of data analytics in cyber security.pptwholeness of data analytics in cyber security.ppt
wholeness of data analytics in cyber security.ppt
hannahroseline2
?
Comprehensive Notes on Big Data Concepts and Applications Based on University...
Comprehensive Notes on Big Data Concepts and Applications Based on University...Comprehensive Notes on Big Data Concepts and Applications Based on University...
Comprehensive Notes on Big Data Concepts and Applications Based on University...
RahulRaj17831
?
Regression and correlation
Regression and correlationRegression and correlation
Regression and correlation
VrushaliSolanke
?
Data Analysis Methods 101 - Turning Raw Data Into Actionable Insights
Data Analysis Methods 101 - Turning Raw Data Into Actionable InsightsData Analysis Methods 101 - Turning Raw Data Into Actionable Insights
Data Analysis Methods 101 - Turning Raw Data Into Actionable Insights
DataSpace Academy
?
Data Science in Python.pptx
Data Science in Python.pptxData Science in Python.pptx
Data Science in Python.pptx
Ramakrishna Reddy Bijjam
?
Data_analyst_types of data, Structured, Unstructured and Semi-structured Data
Data_analyst_types of data, Structured, Unstructured and Semi-structured DataData_analyst_types of data, Structured, Unstructured and Semi-structured Data
Data_analyst_types of data, Structured, Unstructured and Semi-structured Data
grsssyw24
?
IRJET - Big Data: Evolution Cum Revolution
IRJET - Big Data: Evolution Cum RevolutionIRJET - Big Data: Evolution Cum Revolution
IRJET - Big Data: Evolution Cum Revolution
IRJET Journal
?
Data mining-basic
Data mining-basicData mining-basic
Data mining-basic
gufranresearcher
?
Introductions to Business Analytics
Introductions to Business Analytics Introductions to Business Analytics
Introductions to Business Analytics
Venkat .P
?
Big data overview
Big data overviewBig data overview
Big data overview
Shyam Sunder Budhwar
?
Bigdata Hadoop introduction
Bigdata Hadoop introductionBigdata Hadoop introduction
Bigdata Hadoop introduction
Sunitha Mutchintala
?
Data mining
Data miningData mining
Data mining
hardavishah56
?
Data Analyst Interview Questions & Answers
Data Analyst Interview Questions & AnswersData Analyst Interview Questions & Answers
Data Analyst Interview Questions & Answers
Satyam Jaiswal
?
Business Intelligence and decision support system
Business Intelligence and decision support system Business Intelligence and decision support system
Business Intelligence and decision support system
Shrihari Shrihari
?
DATA ANALYSIS Presentation Computing Fundamentals.pptx
DATA ANALYSIS Presentation Computing Fundamentals.pptxDATA ANALYSIS Presentation Computing Fundamentals.pptx
DATA ANALYSIS Presentation Computing Fundamentals.pptx
AmarAbbasShah1
?
Data analytics
Data analyticsData analytics
Data analytics
Bhanu Pratap
?
ii mca juno
ii mca junoii mca juno
ii mca juno
Ramya Sasi
?
What is Data Mining? Key Concepts Explained
What is Data Mining? Key Concepts ExplainedWhat is Data Mining? Key Concepts Explained
What is Data Mining? Key Concepts Explained
Julie Bowie
?
Top 30 Data Analyst Interview Questions.pdf
Top 30 Data Analyst Interview Questions.pdfTop 30 Data Analyst Interview Questions.pdf
Top 30 Data Analyst Interview Questions.pdf
ShaikSikindar1
?
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
IJSCAI Journal
?
wholeness of data analytics in cyber security.ppt
wholeness of data analytics in cyber security.pptwholeness of data analytics in cyber security.ppt
wholeness of data analytics in cyber security.ppt
hannahroseline2
?
Comprehensive Notes on Big Data Concepts and Applications Based on University...
Comprehensive Notes on Big Data Concepts and Applications Based on University...Comprehensive Notes on Big Data Concepts and Applications Based on University...
Comprehensive Notes on Big Data Concepts and Applications Based on University...
RahulRaj17831
?
Data Analysis Methods 101 - Turning Raw Data Into Actionable Insights
Data Analysis Methods 101 - Turning Raw Data Into Actionable InsightsData Analysis Methods 101 - Turning Raw Data Into Actionable Insights
Data Analysis Methods 101 - Turning Raw Data Into Actionable Insights
DataSpace Academy
?
Data_analyst_types of data, Structured, Unstructured and Semi-structured Data
Data_analyst_types of data, Structured, Unstructured and Semi-structured DataData_analyst_types of data, Structured, Unstructured and Semi-structured Data
Data_analyst_types of data, Structured, Unstructured and Semi-structured Data
grsssyw24
?
IRJET - Big Data: Evolution Cum Revolution
IRJET - Big Data: Evolution Cum RevolutionIRJET - Big Data: Evolution Cum Revolution
IRJET - Big Data: Evolution Cum Revolution
IRJET Journal
?
Introductions to Business Analytics
Introductions to Business Analytics Introductions to Business Analytics
Introductions to Business Analytics
Venkat .P
?
Data Analyst Interview Questions & Answers
Data Analyst Interview Questions & AnswersData Analyst Interview Questions & Answers
Data Analyst Interview Questions & Answers
Satyam Jaiswal
?
Business Intelligence and decision support system
Business Intelligence and decision support system Business Intelligence and decision support system
Business Intelligence and decision support system
Shrihari Shrihari
?
DATA ANALYSIS Presentation Computing Fundamentals.pptx
DATA ANALYSIS Presentation Computing Fundamentals.pptxDATA ANALYSIS Presentation Computing Fundamentals.pptx
DATA ANALYSIS Presentation Computing Fundamentals.pptx
AmarAbbasShah1
?
What is Data Mining? Key Concepts Explained
What is Data Mining? Key Concepts ExplainedWhat is Data Mining? Key Concepts Explained
What is Data Mining? Key Concepts Explained
Julie Bowie
?
Top 30 Data Analyst Interview Questions.pdf
Top 30 Data Analyst Interview Questions.pdfTop 30 Data Analyst Interview Questions.pdf
Top 30 Data Analyst Interview Questions.pdf
ShaikSikindar1
?
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
IJSCAI Journal
?

More from Manish Gupta (8)

Hotel cost control - master class
Hotel cost control - master classHotel cost control - master class
Hotel cost control - master class
Manish Gupta
?
Hotel Management Course - Revenue management Concepts
Hotel Management Course - Revenue management Concepts Hotel Management Course - Revenue management Concepts
Hotel Management Course - Revenue management Concepts
Manish Gupta
?
Hotel Management Course - Analyse & Maximize restaurant profitability
Hotel Management Course - Analyse & Maximize restaurant profitability Hotel Management Course - Analyse & Maximize restaurant profitability
Hotel Management Course - Analyse & Maximize restaurant profitability
Manish Gupta
?
Hotel Management course - Profit & Loss Statement rooms
Hotel Management course - Profit & Loss Statement roomsHotel Management course - Profit & Loss Statement rooms
Hotel Management course - Profit & Loss Statement rooms
Manish Gupta
?
Hospitality operation & financial budgeting
Hospitality operation & financial budgetingHospitality operation & financial budgeting
Hospitality operation & financial budgeting
Manish Gupta
?
how to read & analyse hotel income statement
how to read & analyse hotel income statementhow to read & analyse hotel income statement
how to read & analyse hotel income statement
Manish Gupta
?
Hotel Management - Understand Financial Statements and Analyse to gain useful...
Hotel Management - Understand Financial Statements and Analyse to gain useful...Hotel Management - Understand Financial Statements and Analyse to gain useful...
Hotel Management - Understand Financial Statements and Analyse to gain useful...
Manish Gupta
?
Essentials of financial reporting
Essentials of financial reportingEssentials of financial reporting
Essentials of financial reporting
Manish Gupta
?
Hotel cost control - master class
Hotel cost control - master classHotel cost control - master class
Hotel cost control - master class
Manish Gupta
?
Hotel Management Course - Revenue management Concepts
Hotel Management Course - Revenue management Concepts Hotel Management Course - Revenue management Concepts
Hotel Management Course - Revenue management Concepts
Manish Gupta
?
Hotel Management Course - Analyse & Maximize restaurant profitability
Hotel Management Course - Analyse & Maximize restaurant profitability Hotel Management Course - Analyse & Maximize restaurant profitability
Hotel Management Course - Analyse & Maximize restaurant profitability
Manish Gupta
?
Hotel Management course - Profit & Loss Statement rooms
Hotel Management course - Profit & Loss Statement roomsHotel Management course - Profit & Loss Statement rooms
Hotel Management course - Profit & Loss Statement rooms
Manish Gupta
?
Hospitality operation & financial budgeting
Hospitality operation & financial budgetingHospitality operation & financial budgeting
Hospitality operation & financial budgeting
Manish Gupta
?
how to read & analyse hotel income statement
how to read & analyse hotel income statementhow to read & analyse hotel income statement
how to read & analyse hotel income statement
Manish Gupta
?
Hotel Management - Understand Financial Statements and Analyse to gain useful...
Hotel Management - Understand Financial Statements and Analyse to gain useful...Hotel Management - Understand Financial Statements and Analyse to gain useful...
Hotel Management - Understand Financial Statements and Analyse to gain useful...
Manish Gupta
?
Essentials of financial reporting
Essentials of financial reportingEssentials of financial reporting
Essentials of financial reporting
Manish Gupta
?

Recently uploaded (20)

The Role of Christopher Campos Orlando in Sustainability Analytics
The Role of Christopher Campos Orlando in Sustainability AnalyticsThe Role of Christopher Campos Orlando in Sustainability Analytics
The Role of Christopher Campos Orlando in Sustainability Analytics
christophercamposus1
?
Presentation1.pptx for data and table analysis
Presentation1.pptx for data and table analysisPresentation1.pptx for data and table analysis
Presentation1.pptx for data and table analysis
vatsalsingla4
?
IFRS Finance Powerpoint ppt Finance D.pptx
IFRS Finance Powerpoint  ppt Finance D.pptxIFRS Finance Powerpoint  ppt Finance D.pptx
IFRS Finance Powerpoint ppt Finance D.pptx
amantiwari2091
?
iam free indeed.pptxiam free indeed.pptx
iam free indeed.pptxiam free indeed.pptxiam free indeed.pptxiam free indeed.pptx
iam free indeed.pptxiam free indeed.pptx
muhweziart
?
The truth behind the numbers: spotting statistical misuse.pptx
The truth behind the numbers: spotting statistical misuse.pptxThe truth behind the numbers: spotting statistical misuse.pptx
The truth behind the numbers: spotting statistical misuse.pptx
andyprosser3
?
办理魁北克大学成绩单触购买加拿大鲍蚕础惭成绩单文凭定制
办理魁北克大学成绩单触购买加拿大鲍蚕础惭成绩单文凭定制办理魁北克大学成绩单触购买加拿大鲍蚕础惭成绩单文凭定制
办理魁北克大学成绩单触购买加拿大鲍蚕础惭成绩单文凭定制
taqyed
?
exampleexampleexampleexampleexampleexampleexampleexample
exampleexampleexampleexampleexampleexampleexampleexampleexampleexampleexampleexampleexampleexampleexampleexample
exampleexampleexampleexampleexampleexampleexampleexample
lembiczkat
?
Introduction Lecture 01 Data Science.pdf
Introduction Lecture 01 Data Science.pdfIntroduction Lecture 01 Data Science.pdf
Introduction Lecture 01 Data Science.pdf
messagetome133
?
加拿大成绩单购买原版(鲍颁毕业证书)卡尔加里大学毕业证文凭
加拿大成绩单购买原版(鲍颁毕业证书)卡尔加里大学毕业证文凭加拿大成绩单购买原版(鲍颁毕业证书)卡尔加里大学毕业证文凭
加拿大成绩单购买原版(鲍颁毕业证书)卡尔加里大学毕业证文凭
taqyed
?
Stasiun kernel pengolahan kelapa sawit indonesia
Stasiun kernel pengolahan kelapa sawit indonesiaStasiun kernel pengolahan kelapa sawit indonesia
Stasiun kernel pengolahan kelapa sawit indonesia
fikrimanurung1
?
april 2024 paper 2 ms. english non fiction
april 2024 paper 2 ms. english non fictionapril 2024 paper 2 ms. english non fiction
april 2024 paper 2 ms. english non fiction
omokoredeolasunbomi
?
Optimizing Common Table Expressions in Apache Hive with Calcite
Optimizing Common Table Expressions in Apache Hive with CalciteOptimizing Common Table Expressions in Apache Hive with Calcite
Optimizing Common Table Expressions in Apache Hive with Calcite
Stamatis Zampetakis
?
Presentation.2 .reversal. reversal. pptx
Presentation.2 .reversal. reversal. pptxPresentation.2 .reversal. reversal. pptx
Presentation.2 .reversal. reversal. pptx
siliaselim87
?
5.17 - IntroductionToNeo4j-all狠狠撸s_1_2022_DanMc.pdf
5.17 - IntroductionToNeo4j-all狠狠撸s_1_2022_DanMc.pdf5.17 - IntroductionToNeo4j-all狠狠撸s_1_2022_DanMc.pdf
5.17 - IntroductionToNeo4j-all狠狠撸s_1_2022_DanMc.pdf
javiertec21
?
RAGing Against the Literature: LLM-Powered Dataset Mention Extraction-present...
RAGing Against the Literature: LLM-Powered Dataset Mention Extraction-present...RAGing Against the Literature: LLM-Powered Dataset Mention Extraction-present...
RAGing Against the Literature: LLM-Powered Dataset Mention Extraction-present...
suchanadatta3
?
Relationship between Happiness & LifeQuality .pdf
Relationship between Happiness & LifeQuality .pdfRelationship between Happiness & LifeQuality .pdf
Relationship between Happiness & LifeQuality .pdf
wrachelsong
?
Valkey 101 - SCaLE 22x March 2025 Stokes.pdf
Valkey 101 - SCaLE 22x March 2025 Stokes.pdfValkey 101 - SCaLE 22x March 2025 Stokes.pdf
Valkey 101 - SCaLE 22x March 2025 Stokes.pdf
Dave Stokes
?
2025-03-03-Philly-AAAI-GoodData-Build Secure RAG Apps With Open LLM
2025-03-03-Philly-AAAI-GoodData-Build Secure RAG Apps With Open LLM2025-03-03-Philly-AAAI-GoodData-Build Secure RAG Apps With Open LLM
2025-03-03-Philly-AAAI-GoodData-Build Secure RAG Apps With Open LLM
Timothy Spann
?
原版复刻加拿大多伦多大学成绩单(UTSG毕业证书) 文凭
原版复刻加拿大多伦多大学成绩单(UTSG毕业证书) 文凭原版复刻加拿大多伦多大学成绩单(UTSG毕业证书) 文凭
原版复刻加拿大多伦多大学成绩单(UTSG毕业证书) 文凭
taqyed
?
Kaggle & Datathons: A Practical Guide to AI Competitions
Kaggle & Datathons: A Practical Guide to AI CompetitionsKaggle & Datathons: A Practical Guide to AI Competitions
Kaggle & Datathons: A Practical Guide to AI Competitions
rasheedsrq
?
The Role of Christopher Campos Orlando in Sustainability Analytics
The Role of Christopher Campos Orlando in Sustainability AnalyticsThe Role of Christopher Campos Orlando in Sustainability Analytics
The Role of Christopher Campos Orlando in Sustainability Analytics
christophercamposus1
?
Presentation1.pptx for data and table analysis
Presentation1.pptx for data and table analysisPresentation1.pptx for data and table analysis
Presentation1.pptx for data and table analysis
vatsalsingla4
?
IFRS Finance Powerpoint ppt Finance D.pptx
IFRS Finance Powerpoint  ppt Finance D.pptxIFRS Finance Powerpoint  ppt Finance D.pptx
IFRS Finance Powerpoint ppt Finance D.pptx
amantiwari2091
?
iam free indeed.pptxiam free indeed.pptx
iam free indeed.pptxiam free indeed.pptxiam free indeed.pptxiam free indeed.pptx
iam free indeed.pptxiam free indeed.pptx
muhweziart
?
The truth behind the numbers: spotting statistical misuse.pptx
The truth behind the numbers: spotting statistical misuse.pptxThe truth behind the numbers: spotting statistical misuse.pptx
The truth behind the numbers: spotting statistical misuse.pptx
andyprosser3
?
办理魁北克大学成绩单触购买加拿大鲍蚕础惭成绩单文凭定制
办理魁北克大学成绩单触购买加拿大鲍蚕础惭成绩单文凭定制办理魁北克大学成绩单触购买加拿大鲍蚕础惭成绩单文凭定制
办理魁北克大学成绩单触购买加拿大鲍蚕础惭成绩单文凭定制
taqyed
?
exampleexampleexampleexampleexampleexampleexampleexample
exampleexampleexampleexampleexampleexampleexampleexampleexampleexampleexampleexampleexampleexampleexampleexample
exampleexampleexampleexampleexampleexampleexampleexample
lembiczkat
?
Introduction Lecture 01 Data Science.pdf
Introduction Lecture 01 Data Science.pdfIntroduction Lecture 01 Data Science.pdf
Introduction Lecture 01 Data Science.pdf
messagetome133
?
加拿大成绩单购买原版(鲍颁毕业证书)卡尔加里大学毕业证文凭
加拿大成绩单购买原版(鲍颁毕业证书)卡尔加里大学毕业证文凭加拿大成绩单购买原版(鲍颁毕业证书)卡尔加里大学毕业证文凭
加拿大成绩单购买原版(鲍颁毕业证书)卡尔加里大学毕业证文凭
taqyed
?
Stasiun kernel pengolahan kelapa sawit indonesia
Stasiun kernel pengolahan kelapa sawit indonesiaStasiun kernel pengolahan kelapa sawit indonesia
Stasiun kernel pengolahan kelapa sawit indonesia
fikrimanurung1
?
april 2024 paper 2 ms. english non fiction
april 2024 paper 2 ms. english non fictionapril 2024 paper 2 ms. english non fiction
april 2024 paper 2 ms. english non fiction
omokoredeolasunbomi
?
Optimizing Common Table Expressions in Apache Hive with Calcite
Optimizing Common Table Expressions in Apache Hive with CalciteOptimizing Common Table Expressions in Apache Hive with Calcite
Optimizing Common Table Expressions in Apache Hive with Calcite
Stamatis Zampetakis
?
Presentation.2 .reversal. reversal. pptx
Presentation.2 .reversal. reversal. pptxPresentation.2 .reversal. reversal. pptx
Presentation.2 .reversal. reversal. pptx
siliaselim87
?
5.17 - IntroductionToNeo4j-all狠狠撸s_1_2022_DanMc.pdf
5.17 - IntroductionToNeo4j-all狠狠撸s_1_2022_DanMc.pdf5.17 - IntroductionToNeo4j-all狠狠撸s_1_2022_DanMc.pdf
5.17 - IntroductionToNeo4j-all狠狠撸s_1_2022_DanMc.pdf
javiertec21
?
RAGing Against the Literature: LLM-Powered Dataset Mention Extraction-present...
RAGing Against the Literature: LLM-Powered Dataset Mention Extraction-present...RAGing Against the Literature: LLM-Powered Dataset Mention Extraction-present...
RAGing Against the Literature: LLM-Powered Dataset Mention Extraction-present...
suchanadatta3
?
Relationship between Happiness & LifeQuality .pdf
Relationship between Happiness & LifeQuality .pdfRelationship between Happiness & LifeQuality .pdf
Relationship between Happiness & LifeQuality .pdf
wrachelsong
?
Valkey 101 - SCaLE 22x March 2025 Stokes.pdf
Valkey 101 - SCaLE 22x March 2025 Stokes.pdfValkey 101 - SCaLE 22x March 2025 Stokes.pdf
Valkey 101 - SCaLE 22x March 2025 Stokes.pdf
Dave Stokes
?
2025-03-03-Philly-AAAI-GoodData-Build Secure RAG Apps With Open LLM
2025-03-03-Philly-AAAI-GoodData-Build Secure RAG Apps With Open LLM2025-03-03-Philly-AAAI-GoodData-Build Secure RAG Apps With Open LLM
2025-03-03-Philly-AAAI-GoodData-Build Secure RAG Apps With Open LLM
Timothy Spann
?
原版复刻加拿大多伦多大学成绩单(UTSG毕业证书) 文凭
原版复刻加拿大多伦多大学成绩单(UTSG毕业证书) 文凭原版复刻加拿大多伦多大学成绩单(UTSG毕业证书) 文凭
原版复刻加拿大多伦多大学成绩单(UTSG毕业证书) 文凭
taqyed
?
Kaggle & Datathons: A Practical Guide to AI Competitions
Kaggle & Datathons: A Practical Guide to AI CompetitionsKaggle & Datathons: A Practical Guide to AI Competitions
Kaggle & Datathons: A Practical Guide to AI Competitions
rasheedsrq
?

Data analysis step by step guide

  • 1. Manish Gupta DATA ANALYSIS STEP BY STEP DATA, DATA, DATA.. there is a lot of data being gathered by all sort of channels and any organization’s success, to an great extent, depends on how well they analyse such data and at what time. So what is the Data Analysis – “Data analysis is a process of ? inspecting, ? cleansing, ? transforming, and ? modeling data with the goal of discovering useful information, informing conclusions, hidden patterns, unknown correlations and supporting decision-making. ” So in simple terms Data analysis is a process for obtaining raw data and converting it into information useful for decision-making by users. Data is collected and analyzed to answer questions, test hypotheses or disprove theories. Lets understanding it from an example assume an IOT device located on a Toll Gate is collecting number of cars passing by that specific Toll Gate . It collects the registration number of the car and time of passing the traffic light. Now further assume, that there were 100,000 cars passed in a day. Now !! what information this data can give, it depends on the user or management objective but there are following possibilities. ? No of cars passed during certain time slots of the day to understand traffic behavior ? Linkup the plate number of owner attributes and analyse further o What was the ratio of taxi to owned cars (if high taxi may be public transport can be planned) o How many were private cars and how many were company cars for business and personal. o If similar device and data is available at toll exit, can analyse the data to understand average speed by type of cars. Lets Look at another example of an e-commerce business which sells 100s of products each day on its website. There will be atleast 3,000 transactions in a month and whopping 36,000 transactions in a year. Following can be achieved through data analytics. 1. What are most sold products on the platform. 2. What are most profitable products on platform and if they are amongst the most sold products. 3. Are there any products which are most sold but not so profitable? 4. Are there any products which are selling at loss? 5. Is there any product for whose sales is limited due to any controllable constraints? 6. What type of customers are buying product? What are opportunities !
  • 2. Manish Gupta DATA ANALYSIS STEP BY STEP 7. What is the average price per order ! 8. What products mostly brought together or by same customers? And much more depending on business development needs Ok!! Now that we understand how data analysis can be powerful for business lets look at various steps and techniques performed for data analytics. Before we understand what is data analytics, Lets try to Understand the Decision making Process in brief related to data. For this part of book we will look into only decision making needs related to data. Strategic decision making process will be discussed another time. Decision Making points from a data set can be grouped into following Answer How ? How Much or How Many of sales, products, customers, etc ? How long (delivery time, period sales, shelf life, sale time from purchase time) Answer WH Family ? Who (customer profiles and segments), ? What (product segments), ? Where (Geo Segments) and ? When (date, timestamp month) Now that we understand what various possible decision-making points are, lets look at the process of making such decisions starting from collection till interpretation of data into information. Key Steps of Data Analysis a. Assess Data requirements The data is necessary as inputs to the analysis, which is specified based upon the requirements of those directing the analysis or customers (who will use the finished product of the analysis). Following must be kept in mind while assessing data requirements. 1. Specific data variables (decision points) which can be a. numerical quantity of products, revenue, costing or b. categorical i.e category of products, regions, type of customer 2. Data can be collected to the basic transaction level 3. Data must be segmented and referenced to common sets for example customer profile can be collected separately and should be referenced in transactions so that repetitive data can be reduced.
  • 3. Manish Gupta DATA ANALYSIS STEP BY STEP b. Data collection Data is collected from a variety of sources. The requirements may be communicated by analysts to custodians of the data, such as information technology personnel within an organization. The data may also be collected from 1. sensors in the environment, such as traffic cameras, satellites, recording devices, etc. 2. obtained through interviews, 3. downloads from online sources or reading documentation. c. Data processing Data initially obtained must be processed or organised for analysis. For instance, these may involve placing data into rows and columns in a table format (i.e., structured data) for further analysis, such as within a spreadsheet or statistical software. d. Data cleaning Once processed and organised, the data may be ? incomplete, ? contain duplicates, or ? contain errors. Data cleaning is the process of preventing and correcting these errors. Following processes and checks are usually applied while cleaning data. a. record matching, b. deduplication, and c. column segmentation. d. Such data problems can also be identified through a variety of analytical techniques. e. For example, with financial information, the totals for revenue can be matched against total revenue reported in financial statements. f. Textual data spell checkers can be used to lessen the amount of mistyped words, but it is harder to tell if the words themselves are correct. e. Perform Data Analysis methodologies There are two main type of Data Analysis methods. Quantitative Analysis 1. Descriptive Analysis - Describe the main features of a large collection of data. 2. Confirmatory Analysis - Confirm or negate a hypothesis. 3. Exploratory Analysis - Find previously unknown relationships in the data.
  • 4. Manish Gupta DATA ANALYSIS STEP BY STEP 4. Inferential Analysis - Use a smaller sample of data to learn something about a bigger population. 5. Causal Analysis - Find out what happens to one variable when you change another. Event Series Analysis facilitate searching for patterns across multiple event records and datasets f. Communication & Evaluation Data visualization to understand the results of a data analysis. Once the data is analyzed, it may be reported in many formats to the users of the analysis to support their requirements. The users may have feedback, which results in additional analysis. As such, much of the analytical cycle is iterative. Now that we get the brief understanding of data analysis concepts and steps, we will continue to learn detailed methodologies and techniques using Excel. To Learn data analysis in more details with lots of examples and apply techniques subscribe to my detailed e-book Pre-order @ USD 2 only (80% introductory discount), after publishing it will be priced at USD 10. I will sweeten the deal by adding a free access to webinar If you subscribe now !! We will be launching a online course on data analysis, which I will be announcing soon.