狠狠撸

狠狠撸Share a Scribd company logo
Data Science Talk
AMOL SAHASRABUDHE
ANKIT JAIN
WHY THIS PRESENTATION
DEMOCRATIZE
BRAINSTORMING
2
? Why and What of Data Science
? Data Science @Roadrunnr
? Multi Drop Logic
? Demand Prediction
? Future Work
Agenda
3
What is Data Science ?
4
Data is Everywhere
5
HOUSE OF CARDS
MAKING SENSE OF DATA
6
HOUSE OF CARDS
WHO USES DATA SCIENCE
7
HOUSE OF CARDS
WHO IS A DATA SCIENTIST
8
HOUSE OF CARDS
WHAT DO THEY DO
9
HOUSE OF CARDS
WHAT DO THEY DO
IDEA EXPERIMENT VALIDATE SPEC DEPLOY
MODELING
SKILLS : Math, Statistics, Programming and Domain Knowledge
10
HOUSE OF CARDS
DATA SCIENCE@ROADRUNNR
11
Use data as a raw input to develop products to:
? Minimize Estimated Time of Arrival (ETA)
? Improve Reliability
at minimal Costs
VALUES
12
Multi Drop Clubbing Logic
? Improve the multi drop grouping logic to:
? Reduce number of touchpoints/order (5-15%)
? Reduce distance travelled per touch point
? Increase driver satisfaction
13
Existing Multi Drop Logic
HUB
Red Cluster can be done away with in this scenario
14
Ideal Clubbing
HUB
DB should be able to cover points on his way
15
How to Achieve Ideal Clubbing?
HUB
D1 < D2
Join two points which have minimum distance between them
? Joining two points
16
Joining Two Groups
? Join on the basis of shortest distances and not center distances to
achieve ideal clubbing
HUB
? S1, S2 shortest distances
? C1, C2 center distances
17
Delhi West Hub Orders (Existing Clubbing)
18
Delhi West Hub Orders (New Clubbing)
19
Algorithm Limitations
? Sub optimal solution due to accuracy vs complexity trade off
? Performance limited by accuracy of Lat,Long of drop points
? Not optimized for size and weight of shipments
? Routing not included as a part of this version (V2)
? Considers only bike as a carrier (V2)
20
Demand Prediction
21
Demand Prediction (Impact)
?30% stock-outs during peak hours
?70% demand during peak hours
?Supply Planning
?Driver Placement
?Surge Pricing
22
Demand Prediction(Actual Demand)
Koramangala
High Variance in hourly demand
23
Demand Prediction(Method)
24
Demand Prediction(Actual and Predicted)
Koramangala
25
Demand Prediction(drill down)
Koramangala 8PM
Predictability in number of orders in this hour
26
Demand Prediction(Model Failure)
Electronic City
Demand is too erratic to be captured by modeling
Bad predictions
27
Demand Prediction(Future Work)
?Category specific demand
?Prediction for smaller clusters rather than localities
?Prediction over smaller intervals
?Stock-out incorporation
28
Data Science Future Projects
?Surge pricing
?Wait time prediction for food orders (deployment pending)
?Carrier Selection (Extension of multi drop logic)
?Driver Fraud detection
?Product size estimation using image
29
THANK YOU
30
Ad

Recommended

Walkalytics - Reachability Analysis for your business
Walkalytics - Reachability Analysis for your business
Stephan Heuel
?
Life Lessons
Life Lessons
Ankit Jain
?
Data analytics in fraud detection and customer feedback
Data analytics in fraud detection and customer feedback
Ankit Jain
?
Data analytics workshop @IIIT Bangalore
Data analytics workshop @IIIT Bangalore
Ankit Jain
?
Data Science in Ecommerce
Data Science in Ecommerce
Ankit Jain
?
Advanced regression and model selection
Advanced regression and model selection
Ankit Jain
?
Operationalizing Machine Learning Using GPU-accelerated, In-database Analytics
Operationalizing Machine Learning Using GPU-accelerated, In-database Analytics
Kinetica
?
Hybrid Transactional/Analytics Processing with Spark and IMDGs
Hybrid Transactional/Analytics Processing with Spark and IMDGs
Ali Hodroj
?
Uzair's CV
Uzair's CV
uzair rasheed
?
Performance Models for Apache Accumulo
Performance Models for Apache Accumulo
Sqrrl
?
Webinar: SQL for Machine Data?
Webinar: SQL for Machine Data?
Crate.io
?
Accumulo Summit 2015: Performance Models for Apache Accumulo: The Heavy Tail ...
Accumulo Summit 2015: Performance Models for Apache Accumulo: The Heavy Tail ...
Accumulo Summit
?
Research Methodology Presentation - Research in Supply Chain Digital Twins
Research Methodology Presentation - Research in Supply Chain Digital Twins
Arwa Abougharib
?
Large-scaled telematics analytics
Large-scaled telematics analytics
DataWorks Summit
?
Integrating Semantic Web with the Real World - A Journey between Two Cities ...
Integrating Semantic Web with the Real World - A Journey between Two Cities ...
Juan Sequeda
?
Stinger Initiative: Leveraging Hive & Yarn for High-Performance/Interactive Q...
Stinger Initiative: Leveraging Hive & Yarn for High-Performance/Interactive Q...
Caserta
?
Hardware Accelerated Machine Learning Solution for Detecting Fraud and Money ...
Hardware Accelerated Machine Learning Solution for Detecting Fraud and Money ...
TigerGraph
?
Scylla Summit 2016: ScyllaDB, Present and Future
Scylla Summit 2016: ScyllaDB, Present and Future
ScyllaDB
?
PLNOG 3: John Evans - Best Practices in Network Planning
PLNOG 3: John Evans - Best Practices in Network Planning
PROIDEA
?
CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016
Mathieu Dumoulin
?
Big Data 2.0 - Milwaukee Big Data User Group Presentation
Big Data 2.0 - Milwaukee Big Data User Group Presentation
NVISIA
?
OPTIMIZING THE TICK STACK
OPTIMIZING THE TICK STACK
InfluxData
?
Customer Story: Elastic Stack? ??? ?? ??? ?? ?? ???
Customer Story: Elastic Stack? ??? ?? ??? ?? ?? ???
Elasticsearch
?
Approximate shortest distance computing
Approximate shortest distance computing
LeMeniz Infotech
?
Bandwidth distributed denial of service attacks and defenses
Bandwidth distributed denial of service attacks and defenses
LeMeniz Infotech
?
Real-Time Streaming: Move IMS Data to Your Cloud Data Warehouse
Real-Time Streaming: Move IMS Data to Your Cloud Data Warehouse
Precisely
?
Data Scotland - Migrating Mapping Dataflows by Johan Kangasniemi.pdf
Data Scotland - Migrating Mapping Dataflows by Johan Kangasniemi.pdf
kangasniemi
?
An Enterprise Architect's View of MongoDB
An Enterprise Architect's View of MongoDB
MongoDB
?
美国毕业证范本中华盛顿大学学位证书颁奥鲍学生卡购买
美国毕业证范本中华盛顿大学学位证书颁奥鲍学生卡购买
Taqyea
?
UPS and Big Data intro to Business Analytics.pptx
UPS and Big Data intro to Business Analytics.pptx
sanjum5582
?

More Related Content

Similar to Data Science Projects @ Runnr (20)

Uzair's CV
Uzair's CV
uzair rasheed
?
Performance Models for Apache Accumulo
Performance Models for Apache Accumulo
Sqrrl
?
Webinar: SQL for Machine Data?
Webinar: SQL for Machine Data?
Crate.io
?
Accumulo Summit 2015: Performance Models for Apache Accumulo: The Heavy Tail ...
Accumulo Summit 2015: Performance Models for Apache Accumulo: The Heavy Tail ...
Accumulo Summit
?
Research Methodology Presentation - Research in Supply Chain Digital Twins
Research Methodology Presentation - Research in Supply Chain Digital Twins
Arwa Abougharib
?
Large-scaled telematics analytics
Large-scaled telematics analytics
DataWorks Summit
?
Integrating Semantic Web with the Real World - A Journey between Two Cities ...
Integrating Semantic Web with the Real World - A Journey between Two Cities ...
Juan Sequeda
?
Stinger Initiative: Leveraging Hive & Yarn for High-Performance/Interactive Q...
Stinger Initiative: Leveraging Hive & Yarn for High-Performance/Interactive Q...
Caserta
?
Hardware Accelerated Machine Learning Solution for Detecting Fraud and Money ...
Hardware Accelerated Machine Learning Solution for Detecting Fraud and Money ...
TigerGraph
?
Scylla Summit 2016: ScyllaDB, Present and Future
Scylla Summit 2016: ScyllaDB, Present and Future
ScyllaDB
?
PLNOG 3: John Evans - Best Practices in Network Planning
PLNOG 3: John Evans - Best Practices in Network Planning
PROIDEA
?
CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016
Mathieu Dumoulin
?
Big Data 2.0 - Milwaukee Big Data User Group Presentation
Big Data 2.0 - Milwaukee Big Data User Group Presentation
NVISIA
?
OPTIMIZING THE TICK STACK
OPTIMIZING THE TICK STACK
InfluxData
?
Customer Story: Elastic Stack? ??? ?? ??? ?? ?? ???
Customer Story: Elastic Stack? ??? ?? ??? ?? ?? ???
Elasticsearch
?
Approximate shortest distance computing
Approximate shortest distance computing
LeMeniz Infotech
?
Bandwidth distributed denial of service attacks and defenses
Bandwidth distributed denial of service attacks and defenses
LeMeniz Infotech
?
Real-Time Streaming: Move IMS Data to Your Cloud Data Warehouse
Real-Time Streaming: Move IMS Data to Your Cloud Data Warehouse
Precisely
?
Data Scotland - Migrating Mapping Dataflows by Johan Kangasniemi.pdf
Data Scotland - Migrating Mapping Dataflows by Johan Kangasniemi.pdf
kangasniemi
?
An Enterprise Architect's View of MongoDB
An Enterprise Architect's View of MongoDB
MongoDB
?
Performance Models for Apache Accumulo
Performance Models for Apache Accumulo
Sqrrl
?
Webinar: SQL for Machine Data?
Webinar: SQL for Machine Data?
Crate.io
?
Accumulo Summit 2015: Performance Models for Apache Accumulo: The Heavy Tail ...
Accumulo Summit 2015: Performance Models for Apache Accumulo: The Heavy Tail ...
Accumulo Summit
?
Research Methodology Presentation - Research in Supply Chain Digital Twins
Research Methodology Presentation - Research in Supply Chain Digital Twins
Arwa Abougharib
?
Large-scaled telematics analytics
Large-scaled telematics analytics
DataWorks Summit
?
Integrating Semantic Web with the Real World - A Journey between Two Cities ...
Integrating Semantic Web with the Real World - A Journey between Two Cities ...
Juan Sequeda
?
Stinger Initiative: Leveraging Hive & Yarn for High-Performance/Interactive Q...
Stinger Initiative: Leveraging Hive & Yarn for High-Performance/Interactive Q...
Caserta
?
Hardware Accelerated Machine Learning Solution for Detecting Fraud and Money ...
Hardware Accelerated Machine Learning Solution for Detecting Fraud and Money ...
TigerGraph
?
Scylla Summit 2016: ScyllaDB, Present and Future
Scylla Summit 2016: ScyllaDB, Present and Future
ScyllaDB
?
PLNOG 3: John Evans - Best Practices in Network Planning
PLNOG 3: John Evans - Best Practices in Network Planning
PROIDEA
?
CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016
Mathieu Dumoulin
?
Big Data 2.0 - Milwaukee Big Data User Group Presentation
Big Data 2.0 - Milwaukee Big Data User Group Presentation
NVISIA
?
OPTIMIZING THE TICK STACK
OPTIMIZING THE TICK STACK
InfluxData
?
Customer Story: Elastic Stack? ??? ?? ??? ?? ?? ???
Customer Story: Elastic Stack? ??? ?? ??? ?? ?? ???
Elasticsearch
?
Approximate shortest distance computing
Approximate shortest distance computing
LeMeniz Infotech
?
Bandwidth distributed denial of service attacks and defenses
Bandwidth distributed denial of service attacks and defenses
LeMeniz Infotech
?
Real-Time Streaming: Move IMS Data to Your Cloud Data Warehouse
Real-Time Streaming: Move IMS Data to Your Cloud Data Warehouse
Precisely
?
Data Scotland - Migrating Mapping Dataflows by Johan Kangasniemi.pdf
Data Scotland - Migrating Mapping Dataflows by Johan Kangasniemi.pdf
kangasniemi
?
An Enterprise Architect's View of MongoDB
An Enterprise Architect's View of MongoDB
MongoDB
?

Recently uploaded (20)

美国毕业证范本中华盛顿大学学位证书颁奥鲍学生卡购买
美国毕业证范本中华盛顿大学学位证书颁奥鲍学生卡购买
Taqyea
?
UPS and Big Data intro to Business Analytics.pptx
UPS and Big Data intro to Business Analytics.pptx
sanjum5582
?
presentation4.pdf Intro to mcmc methodss
presentation4.pdf Intro to mcmc methodss
SergeyTsygankov6
?
Microsoft Power BI - Advanced Certificate for Business Intelligence using Pow...
Microsoft Power BI - Advanced Certificate for Business Intelligence using Pow...
Prasenjit Debnath
?
NASA ESE Study Results v4 05.29.2020.pptx
NASA ESE Study Results v4 05.29.2020.pptx
CiroAlejandroCamacho
?
NVIDIA Triton Inference Server, a game-changing platform for deploying AI mod...
NVIDIA Triton Inference Server, a game-changing platform for deploying AI mod...
Tamanna36
?
25 items quiz for practical research 1 in grade 11
25 items quiz for practical research 1 in grade 11
leamaydayaganon81
?
Indigo dyeing Presentation (2).pptx as dye
Indigo dyeing Presentation (2).pptx as dye
shreeroop1335
?
Data Visualisation in data science for students
Data Visualisation in data science for students
confidenceascend
?
MRI Pulse Sequence in radiology physics.pptx
MRI Pulse Sequence in radiology physics.pptx
BelaynehBishaw
?
Indigo_Airlines_Strategy_Presentation.pptx
Indigo_Airlines_Strategy_Presentation.pptx
mukeshpurohit991
?
Camuflaje Tipos Características Militar 2025.ppt
Camuflaje Tipos Características Militar 2025.ppt
e58650738
?
最新版美国威斯康星大学河城分校毕业证(鲍奥搁贵毕业证书)原版定制
最新版美国威斯康星大学河城分校毕业证(鲍奥搁贵毕业证书)原版定制
taqyea
?
Crafting-Research-Recommendations Grade 12.pptx
Crafting-Research-Recommendations Grade 12.pptx
DaryllWhere
?
The Influence off Flexible Work Policies
The Influence off Flexible Work Policies
sales480687
?
最新版美国芝加哥大学毕业证(鲍颁丑颈肠补驳辞毕业证书)原版定制
最新版美国芝加哥大学毕业证(鲍颁丑颈肠补驳辞毕业证书)原版定制
taqyea
?
英国毕业证范本利物浦约翰摩尔斯大学成绩单底纹防伪尝闯惭鲍学生证办理学历认证
英国毕业证范本利物浦约翰摩尔斯大学成绩单底纹防伪尝闯惭鲍学生证办理学历认证
taqyed
?
YEAP !NOT WHAT YOU THINK aakshdjdncnkenfj
YEAP !NOT WHAT YOU THINK aakshdjdncnkenfj
payalmistryb
?
Allotted-MBBS-Student-list-batch-2021.pdf
Allotted-MBBS-Student-list-batch-2021.pdf
subhansaifi0603
?
Presentation by Tariq & Mohammed (1).pptx
Presentation by Tariq & Mohammed (1).pptx
AbooddSandoqaa
?
美国毕业证范本中华盛顿大学学位证书颁奥鲍学生卡购买
美国毕业证范本中华盛顿大学学位证书颁奥鲍学生卡购买
Taqyea
?
UPS and Big Data intro to Business Analytics.pptx
UPS and Big Data intro to Business Analytics.pptx
sanjum5582
?
presentation4.pdf Intro to mcmc methodss
presentation4.pdf Intro to mcmc methodss
SergeyTsygankov6
?
Microsoft Power BI - Advanced Certificate for Business Intelligence using Pow...
Microsoft Power BI - Advanced Certificate for Business Intelligence using Pow...
Prasenjit Debnath
?
NASA ESE Study Results v4 05.29.2020.pptx
NASA ESE Study Results v4 05.29.2020.pptx
CiroAlejandroCamacho
?
NVIDIA Triton Inference Server, a game-changing platform for deploying AI mod...
NVIDIA Triton Inference Server, a game-changing platform for deploying AI mod...
Tamanna36
?
25 items quiz for practical research 1 in grade 11
25 items quiz for practical research 1 in grade 11
leamaydayaganon81
?
Indigo dyeing Presentation (2).pptx as dye
Indigo dyeing Presentation (2).pptx as dye
shreeroop1335
?
Data Visualisation in data science for students
Data Visualisation in data science for students
confidenceascend
?
MRI Pulse Sequence in radiology physics.pptx
MRI Pulse Sequence in radiology physics.pptx
BelaynehBishaw
?
Indigo_Airlines_Strategy_Presentation.pptx
Indigo_Airlines_Strategy_Presentation.pptx
mukeshpurohit991
?
Camuflaje Tipos Características Militar 2025.ppt
Camuflaje Tipos Características Militar 2025.ppt
e58650738
?
最新版美国威斯康星大学河城分校毕业证(鲍奥搁贵毕业证书)原版定制
最新版美国威斯康星大学河城分校毕业证(鲍奥搁贵毕业证书)原版定制
taqyea
?
Crafting-Research-Recommendations Grade 12.pptx
Crafting-Research-Recommendations Grade 12.pptx
DaryllWhere
?
The Influence off Flexible Work Policies
The Influence off Flexible Work Policies
sales480687
?
最新版美国芝加哥大学毕业证(鲍颁丑颈肠补驳辞毕业证书)原版定制
最新版美国芝加哥大学毕业证(鲍颁丑颈肠补驳辞毕业证书)原版定制
taqyea
?
英国毕业证范本利物浦约翰摩尔斯大学成绩单底纹防伪尝闯惭鲍学生证办理学历认证
英国毕业证范本利物浦约翰摩尔斯大学成绩单底纹防伪尝闯惭鲍学生证办理学历认证
taqyed
?
YEAP !NOT WHAT YOU THINK aakshdjdncnkenfj
YEAP !NOT WHAT YOU THINK aakshdjdncnkenfj
payalmistryb
?
Allotted-MBBS-Student-list-batch-2021.pdf
Allotted-MBBS-Student-list-batch-2021.pdf
subhansaifi0603
?
Presentation by Tariq & Mohammed (1).pptx
Presentation by Tariq & Mohammed (1).pptx
AbooddSandoqaa
?
Ad

Data Science Projects @ Runnr