This document summarizes a research project analyzing travel demand data from GoGet, a car sharing service, to develop a predictive model of customer trip generation. The researchers used GoGet customer data and transportation demand modeling fundamentals to create a multinomial logistic regression model predicting the number of monthly GoGet trips based on a person's age, income, and car ownership. The results provide GoGet useful information about expected customer demand in different locations to improve service planning and expansion.
Autonomous smart traffic control is proposed to relieve traffic congestion for bus scheduling, to intelligently accomplish tasks such as on-demand dynamic passenger pickup or drop-off.
The Workplace Newsletter - February 2017Taylor Newton
Ìý
Welcome to the first edition of "The Workplace" for 2017. In this edition: * Worker unfairly dismissed after leaving work due to panic attack * Gambling addiction and theft - can the two be linked? * Intoxicated worker unsuccessfully sues for employer's request for alcohol screening * Meet our team *
Megan Pollins is currently studying for a CILEX Legal Secretarial Diploma Level 3 at Palmer's College. She has gained legal work experience at King & Wood Mallesons, McDermott Will & Emery, and Numis Securities Limited. Her roles have included providing secretarial and administrative support, file management, updating contacts, creating presentations, and typing documents. Megan also works part-time at House of Fraser in menswear where she assists customers and balances cash registers.
Este documento describe los pasos para redactar un ensayo y define los diferentes tipos de ensayos. Explica que la redacción de un ensayo implica expresar ideas sobre un tema especÃfico. Luego, enumera siete pasos para redactar un ensayo, como identificar el tema, ordenar las ideas y realizar correcciones. Además, detalla seis etapas para la elaboración de ensayos como buscar información y organizarla. Finalmente, define cuatro tipos de ensayos como filosóficos, crÃticos, descriptivos
The document provides a detailed wardrobe outline for various scenes in a film or show. It lists outfits for multiple main characters for over 30 scenes, specifying clothing items down to brands and styles. Outfits range from formal wear, casual daily clothes, pajamas, and underwear. Location and time of day are provided to help determine appropriate attire for each scene.
Hybrid iterated local search algorithm for optimization route of airplane tr...IJECEIAES
Ìý
The traveling salesman problem (TSP) is a very popular combinatorics problem. This problem has been widely applied to various real problems. The TSP problem has been classified as a Non-deterministic Polynomial Hard (NP-Hard), so a non-deterministic algorithm is needed to solve this problem. However, a non-deterministic algorithm can only produce a fairly good solution but does not guarantee an optimal solution. Therefore, there are still opportunities to develop new algorithms with better optimization results. This research develops a new algorithm by hybridizing three local search algorithms, namely, iterated local search (ILS) with simulated annealing (SA) and hill climbing (HC), to get a better optimization result. This algorithm aimed to solve TSP problems in the transportation sector, using a case study from the Traveling Salesman Challenge 2.0 (TSC 2.0). The test results show that the developed algorithm can optimize better by 15.7% on average and 11.4% based on the best results compared to previous studies using the TabuSA algorithm.
Airlines are concerned for route development, but if there is no business for certain routes/city, there will be no operation to that city/airport. While airports acts as facilitators for airlines to encourage them to operates for new routes or increase their frequencies. also airports offered a good service at a most convenient cost for airlines. That is why each airport in the world is always concerned about the route development, they are publishing the airlines traffic/statistics on monthly bases, exploring the future business potential of existing/operating routes.
Predictive Analysis of Bike Sharing System Using Machine Learning Algorithmssushantparte
Ìý
Provided business solutions based on the ethical aspects of data collection and shortcomings of business by visualizing data and forecasting the demand using Ensemble Learning Technique (Random Forest) with an RMSE of 89.09%.
This document is a project report describing the use of a genetic algorithm to solve a traveling salesman problem. It details how the genetic algorithm was implemented to find the optimal route for a honeymoon couple visiting multiple European countries, given the starting and ending countries. Key steps included image processing to obtain country coordinates, initializing a population of routes, evaluating routes using a fitness function, and applying genetic operators like selection, crossover and mutation over multiple iterations to converge on the shortest route. The genetic algorithm approach was found to be well-suited for the traveling salesman problem by providing good solutions efficiently.
In a marketing analytics class, we were responsible for mimicking a company, coming up with a questions to be solved, making an analytical model to answer the question, and determine if we asked the right question.
Bulldozer price prediction using regression model (Research Ethics).pptxHaxiKhan1
Ìý
The document discusses several research papers related to predicting vehicle prices using machine learning models. It summarizes three papers in particular. The first paper aims to build a model to predict vehicle value based on attributes using KNN and CART algorithms, finding CART to be more accurate. The second paper proposes using machine learning on optimal features to predict used vehicle prices, obtaining a 90% accuracy. The third paper evaluates models for predicting used car prices, finding the random forest regressor performed best with the lowest error.
IRJET- A Hybrid Approach for Travelling Service by using Data Parsing and Enh...IRJET Journal
Ìý
This document describes a hybrid approach for an online travelling service that provides car rentals for short periods of time using data parsing and enhanced prediction techniques. It aims to suggest places for users to travel to using artificial intelligence and provide a cost comparison of major online car booking services. The system is intended to make travel planning easier and more cost-effective for users. It works by learning user preferences from multiple data sources to generate personalized travel recommendations and filter unwanted information.
The document is a student's report on analyzing Uber trip data using Python. It introduces the data exploration process, including importing the data file into Jupyter Notebook. Various analyses are performed to determine the most popular trip purpose/distance, unique destinations, popular starting points, and trips by category in a bar graph. The conclusion summarizes the key insights from visualizing the data on trip purposes, destinations, starting points, and categories using Python techniques. It emphasizes that exploratory data analysis helps understand business data better.
Analysis on Bike Rental Data to Predict Future UseKimberly Nguyen
Ìý
This document summarizes a student project analyzing bike rental data to build predictive models for casual and registered bike users. The students created separate linear regression models for casual and registered users. For casual users, they found the original model violated assumptions, so they took the response variable to the power of 0.4 and saw improved linearity and constant variance. Their best casual user model included variables for weather, season, month, and temperature. For registered users, they applied a mean shift which showed unbiasedness. The students' predictive models were better at predicting bike use away from holidays and extreme weather.
Using Gamification For Stimulating Safe And Good Driving BehaviorLucas Machado
Ìý
The document proposes a gamification system to stimulate safe and good driving behavior. The system would use sensors in vehicles to track driving data like speed, braking distance, and route taken. Drivers would earn points for good behavior like following traffic laws and driving efficiently. Points could be redeemed for discounts on taxes, fuel, and insurance. By making driving a game and rewarding safe habits, the system aims to improve road safety and reduce accidents and traffic congestion.
A Novel Feature Engineering Framework in Digital Advertising Platformijaia
Ìý
Digital advertising is growing massively all over the world, and, nowadays, is the best way to reach potential customers, where they spend the vast majority of their time on the Internet. While an advertisement is an announcement online about something such as a product or service, predicting the probability that a user do any action on the ads, is critical to many web applications. Due to over billions daily active users, and millions daily active advertisers, a typical model should provide predictions on billions events per day. So, the main challenge lies in the large design space to address issues of scale, where we need to rely on a subset of well-designed features. In this paper, we propose a novel feature engineering framework, specialized in feature selection using the efficient statistical approaches, which significantly outperform the state-of-the-art ones. To justify our claim, a large dataset of a running marketing campaign is used to evaluate the efficiency of the proposed approaches, where the results illustrate their benefits.
A Novel Feature Engineering Framework in Digital Advertising Platformgerogepatton
Ìý
Digital advertising is growing massively all over the world, and, nowadays, is the best way to reach potential customers, where they spend the vast majority of their time on the Internet. While an advertisement is an announcement online about something such as a product or service, predicting the probability that a user do any action on the ads, is critical to many web applications. Due to over billions daily active users, and millions daily active advertisers, a typical model should provide predictions on billions events per day. So, the main challenge lies in the large design space to address issues of scale, where we need to rely on a subset of well-designed features. In this paper, we propose a novel feature engineering framework, specialized in feature selection using the efficient statistical approaches, which significantly outperform the state-of-the-art ones. To justify our claim, a large dataset of a running marketing campaign is used to evaluate the efficiency of the proposed approaches, where the results illustrate their benefits.
Hybrid Genetic Algorithms and Simulated Annealing for Multi-trip Vehicle Rout...IJECEIAES
Ìý
Vehicle routing problem with time windows (VRPTW) is one of NP-hard problem. Multi-trip is approach to solve the VRPTW that looking trip scheduling for gets best result. Even though there are various algorithms for the problem, there is opportunity to improve the existing algorithms in order gaining a better result. In this research, genetic algoritm is hybridized with simulated annealing algoritm to solve the problem. Genetic algoritm is employed to explore global search area and simulated annealing is employed to exploit local search area. Four combination types of genetic algorithm and simulated annealing (GA-SA) are tested to get the best solution. The computational experiment shows that GA-SA1 and GA-SA4 can produced the most optimal fitness average values with each value was 1.0888 and 1.0887. However GA-SA4 can found the best fitness chromosome faster than GA-SA1.
Prediction of Used Car Prices using Machine Learning TechniquesIRJET Journal
Ìý
This document describes research on using machine learning techniques to predict used car prices. It discusses collecting a dataset from online sources and preprocessing the data, which includes features like initial price, mileage, fuel type, etc. Various regression algorithms are then applied to build models for predicting car prices, including linear regression, lasso regression, ridge regression, and ensemble methods like random forest and gradient boosting. The models are evaluated and the best performing techniques are identified. Finally, a web application is created to allow users to input vehicle details and receive predicted price values.
This document proposes reallocating the Georgia Tech Police Department's (GTPD) patrol zones based on analyzing past crime patterns to make the campus safer. The research team has analyzed four years of crime data using clustering algorithms and time series analysis. They found crime clusters, relationships between certain crimes, and were able to predict future crime locations and types. The current 4 patrol zones are inefficient as Zone 2 has noticeably more crimes. The team aims to strategically define new zones incorporating their findings to suggest more reasonable patrol routes and make crime occurrences more uniform across zones. Their goal is to increase patrol efficiency and encounter more criminals to improve campus safety. They will clean the data, analyze crime patterns, predict future crimes, optimize patrol zone
A new hybrid approach for solving travelling salesman problem using ordered c...eSAT Journals
Ìý
Abstract Travelling Salesman Problem is a well known NP problem. It is an optimization problem. Genetic Algorithms are the evolution techniques to solve optimization problems. In this paper a new hybrid technique using ordered cross over 1 (OX1) and greedy approach has been proposed. Experiment results shows that the proposed hybrid cross over is better than the existing cross over operator as the new operator provide a better path when executed for the same number of iterations. Keywords:- Travelling Salesman Problem, ordered cross over 1 (OX1)
The Optimizing Multiple Travelling Salesman Problem Using Genetic Algorithmijsrd.com
Ìý
The traveling salesman problem (TSP) supports the idea of a single salesperson traveling in a continuous trip visiting all n cities exactly once and returning to the starting point. The multiple traveling salesman problems (mTSP) is complex combinatorial optimization problem, which is a generalization of the well-known Travelling Salesman Problem (TSP), where one or more salesman can be used in the path. In this paper mTSP has also been studied and solved with GA in the form of the vehicle scheduling problem. The existing model is new models are compared to both mathematically and experimentally. This work presents a chromosome methodology setting up the MTSP to be solved using a GA.
PREDICTING ROAD ACCIDENT RISK USING GOOGLE MAPS IMAGES AND ACONVOLUTIONAL NEU...gerogepatton
Ìý
Location specific characteristics of a road segment such as road geometry as well as surrounding road features can contribute significantly to road accident risk. A Google Maps image of a road segment provides a comprehensive visual of its complex geometry and the surrounding features. This paper proposes a novel machine learning approach using Convolutional Neural Networks (CNN) to accident risk prediction by unlocking the precise interaction of these many small road features that work in combination to contribute to a greater accident risk. The model has worldwide applicability and a very low cost/time effort to implement for a new city since Google Maps are available in most places across the globe. It also significantly contributes to existing research on accident prevention by allowing for the inclusion of highly detailed road geometry to weigh in on the prediction as well as the new locationbased attributes like proximity to schools and businesses.
PREDICTING ROAD ACCIDENT RISK USING GOOGLE MAPS IMAGES AND ACONVOLUTIONAL NEU...ijaia
Ìý
Location specific characteristics of a road segment such as road geometry as well as surrounding road features can contribute significantly to road accident risk. A Google Maps image of a road segment provides a comprehensive visual of its complex geometry and the surrounding features. This paper proposes a novel machine learning approach using Convolutional Neural Networks (CNN) to accident risk prediction by unlocking the precise interaction of these many small road features that work in combination to contribute to a greater accident risk. The model has worldwide applicability and a very low cost/time effort to implement for a new city since Google Maps are available in most places across the globe. It also significantly contributes to existing research on accident prevention by allowing for the inclusion of highly detailed road geometry to weigh in on the prediction as well as the new locationbased attributes like proximity to schools and businesses.
Predicting Road Accident Risk Using Google Maps Images and A Convolutional Ne...gerogepatton
Ìý
This document describes a study that used convolutional neural networks and Google Maps images to predict road accident risk. The model was trained on past accident data and images of accident locations from cities like New York, Chicago and Austin. It achieved prediction accuracies of 85-86% on test data from those cities. The model provides a low-cost way to identify potentially risky road segments that is applicable worldwide since Google Maps coverage is extensive. It also considers detailed road geometry and nearby features that may contribute to accident risk, unlike some previous approaches.
Hybrid iterated local search algorithm for optimization route of airplane tr...IJECEIAES
Ìý
The traveling salesman problem (TSP) is a very popular combinatorics problem. This problem has been widely applied to various real problems. The TSP problem has been classified as a Non-deterministic Polynomial Hard (NP-Hard), so a non-deterministic algorithm is needed to solve this problem. However, a non-deterministic algorithm can only produce a fairly good solution but does not guarantee an optimal solution. Therefore, there are still opportunities to develop new algorithms with better optimization results. This research develops a new algorithm by hybridizing three local search algorithms, namely, iterated local search (ILS) with simulated annealing (SA) and hill climbing (HC), to get a better optimization result. This algorithm aimed to solve TSP problems in the transportation sector, using a case study from the Traveling Salesman Challenge 2.0 (TSC 2.0). The test results show that the developed algorithm can optimize better by 15.7% on average and 11.4% based on the best results compared to previous studies using the TabuSA algorithm.
Airlines are concerned for route development, but if there is no business for certain routes/city, there will be no operation to that city/airport. While airports acts as facilitators for airlines to encourage them to operates for new routes or increase their frequencies. also airports offered a good service at a most convenient cost for airlines. That is why each airport in the world is always concerned about the route development, they are publishing the airlines traffic/statistics on monthly bases, exploring the future business potential of existing/operating routes.
Predictive Analysis of Bike Sharing System Using Machine Learning Algorithmssushantparte
Ìý
Provided business solutions based on the ethical aspects of data collection and shortcomings of business by visualizing data and forecasting the demand using Ensemble Learning Technique (Random Forest) with an RMSE of 89.09%.
This document is a project report describing the use of a genetic algorithm to solve a traveling salesman problem. It details how the genetic algorithm was implemented to find the optimal route for a honeymoon couple visiting multiple European countries, given the starting and ending countries. Key steps included image processing to obtain country coordinates, initializing a population of routes, evaluating routes using a fitness function, and applying genetic operators like selection, crossover and mutation over multiple iterations to converge on the shortest route. The genetic algorithm approach was found to be well-suited for the traveling salesman problem by providing good solutions efficiently.
In a marketing analytics class, we were responsible for mimicking a company, coming up with a questions to be solved, making an analytical model to answer the question, and determine if we asked the right question.
Bulldozer price prediction using regression model (Research Ethics).pptxHaxiKhan1
Ìý
The document discusses several research papers related to predicting vehicle prices using machine learning models. It summarizes three papers in particular. The first paper aims to build a model to predict vehicle value based on attributes using KNN and CART algorithms, finding CART to be more accurate. The second paper proposes using machine learning on optimal features to predict used vehicle prices, obtaining a 90% accuracy. The third paper evaluates models for predicting used car prices, finding the random forest regressor performed best with the lowest error.
IRJET- A Hybrid Approach for Travelling Service by using Data Parsing and Enh...IRJET Journal
Ìý
This document describes a hybrid approach for an online travelling service that provides car rentals for short periods of time using data parsing and enhanced prediction techniques. It aims to suggest places for users to travel to using artificial intelligence and provide a cost comparison of major online car booking services. The system is intended to make travel planning easier and more cost-effective for users. It works by learning user preferences from multiple data sources to generate personalized travel recommendations and filter unwanted information.
The document is a student's report on analyzing Uber trip data using Python. It introduces the data exploration process, including importing the data file into Jupyter Notebook. Various analyses are performed to determine the most popular trip purpose/distance, unique destinations, popular starting points, and trips by category in a bar graph. The conclusion summarizes the key insights from visualizing the data on trip purposes, destinations, starting points, and categories using Python techniques. It emphasizes that exploratory data analysis helps understand business data better.
Analysis on Bike Rental Data to Predict Future UseKimberly Nguyen
Ìý
This document summarizes a student project analyzing bike rental data to build predictive models for casual and registered bike users. The students created separate linear regression models for casual and registered users. For casual users, they found the original model violated assumptions, so they took the response variable to the power of 0.4 and saw improved linearity and constant variance. Their best casual user model included variables for weather, season, month, and temperature. For registered users, they applied a mean shift which showed unbiasedness. The students' predictive models were better at predicting bike use away from holidays and extreme weather.
Using Gamification For Stimulating Safe And Good Driving BehaviorLucas Machado
Ìý
The document proposes a gamification system to stimulate safe and good driving behavior. The system would use sensors in vehicles to track driving data like speed, braking distance, and route taken. Drivers would earn points for good behavior like following traffic laws and driving efficiently. Points could be redeemed for discounts on taxes, fuel, and insurance. By making driving a game and rewarding safe habits, the system aims to improve road safety and reduce accidents and traffic congestion.
A Novel Feature Engineering Framework in Digital Advertising Platformijaia
Ìý
Digital advertising is growing massively all over the world, and, nowadays, is the best way to reach potential customers, where they spend the vast majority of their time on the Internet. While an advertisement is an announcement online about something such as a product or service, predicting the probability that a user do any action on the ads, is critical to many web applications. Due to over billions daily active users, and millions daily active advertisers, a typical model should provide predictions on billions events per day. So, the main challenge lies in the large design space to address issues of scale, where we need to rely on a subset of well-designed features. In this paper, we propose a novel feature engineering framework, specialized in feature selection using the efficient statistical approaches, which significantly outperform the state-of-the-art ones. To justify our claim, a large dataset of a running marketing campaign is used to evaluate the efficiency of the proposed approaches, where the results illustrate their benefits.
A Novel Feature Engineering Framework in Digital Advertising Platformgerogepatton
Ìý
Digital advertising is growing massively all over the world, and, nowadays, is the best way to reach potential customers, where they spend the vast majority of their time on the Internet. While an advertisement is an announcement online about something such as a product or service, predicting the probability that a user do any action on the ads, is critical to many web applications. Due to over billions daily active users, and millions daily active advertisers, a typical model should provide predictions on billions events per day. So, the main challenge lies in the large design space to address issues of scale, where we need to rely on a subset of well-designed features. In this paper, we propose a novel feature engineering framework, specialized in feature selection using the efficient statistical approaches, which significantly outperform the state-of-the-art ones. To justify our claim, a large dataset of a running marketing campaign is used to evaluate the efficiency of the proposed approaches, where the results illustrate their benefits.
Hybrid Genetic Algorithms and Simulated Annealing for Multi-trip Vehicle Rout...IJECEIAES
Ìý
Vehicle routing problem with time windows (VRPTW) is one of NP-hard problem. Multi-trip is approach to solve the VRPTW that looking trip scheduling for gets best result. Even though there are various algorithms for the problem, there is opportunity to improve the existing algorithms in order gaining a better result. In this research, genetic algoritm is hybridized with simulated annealing algoritm to solve the problem. Genetic algoritm is employed to explore global search area and simulated annealing is employed to exploit local search area. Four combination types of genetic algorithm and simulated annealing (GA-SA) are tested to get the best solution. The computational experiment shows that GA-SA1 and GA-SA4 can produced the most optimal fitness average values with each value was 1.0888 and 1.0887. However GA-SA4 can found the best fitness chromosome faster than GA-SA1.
Prediction of Used Car Prices using Machine Learning TechniquesIRJET Journal
Ìý
This document describes research on using machine learning techniques to predict used car prices. It discusses collecting a dataset from online sources and preprocessing the data, which includes features like initial price, mileage, fuel type, etc. Various regression algorithms are then applied to build models for predicting car prices, including linear regression, lasso regression, ridge regression, and ensemble methods like random forest and gradient boosting. The models are evaluated and the best performing techniques are identified. Finally, a web application is created to allow users to input vehicle details and receive predicted price values.
This document proposes reallocating the Georgia Tech Police Department's (GTPD) patrol zones based on analyzing past crime patterns to make the campus safer. The research team has analyzed four years of crime data using clustering algorithms and time series analysis. They found crime clusters, relationships between certain crimes, and were able to predict future crime locations and types. The current 4 patrol zones are inefficient as Zone 2 has noticeably more crimes. The team aims to strategically define new zones incorporating their findings to suggest more reasonable patrol routes and make crime occurrences more uniform across zones. Their goal is to increase patrol efficiency and encounter more criminals to improve campus safety. They will clean the data, analyze crime patterns, predict future crimes, optimize patrol zone
A new hybrid approach for solving travelling salesman problem using ordered c...eSAT Journals
Ìý
Abstract Travelling Salesman Problem is a well known NP problem. It is an optimization problem. Genetic Algorithms are the evolution techniques to solve optimization problems. In this paper a new hybrid technique using ordered cross over 1 (OX1) and greedy approach has been proposed. Experiment results shows that the proposed hybrid cross over is better than the existing cross over operator as the new operator provide a better path when executed for the same number of iterations. Keywords:- Travelling Salesman Problem, ordered cross over 1 (OX1)
The Optimizing Multiple Travelling Salesman Problem Using Genetic Algorithmijsrd.com
Ìý
The traveling salesman problem (TSP) supports the idea of a single salesperson traveling in a continuous trip visiting all n cities exactly once and returning to the starting point. The multiple traveling salesman problems (mTSP) is complex combinatorial optimization problem, which is a generalization of the well-known Travelling Salesman Problem (TSP), where one or more salesman can be used in the path. In this paper mTSP has also been studied and solved with GA in the form of the vehicle scheduling problem. The existing model is new models are compared to both mathematically and experimentally. This work presents a chromosome methodology setting up the MTSP to be solved using a GA.
PREDICTING ROAD ACCIDENT RISK USING GOOGLE MAPS IMAGES AND ACONVOLUTIONAL NEU...gerogepatton
Ìý
Location specific characteristics of a road segment such as road geometry as well as surrounding road features can contribute significantly to road accident risk. A Google Maps image of a road segment provides a comprehensive visual of its complex geometry and the surrounding features. This paper proposes a novel machine learning approach using Convolutional Neural Networks (CNN) to accident risk prediction by unlocking the precise interaction of these many small road features that work in combination to contribute to a greater accident risk. The model has worldwide applicability and a very low cost/time effort to implement for a new city since Google Maps are available in most places across the globe. It also significantly contributes to existing research on accident prevention by allowing for the inclusion of highly detailed road geometry to weigh in on the prediction as well as the new locationbased attributes like proximity to schools and businesses.
PREDICTING ROAD ACCIDENT RISK USING GOOGLE MAPS IMAGES AND ACONVOLUTIONAL NEU...ijaia
Ìý
Location specific characteristics of a road segment such as road geometry as well as surrounding road features can contribute significantly to road accident risk. A Google Maps image of a road segment provides a comprehensive visual of its complex geometry and the surrounding features. This paper proposes a novel machine learning approach using Convolutional Neural Networks (CNN) to accident risk prediction by unlocking the precise interaction of these many small road features that work in combination to contribute to a greater accident risk. The model has worldwide applicability and a very low cost/time effort to implement for a new city since Google Maps are available in most places across the globe. It also significantly contributes to existing research on accident prevention by allowing for the inclusion of highly detailed road geometry to weigh in on the prediction as well as the new locationbased attributes like proximity to schools and businesses.
Predicting Road Accident Risk Using Google Maps Images and A Convolutional Ne...gerogepatton
Ìý
This document describes a study that used convolutional neural networks and Google Maps images to predict road accident risk. The model was trained on past accident data and images of accident locations from cities like New York, Chicago and Austin. It achieved prediction accuracies of 85-86% on test data from those cities. The model provides a low-cost way to identify potentially risky road segments that is applicable worldwide since Google Maps coverage is extensive. It also considers detailed road geometry and nearby features that may contribute to accident risk, unlike some previous approaches.
Predicting Road Accident Risk Using Google Maps Images and A Convolutional Ne...gerogepatton
Ìý
Final_Report.docx (2)
1. Summer Research Program UNSW
Data-driven Approach to Demand Modelling in Transport
Professor: Chen Cai
Collaborator: Hoang Nguyen
Authors: Luiza Anselmo Olinto Pavao Xavier and Marinna Pereira Pivatto
20 February 2015
2. Abstract
We studied the travel demand forecast fundamentals so we could apply this
knowledge in provide GoGet improvement. Our work was based in analyse the
GoGet data to make it useful to improve their services. Finding the correct
mathematical model gave us the number of GoGet trips that a person with some
characteristics will do in a month. Using this the company can know how many cars
they should have available.
1. Problem of the research
Now a days, we can realise a lack in travel demand forecast, because they are
usually wrong and this could happen because the data used is not correct, or even
the way of calculation is not right.
However, travel demand is an important information for all transport
companies, because it is a way to estimate the number of trips that will be made in
an area at some future time point. It starts with the calculation of trip generation
that is the number of trips that will be made. This will be influenced by factors as:
number of cars, workers and number of households, for example.
Using this knowledge of travel forecasting we will apply it on GoGet analysis
to study the demand for it. GoGet is a car sharing service that begun in 2003 in
Australia.
The company needs to improve the calculation of how many spots should
have in each area. For this study we will use the data base collected from the
registration forms about their customers. So there are some data available but just
the data does not mean anything for the company. Therefore, the challenge will be
transform this data in information that will be useful for GoGet to improve their
service.
3. 2. Solution
In order to solve our problem, we used the trip generation formula that we extracted
from the document provided by Chen Cai. The document was an important method to
present all the definitions around Transport Area.
We select the example of a household-based model to calculate the number of trips
that one person with a specific job category can generate.
Y = 0.91 + 1.44X​1​+ 1.07X​2
Where Y is the number of trips per household
X​1​ is the number of workers per household
X​2 ​is the number of cars per household
This linear regression model assumes that there is a relationship between the
independent variables (workers and car ownership) and the dependent variable (trips per
household).
We adapted this model to our problem. Assuming Y as a trips per Job Category. So,
our input was:
X​1​= 1
X​2​= Average of car ownership in which Job Category
For example:
A director has an average of 1.2 cars, so the number of trips for this category will be:
Y = 0.91 + 1.44*1 + 1.07*1.2
Y = 3.66
One director will generate on average of 3.7 almost 4 trips per day.
Knowing that, we analysed the GoGet data and we did a relationship between them.
The data showed us how often directors (we will continue using Directors as an
example) use GoGet and also the quantity of trips made by them per month. Using this
inputs we were able to calculate the probability of directors to choose GoGet.
Y​total​= Y*F*30
Where Y​total​is the total monthly trips per category
Y is the number of trips per job category
F is the frequency (how often one job category use GoGet)
4. 30 is the total days per month
So the probability to choose GoGet is:
P (x) = Y / Y​total
By identifying the probability we were able to calculate how many trips one director
will make using GoGet.
G = Y*30*P(x)
Where G is the number of trips using GoGet
P(x) is the probability to choose GoGet
For example:
After that we should create a model to identify how many trips using GoGet a person
with some characteristics like age, income and car ownership will probably make in one
month. To solve this problem we start to analyse the relationship between our independent
variable (age, car ownership and income) with our dependent variable (Trips using GoGet).
The analyse ended up with a multinomial logistic regression model because our dependent
variable has a limited number of possible values.
5. 3. Results
Using the code provide by MatLab for multinomial logistic regression:
X = [Avgage CarOwnership IncoDay];
prob = ordinal(Y,{'1','2','3','4'},[],[0 1 2 3 4]);
[B,dev,stats] = mnrfit(X,prob,'model','ordinal','Interactions','on')
i = 1;
x = exp(B(1,1) + B(2,1)*Avgage(i) + B(3,1)* CarOwnership(i) + B(4,1)*IncoDay(i))
y = exp(B(1,2) + B(2,2)*Avgage(i) + B(3,2)* CarOwnership(i) + B(4,2)*IncoDay(i))
z = exp(B(1,3) + B(2,3)*Avgage(i) + B(3,3)* CarOwnership(i) + B(4,3)*IncoDay(i))
We found the following results:
B =
12.8314 10.3285 14.3284
-0.1505 -0.1821 -0.0366
-17.6728 -4.7472 -19.1064
0.0237 0.0045 0.0215
x =
0.1091
y =
0.2579
z =
4.5673
It means that the three equations of our model will be:
The code is an example of how many trips a director can make in one month.
Analysing the results for ​x​, ​y and ​z ​we can conclude that this category will travel using GoGet
more than 3 times per month because ​x​, ​y ​are less than 1. Also, we can analyse that our
result is reliable using the ​t​and ​p​statistical methods, as we show above.
>> stats.t
ans =
3.1928 3.0076 2.4068
-1.6417 -2.4583 -0.3832
-3.2159 -0.9729 -2.0562
2.8027 0.6066 1.7588
>> stats.p
ans =
0.0014 0.0026 0.0161
6. 0.1007 0.0140 0.7016
0.0013 0.3306 0.0398
0.0051 0.5441 0.0786
To help the interpretation of the results, we made a program using Visual Basic.
The code:
The dashboard to input the independent variables and check the results:
7. Our results can ended up with four options for the number of trips made by month.
The analysed category can make ​zero, one, two or more than thre​e trips. The result depends
on age, income per day and car ownership. The range of results we stipulated by analysing
the data that we had. The maximum trips were four and the minimum were zero. We did
not use four as our maximum value because in the data just a few cases ended up with four
trips. In the future, the data can change and maybe more than five trips can be made, so to
refine the model is necessary increase the number of boundaries in the MatLab code.
4. Implications
The results of the model will be important to improve GoGet services. Knowing how
many trips should a person with some characteristics (age, income per day and car
ownership) use in a month, it will be easy to calculate the demand and also the number of
spots that will be necessary in some area.
Furthermore, this results could also be improved to analyse the data of one specific
zone, so they can calculate the number of trips in each zone and the opportunity of apply
GoGet in new regions. So, the model will provide important demand information for the
company and this will be able to manage it to better serve customers.