You are on page 1of 7

IETE Journal of Research

ISSN: 0377-2063 (Print) 0974-780X (Online) Journal homepage: http://www.tandfonline.com/loi/tijr20

Spatio-temporal Network-Constrained Trajectory


Data Model and Service Reliability Assessment

Muhammad Burhan Khalid, Muhammad Haris, Qudsia Gulzar, Muhammad


Hamid Ch & Nida Samad

To cite this article: Muhammad Burhan Khalid, Muhammad Haris, Qudsia Gulzar, Muhammad
Hamid Ch & Nida Samad (2017): Spatio-temporal Network-Constrained Trajectory Data Model and
Service Reliability Assessment, IETE Journal of Research, DOI: 10.1080/03772063.2017.1369372

To link to this article: https://doi.org/10.1080/03772063.2017.1369372

Published online: 21 Sep 2017.

Submit your article to this journal

Article views: 116

View related articles

View Crossmark data

Full Terms & Conditions of access and use can be found at


http://www.tandfonline.com/action/journalInformation?journalCode=tijr20
IETE JOURNAL OF RESEARCH, 2017
https://doi.org/10.1080/03772063.2017.1369372

Spatio-temporal Network-Constrained Trajectory Data Model and Service


Reliability Assessment
Muhammad Burhan Khalid1, Muhammad Haris2, Qudsia Gulzar2, Muhammad Hamid Ch2 and Nida Samad1,2
1
Sw3 Solutions, Lahore, Pakistan; 2GIS Centre, University of the Punjab, Lahore, Pakistan

ABSTRACT KEYWORDS
Due to urbanization and increase in traffic density, provision of efficient public transport facility is a Data cleaning; Map
major component of planning and development. The spatio-temporal data of vehicles’ movement matching; Mobility data;
have mainly led to mobility pattern analysis, which eventually reveal new information about the Spatial entities; Trajectory
efficiency of transport service. This research focuses on extracting mobility patterns of bus service
for students in Lahore City. Trajectory data collected through GPS survey have been processed by
using MYSQL spatial database applications and HTML to identify areas in entire route of bus service
causing delay. Three different types of delay have been calculated, namely stop delay, simple path
delay, and path repeated delay. The entire analysis results reveal that mainly the delay is caused at
start segments of planned route or the last segment of stop route. The major cause of delay is bus
movement on tertiary roads through congested commercial areas.

1. INTRODUCTION network-constrained trajectories in which objects move


on a predefined path and are partitioned on the bases of
With rapid and unplanned growth of cities, traffic vol-
stops. With advances in location-aware technologies
ume and density have increased manifold resulting in
such as Global Positioning System (GPS), radio-
traffic congestion and jams. As a result, efficient route
frequency identification (RFID), and autonomous
planning is the most important research domain in
agents, we can track moving objects with increasingly
order to enhance transportation system efficiency in
finer spatial resolutions for longer periods of time. These
terms of time, fuel, and cost management. Such manage-
have led to the accumulation of large repositories of tra-
ment system needs continuous mobility data of moving
jectories in many real-world applications [4].
vehicles. Recent advances in wireless communication,
sensor network, and location determination technologies
GPS is a Global Navigation Satellite System for location
make it possible to gather continuous spatio-temporal
determination. From the last three decades, GPS has been
data which represent positions and movements of mov-
widely used for trajectory data collection. The GPS has
ing objects [1].
emerged for civilian use in the 1990s as the space geodetic
technique being accurate and affordable [5]. Trajectory
A mobility data-set normally consists of a set of moving
data acquired with the help of GPS are X and Y locations
objects (e.g. people, cars, or animals) and their trajecto-
in sequence with time stamp. As these data are in digital
ries over space and time. Mobility data analysis is of
format, it can be directly handled in GIS applications.
great importance to advance our understanding of com-
There are many advantages: under-reporting of trips is
plex space–time dynamics in a variety of domains,
less likely, the data are immediately available in a digital
including urban planning, transportation, demography,
form and can be analysed in a Geographical Information
and emergency management. There are great challenges,
System (GIS) environment, and, in general, more data are
however, for the analysis of mobility data due to their
available at a finer level of resolution [6–8].
massive data volume and complexity. For example, a
data-set may have millions of points/movements and
unknown structures/patterns over space and time [2].
2. DATA PROCESSING
A trajectory is a sequence of sampled locations and time Lahore is the second largest metropolitan area of Pakistan
stamps along the route of a moving object [3]. Trajecto- and the largest metropolitan area in Punjab province.
ries are normally of two types: the first type is of trajecto- Due to rapid urbanization, increased traffic density results
ries in which objects are moving freely, while the other is in travel time loss. This research aims at developing a
© 2017 IETE
2 M. B. KHALID ET AL.: SPATIO-TEMPORAL NETWORK-CONSTRAINED TRAJECTORY DATA MODEL AND SERVICE RELIABILITY ASSESSMENT

Figure 1: Logical schema

network-constrained trajectory data along a route of bus preparation step, a line shapefile was created by digitizing
service in Lahore city provided for university students to the selected route in ArcGIS software, and a point shape-
move from one campus to another campus. The total file of 11 stops was created from Arc Blue images. The
length of trajectory is 12.8 km, planned for 30 minutes coordinates of running bus were collected by using
and partitioned by 11 stops to provide on-the-way pick android-based GPS tracker application after every two sec-
and drop facility to students. Trajectory data collected six onds and sent it to online server with timestamp (Figure 2
times through GPS survey identify time delay in service (a)).
by using GIS analysis techniques. The main advantage of
GIS, the possibility of integrating spatial and alphanu- After the collection of trajectory data was data modelling
meric data, has made it widely applicable to a variety of stage. GISs can be integrated successfully with modern
fields. GIS allows new processing methods to be used and database management systems. Database management is
provides high-quality presentation of the processed data. described as linking topology data and attributes to geo-
These characteristics make it an unavoidable decision- graphical elements (a point, line, polygon) [9]. In the
making tool in a situation when data relevant to a deci- next step to define entities, their attributes and relation-
sion include a spatial component [9]. ship between entities conceptual, logical, and physical
data models were developed in visual paradigm. Visual
To analyse the bus service efficiency spatial and non- paradigm supports a number of database types such as
spatial data is processed by using Arc Map for spatial MySQL, MS SQL Server, Oracle, PostgreSQL (www.
data development, Visual paradigm for database model- visual-paradigm.com). Through this tool, conceptual,
ling, MYSQL for database implementation, php for data logical, and physical schemas are designed and reverse it
manipulation and analysis and QGIS for visualization of into MySQL database type as given in Figure 1.
data during processing and display of analysis results.
Afterwards, route shapefile and stop shapefile were con-
The GIS uses complex structures of data in order to be verted into database format for further calculations and
able to describe the physical objects and their interrela- analysis. ESRI shapefile is being given as an input data-
tions [10]. In the entire research project, three different set in Shp2mysql program and outputs SQL command
spatial components have been prepared, namely bus route files which is used in the MySQL spatial database. This
from origin to destination, stops on the route, and GPS application uses functionality from shapelib 1.2.9 by
survey points from moving buses. As the first data Frank Warmerdam to read from ESRI Shape files
M. B. KHALID ET AL.: SPATIO-TEMPORAL NETWORK-CONSTRAINED TRAJECTORY DATA MODEL AND SERVICE RELIABILITY ASSESSMENT 3

Figure 2: Raw and processed trajectories

(http://shapelib.maptools.org). While live-info table was In the first step of data cleaning, clusters were generalized
populated by survey points collected through GPS. from raw trajectory. Cluster reduction reduces the sys-
tematic noise in the data. As trajectory data of moving
The next important step was reconstruction of raw tra- object are collected through tracking devices, due to its
jectory data by using trajectory generalization (data inherent errors, it cannot be used directly for analysis.
cleaning) and map-matching techniques. GPS systems Even if the vehicle is not moving or moving slowly due
are routinely used for navigation and geodetic measure- to traffic congestion, the tracking device records a minor
ments. The results from the measurements in real time deviation in true position resulting in clusters of points at
are not needed, and therefore, a post-processing proce- the same location. Due to their unpredictability and falsi-
dure could be applied to improve the accuracy of the col- fying characteristics, the GPS point clusters could be
lected measurements [11]. extremely misleading to map-matching procedures and
are the greatest cause of overshoots and mismatch [13].
The GPS device may generate (infrequent) errors In order to remove clusters, two different filters were gen-
due to the interferences with the surroundings erated by using php commands. The first filter removed
such as trees and built-ups in a non-obvious way the points having one-second time interval (Figure 2(b)),
[12]. Therefore, raw trajectory data always are whereas the second filter removed all points lying within
cleaned through different techniques and recon- 20 m radius of each other (Figure 2(c)). The resultant
structed trajectory is used for analysis. The pur- points were saved in the database along their id, longi-
pose of trajectory reconstruction and partition is tude, latitude, time, and calculated time.
to extract all characteristic points and get rid of
useless location data. It can reduce the storage, Whereas the analysis of raw trajectory revealed that sur-
usage, and improve the efficiency for further simi- vey points were not completely lying on the route line
larity measurement [1]. and it needed to be aligned with road network. The

Figure 3: Processed trajectories and bus schedule


4 M. B. KHALID ET AL.: SPATIO-TEMPORAL NETWORK-CONSTRAINED TRAJECTORY DATA MODEL AND SERVICE RELIABILITY ASSESSMENT

Figure 4: Spatial data and efficiency assessment

alignment process of GPS Survey points with road net- To calculate stops delay, 200-m radius buffer was gener-
work is called map matching. Map matching is a funda- ated on all 11 stops on the route in database. Time dif-
mental pre-processing step for many trajectory-based ference of all survey points lying within buffer zone was
applications, such as moving object management, traffic accumulated and subtracted from a threshold value of
flow analysis, and driving directions [14]. So, to over- 15 seconds (the proposed time for each stop to pick and
come this problem, snapping algorithm is designed that drop). The resultant time was considered to be extra
helps to snap all points on the line. In this algorithm, time spent on each stop. As an analysis of six trajectories
line slope was calculated for route data. Then the dis- stops, “B”, “F”, and “C” are more critical. On stop “B”,
tance of each survey points was calculated from line the average delay is 29 seconds; on stop “F”, the average
nodes and the survey point was snapped to line node delay is 26 seconds; and on stop “C”, the average delay is
having minimum distance between 0 and 1 (Figure 2 23 seconds. As both stops “B” and “C” lie within the
(d)). As a final output, we have a sample point table with campus apart from the main road, but due to more
sample point ID, latitude (X), longitude (Y), time, dis- number of students, delay in bus departure is caused.
tance between successive points, and time difference While on Stop “F”, the number of lanes is reduced due
between successive points (Figure 1). to road turnings and fly overs. As a result, bottleneck is
being created in this area resulting in traffic problems.

Second, the PSD was calculated on the basis of GPS sur-


3. ANALYSIS
vey points. Time difference between all successive survey
After complete data processing, it was ready to calculate points was calculated and line segments between those
the delay time to analyse the service efficiency. The qual- points were selected, where the time difference was
ity of public transport is determined by many aspects, greater than five seconds. The most critical line segment
such as availability, comfort, travel time, and cost. Reli- was between stops “J” and “K”, where the average delay
ability of travel time has become increasingly important is 141 seconds during the week. It is mainly due to the
over the last decade [15,16]. Nowadays, in the majority leaving of main road and entering in a narrow tertiary
of the cities, chronic traffic jam happens and traffic con- road of a congested commercial area. The secondmost
gestions lose billions of hours and money [7,17]. Figure 3 critical line segment is between stops “C”and “D” with
shows the plot of all six trajectories against the scheduled an average delay of 87 seconds.
trajectory. It shows that each trajectory has a different
temporal behaviour with respect to scheduled time, and The third type of delay calculated was PRD. In PRD, sur-
overall, there is a delay in bus service ever. vey points with more than three times repeated maxi-
mum value of time difference between successive survey
To evaluate the efficiency of route, three types of delay points were selected. On each point, a buffer of 100 m
are calculated, namely stops delay, path repeated delay was generated. Time difference of survey points lying
(PRD), and path simple delay (PSD). within the buffer zone was accumulated with the most
M. B. KHALID ET AL.: SPATIO-TEMPORAL NETWORK-CONSTRAINED TRAJECTORY DATA MODEL AND SERVICE RELIABILITY ASSESSMENT 5

repeated maximum value of time difference to identify 5. D. Guo, X. Zhu, H. Jin, P. Gao, and C. Andris, “Discover-
route segments with repeated delay time. The most criti- ing spatial patterns in origin-destination mobility data,”
cal route segments identified was between stops “J” and Transp. GIS., Vol. ED-16, pp. 411–29, Jun. 2012.
“K” with an average time delay of 68.6 seconds. The 6. G. Draijer, N. Kalfs, and J. Perdok, “Global position-
other one was between stops “C” and “D” with an aver- ing system as data collection method for travel
age time delay of 20.4 seconds. While both segments are research,” J. Transp. Res. Board, Vol. ED-1719,
main commercial areas along the route. The results of all pp. 147–53, 2000.
three types of delay in terms of critical segments and
7. K. Hashimoto, “Monitoring road traffic congestion in
stops have been shown in Figure 4.
Japan,” Transp. Rev., Vol. ED-10, pp. 171–86, Apr.
1990.
After delay analysis, an alternate route was identified to
avoid the most critical segment between stops “J” and 8. J. Zhou and R. Golledge, “A three-step map matching
“K” as shown in Figure 4. As the alternate route is on method in the GIS environment,” Int. J. Geogr. Inform.
the main one-way road, which may be to further facili- Syst., Vol. ED-X, Oct. 2006.
tate extra commuters along efficient time management. 9. M. Modsching, R. Kramer, and K. T. Hagen, “Field trial
on GPS accuracy in a medium size city: The influence of
built-up,” in 3rd Workshop on Positioning, Navigation
4. CONCLUSION and Communication, Hannover, Germany, Mar. 2006.
This research aims at using spatio-temporal dynamic
10. N. Trubint, “A GIS application to explore postal retail out-
data, data modelling techniques, certain data processing
let locations,” Promet – Traffic Transp., Vol. ED-24,
algorithms of data cleaning, map matching, and analysis pp. 161–8, 2012.
for efficiency assessment of transport service in urban
areas. The results calculate the average as well as the total 11. N. V. Oort, J. W. Boterman, and R. V. Nes, “The impact of
delay in bus reaching from the origin to the destination, scheduling on service reliability: trip-time determination
and identification of areas causing maximum delay. and holding points in long-headway services,” Public
Transp., Vol. ED-4, pp. 39–56, Jun. 2012.

Analysis reveals practical valuable results for managers 12. N. V. Oort, “Service reliability and urban public transport
and decision-makers in order to reduce the extra time design,” PhD thesis, Delft University of Technology, Delft,
spent in a trip by changing route. An alternate route Netherlands, 2011. ISBN: 9789055841424.
with an extra mileage of 396 m has been identified to
13. T. Apeltauer, J. Macur, P. Holcner, and M. Radimsky,
avoid the maximum delay area and assure the timely
“Validation of microscopic traffic models based on
availability of transport service for commuters. GPS precise measurement of vehicle dynamics”
Promet – Traffic Transp., Vol. ED-25, pp. 157–67,
Mar. 2013.
DISCLOSURE STATEMENT
No potential conflict of interest was reported by the authors. 14. J. Wolf, “Using GPS data loggers to replace travel diaries
in the collection of travel data,” Thesis (PhD), Georgia
Institute of Technology, Atlanta, GA, 2000.
REFERENCES
15. Y. Lou, X. Xie, C. Zhang, W. Wang, Y. Zheng, and Y.
1. S. Bricka and C. Bhat, “Comparative analysis of global Huang, “Map-matching for low-sampling-rate GPS trajec-
positioning system-based and travel survey-based data,” J. tories,” ACM GIS, Nov., 2009. ISBN 978-1-60558-649-6/
Transp. Res. Board, Vol. ED-1972, pp. 9–20, Jan. 2006. 09/11

2. D. Eurepiæ and Z. Tos, “Technical documentation in traf- 16. Y. Xia, G. Wang, X. Zhang, G. Kim, and H. Bae, “Spatio-
fic technology,” Promet – Traffic Transp., Vol. ED-21, temporal similarity measure for network constrained tra-
pp. 359–65, Sep. 2009. jectory data,” Int. J. Comput. Intell. Syst. Vol. ED-4,
pp. 1070–9, Sep. 2011.
3. D. Patel, “Incorporating duration and region association
information in trajectory classification,” J. Location Based 17. J. F. Zumberge, M. B. Heflin, D. C. Jefferson, M. M. Wat-
Ser., Vol. ED-7, pp. 246–71, Sep. 2013. kins, and F. H. Webb, “Precise point positioning for the
efficient and robust analysis of GPS data from large net-
4. D. Guo, S. Liu, and H. Jin, “A graph-based approach to works,” J. Geophys. Res., Vol. ED-102, pp. 5005–17, Mar.
vehicle trajectory analysis,” J. Location Based Ser., 1997.
Vol. ED-4, pp. 183–99, Dec. 2010.
6 M. B. KHALID ET AL.: SPATIO-TEMPORAL NETWORK-CONSTRAINED TRAJECTORY DATA MODEL AND SERVICE RELIABILITY ASSESSMENT

Authors
Muhammad Burhan Khalid received the Muhammad Hamid Ch received the master’s degree in
MSc degree in remote sensing and GIS geography from Punjab University in 2000 and MSc degree
from Punjab University College of Infor- in remote sensing and GIS from Punjab University College
mation Technology in 2012. He com- of Information Technology in 2003. He completed his
pleted his MPhil RSGIS in 2015 from MPhil RSGIS in 2012 from Punjab University College of
Punjab University College of Informa- Information Technology. At present, he is working as a lec-
tion Technology. At present, he is work- turer in GIS Centre, University of the Punjab, Lahore, Paki-
ing as a software developer in Sw3 stan. His main interests are digital image processing,
Solution. He is also teaching Web Engi- remote–sensing, and spatial data visualization techniques.
neering module as a visiting faculty member in Punjab Univer-
sity College of Information Technology. His main interests are E-mail: hamid.ch@pucit.edu.pk
web-based GIS data development and spatial data modelling.
Nida Samad received the MSc degree in
E-mail: Burhan.gis@gmail.com remote sensing and GIS from Punjab
University College of Information Tech-
Muhammad Haris received his BS nology in 2008. She completed his MPhil
(hons) degree in computer science from RSGIS in 2013 from Punjab University
Punjab University College of Informa- College of Information Technology. At
tion Technology in 2005, MSc degree in present, she is working as a lecturer in
remote sensing and GIS from Punjab GIS Centre, University of the Punjab,
University College of Information Tech- Lahore, Pakistan. Her main interests are
nology in 2009. He completed his MPhil spatial data modelling, database development, and
RSGIS in 2012 from Punjab University geostatistics.
College of Information Technology. At
present, he is working as a lecturer in GIS Centre, University E-mail: nida.samad@pucit.edu.pk
of the Punjab, Lahore Pakistan. His main interests are web-
based Web GIS, spatial data modelling, and social media
networking.

E-mail: haris@pucit.edu.pk

Qudsia Gulzar received the master’s


degree in geography from Punjab Uni-
versity in 1996, and MSc degree in
remote sensing and GIS from Punjab
University College of Information Tech-
nology in 2003. She completed her MPhil
RSGIS in 2012 from Punjab University
College of Information Technology. At
present, she is working as a lecturer in
GIS Centre, University of the Punjab, Lahore, Pakistan. Her
main interests are GIS applications in urban planning, spatial
data modelling, and spatial data visualization techniques.

E-mail: qudsia@pucit.edu.pk

You might also like