Professional Documents
Culture Documents
To cite this article: Muhammad Burhan Khalid, Muhammad Haris, Qudsia Gulzar, Muhammad
Hamid Ch & Nida Samad (2017): Spatio-temporal Network-Constrained Trajectory Data Model and
Service Reliability Assessment, IETE Journal of Research, DOI: 10.1080/03772063.2017.1369372
ABSTRACT KEYWORDS
Due to urbanization and increase in traffic density, provision of efficient public transport facility is a Data cleaning; Map
major component of planning and development. The spatio-temporal data of vehicles’ movement matching; Mobility data;
have mainly led to mobility pattern analysis, which eventually reveal new information about the Spatial entities; Trajectory
efficiency of transport service. This research focuses on extracting mobility patterns of bus service
for students in Lahore City. Trajectory data collected through GPS survey have been processed by
using MYSQL spatial database applications and HTML to identify areas in entire route of bus service
causing delay. Three different types of delay have been calculated, namely stop delay, simple path
delay, and path repeated delay. The entire analysis results reveal that mainly the delay is caused at
start segments of planned route or the last segment of stop route. The major cause of delay is bus
movement on tertiary roads through congested commercial areas.
network-constrained trajectory data along a route of bus preparation step, a line shapefile was created by digitizing
service in Lahore city provided for university students to the selected route in ArcGIS software, and a point shape-
move from one campus to another campus. The total file of 11 stops was created from Arc Blue images. The
length of trajectory is 12.8 km, planned for 30 minutes coordinates of running bus were collected by using
and partitioned by 11 stops to provide on-the-way pick android-based GPS tracker application after every two sec-
and drop facility to students. Trajectory data collected six onds and sent it to online server with timestamp (Figure 2
times through GPS survey identify time delay in service (a)).
by using GIS analysis techniques. The main advantage of
GIS, the possibility of integrating spatial and alphanu- After the collection of trajectory data was data modelling
meric data, has made it widely applicable to a variety of stage. GISs can be integrated successfully with modern
fields. GIS allows new processing methods to be used and database management systems. Database management is
provides high-quality presentation of the processed data. described as linking topology data and attributes to geo-
These characteristics make it an unavoidable decision- graphical elements (a point, line, polygon) [9]. In the
making tool in a situation when data relevant to a deci- next step to define entities, their attributes and relation-
sion include a spatial component [9]. ship between entities conceptual, logical, and physical
data models were developed in visual paradigm. Visual
To analyse the bus service efficiency spatial and non- paradigm supports a number of database types such as
spatial data is processed by using Arc Map for spatial MySQL, MS SQL Server, Oracle, PostgreSQL (www.
data development, Visual paradigm for database model- visual-paradigm.com). Through this tool, conceptual,
ling, MYSQL for database implementation, php for data logical, and physical schemas are designed and reverse it
manipulation and analysis and QGIS for visualization of into MySQL database type as given in Figure 1.
data during processing and display of analysis results.
Afterwards, route shapefile and stop shapefile were con-
The GIS uses complex structures of data in order to be verted into database format for further calculations and
able to describe the physical objects and their interrela- analysis. ESRI shapefile is being given as an input data-
tions [10]. In the entire research project, three different set in Shp2mysql program and outputs SQL command
spatial components have been prepared, namely bus route files which is used in the MySQL spatial database. This
from origin to destination, stops on the route, and GPS application uses functionality from shapelib 1.2.9 by
survey points from moving buses. As the first data Frank Warmerdam to read from ESRI Shape files
M. B. KHALID ET AL.: SPATIO-TEMPORAL NETWORK-CONSTRAINED TRAJECTORY DATA MODEL AND SERVICE RELIABILITY ASSESSMENT 3
(http://shapelib.maptools.org). While live-info table was In the first step of data cleaning, clusters were generalized
populated by survey points collected through GPS. from raw trajectory. Cluster reduction reduces the sys-
tematic noise in the data. As trajectory data of moving
The next important step was reconstruction of raw tra- object are collected through tracking devices, due to its
jectory data by using trajectory generalization (data inherent errors, it cannot be used directly for analysis.
cleaning) and map-matching techniques. GPS systems Even if the vehicle is not moving or moving slowly due
are routinely used for navigation and geodetic measure- to traffic congestion, the tracking device records a minor
ments. The results from the measurements in real time deviation in true position resulting in clusters of points at
are not needed, and therefore, a post-processing proce- the same location. Due to their unpredictability and falsi-
dure could be applied to improve the accuracy of the col- fying characteristics, the GPS point clusters could be
lected measurements [11]. extremely misleading to map-matching procedures and
are the greatest cause of overshoots and mismatch [13].
The GPS device may generate (infrequent) errors In order to remove clusters, two different filters were gen-
due to the interferences with the surroundings erated by using php commands. The first filter removed
such as trees and built-ups in a non-obvious way the points having one-second time interval (Figure 2(b)),
[12]. Therefore, raw trajectory data always are whereas the second filter removed all points lying within
cleaned through different techniques and recon- 20 m radius of each other (Figure 2(c)). The resultant
structed trajectory is used for analysis. The pur- points were saved in the database along their id, longi-
pose of trajectory reconstruction and partition is tude, latitude, time, and calculated time.
to extract all characteristic points and get rid of
useless location data. It can reduce the storage, Whereas the analysis of raw trajectory revealed that sur-
usage, and improve the efficiency for further simi- vey points were not completely lying on the route line
larity measurement [1]. and it needed to be aligned with road network. The
alignment process of GPS Survey points with road net- To calculate stops delay, 200-m radius buffer was gener-
work is called map matching. Map matching is a funda- ated on all 11 stops on the route in database. Time dif-
mental pre-processing step for many trajectory-based ference of all survey points lying within buffer zone was
applications, such as moving object management, traffic accumulated and subtracted from a threshold value of
flow analysis, and driving directions [14]. So, to over- 15 seconds (the proposed time for each stop to pick and
come this problem, snapping algorithm is designed that drop). The resultant time was considered to be extra
helps to snap all points on the line. In this algorithm, time spent on each stop. As an analysis of six trajectories
line slope was calculated for route data. Then the dis- stops, “B”, “F”, and “C” are more critical. On stop “B”,
tance of each survey points was calculated from line the average delay is 29 seconds; on stop “F”, the average
nodes and the survey point was snapped to line node delay is 26 seconds; and on stop “C”, the average delay is
having minimum distance between 0 and 1 (Figure 2 23 seconds. As both stops “B” and “C” lie within the
(d)). As a final output, we have a sample point table with campus apart from the main road, but due to more
sample point ID, latitude (X), longitude (Y), time, dis- number of students, delay in bus departure is caused.
tance between successive points, and time difference While on Stop “F”, the number of lanes is reduced due
between successive points (Figure 1). to road turnings and fly overs. As a result, bottleneck is
being created in this area resulting in traffic problems.
repeated maximum value of time difference to identify 5. D. Guo, X. Zhu, H. Jin, P. Gao, and C. Andris, “Discover-
route segments with repeated delay time. The most criti- ing spatial patterns in origin-destination mobility data,”
cal route segments identified was between stops “J” and Transp. GIS., Vol. ED-16, pp. 411–29, Jun. 2012.
“K” with an average time delay of 68.6 seconds. The 6. G. Draijer, N. Kalfs, and J. Perdok, “Global position-
other one was between stops “C” and “D” with an aver- ing system as data collection method for travel
age time delay of 20.4 seconds. While both segments are research,” J. Transp. Res. Board, Vol. ED-1719,
main commercial areas along the route. The results of all pp. 147–53, 2000.
three types of delay in terms of critical segments and
7. K. Hashimoto, “Monitoring road traffic congestion in
stops have been shown in Figure 4.
Japan,” Transp. Rev., Vol. ED-10, pp. 171–86, Apr.
1990.
After delay analysis, an alternate route was identified to
avoid the most critical segment between stops “J” and 8. J. Zhou and R. Golledge, “A three-step map matching
“K” as shown in Figure 4. As the alternate route is on method in the GIS environment,” Int. J. Geogr. Inform.
the main one-way road, which may be to further facili- Syst., Vol. ED-X, Oct. 2006.
tate extra commuters along efficient time management. 9. M. Modsching, R. Kramer, and K. T. Hagen, “Field trial
on GPS accuracy in a medium size city: The influence of
built-up,” in 3rd Workshop on Positioning, Navigation
4. CONCLUSION and Communication, Hannover, Germany, Mar. 2006.
This research aims at using spatio-temporal dynamic
10. N. Trubint, “A GIS application to explore postal retail out-
data, data modelling techniques, certain data processing
let locations,” Promet – Traffic Transp., Vol. ED-24,
algorithms of data cleaning, map matching, and analysis pp. 161–8, 2012.
for efficiency assessment of transport service in urban
areas. The results calculate the average as well as the total 11. N. V. Oort, J. W. Boterman, and R. V. Nes, “The impact of
delay in bus reaching from the origin to the destination, scheduling on service reliability: trip-time determination
and identification of areas causing maximum delay. and holding points in long-headway services,” Public
Transp., Vol. ED-4, pp. 39–56, Jun. 2012.
Analysis reveals practical valuable results for managers 12. N. V. Oort, “Service reliability and urban public transport
and decision-makers in order to reduce the extra time design,” PhD thesis, Delft University of Technology, Delft,
spent in a trip by changing route. An alternate route Netherlands, 2011. ISBN: 9789055841424.
with an extra mileage of 396 m has been identified to
13. T. Apeltauer, J. Macur, P. Holcner, and M. Radimsky,
avoid the maximum delay area and assure the timely
“Validation of microscopic traffic models based on
availability of transport service for commuters. GPS precise measurement of vehicle dynamics”
Promet – Traffic Transp., Vol. ED-25, pp. 157–67,
Mar. 2013.
DISCLOSURE STATEMENT
No potential conflict of interest was reported by the authors. 14. J. Wolf, “Using GPS data loggers to replace travel diaries
in the collection of travel data,” Thesis (PhD), Georgia
Institute of Technology, Atlanta, GA, 2000.
REFERENCES
15. Y. Lou, X. Xie, C. Zhang, W. Wang, Y. Zheng, and Y.
1. S. Bricka and C. Bhat, “Comparative analysis of global Huang, “Map-matching for low-sampling-rate GPS trajec-
positioning system-based and travel survey-based data,” J. tories,” ACM GIS, Nov., 2009. ISBN 978-1-60558-649-6/
Transp. Res. Board, Vol. ED-1972, pp. 9–20, Jan. 2006. 09/11
2. D. Eurepiæ and Z. Tos, “Technical documentation in traf- 16. Y. Xia, G. Wang, X. Zhang, G. Kim, and H. Bae, “Spatio-
fic technology,” Promet – Traffic Transp., Vol. ED-21, temporal similarity measure for network constrained tra-
pp. 359–65, Sep. 2009. jectory data,” Int. J. Comput. Intell. Syst. Vol. ED-4,
pp. 1070–9, Sep. 2011.
3. D. Patel, “Incorporating duration and region association
information in trajectory classification,” J. Location Based 17. J. F. Zumberge, M. B. Heflin, D. C. Jefferson, M. M. Wat-
Ser., Vol. ED-7, pp. 246–71, Sep. 2013. kins, and F. H. Webb, “Precise point positioning for the
efficient and robust analysis of GPS data from large net-
4. D. Guo, S. Liu, and H. Jin, “A graph-based approach to works,” J. Geophys. Res., Vol. ED-102, pp. 5005–17, Mar.
vehicle trajectory analysis,” J. Location Based Ser., 1997.
Vol. ED-4, pp. 183–99, Dec. 2010.
6 M. B. KHALID ET AL.: SPATIO-TEMPORAL NETWORK-CONSTRAINED TRAJECTORY DATA MODEL AND SERVICE RELIABILITY ASSESSMENT
Authors
Muhammad Burhan Khalid received the Muhammad Hamid Ch received the master’s degree in
MSc degree in remote sensing and GIS geography from Punjab University in 2000 and MSc degree
from Punjab University College of Infor- in remote sensing and GIS from Punjab University College
mation Technology in 2012. He com- of Information Technology in 2003. He completed his
pleted his MPhil RSGIS in 2015 from MPhil RSGIS in 2012 from Punjab University College of
Punjab University College of Informa- Information Technology. At present, he is working as a lec-
tion Technology. At present, he is work- turer in GIS Centre, University of the Punjab, Lahore, Paki-
ing as a software developer in Sw3 stan. His main interests are digital image processing,
Solution. He is also teaching Web Engi- remote–sensing, and spatial data visualization techniques.
neering module as a visiting faculty member in Punjab Univer-
sity College of Information Technology. His main interests are E-mail: hamid.ch@pucit.edu.pk
web-based GIS data development and spatial data modelling.
Nida Samad received the MSc degree in
E-mail: Burhan.gis@gmail.com remote sensing and GIS from Punjab
University College of Information Tech-
Muhammad Haris received his BS nology in 2008. She completed his MPhil
(hons) degree in computer science from RSGIS in 2013 from Punjab University
Punjab University College of Informa- College of Information Technology. At
tion Technology in 2005, MSc degree in present, she is working as a lecturer in
remote sensing and GIS from Punjab GIS Centre, University of the Punjab,
University College of Information Tech- Lahore, Pakistan. Her main interests are
nology in 2009. He completed his MPhil spatial data modelling, database development, and
RSGIS in 2012 from Punjab University geostatistics.
College of Information Technology. At
present, he is working as a lecturer in GIS Centre, University E-mail: nida.samad@pucit.edu.pk
of the Punjab, Lahore Pakistan. His main interests are web-
based Web GIS, spatial data modelling, and social media
networking.
E-mail: haris@pucit.edu.pk
E-mail: qudsia@pucit.edu.pk