You are on page 1of 41

GEOSTATISTICAL

ANALYSIS OF GLENDALE
CAMPUS HUMIDITY
GISC9318 DELIVERABLE 4B

Tetiana Yurchenko
tatianayu20@gmail.com
Mireya Mosquera Patino
mmosquerapatino@gmail.com
614 Lake Street, St. Catharines, ON, L2N 6P6
(289) 969 6448 | mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B
Ian Smith
GISC9318 Course Instructor
Niagara College
135 Taylor Road,
Niagara-on-the-Lake, ON
L0S 1J0

Dear Mr. Smith,

RE: GISC9318 Deliverable 4B Geostatistical Analysis of Glendale Campus Humidity

Please accept this document as an official submission for Deliverable 4B: Geostatistical Analysis
of Glendale Campus Humidity.
This assignment describes the procedures undertaken in order to accomplish the main deliverable,
which is to create a predicted surface of a previously collected humidity data in the vicinity of
Glendale Campus using ArcGIS Geostatistical Analyst tool in ArcMap.
In this report, you will find section 2 that describes the methodology of data collecting and
processing and the tools that were used. In section 3 of this report, it can be found the procedures
undertaken for the completion of the surface using two methods: Kriging and Inverse Distance
Weighted (IDW). It can be also found two maps showing these predicted surfaces in Appendix B
and C of the report.
All these steps and procedure will be explained with tables and figures for better understanding
and visualization.
If you will have any questions regarding the Deliverable 4B for Spatial Analysis/Statistics, please
contact me at your convenience at (289) 969-6448 or by e-mail at mmosquerapatino@gmail.com.
Geospatial Environment Group and GIS & Environmental Solutions are looking forward working
with you again in the future.

Best Regards,

Mireya Mosquera Patino, B.Sc


M.M.P/t.y.

i
EXECUTIVE SUMMARY

The Geostatistical Analysis Final Report provides a complex analysis of humidity data within the
Niagara College Glendale Campus territory. In order to complete this assignment, data was
collected, preprocessed and analyzed. First, the area of interest was selected and randomly
sampled. Humidity data in 77 points across the territory were collected on 8 of February 2019.
Weather meter was used to measure the humidity and GPS to collect XY data.

Histograms, normal QQ plots, and trends 3-dimensional graph were analyzed in order to
understand the data distribution, skewness, and check for any existing trends. Two geostatistical
methods (Kriging and Inverse Distance Weighted) were used to create a predictive surface that
would cover the whole area of the Glendale campus showing the humidity prediction using raster
layer.

ii
TABLE OF CONTENTS

Executive summary ....................................................................................................................................................... ii


1. Introduction ...............................................................................................................................................................1
2. Methodology..............................................................................................................................................................2
2.1 Data collection .....................................................................................................................................................2
2.2 Documentation of data collection ........................................................................................................................4
2.3 Data Processing ...................................................................................................................................................6
2.4 Distribution of the data ......................................................................................................................................10
3. Geostatistical Models ..............................................................................................................................................12
3.1 Kriging ...............................................................................................................................................................13
3.2 Inverse Distance Weighted (IDW) ....................................................................................................................17
4. Data Analysis ...........................................................................................................................................................21
4.1. Histograms ........................................................................................................................................................21
4.2. QQ Plots ...........................................................................................................................................................23
4.3. Trend analysis ...................................................................................................................................................25
4.4. Semivariogram/Covariance Cloud ....................................................................................................................26
5. Best Fitted method selection ....................................................................................................................................28
6. Findings and Conclusions ........................................................................................................................................29
Bibliography ................................................................................................................................................................30
Appendix .....................................................................................................................................................................31
Appendix A .............................................................................................................................................................32
Appendix B ..............................................................................................................................................................34
Appendix C ..............................................................................................................................................................35

iii
LIST OF FIGURES
Figure 1 Area of interest ................................................................................................................................................1
Figure 2 Raosoft Sample Size Calculator ........................................................................................................................2
Figure 3 Random sampling in R .....................................................................................................................................2
Figure 4 Sampling Location............................................................................................................................................3
Figure 5 Weather meter and GPS equipment (Photo credit: Tetiana Yurchenko) .........................................................4
Figure 6 Field book (Photo credit: Tetiana Yurchenko) ..................................................................................................5
Figure 7 Easting Histogram (X value).............................................................................................................................7
Figure 8 Northing Histogram (Y value) ..........................................................................................................................8
Figure 9 Humidity Histogram (Z value) ..........................................................................................................................9
Figure 10 X, Y, and Z variability....................................................................................................................................10
Figure 11 Correlation between Humidity and Easting .................................................................................................11
Figure 12 Correlation between Humidity and Northing ..............................................................................................11
Figure 13 Boxplot of humidity data Distribution .........................................................................................................12
Figure 14 Semivariogram of Kriging Modeling ............................................................................................................13
Figure 15 Kriging. Search Neighborhood .....................................................................................................................14
Figure 16 Kriging. Cross Validation ..............................................................................................................................15
Figure 17 Kriging Method Report ................................................................................................................................16
Figure 18 Kriging method's surface clipped to the limits .............................................................................................16
Figure 19 Attribute table of predicted points ..............................................................................................................17
Figure 20 Humidity Histogram in ArcMap ...................................................................................................................21
Figure 21 Easting Histogram in ArcMap ......................................................................................................................22
Figure 22 Northing Histogram in ArcMap ...................................................................................................................22
Figure 23 Humidity QQ plot .........................................................................................................................................23
Figure 24 Easting QQ plot ............................................................................................................................................24
Figure 25 Northing QQ plot .........................................................................................................................................24
Figure 26 Trend Analysis ..............................................................................................................................................25
Figure 27 Semivariogram/Covariance Cloud of the Humidity data .............................................................................26

LIST OF TABLES
Table 1 Humidity data in sample locations....................................................................................................................6

iv
April 18, 2019
GISC9318-D4B

1. INTRODUCTION

The purpose of this assignment is to describe and analyze the previously collected data (humidity
percentage in the vicinity of the Niagara College Glendale Campus) both geostatistically as well
as practically. Also, it is needed to predict geospatial coverage by interpolation (creation of a
continuous surface using both IDW and kriging techniques based upon the sample points measured
in the area (Smith, Terms of Reference for Deliverable4B, Retrieved 2019).

The humidity percentage is based on a 100% scale. Humidity is the water vapor mass contained
within the total mass of dry air inside a specified volume of air at a specific temperature.
Essentially, the hotter the air is, the more water the air can contain. Relative humidity becomes the
ratio of highest absolute humidity against the current absolute humidity, which is dependent on
current air temperature (What Does Humidity Percentage Mean?, 2016).

Figure 1 illustrates the area of interest (AOI), that is NOTL Glendale Campus vicinity.

Figure 1 Area of interest

1
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

The AOI is delimited in yellow and gives a general idea of the size of the area (52.41 ha) where
the measures of humidity were taken. Also, shows that most of the study area lies within the natural
environment (vineyard, forest, lagoon, grass) and just a ¼ is parking lots and buildings.

2. METHODOLOGY

2.1 DATA COLLECTION

Firstly, using ArcMap the grid of 50x50m (fishnet tool) and centroids for each parcel were created.
This was done in order to divide the area in equal-size parcels. Sample Size Calculator by Raosoft
was used to define how many points are needed for accurate sampling (Sample Size Calculator by
Raosoft, 2004). Figure 2 below shows the factors selected in the Sample Size Calculator.

Figure 2 Raosoft Sample Size Calculator

It can be seen that for the area of 52.41 ha with 210 centroids, error 8% and 92% confidence there
were 77 points recommended by the software.

To select the 77 points randomly R studio software was used. Figure 3 below shows in blue, the
random sampling command and the result of the procedure in black.

Figure 3 Random sampling in R

2
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

In order to obtain these points, it was necessary to use the command ‘sample.int’. It was also
needed to add the total number of points, the recommended sample size and if a chance of iteration
was allowed (for this specific case it was not).

After getting the points randomly, it was proceeded to elaborate a map with all samples and printed
coordinates to facilitate work before going to the field. Coordinates were needed for checking with
GPS and map for a visual representation of the area. Figure 4 shows the sample locations on the
map of Glendale Campus territory.

Figure 4 Sampling Location

The sample points were symbolized as red circles on the map. Each dot represented a centroid of
the 50x50 m grid parcel.

After leaving the building it was necessary to wait between 2 to 3 minutes in order to the GPS
hooks up the satellite signal and weather meter stables and shows accurate values of humidity
given that the values inside and outside are completely different.

3
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

The area of the escarpment was problematic because of the steep slope, therefore some of the
locations might be not that accurate. There also were a couple of samples located in the lagoon or
inside the building, so the closest possible locations were used.

The data were collected within 5 hours because of the broad area and the weather conditions.

2.2 DOCUMENTATION OF DATA COLLECTION

It is important and useful to document the data collection of any project. It is necessary to
demonstrate what types of tools were used for the collection of information. It also allows to
ensuring transparency in the handling of the data.

Figure 5 shows the two types of equipment used for the measurements.

Figure 5 Weather meter and GPS equipment (Photo credit: Tetiana Yurchenko)

The weather meter (on the left) was used to collect the percentage of the humidity and temperature
in the surveyed locations and GPS (on the right) was needed to verify the coordinates of each point.
In order to record the exact time of the field work in each location digital watches were used.

Figure 6 shows a sample of the data collected in the field.

4
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

Figure 6 Field book (Photo credit: Tetiana Yurchenko)

This field book was one of the main sources of further analysis since it kept all the data and field
observations.

5
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

2.3 DATA PROCESSING

Once the humidity measurements were made in the field, the data was analyzed and processed. An
excel table of the data from humidity was elaborated. Table 1 below shows an extract of it. The
table in its entirety can be found as Appendix A of this report.

ID Easting (m) Northing (m) Humidity (%)


1 649264.0405 4778771.214 53.7
2 649314.0405 4778771.214 53.2
4 649314.0405 4778821.214 55.2
12 649264.0405 4778921.214 55.2
15 649414.0405 4778921.214 56.3
16 649464.0405 4778921.214 56
17 649514.0405 4778921.214 55.2
21 649314.0405 4778971.214 53.9
22 649364.0405 4778971.214 55.2
23 649414.0405 4778971.214 56.5
26 649564.0405 4778971.214 55.1
27 649614.0405 4778971.214 53.8
28 649164.0405 4779021.214 54.6
30 649264.0405 4779021.214 55.2
31 649314.0405 4779021.214 54.7
33 649414.0405 4779021.214 56.7
Table 1 Humidity data in sample locations

Table 1 represents collected humidity data and its UTM coordinates. The ID column shows the
actual random numbers obtained in RStudio. Easting and Northing were taken from ArcMap
Attribute Table and checked by GPS equipment in the field.

Likewise, as part of Deliverable 4A, it was required to produce histograms for the x, y, and z values
of the data. This was done by using the R studio software. Figure 7 shows the graph of Easting
data and its distribution.

6
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

Figure 7 Easting Histogram (X value)

The bars represent the frequency of occurrence of Easting. The X - axis shows the range of Easting
values. Since the area of the Glendale Campus is quite small, the Easting values do not vary that
much. The minimum value of Easting (m) is roughly around 649000 and the maximum – 649800.
The histograms have two peaks that have a bigger amount of collected points, while the lowest
number of points was surveyed in the area with the highest value of Easting (symbolized by the
last bar of the histogram).

7
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

Figure 8 shows the graph of northing data and its distribution.

Figure 8 Northing Histogram (Y value)

The bars represent the frequency of occurrence of Northing. As mentioned before, the area of
interest is not large, therefore Easting and Northing values won’t change significantly. The
minimum value of Northing (m) is roughly around 4778800 and the maximum – 4779800. A major
amount of points were collected in the range from 4778900 to 4779600 as shown in Figure 8.

8
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

Figure 9 shows the graph of humidity data and its distribution.

Figure 9 Humidity Histogram (Z value)

The bars represent the frequency of occurrence of humidity. The histogram is right-skewed,
however, first two bars show that only 10 points had values less than 52%. The majority of points
were concentrated in the range from 52% to 58%.

Also, it was required to calculate the mean, median and standard deviation for all the data.
Variability can be checked either in RStudio or in Excel. For this case, R Studio software was
used. Figure 10 below illustrates the variability of the data.

9
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

Figure 10 X, Y, and Z variability

According to Figure 10, the mean value for Easting is 649404; median is 649414; SD is 212.5.

The mean value for Northing is 4779243; median is 4779221; SD is 228.2.

Finally, the mean value for Humidity is 55.28; median is 54.8; SD is 3.35.

2.4 DISTRIBUTION OF THE DATA

To begin to talk about the distribution of the data, it was first needed to create scatter diagrams in
RStudio of the correlation of the variables. Figure 11 shows that

10
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

Figure 11 Correlation between Humidity and Easting

Figure 12 Correlation between Humidity and Northing

11
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

Both QQ plots that have been shown in Figure 11 and Figure 12 had nonlinear relationships
between the variables that were obtained in the field.

To analyze the distribution of the data, the calculation of the Pearson coefficient was also made.
This coefficient was calculated in Excel for both correlations (Easting, and Northing). It indicates
that the correlation between Easting and Humidity is moderate with a value of 0.397 and
correlation between Northing and Humidity is weak with a value of -0.181.

Finally, it can be concluded that the data is not normally distributed because there are outliers (see
Figure 13), but those outliers exist in reality (in forest escarpment area and lagoon area the
humidity is higher).

Figure 13 Boxplot of humidity data Distribution

3. GEOSTATISTICAL MODELS

Geostatistical models can handle several components, in order to be able to examine the data
(distribution, trends, directional components, outliers), calculate the semivariogram or covariance
values, fitting a model, and to obtain a predicted value and the error (uncertainty) associated with
it for each location in the output surface (Components of geostatistical models, 2019).

12
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

As part of this report, two geostatistical methods will be analyzed and carried out in order to
compare them and establish which is the most suitable for the data collected in the field. The IDW
and Kriging methods are explained in sections 3.1 and 3.2 below.

3.1 KRIGING

“Kriging is an advanced geostatistical procedure that generates an estimated surface from a


scattered set of points with z-values. Unlike other interpolation methods, to use the Kriging tool
effectively involves an interactive investigation of the spatial behavior of the phenomenon
represented by the z-values before you select the best estimation method for generating the output
surface. Kriging assumes that the distance or direction between sample points reflects a spatial
correlation that can be used to explain variation in the surface. The Kriging tool fits a mathematical
function to a specified number of points, or all points within a specified radius, to determine the
output value for each location” (How Kriging works, 2016).

Using Geostatistical Wizard, the Kriging method was selected to predict the surface of Glendale
Campus territory based on surveyed humidity points (layer was set as Z value). First, the ordinary
kriging (default) was completed. Figure 14 illustrates the semivariogram of Kriging modeling.

Figure 14 Semivariogram of Kriging Modeling

13
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

Semivariogram shows spatial relationships between the points. All the values for semivariogram
were set to default and “Model Nugget” was enabled in order to reduce the nugget effect that is
characterized by the high variability between samples that are closely located.

In Figure 14 it can be seen that there is a spatial correlation between the points on the distance
between ‘0’ to ‘0.03’ meters, which means that those points in that range are closely related.

Figure 15 shows the Search Neighborhood option performed for the one point prediction.

Figure 15 Kriging. Search Neighborhood

The crosshairs show a location that has no measured value. To predict a value at the crosshairs,
the values at the measured locations are used. The values of the closest measured locations are
similar to the value of the unmeasured location that one wants to predict. The red points in the
image below are going to be weighted (or influence the unknown value) more than the green points
since they are closer to the location that is going to be predicted (Exercise 1: Creating a surface
using default parameters, 2017).

In this case, a maximum neighbor was set to 5, minimum – to 2 (in each of 4 sectors). As can be
seen, there are from two to five points that were weighted and only one of them is red, which
means that this point will affect the prediction the most.

14
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

Cross validation of the Kriging method is shown in Figure 16.

Figure 16 Kriging. Cross Validation

It can be seen that the root-mean-square standardized is 1.11, while the perfect number for this
would be 1. The average standard error is 1.99, the perfect number of error would be close to 0.
The prediction is unbiased as the value of the mean error is close to 0. However, the value of the
root-mean-square indicates that the predicted humidity values might deviate from the surveyed
points (Exercise 4: Comparing models, 2017)

15
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

Figure 17 shows the Kriging Method Report.

Figure 17 Kriging Method Report

All the parameters and options used for the creation of the Kriging raster were listed in the method
report.

After creating the Kriging model it was clipped to the Niagara College campus shapefile area
(Figure 18).

Figure 18 Kriging method's surface clipped to the limits

16
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

Using Validation/Prediction (Right-click the Kriging layer) the predicted humidity points were
created with proper humidity percentage noted in the attribute table. Figure 19 shows the attribute
table of the predicted humidity points.

Figure 19 Attribute table of predicted points

The selected points in Figure 19 above were the points that were randomly surveyed. The rest of
the 133 points (210 in total – 77 surveyed) had the minimum standard error at 1.50 and maximum
– 2.86. The predicted humidity values were in the range from 49.63% to 62.33%.

3.2 INVERSE DISTANCE WEIGHTED (IDW)

Inverse distance weighted (IDW) interpolation explicitly makes the assumption that things that are
close to one another are more alike than those that are farther apart. To predict a value for any
unmeasured location, IDW uses the measured values surrounding the prediction location (How
inverse distance weighted interpolation works, 2018).

In order to execute the IDW methodology, the IDW spatial analysis tool was used (ArcMap). Z
values and input point features were added to the tool. In this case, Z values represent the humidity
percentage collected in the field and feature points are X and Y coordinates of the samples in total.

17
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

Figure 20 shows the Search Neighborhood option for IDW.

Figure 20 IDW. Search Neighborhood

Following the same logic as in section 3.1, the area was divided into four sectors and from 2 to 5
points were selected as neighbors that influence the value of the predicted point. The red points
symbolize the neighbors that have more weight and green points as those that has less weight. Blue
points are not taken into account.

18
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

Below, Figure 21 demonstrates the cross validation for the IDW method.

Figure 21 IDW. Cross Validation

The ideal value for root mean square should be close to 1. Using the IDW method the result of the
root-mean-square is 2.18. Also, it was noticed that the value of the mean error is close to 0, which
indicates that the prediction is unbiased. For this data, as the root-mean-square value is not close
to 0 it can be said that the predicted humidity values might deviate from the surveyed points
(Exercise 4: Comparing models, 2017)

19
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

Once the process was completed, a raster of IDW and a report with specifications was generated.
Figure 22 shows the IDW Method Report.

Figure 22 IDW Method Report

All the parameters and options used for the creation of the IDW raster were listed in the method
report.

Analogically to the procedure described in section 3.1 (refer to page 16), the raster was clipped to
the AOI extent and then predicted humidity points shapefile were created using the
Validation/Prediction option.

The result for IDW Validation/Prediction was in the range from 50.76% to 61.18% of humidity

20
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

4. DATA ANALYSIS

4.1. HISTOGRAMS

Using the Geostatistical Analyst tool in ArcMap, it was possible to explore the humidity data by
creating histograms, QQ plots, and performing trend analysis. Below, Figure 23 shows the
histogram for humidity data and its distribution.

Figure 23 Humidity Histogram in ArcMap

The bars represent the frequency of occurrence of humidity. As seen in Figure 23, the mean and
the median values are not the same, which proves that the data is not normally distributed, and it
is right-skewed. It can be also clearly seen that the selected blue part has a small number of sample
points that contains the bigger percentage of humidity (from 57.8% to 66.3%).

Before starting to analyze the Easting and Northing data it should be mention that the area of
interest (AOI) is 52.41 ha, which will not reflect a big change in the range of the values of Easting
and Northing.

21
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

Figure 24 shows the Easting distribution of the humidity data.

Figure 24 Easting Histogram in ArcMap

It can be seen that the ‘Y’ axis of the graphic is the frequency of the data and the ‘X’ axis is the
coordinate value. By examining the data it can be observed that the minimum values
(649014.040486) are located on the North-West part of the AOI, while the maximum
(649814.040486) – on the South-East. The points that are located in the same column of the grid
have the same Easting.

Figure 25 shows the Northing distribution of the humidity data.

Figure 25 Northing Histogram in ArcMap

22
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

Following the logic of the Easting histogram analysis, it can be said that the minimum values
(4778771.21371) are located on the South-West part of the AOI, while the maximum
(4779671.21371) – on the North-East. The points that are located in the same row of the grid have
the same Northing.

4.2. QQ PLOTS

“Quantile-quantile (QQ) plots are graphs on which quantiles from two distributions are plotted
relative to each other. If the data is normally distributed, the points will fall on the 45-degree
reference line.” (Normal QQ plot and general QQ plot, 2019).

In the QQ plot of the humidity data, it can be seen that the more normally distributed data are
located closer to the reference line as shown in Figure 26.

Figure 26 Humidity QQ plot

The selected points are the most normally distributed and they basically represent the average of
humidity data, while the points with the highest humidity percentage are distant from the reference
line.

Figure 27 illustrates the Easting QQ plot for the collected points.

23
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

Figure 27 Easting QQ plot

Generally, the Easting data is normally distributed because all the points are located close to the
reference line. The blue points that are crossing the reference line indicate the standard normal
value and located in the center vertical line of the study area. The number of the points selected in
the QQ plot corresponds with the number of the points in the grid column in the map.

Figure 28 illustrates the Northing QQ plot for the collected points.

Figure 28 Northing QQ plot

24
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

Likewise, the Northing QQ plot indicates that the data is normally distributed as they are located
close to the reference line. The blue points are located in the center horizontal line of the study
area and represent the average values of the Northing data.

4.3. TREND ANALYSIS

The Trend is a pattern of gradual change in a variable. It also can be said that is the average or
general tendency of the points collection to move spatially (Smith, Lecture 08, Variography & the
ArcGIS Geostatistical Analyst Extension, Retrieved 2019). Figure 29 shows the trend analysis of
the humidity data within the Glendale Campus area.

Figure 29 Trend Analysis

It can be seen that the data has an upward linear trend in one of the dimensions (green line on the
graph) and a curved trend in another dimension (blue line in the graph). When rotating the graph
the green line becomes a curve and the blue becomes a line with an upward trend. The Z values

25
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

represent the humidity values. The trend model projects the Z values on the Easting and Northing
graphs (the XZ and YZ planes).

4.4. SEMIVARIOGRAM/COVARIANCE CLOUD

Semivariogram cloud is created to measure the distance between two Z values and plot the distance
versus the (difference in Z values)2 (Smith, Lecture 08, Variography & the ArcGIS Geostatistical
Analyst Extension, Retrieved 2019).

Figure 30 shows the Semivariogram/Covariance Cloud of the Humidity data.

Figure 30 Semivariogram/Covariance Cloud of the Humidity data

The samples of humidity locations that are selected in the semivariogram are also shown on the
map, and the links between them indicate the pairing. It was established that the sample pairs that
are located closer to the beginning of the X and Y axes have higher correlation than the ones that
are farther.

26
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

For example, in Figure 31 it can be seen that these two pairs of Easting locations are not correlated
due to the distance between them.

Figure 31 Semivariogram/Covariance Cloud of the Easting

Analogically, there is a similar correlation in the Northing data.

27
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

5. BEST FITTED METHOD SELECTION

The best fitted method will allow to predict the humidity of the whole surface more precisely and
give a more realistic sense of the spatial distribution of the data. Therefore, it was decided to
perform the cross-validation comparison between Kriging and IDW methods shown in Figure 32.

Figure 32 Comparison of Kriging and IDW methods

After having made two geostatistical methods for the humidity data, it was possible to define that
the most adjusted and real method is the Ordinary Kriging Method. This is concluded because data
were obtained on the mean of 0.018 and the root mean square of 1.198, while the IDW result was
0.054 and 2.18 respectively. In the most optimal case, the value closest to 0 for the mean is the
most accurate, while for the root square mean it is the one that is closest to 1.

28
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

6. FINDINGS AND CONCLUSIONS

This report was describing the procedure for the second part of Deliverable 4. Within Deliverable
4A, the data was collected, and the initial analysis was conducted. It was determined that for
analyzing the data it is more convenient to use ArcMap rather than RStudio. RStudio shows the
histogram or QQ plot, but there is no way to analyze the data spatially, while in ArcMap the
point(s)/line(s)/bar(s) on the graph can be selected and the result will appear on the map or in the
attribute table, that gives better understanding the data.

In this deliverable, the steps and procedures to accomplish two surface interpolations were
described using the geostatistical methods of Kriging and IDW. Both interpolation methods
produced surfaces that at the end were analyzed and revised with the purpose of knowing which
of the two was the most convenient for the humidity data.

There are few ways to conduct Kriging and IDW operations in ArcMap. It was observed that
Geostatistical Wizard was more appropriate than the Kriging or IDW tools in Spatial Analyst
toolbox, because not only does it show the result of the analysis, it also allows to set the parameters
in each of the steps, that makes the result more meaningful and precise.

Both, Kriging and IDW are methods that allowed to model the humidity surface. The Kriging
method assumes that the data closest to a known point have greater weight or influence over
interpolation. This influence decreases when the point of interest moves away. In the other hand,
the IDW method predicts missing data and estimates an unknown value at a location using some
known values with corresponding weighted values.

After finalizing the analysis, it was concluded that the most convenient interpolation surface for
the data was the Kriging method since the collection of humidity data was done in a relatively
small area where there were no gaps in the data or unknown values.

29
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

BIBLIOGRAPHY
Components of geostatistical models. (2019). Retrieved from ESRI:
http://desktop.arcgis.com/en/arcmap/latest/extensions/geostatistical-analyst/components-of-
geostatistical-models.htm

Exercise 1: Creating a surface using default parameters. (2017). Retrieved from ESRI :
http://desktop.arcgis.com/en/arcmap/10.5/extensions/geostatistical-analyst/exercise-1-creating-a-
surface-using-default-parameters.htm

Exercise 4: Comparing models. (2017). Retrieved from ESRI:


http://desktop.arcgis.com/en/arcmap/10.5/extensions/geostatistical-analyst/exercise-4-comparing-
models.htm

How inverse distance weighted interpolation works. (2018). Retrieved from ESRI: https://pro.arcgis.com/en/pro-
app/help/analysis/geostatistical-analyst/how-inverse-distance-weighted-interpolation-works.htm

How Kriging works. (2016). Retrieved from ESRI: http://desktop.arcgis.com/en/arcmap/10.3/tools/3d-analyst-


toolbox/how-kriging-works.htm

Normal QQ plot and general QQ plot. (2019). Retrieved from ESRI:


http://desktop.arcgis.com/en/arcmap/latest/extensions/geostatistical-analyst/normal-qq-plot-and-
general-qq-plot.htm

Sample Size Calculator by Raosoft. (2004). Retrieved from Raosoft: www.raosoft.com/samplesize.html

Smith, I. (Retrieved 2019). Lecture 08, Variography & the ArcGIS Geostatical Analyst Extension. Niagara-on-the-
Lake, Ontario, Canada.

Smith, I. (Retrieved 2019). Terms of Reference for Deliverable4B. Niagara-on-the-Lake.

What Does Humidity Percentage Mean? (2016, November 14). Retrieved from Smart Fog:
https://www.smartfog.com/humidity-percentage-mean.html

30
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

APPENDIX

31
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

APPENDIX A

ID Easting (m) Northing (m) Humidity (%)


1 649264.0405 4778771.214 53.7
2 649314.0405 4778771.214 53.2
4 649314.0405 4778821.214 55.2
12 649264.0405 4778921.214 55.2
15 649414.0405 4778921.214 56.3
16 649464.0405 4778921.214 56
17 649514.0405 4778921.214 55.2
21 649314.0405 4778971.214 53.9
22 649364.0405 4778971.214 55.2
23 649414.0405 4778971.214 56.5
26 649564.0405 4778971.214 55.1
27 649614.0405 4778971.214 53.8
28 649164.0405 4779021.214 54.6
30 649264.0405 4779021.214 55.2
31 649314.0405 4779021.214 54.7
33 649414.0405 4779021.214 56.7
38 649664.0405 4779021.214 51.8
39 649714.0405 4779021.214 63.6
41 649214.0405 4779071.214 54.8
44 649364.0405 4779071.214 53.5
48 649564.0405 4779071.214 57.7
49 649614.0405 4779071.214 57.3
52 649764.0405 4779071.214 54
55 649164.0405 4779121.214 58.7
57 649264.0405 4779121.214 53.7
61 649464.0405 4779121.214 57.2
65 649664.0405 4779121.214 55.2
67 649764.0405 4779121.214 54.4
68 649814.0405 4779121.214 54.4
69 649114.0405 4779171.214 51.6
71 649214.0405 4779171.214 57.9
75 649414.0405 4779171.214 58.9
78 649564.0405 4779171.214 56.5
86 649164.0405 4779221.214 51.3
87 649214.0405 4779221.214 57.1

32
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

88 649264.0405 4779221.214 54.3


94 649564.0405 4779221.214 56.6
97 649714.0405 4779221.214 55.9
98 649764.0405 4779221.214 56.9
99 649064.0405 4779271.214 49.3
100 649114.0405 4779271.214 49.4
102 649214.0405 4779271.214 54.2
103 649264.0405 4779271.214 56.7
106 649414.0405 4779271.214 62.2
107 649464.0405 4779271.214 61.6
112 649714.0405 4779271.214 57.9
119 649264.0405 4779321.214 55.9
126 649614.0405 4779321.214 66.3
127 649664.0405 4779321.214 64.6
130 649064.0405 4779371.214 52.4
131 649114.0405 4779371.214 50.5
140 649564.0405 4779371.214 59.7
141 649614.0405 4779371.214 61.3
143 649714.0405 4779371.214 61.2
144 649014.0405 4779421.214 54.5
145 649064.0405 4779421.214 54.4
147 649164.0405 4779421.214 54.6
150 649314.0405 4779421.214 54.1
154 649514.0405 4779421.214 54.7
160 649064.0405 4779471.214 54.9
166 649364.0405 4779471.214 49.8
169 649514.0405 4779471.214 54
170 649564.0405 4779471.214 55.2
171 649614.0405 4779471.214 56.3
172 649664.0405 4779471.214 56.7
173 649064.0405 4779521.214 55.2
174 649114.0405 4779521.214 54
175 649164.0405 4779521.214 53.4
179 649364.0405 4779521.214 49.5
182 649514.0405 4779521.214 53.5
183 649564.0405 4779521.214 53.5
186 649214.0405 4779571.214 53.3

33
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

193 649564.0405 4779571.214 52.8


198 649414.0405 4779621.214 49.3
205 649464.0405 4779671.214 52.2
206 649514.0405 4779671.214 53.1
208 649414.0405 4779721.214 50.9

34
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

APPENDIX B

34
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com
April 18, 2019
GISC9318-D4B

APPENDIX C

35
Tetiana Yurchenko Mireya Mosquera
tatianayu20@gmail.com mmosquerapatino@gmail.com

You might also like