Classificaon methods are affected by the spectral mixture due to the multude of features observed in high resoluon images, especially in urban environments. InterIMAGE is highlighted as an open source system developed by the Computer Vision Laboratory in the Department of Electrical Engineering of the Ponfical Catholic University of Rio de Janeiro. Interimage is integrated with data mining to generate more accurate classifications.
Original Description:
Original Title
Comparison of the Accuracy of Classifications Generated by Interimage and by Interimage Integrated With Data Mining
Classificaon methods are affected by the spectral mixture due to the multude of features observed in high resoluon images, especially in urban environments. InterIMAGE is highlighted as an open source system developed by the Computer Vision Laboratory in the Department of Electrical Engineering of the Ponfical Catholic University of Rio de Janeiro. Interimage is integrated with data mining to generate more accurate classifications.
Classificaon methods are affected by the spectral mixture due to the multude of features observed in high resoluon images, especially in urban environments. InterIMAGE is highlighted as an open source system developed by the Computer Vision Laboratory in the Department of Electrical Engineering of the Ponfical Catholic University of Rio de Janeiro. Interimage is integrated with data mining to generate more accurate classifications.
93 Aristotle University of Thessaloniki, Greece Published online May 2014
Comparison of the accuracy of classicaons generated by interimage and by interimage integrated with data mining
Denilson Passo a , Edilson Bias a,* , Ricardo Brites a , Gilson Costa b a Instute of Geosciences , University of Braslia , Brazil b Department of Electrical Engineering , Poncal Catholic University of Rio de Janeiro, Brazil *Corresponding author: edbias@unb.br, +556131076977, +556181317202
Abstract: The technological development in remote sensing has been providing a large increase in the quality of satellite imagery. A notable example is the WorldView II satellite that has a spaal resoluon of 46 cm. Despite this breakthrough, the classicaon methods are sll very aected by the spectral mixture due to the multude of features observed in high resoluon images, especially in urban environments. In the search for a soluon, soware packages that enable imagery classicaon based on objects, facilitang the representaon and processing of human knowledge about the specic characteriscs of targets of interest, have appeared. These systems are based on rules and decision trees, where they seek to represent the mode of reasoning and knowledge used by experts in solving problems within its specialty. Among the classicaon systems based on objects, the InterIMAGE is highlighted as an open source system developed by the Computer Vision Laboratory in the Department of Electrical Engineering of the Poncal Catholic University of Rio de Janeiro and the divisions of Image Processing and Remote Sensing in the Naonal Instute for Space Research. The soware has a specic graphical user interface to support the denion of decision rules. These rules dene the structured and explicit knowledge of the user / analyst and are used by the system in the process of interpretaon, where the values to be inserted in the structure of the rule can be obtained by visual analysis of aributes histograms calculated for image segments. Once appropriately constructed, its use is fast and computaonally ecient in separang the classes. In order to avoid errors in interpretaon of visual aributes and to acquire greater eciency and speed in construcng rules, authors have used data mining techniques, which help to check which aributes, from those available in the data set, are more relevant to describe classes automacally and to provide the thresholds to separate them . Finally, in order to evaluate the classicaons, the accuracy assessment was conducted making use of three indices: Global, Kappa and Tau. Keywords: WorldViewII, InterIMAGE,data mining, accuracy assessment.
1. Introducon Over the past few years there was a signicant increase in the use of satellite remote sensing imagery, which has contributed to new research, mainly targeng improvements in classiers methodologies. Among these alternaves, the methodology for spaal imagery analysis based on objects GEOBIA (Geographic Object - Based Image Analysis) (Hay and Caslla, 2008) has gained importance in recent years. It emerges as a signicant advance in the process of automac classicaon of satellite imagery, as it enables the modeling of human knowledge related to specic features of the targets, such as brightness, texture, shape, context, size, neighborhood relaons, among others, with the focus on the classicaon of image segments instead of isolated pixels. South-Eastern European Journal of Earth Observaon and Geomacs Issue Vo3, 2S, 2014
94 Aristotle University of Thessaloniki, Greece Published online May 2014
With respect to object-based soware, currently one can nd proprietary and open source systems, as is the case of InterIMAGE developed by the Computer Vision Laboratory, Department of Electrical Engineering, Poncal Catholic University of Rio de Janeiro and the divisions of Image Processing and Remote Sensing of the Naonal Instute for Space Research (Costa et al., 2008). This study was conducted at the satellite city of Sobradinho - DF, distant about 30 miles from downtown Brasilia - DF, Figure 1, and aimed to compare the potenal and accuracy of a classicaon generated by InterIMAGE with decision rules constructed through visual analysis of histograms and classicaon using decision rules built automacally by data mining and evaluated ulizing accuracy assessment techniques.
Figure 1. Locaon of test-area.
2. Methodological procedures Figure 2 depicts the methodological procedures for the analysis.
Figure 2. Methodological procedures.
3. Results and discussion The use of Analysis Manager resorces, available in the InterIMAGE soware, allowed the renement of the results by the analysis of histograms. The separaon of classes in InterIMAGE, with decision rules generated automacally by data mining, was possible based South-Eastern European Journal of Earth Observaon and Geomacs Issue Vo3, 2S, 2014
95 Aristotle University of Thessaloniki, Greece Published online May 2014
on the informaon of the decision tree created by the WEKA soware (Tab. 1) using the J48 algorithm that redeploys, in the soware, the C4.5 algorithm, proposed by Quinlan (1993). Figure 3 shows an example of arboreal vegetaon class generated by each of the methods: rule created in InterIMAGEby visual interpretaon of histograms (Fig. 3a) and rule created in InterIMAGEusing data mining (Fig. 3b). Besides assisng in the construcon of rules, simplifying their construcon, data mining also assisted in verifying which aributes among those available in the data set are the most relevant to describe classes automacally, they were: image brightness; standard deviaon of bands 1, 2, 6 and 7, rao using the bands 3 and 1; entropy of bands 2 and 5; rao of the bands 1, 3 and 8; maximum value of the bands 1, 2, 3 and 4; minimum value of bands 5 and 6 and, nally, two results of the Normalized Dierence Vegetaon Index - NDVI using the bands 7 and 8 as the infrared, since WorldView II has two bands in the infrared. These parameters were calculated using, always, the average value of the aribute within the object (segment).
South-Eastern European Journal of Earth Observaon and Geomacs Issue Vo3, 2S, 2014
96 Aristotle University of Thessaloniki, Greece Published online May 2014
Figure 2. Samples of decision rules created by visual interpretaon of histogram (a) and by data mining (b).
Visually, one can observe a lile dierence between the two classicaons (Fig. 4). However, when looking at their confusion matrix (Tab. 2 and 3), it is percepble that in the classicaon that used the rules created by visual interpretaon of histograms, the class that got beer separaon was the Metal Tile, explained by its lile mixture with the others, despite the reduced number of reference samples. The coverage classes Clear Cement Tile and Dark Ciment Tile also had good separaon, contribung signicantly to percentage increase of accuracy indices. By analyzing the confusion matrix of the classicaon that used decision rules created by data mining (Tab. 3), it is noced that, in general, there was lile dispersion of values , i.e., the values were concentrated in the main diagonal, meaning lile confusion between classes. However, there were sll mixing between classes Caly Tile and Bare Soil, and the laer, in turn, with the Undergrowth vegetaon class, which may have occurred due to some areas experiencing sparse distribuon or dry leaves, due to the season, exposing soil and hindering the spectral separaon of the classes. In both classicaons the Asphalt class, got good accuracy indices due to the use of a shapele of roads which served as a mask.
Figure 3. Themac map resulng from the classicaon created by visual interpretaon of histograms (a) and datamining (b).
South-Eastern European Journal of Earth Observaon and Geomacs Issue Vo3, 2S, 2014
97 Aristotle University of Thessaloniki, Greece Published online May 2014
Table 2. Confusion matrix of the classicaon created by InterIMAGE using interpretaon of histograms. Table 3. Confusion matrix of the classicaon created by InterIMAGEusing data mining.
4. Conclusions The results showed that the classicaon using data mining to build the decision rules was more ecient to map the coverage of urban areas, than the classicaon performed solely for the visual analysis of histograms generated in InterIMAGE itself. The percentages of overall accuracy (global) for the classicaon generated by InterIMAGEwith rules generated by visual interpretaon of histograms achieved overall index, or gobal, (Hellden et al., 1980), Kappa index (Cohen, 1960) and Tau Index (Kleka, 1980) of 90.6%, 89.4% and 89.5%, respecvely, while the result of classicaon rules generated by data mining reached values of 92.6%, 91.6% and 91.7% respecvely.
References Cohen J., 1960, Coecient of Agreement for Nominal Scales. Educaonal and Measurment. Volume XX, Issue 1: 37-46. Costa G., Pinho C., Feitosa R., Almeida C., Kux H., Fonseca L., Oliveira D., 2008, INTERIMAGE: Uma Plataforma Cogniva Open Source para a Interpretao Automca de Imagens Digitais. Brazilian Journal of Cartography, Volume 60: 331-337. Hay G., Caslla G., 2008, Geographic Object-based Image Analysis (GEOBIA):a new name for a new discipline. In Object-based image analysis-spaal concepts for knowledge- driven remote sensing applicaons, Blaschke T., Lang S., Hay G. (editors), Springer Verlag, Berlin. Hellden U., Stern M., 1980, Evaluaon of Landsat Imagery and Digital Data for Monitoring Desercaon Indicators in Tunisia. Proceedings 14 th Int. Symp. on Rem. Sens. of Environ., pp. 1601-1611. Klecka W., 1980, Discriminant Analysis. SAGE Publicaons, Inc., Beverly Hills, California, pp. 71. Quinlan R., 1993, C4.5: Programs for Machine Learning Morgan Kaufmann Publishers. Inc., San Francisco, CA, USA.