Have a personal or library account? Click to login
On the Importance of 3D Surface Information for Remote Sensing Classification Tasks Cover

On the Importance of 3D Surface Information for Remote Sensing Classification Tasks

Open Access
|May 2021

Figures & Tables

dsj-20-1329-g1.png
Figure 1

Feature extraction for SVM classification using a 5 × 5 neighborhood and a 1 channel (DSM only), 3 channel (RGB-only), or 4 channel (DSM & RGB) representation.

dsj-20-1329-g2.png
Figure 2

SegNet architecture from (Audebert, Le Saux, and Lefèvre, 2018; Badrinarayanan, Kendall, and Cipolla, 2017) utilizing a deep encoder-decoder structure for image segmentation and object classification. To perform nonlinear up-sampling, the SegNet decoder leverages pooling indices computed in the encoder layers of the network and connected from the encoder to the decoder via skip connections (Badrinarayanan, Kendall, and Cipolla, 2017).

dsj-20-1329-g3.png
Figure 3

Comparison of Segnet and Segnet Lite architectures by number of indices per layer, number of input and output channels, weights per layer, and total weights. Note that the Segnet Lite architecture limits the number of layers per block to two and reduces the output channels for each layer by 75 percent.

Table 1

Comparative summary between our two Fully Convolutional Neural Network architectures, SegNet and SegNet Lite. These metrics are based off of Figure 3.

NEURAL ARCHITECTURETOTAL PARAMETERSCHANNELS (RELATIVE TO SEGNET)KERNEL SIZE
SegNet29,422,6561.0×3
SegNet Lite1,176,3360.25×3
dsj-20-1329-g4.png
Figure 4

Sample tile from USSOCOM Urban 3D Challenge dataset for Jacksonville, FL showing RGB imagery (left), nDSM info (center), and annotated ground truth for building footprints (right).

dsj-20-1329-g5.png
Figure 5

Another view of a sample nDSM (Jacksonville Tile 23) from the USSOCOM dataset.

dsj-20-1329-g6.png
Figure 6

Sample tile from the ISPRS dataset for Vaihingen, Germany showing IRRG imagery (left), nDSM information (center), and color-coded ground truth for six object classes of interest (right).

Table 2

USSOCOM training and testing (in-sample and out-of-sample) procedures for SVM and SegNet. For evaluating the SegNet classifier on the USSOCOM dataset, we only test out-of-sample performance.

CLASSIFIER ARCHITECTURE
TYPE OF DATASETSVMSEGNET
TrainingJacksonville, FLTampa, FLTampa, FL
In-Sample TestingJacksonville, FL
Out-of-Sample TestingTampa, FLRichmond, VAJacksonville, FL
Table 3

ISPRS training and testing (in-sample and out-of-sample) procedures for our classification architectures: SVM, SegNet Lite, and SegNet.

CLASSIFIER ARCHITECTURE
TYPE OF DATASETSVMSEGNET LITESEGNET
TrainingVaihingen tiles 1–12Vaihingen tiles 1–12Vaihingen tiles 1–12
In-Sample TestingVaihingen tiles 13–16Vaihingen tiles 13–16Vaihingen tiles 13–16
Out-of-Sample TestingPotsdam*Potsdam*Potsdam*
Table 4

SegNet – Classification performance by object type (accuracy only) for ISPRS in-sample (Vaihingen) and out-of-sample (Potsdam) validation using three training cases.

OBJECTS OF INTERESTSEGNET (ISPRS)
VAIHINGENPOTSDAM (9 CM)
NDSMIRRGNDSM & IRRGNDSMIRRGNDSM & IRRG
Impervious surfaces0.87270.95200.95310.71270.75020.8374
Buildings0.95490.97380.97220.68280.45710.7886
Low vegetation0.84860.92990.92430.73200.78290.8589
Trees0.91590.94880.94730.88460.85680.8643
Cars0.99220.99690.99590.98650.98790.9912
Clutter0.99950.99930.99960.95180.95980.9522
Total0.79190.90030.89620.47520.39740.6463
Table 5

SegNet Lite – Classification performance by object (accuracy only) for ISPRS in-sample (Vaihingen) and out-of-sample (Potsdam) validation using three training cases.

OBJECTS OF INTERESTSEGNET LITE (ISPRS)
VAIHINGENPOTSDAM (9 CM)
NDSMIRRGNDSM & IRRGNDSMIRRGnDSM & IRRG
Impervious surfaces0.87060.95190.95590.71230.79500.7827
Buildings0.95390.97260.97350.85590.55540.6016
Low vegetation0.84170.93220.92760.60770.76510.8182
Trees0.91620.94900.94860.86870.83840.8669
Cars0.99220.99690.99590.98640.98870.9871
Clutter0.99920.99920.99960.95220.95510.9495
Total0.78690.90090.90060.49160.44880.5030
Table 6

SVM – Classification performance by object (accuracy only) for in-sample (Vaihingen) and out-of-sample (Potsdam) validation using three training cases.

OBJECTS OF INTEREST5 × 5 SVM CLASSIFIER (ISPRS)
VAIHINGENPOTSDAM (9 CM)
NDSMIRRGNDSM & IRRGNDSMIRRGNDSM & IRRG
Impervious surfaces0.78120.87330.93200.68470.76650.8352
Buildings0.79310.89140.95670.75500.52570.6913
Low vegetation0.83090.87150.89780.72460.77680.8147
Trees0.75370.91010.93170.74640.83250.8214
Cars0.96880.99150.99280.85300.98620.9832
Clutter0.99220.99970.99970.94120.94360.9429
Total0.56000.76870.85530.32660.41570.5444
dsj-20-1329-g7.png
Figure 7

Qualitative out-of-sample classification performance for SegNet classifier applied to ISPRS Potsdam data. From left to right, the top row shows IRRG imagery, nDSM information, color-coded ground truth annotations. From left to right, bottom row display predictions when trained with (i) IRRG info only, (ii) nDSM info only, and (iii) combined IIRG & nDSM info.

dsj-20-1329-g8.png
Figure 8

Qualitative out-of-sample classification performance for SegNet Lite classifier for the same tile as used in Figure 7 from the ISPRS Potsdam data, display predictions when trained with (i) IRRG info only, (ii) nDSM info only, and (iii) combined IIRG & nDSM info.

dsj-20-1329-g9.png
Figure 9

Cross-city building classification performance for the USSOCOM dataset using SegNet classifiers. Classifiers are color-coded: nDSM & RGB in blue, RGB-only in green, and nDSM-only in yellow. Note that JAX corresponds to out-of-sample testing with tiles from Jacksonville, and RIC corresponds to out-of-sample testing with tiles from Richmond.

dsj-20-1329-g10.png
Figure 10

In-sample and out-of-sample building classification performance for the USSOCOM dataset using SVM classifiers. Classifiers are color-coded: nDSM & RGB in blue, RGB-only in green, and nDSM-only in yellow.

dsj-20-1329-g11.png
Figure 11

Qualitative out-of-sample classification performance for SVM classifiers applied to USSOCOM data. From left to right, the upper row shows RGB imagery, nDSM (DSM-DTM) information, and ground truth, i.e. annotated building footprints, for Tampa tile #014. From left to right, the lower row shows predicted building footprints when training on (i) nDSM information only, (ii) RGB imagery only, and (iii) combined RGB & nDSM information.

Table 7

SegNet – Balanced building classification performance metrics for cross-city (out-of-sample) validation following procedures 1 and 2 in table 2. In procedure 1 (left three columns), SegNet was trained on tiles from Tampa, Florida, and tested on tiles from Jacksonville, Florida. In procedure 2 (right three columns), SegNet was trained on tiles from Tampa, Florida, and tested on tiles from Richmond, Virginia.

CLASSIFICATION METRICSSEGNET (US SOCOM)
TRAIN TAM
TEST JAX
TRAIN TAM
TEST RIC
NDSMRGBNDSM & RGBNDSMRGBNDSM & RGB
Accuracy0.91640.92980.93670.86900.93390.9386
Precision0.92450.94120.94510.94250.94160.9512
Recall0.91050.92450.93410.81220.93070.9298
F1 Score0.91750.93280.93960.87250.93610.9404
False Negative Rate0.08950.07550.06590.18780.06930.0702
False Positive Rate0.08290.06430.06040.06100.06260.0518
Table 8

SVM – Balanced building classification performance metrics for in-sample and out-of-sample testing on the USSOCOM dataset.

CLASSIFICATION METRICS5 × 5 SVM CLASSIFIER (USSOCOM)
IN-SAMPLE TESTINGOUT-OF-SAMPLE TESTING
NDSMRGBNDSM & RGBNDSMRGBNDSM & RGB
Accuracy0.87630.89780.91780.87630.72120.8931
Precision0.84380.88500.92140.84380.74670.9003
Recall0.90470.89960.90230.80000.71260.8963
F1 Score0.87320.89220.91170.82000.72920.8983
False Negative Rate0.09530.10040.09770.20000.28740.1037
False Positive Rate0.14880.10390.06840.20000.26930.1105
Table 9

Average training times (in seconds) for SVM classifiers when using smaller sample proportions for training on USSOCOM data.

SAMPLE PROPORTION FOR TRAINING0.0001%0.001%0.01%0.1%
SVM Training Times (sec)0.11.518024,000
dsj-20-1329-g12.png
Figure 12

Impact of sample proportion on in-sample (dotted lines) and out-of-sample (solid lines) SVM classification performance on the USSOCOM Jacksonville, FL dataset. The study compares three input data scenarios, (a) RGB & nDSM (black), (b) RGB-only (red), and (c) nDSM-only (blue). From left to right, the individual plots show accuracy, F1-score, and error rate as a function of sample proportion.

dsj-20-1329-g13.png
Figure 13

Impact of sample proportion on classification performance using SegNet (left) and SegNet Lite (right) on ISPRS data.

Language: English
Submitted on: Feb 4, 2021
|
Accepted on: Apr 20, 2021
|
Published on: May 10, 2021
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2021 Jan Petrich, Ryan Sander, Eliza Bradley, Adam Dawood, Shawn Hough, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.