MARE: Self-Supervised Multi-Attention REsu-Net for Semantic Segmentation in Remote Sensing
release_v3jfsf5kbbaera7kmmjjgupg4y
by
Valerio Marsocci, Simone Scardapane, Nikos Komodakis
Abstract
Scene understanding of satellite and aerial images is a pivotal task in various remote sensing (RS) practices, such as land cover and urban development monitoring. In recent years, neural networks have become a de-facto standard in many of these applications. However, semantic segmentation still remains a challenging task. With respect to other computer vision (CV) areas, in RS large labeled datasets are not very often available, due to their large cost and to the required manpower. On the other hand, self-supervised learning (SSL) is earning more and more interest in CV, reaching state-of-the-art in several tasks. In spite of this, most SSL models, pretrained on huge datasets like ImageNet, do not perform particularly well on RS data. For this reason, we propose a combination of a SSL algorithm (particularly, Online Bag of Words) and a semantic segmentation algorithm, shaped for aerial images (namely, Multistage Attention ResU-Net), to show new encouraging results (i.e., 81.76% mIoU with ResNet-18 backbone) on the ISPRS Vaihingen dataset.
In application/xml+jats
format
Archived Files and Locations
application/pdf 15.1 MB
file_2hjflemtejgvbhdhh37jtsn7se
|
mdpi-res.com (publisher) web.archive.org (webarchive) |
Web Captures
https://www.mdpi.com/2072-4292/13/16/3275/htm
2021-12-09 06:02:32 | 49 resources webcapture_aagfeltizfezbca5uae4nwhcb4
|
web.archive.org (webarchive) |
Open Access Publication
In DOAJ
In ISSN ROAD
In Keepers Registry
ISSN-L:
2072-4292
access all versions, variants, and formats of this works (eg, pre-prints)
Crossref Metadata (via API)
Worldcat
SHERPA/RoMEO (journal policies)
wikidata.org
CORE.ac.uk
Semantic Scholar
Google Scholar