Template-based automatic search of compact semantic segmentation architectures

Nekrasov, V.; Shen, C.; Reid, I.D.

Please use this identifier to cite or link to this item: https://hdl.handle.net/2440/131561

Scopus	Web of Science®	Altmetric
Citations
?	?

Type:	Conference paper
Title:	Template-based automatic search of compact semantic segmentation architectures
Author:	Nekrasov, V. Shen, C. Reid, I.D.
Citation:	Proceedings of the IEEE Winter Conferene on Applications of Computer Vision (WACV 2020), 2020, vol.abs/1904.02365, pp.1969-1978
Publisher:	IEEE
Publisher Place:	online
Issue Date:	2020
Series/Report no.:	IEEE Winter Conference on Applications of Computer Vision
ISBN:	9781728165547
ISSN:	2472-6737 2642-9381
Conference Name:	IEEE Winter Conference on Applications of Computer Vision (WACV) (1 Mar 2020 - 5 Mar 2020 : Snowmass Village, USA)
Statement of Responsibility:	Vladimir Nekrasov, Chunhua Shen, Ian Reid
Abstract:	Automatic search of neural architectures for various vision and natural language tasks is becoming a prominent tool as it allows to discover high-performing structures on any dataset of interest. Nevertheless, on more difficult domains, such as dense per-pixel classification, current automatic approaches are limited in their scope - due to their strong reliance on existing image classifiers they tend to search only for a handful of additional layers with discovered architectures still containing a large number of parameters. In contrast, in this work we propose a novel solution able to find light-weight and accurate segmentation architectures starting from only few blocks of a pre-trained classification network. To this end, we progressively build up a methodology that relies on templates of sets of operations, predicts which template and how many times should be applied at each step, while also generating the connectivity structure and downsampling factors. All these decisions are being made by a recurrent neural network that is rewarded based on the score of the emitted architecture on the holdout set and trained using reinforcement learning. One discovered architecture achieves 63.2% mean IoU on CamVid and 67.8% on CityScapes having only 270K parameters.
Rights:	©2020 IEEE
DOI:	10.1109/WACV45572.2020.9093567
Grant ID:	ARC
Published version:	https://ieeexplore.ieee.org/xpl/conhome/9087828/proceeding
Appears in Collections:	Aurora harvest 4 Computer Science publications

Files in This Item:

There are no files associated with this item.

Show full item record

Adelaide Research & Scholarship