RoseSegNet: An attention-based deep learning architecture for organ segmentation of plants

TURGUT, KAYA; DUTAĞACI, HELİN; Rousseau, David

doi:10.1016/j.biosystemseng.2022.06.016

RoseSegNet: An attention-based deep learning architecture for organ segmentation of plants

Atıf İçin Kopyala

TURGUT K., DUTAĞACI H., Rousseau D.

BIOSYSTEMS ENGINEERING, cilt.221, ss.138-153, 2022 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 221
Basım Tarihi: 2022
Doi Numarası: 10.1016/j.biosystemseng.2022.06.016
Dergi Adı: BIOSYSTEMS ENGINEERING
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, PASCAL, Aerospace Database, BIOSIS, Biotechnology Research Abstracts, CAB Abstracts, Communication Abstracts, Compendex, Food Science & Technology Abstracts, INSPEC, Metadex, Veterinary Science Database, DIALNET, Civil Engineering Abstracts
Sayfa Sayıları: ss.138-153
Anahtar Kelimeler: 3D phenotyping, Plant segmentation, Point -based deep learning, Surface descriptor, Self-attention, PART SEGMENTATION, POINT CLOUDS, CLASSIFICATION
Eskişehir Osmangazi Üniversitesi Adresli: Evet

Özet

An important component for the advancement of plant breeding, genetics, and genomics research is the rapid and accurate measurement of phenotypic traits of large plant populations. The phenotypic data that are of interest can be at multiple levels of plant organization including organ-level geometric characteristics as well as the spatial organization of the organs. 3D computer vision enabling 3D geometry acquisition and processing promises to supply fast, automated phenotypic data collection. One important component of the processing pipeline is the segmentation of the plant into its structural components, such as leaves, stems, and flowers. In this paper, a novel 3D point-based deep learning network, namely RoseSegnet, is proposed for segmentation of point clouds of rosebush plants to their organs. The network is equipped with two attention-based modules, one for extracting contextual features at the encoder phase, another for feature propagation at the decoder phase. The network processes regions of points in a hierarchical manner, where at each level, point features are aggregated using attention-based operators. The aggregation is performed by incorporating point relations both within and between the receptive fields, defined by the hierarchical organization of points. RoseSegNet outperforms the widely-used architecture PointNet++ by 4% in terms of MIoU on the publicly available ROSE-X data set. Also, it is demonstrated that introducing local surface features together with the spatial coordinates of each 3D point at the input level boosts the segmentation performance of both networks by 9% in terms of MIoU. (c) 2022 IAgrE. Published by Elsevier Ltd. All rights reserved.