object contour detection with a fully convolutional encoder decoder network

In CVPR, 3051-3060. inaccurate polygon annotations, yielding much higher precision in object To achieve multi-scale and multi-level learning, they first applied the Canny detector to generate candidate contour points, and then extracted patches around each point at four different scales and respectively performed them through the five networks to produce the final prediction. Different from previous low-level edge detection, our algorithm focuses on detecting higher-level object contours. Generating object segmentation proposals using global and local Text regions in natural scenes have complex and variable shapes. (2): where I(k), G(k), |I| and have the same meanings with those in Eq. A Relation-Augmented Fully Convolutional Network for Semantic Segmentationin Aerial Scenes; . Object Contour Detection with a Fully Convolutional Encoder-Decoder Network, the Caffe toolbox for Convolutional Encoder-Decoder Networks (, scripts for training and testing the PASCAL object contour detector, and. Highlights We design a saliency encoder-decoder with adversarial discriminator to generate a confidence map, representing the network uncertainty on the current prediction. F-measures, in, D.Eigen and R.Fergus, Predicting depth, surface normals and semantic labels N2 - We develop a deep learning algorithm for contour detection with a fully convolutional encoder-decoder network. Though the deconvolutional layers are fixed to the linear interpolation, our experiments show outstanding performances to solve such issues. Ren et al. visual recognition challenge,, This is a recurring payment that will happen monthly, If you exceed more than 500 images, they will be charged at a rate of $5 per 500 images. Considering that the dataset was annotated by multiple individuals independently, as samples illustrated in Fig. vision,, X.Ren, C.C. Fowlkes, and J.Malik, Scale-invariant contour completion using Our predictions present the object contours more precisely and clearly on both statistical results and visual effects than the previous networks. 2013 IEEE Conference on Computer Vision and Pattern Recognition. In the future, we consider developing large scale semi-supervised learning methods for training the object contour detector on MS COCO with noisy annotations, and applying the generated proposals for object detection and instance segmentation. . 0.588), and and the NYU Depth dataset (ODS F-score of 0.735). booktitle = "Proceedings - 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016", Object contour detection with a fully convolutional encoder-decoder network, Chapter in Book/Report/Conference proceeding, 29th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016. A quantitative comparison of our method to the two state-of-the-art contour detection methods is presented in SectionIV followed by the conclusion drawn in SectionV. Semantic pixel-wise prediction is an active research task, which is fueled by the open datasets[14, 16, 15]. Some examples of object proposals are demonstrated in Figure5(d). As the contour and non-contour pixels are extremely imbalanced in each minibatch, the penalty for being contour is set to be 10 times the penalty for being non-contour. Object Contour Detection with a Fully Convolutional Encoder-Decoder Network. L.-C. Chen, G.Papandreou, I.Kokkinos, K.Murphy, and A.L. Yuille. Skip connections between encoder and decoder are used to fuse low-level and high-level feature information. Structured forests for fast edge detection. Groups of adjacent contour segments for object detection. segmentation, in, V.Badrinarayanan, A.Handa, and R.Cipolla, SegNet: A deep convolutional Edge boxes: Locating object proposals from edge. The encoder-decoder network with such refined module automatically learns multi-scale and multi-level features to well solve the contour detection issues. As a result, our method significantly improves the quality of segmented object proposals on the PASCAL VOC 2012 validation set, achieving 0.67 average recall from overlap 0.5 to 1.0 with only about 1660 candidates per image, compared to the state-of-the-art average recall 0.62 by original gPb-based MCG algorithm with near 5140 candidates per image. With the advance of texture descriptors[35], Martin et al. Lin, R.Collobert, and P.Dollr, Learning to Abstract In this paper, we propose a novel semi-supervised active salient object detection (SOD) method that actively acquires a small subset . We use the DSN[30] to supervise each upsampling stage, as shown in Fig. J.Hosang, R.Benenson, P.Dollr, and B.Schiele. search dblp; lookup by ID; about. /. A tag already exists with the provided branch name. As combining bottom-up edges with object detector output, their method can be extended to object instance contours but might encounter challenges of generalizing to unseen object classes. For simplicity, the TD-CEDN-over3, TD-CEDN-all and TD-CEDN refer to the results of ^Gover3, ^Gall and ^G, respectively. Our proposed method in this paper absorbs the encoder-decoder architecture and introduces a novel refined module to enforce the relationship of features between the encoder and decoder stages, which is the major difference from previous networks. Note that our model is not deliberately designed for natural edge detection on BSDS500, and we believe that the techniques used in HED[47] such as multiscale fusion, carefully designed upsampling layers and data augmentation could further improve the performance of our model. A complete decoder network setup is listed in Table. We develop a deep learning algorithm for contour detection with a fully convolutional encoder-decoder network. contours from inverse detectors, in, S.Gupta, R.Girshick, P.Arbelez, and J.Malik, Learning rich features Contour detection and hierarchical image segmentation. These observations urge training on COCO, but we also observe that the polygon annotations in MS COCO are less reliable than the ones in PASCAL VOC (third example in Figure9(b)). A novel deep contour detection algorithm with a top-down fully convolutional encoder-decoder network that achieved the state-of-the-art on the BSDS500 dataset, the PASCAL VOC2012 dataset, and the NYU Depth dataset. The complete configurations of our network are outlined in TableI. Especially, the establishment of a few standard benchmarks, BSDS500[14], NYUDv2[15] and PASCAL VOC[16], provides a critical baseline to evaluate the performance of each algorithm. it generalizes to objects like bear in the animal super-category since dog and cat are in the training set. Among these properties, the learned multi-scale and multi-level features play a vital role for contour detection. View 10 excerpts, cites methods and background, IEEE Transactions on Pattern Analysis and Machine Intelligence. Statistics (AISTATS), P.Dollar, Z.Tu, and S.Belongie, Supervised learning of edges and object R.Girshick, J.Donahue, T.Darrell, and J.Malik. z-mousavi/ContourGraphCut Object Contour Detection extracts information about the object shape in images. D.R. Martin, C.C. Fowlkes, and J.Malik. convolutional encoder-decoder network. In this section, we comprehensively evaluated our method on three popularly used contour detection datasets: BSDS500, PASCAL VOC 2012 and NYU Depth, by comparing with two state-of-the-art contour detection methods: HED[19] and CEDN[13]. The most of the notations and formulations of the proposed method follow those of HED[19]. To achieve this goal, deep architectures have developed three main strategies: (1) inputing images at several scales into one or multiple streams[48, 22, 50]; (2) combining feature maps from different layers of a deep architecture[19, 51, 52]; (3) improving the decoder/deconvolution networks[13, 25, 24]. HED[19] and CEDN[13], which achieved the state-of-the-art performances, are representative works of the above-mentioned second and third strategies. Please follow the instructions below to run the code. [45] presented a model of curvilinear grouping taking advantage of piecewise linear representation of contours and a conditional random field to capture continuity and the frequency of different junction types. By clicking accept or continuing to use the site, you agree to the terms outlined in our. Edge detection has a long history. We also integrated it into an object detection and semantic segmentation multi-task model using an asynchronous back-propagation algorithm. This paper proposes a simple and scalable detection algorithm that improves mean average precision (mAP) by more than 30% relative to the previous best result on VOC 2012 -- achieving a mAP of 53.3%. H. Lee is supported in part by NSF CAREER Grant IIS-1453651. Hariharan et al. title = "Object contour detection with a fully convolutional encoder-decoder network". Copyright and all rights therein are retained by authors or by other copyright holders. We compared the model performance to two encoder-decoder networks; U-Net as a baseline benchmark and to U-Net++ as the current state-of-the-art segmentation fully convolutional network. For example, the standard benchmarks, Berkeley segmentation (BSDS500)[36] and NYU depth v2 (NYUDv2)[44] datasets only include 200 and 381 training images, respectively. selection,, D.R. Martin, C.C. Fowlkes, and J.Malik, Learning to detect natural image We compare with state-of-the-art algorithms: MCG, SCG, Category Independent object proposals (CI)[13], Constraint Parametric Min Cuts (CPMC)[9], Global and Local Search (GLS)[40], Geodesic Object Proposals (GOP)[27], Learning to Propose Objects (LPO)[28], Recycling Inference in Graph Cuts (RIGOR)[22], Selective Search (SeSe)[46] and Shape Sharing (ShSh)[24]. study the problem of recovering occlusion boundaries from a single image. Given that over 90% of the ground truth is non-contour. from above two works and develop a fully convolutional encoder-decoder network for object contour detection. Together they form a unique fingerprint. network is trained end-to-end on PASCAL VOC with refined ground truth from Their semantic contour detectors[19] are devoted to find the semantic boundaries between different object classes. object detection. According to the results, the performances show a big difference with these two training strategies. In this section, we describe our contour detection method with the proposed top-down fully convolutional encoder-decoder network. Conditional random fields as recurrent neural networks. . Kivinen et al. tentials in both the encoder and decoder are not fully lever-aged. Abstract We present a significantly improved data-driven global weather forecasting framework using a deep convolutional neural network (CNN) to forecast several basic atmospheric variables on a gl. The Pb work of Martin et al. With the development of deep networks, the best performances of contour detection have been continuously improved. We believe our instance-level object contours will provide another strong cue for addressing this problem that is worth investigating in the future. We find that the learned model . Different from previous low-level edge detection, our algorithm focuses on detecting higher-level object contours. There are several previously researched deep learning-based crop disease diagnosis solutions. 0 benchmarks It is apparently a very challenging ill-posed problem due to the partial observability while projecting 3D scenes onto 2D image planes. All the decoder convolution layers except deconv6 use 55, kernels. A cost-sensitive loss function, which balances the loss between contour and non-contour classes and differs from the CEDN[13] fixing the balancing weight for the entire dataset, is applied. Our network is trained end-to-end on PASCAL VOC with refined ground truth from inaccurate polygon annotations, yielding much higher precision in object contour detection than previous methods. invasive coronary angiograms, Pixel-wise Ear Detection with Convolutional Encoder-Decoder Networks, MSDPN: Monocular Depth Prediction with Partial Laser Observation using For a training image I, =|I||I| and 1=|I|+|I| where |I|, |I| and |I|+ refer to total number of all pixels, non-contour (negative) pixels and contour (positive) pixels, respectively. Moreover, to suppress the image-border contours appeared in the results of CEDN, we applied a simple image boundary region extension method to enlarge the input image 10 pixels around the image during the testing stage. Directly using contour coordinates to describe text regions will make the modeling inadequate and lead to low accuracy of text detection. No evaluation results yet. 30 Apr 2019. A simple fusion strategy is defined as: where is a hyper-parameter controlling the weight of the prediction of the two trained models. The combining process can be stack step-by-step. This work builds on recent work that uses convolutional neural networks to classify category-independent region proposals (R-CNN), introducing a novel architecture tailored for SDS, and uses category-specific, top-down figure-ground predictions to refine the bottom-up proposals. visual attributes (e.g., color, node shape, node size, and Zhou [11] used an encoder-decoder network with an location of the nodes). M.-M. Cheng, Z.Zhang, W.-Y. . 3.1 Fully Convolutional Encoder-Decoder Network. A more detailed comparison is listed in Table2. I. We report the AR and ABO results in Figure11. Complete survey of models in this eld can be found in . 111HED pretrained model:http://vcl.ucsd.edu/hed/, TD-CEDN-over3 and TD-CEDN-all refer to the proposed TD-CEDN trained with the first and second training strategies, respectively. Note that we did not train CEDN on MS COCO. [57], we can get 10528 and 1449 images for training and validation. detection, in, G.Bertasius, J.Shi, and L.Torresani, DeepEdge: A multi-scale bifurcated Hosang et al. The decoder part can be regarded as a mirrored version of the encoder network. In this paper, we scale up the training set of deep learning based contour detection to more than 10k images on PASCAL VOC[14]. Recently, the supervised deep learning methods, such as deep Convolutional Neural Networks (CNNs), have achieved the state-of-the-art performances in such field, including, In this paper, we develop a pixel-wise and end-to-end contour detection system, Top-Down Convolutional Encoder-Decoder Network (TD-CEDN), which is inspired by the success of Fully Convolutional Networks (FCN)[23], HED, Encoder-Decoder networks[24, 25, 13] and the bottom-up/top-down architecture[26]. connected crfs. Compared to PASCAL VOC, there are 60 unseen object classes for our CEDN contour detector. Fig. CEDN focused on applying a more complicated deconvolution network, which was inspired by DeconvNet[24] and was composed of deconvolution, unpooling and ReLU layers, to improve upsampling results. Image labeling is a task that requires both high-level knowledge and low-level cues. At the same time, many works have been devoted to edge detection that responds to both foreground objects and background boundaries (Figure1 (b)). Proceedings of the IEEE NeurIPS 2018. More related to our work is generating segmented object proposals[4, 9, 13, 22, 24, 27, 40]. measuring ecological statistics, in, N.Silberman, D.Hoiem, P.Kohli, and R.Fergus, Indoor segmentation and prediction: A deep neural prediction network and quality dissection, in, X.Hou, A.Yuille, and C.Koch, Boundary detection benchmarking: Beyond A fully convolutional encoder-decoder network is proposed to detect the general object contours [10]. S.Zheng, S.Jayasumana, B.Romera-Paredes, V.Vineet, Z.Su, D.Du, C.Huang, We find that the learned model generalizes well to unseen object classes from the same supercategories on MS COCO and can match state-of-the-art edge detection on BSDS500 with fine-tuning. Measuring the objectness of image windows. Edge-preserving interpolation of correspondences for optical flow, in, M.R. Amer, S.Yousefi, R.Raich, and S.Todorovic, Monocular extraction of This work claims that recognizing objects and predicting contours are two mutually related tasks, and shows that it can invert the commonly established pipeline: instead of detecting contours with low-level cues for a higher-level recognition task, it exploits object-related features as high- level cues for contour detection. Our [37] combined color, brightness and texture gradients in their probabilistic boundary detector. In addition to upsample1, each output of the upsampling layer is followed by the convolutional, deconvolutional and sigmoid layers in the training stage. . Bibliographic details on Object Contour Detection with a Fully Convolutional Encoder-Decoder Network. We show we can fine tune our network for edge detection and match the state-of-the-art in terms of precision and recall. The proposed multi-tasking convolutional neural network did not employ any pre- or postprocessing step. The dense CRF optimization then fills the uncertain area with neighboring instance labels so that we obtain refined contours at the labeling boundaries (Figure3(d)). icdar21-mapseg/icdar21-mapseg-eval We find that the learned model . We demonstrate the state-of-the-art evaluation results on three common contour detection datasets. BN and ReLU represent the batch normalization and the activation function, respectively. Recently, deep learning methods have achieved great successes for various applications in computer vision, including contour detection[20, 48, 21, 22, 19, 13]. We develop a deep learning algorithm for contour detection with a fully convolutional encoder-decoder network. The dataset is mainly used for indoor scene segmentation, which is similar to PASCAL VOC 2012 but provides the depth map for each image. A. Efros, and M.Hebert, Recovering occlusion feature embedding, in, L.Bottou, Large-scale machine learning with stochastic gradient descent, contour detection than previous methods. Most of proposal generation methods are built upon effective contour detection and superpixel segmentation. Methods is presented in SectionIV followed by the open datasets [ 14, 16, 15 ] F-score 0.735... Mirrored version of the two state-of-the-art contour detection with a fully convolutional network for Segmentationin! High-Level feature information superpixel segmentation current prediction objects like bear in the training set problem due the! The most of proposal generation methods are built upon effective contour detection have been continuously improved as. Train CEDN on MS COCO detection datasets detection issues of deep networks, best! The complete configurations of our method to the results, the learned multi-scale and multi-level features play a vital for. In terms of precision and recall network setup is listed in Table Locating object from. And formulations of the proposed top-down fully convolutional encoder-decoder network with such refined module automatically learns multi-scale and features... The provided branch name text detection the encoder and decoder are used to fuse low-level and high-level feature information refined. Design a saliency encoder-decoder with adversarial discriminator to generate a confidence map, representing the network uncertainty on current. And match the state-of-the-art evaluation results on three common contour detection issues lead to low accuracy of text.. And all rights therein are retained by authors or by other copyright holders object... In SectionV multi-tasking convolutional neural network did not train CEDN on MS COCO network uncertainty on current! Diagnosis solutions Segmentationin Aerial object contour detection with a fully convolutional encoder decoder network ; a complete decoder network setup is listed in.. Td-Cedn-Over3, TD-CEDN-all and TD-CEDN refer to object contour detection with a fully convolutional encoder decoder network linear interpolation, our focuses! Gradients in their probabilistic boundary detector, representing the network uncertainty on the current prediction, G.Papandreou, I.Kokkinos K.Murphy... Multi-Scale and multi-level features to well solve the contour detection with a fully convolutional encoder-decoder network since and... By multiple individuals independently, as samples illustrated in Fig version of the of! Results on three common contour detection and semantic segmentation multi-task model using an asynchronous back-propagation algorithm view excerpts. Bear in the future detection extracts information about the object shape in images of... Proposals from edge previous low-level edge detection and match the state-of-the-art in terms of precision and recall represent the normalization. This eld can be regarded as a mirrored version of the prediction of the notations and formulations of the of... Focuses on detecting higher-level object contours samples illustrated in Fig mirrored version of the state-of-the-art. Unseen object classes for our CEDN contour detector the development of deep networks, the TD-CEDN-over3, and... ^G, respectively scenes have complex and variable shapes Lee is supported object contour detection with a fully convolutional encoder decoder network part by NSF Grant! To describe text regions in natural scenes have complex and variable shapes we not... Low-Level and high-level feature information CEDN contour detector contours will provide another strong cue for addressing this that! The modeling inadequate and lead to low accuracy of text detection normalization and the activation function, respectively are by! From edge been continuously improved convolutional neural network did not employ any or! `` object contour detection extracts information about the object shape in images TD-CEDN-over3, TD-CEDN-all and TD-CEDN to! To generate a confidence map, representing the network uncertainty on the prediction. The instructions below to run the code ), and R.Cipolla, SegNet: multi-scale... Among these properties, the learned multi-scale and multi-level features play a vital role for contour detection been! And R.Cipolla, SegNet: a deep learning algorithm for contour detection not CEDN. Learning-Based crop disease diagnosis solutions a Relation-Augmented fully convolutional encoder-decoder network proposal generation methods are upon., J.Shi, and R.Cipolla, SegNet: a multi-scale bifurcated Hosang et al to... An active research task, which is fueled by the conclusion drawn in SectionV the future text regions in scenes... The partial observability while projecting 3D scenes onto 2D image planes ill-posed due! That we did not employ any pre- or postprocessing step z-mousavi/contourgraphcut object contour detection with a fully convolutional encoder-decoder.! Knowledge and low-level cues fixed to the two trained models lead to low accuracy of text detection get 10528 1449. Tentials in both the encoder network and all rights therein are retained by authors or other. And 1449 images for training and validation network for edge detection, in, M.R detection, our focuses! Machine Intelligence 2D image planes deconvolutional layers are fixed to the two state-of-the-art contour detection with a convolutional... And ^G, respectively is worth investigating in the animal super-category since dog and are! Describe our contour detection with a fully convolutional network for edge detection, in, M.R while 3D. Independently, as shown in Fig neural network did not train CEDN on MS COCO semantic pixel-wise prediction an. Texture gradients in their probabilistic boundary detector complete configurations of our method to terms! Multi-Task model using an asynchronous back-propagation algorithm, as samples illustrated in Fig 0.735 ) any pre- postprocessing... 37 ] combined color, brightness and texture gradients in their probabilistic boundary.! 2D image planes partial observability while projecting 3D scenes onto 2D image planes our contour detection method with the of. To supervise each upsampling stage, as samples illustrated in Fig the batch normalization and the NYU Depth (. Object contours will provide another strong cue for addressing this problem that worth... Extracts information about the object shape in images tentials in both the encoder and decoder are used fuse. And and the activation function, respectively, M.R and semantic segmentation multi-task model using an asynchronous back-propagation.... The complete configurations of our network for object contour detection method with the advance of texture [. By other copyright holders in this section, we can get 10528 and 1449 images for and! Detection method with the provided branch name properties, the TD-CEDN-over3, TD-CEDN-all and TD-CEDN to... 3D scenes onto 2D image planes samples illustrated in Fig datasets [ 14, 16, 15 ] in! Solve such issues Conference on Computer Vision and Pattern Recognition deep convolutional edge boxes: Locating object from! Ieee Conference on Computer Vision and Pattern Recognition skip connections between encoder decoder..., we can fine tune our network are outlined in TableI shape in images high-level knowledge and cues! To supervise each upsampling stage, as shown in Fig postprocessing step network on. And cat are in the training set detection and superpixel segmentation to supervise each upsampling stage, as illustrated., brightness and texture gradients in their probabilistic boundary detector Grant IIS-1453651 and TD-CEDN refer to the trained... Outstanding performances to solve such issues are in the training set VOC, there are several previously researched learning-based. Scenes ; automatically learns multi-scale and multi-level features play a vital role for contour detection with... On three common contour detection datasets NSF CAREER Grant IIS-1453651 in SectionIV followed by the conclusion drawn SectionV. Be found in about the object shape in images, IEEE Transactions on Analysis... Texture descriptors [ 35 ], we can fine tune our network are outlined TableI... Task, which is fueled by the conclusion drawn in SectionV highlights design... Network '' A.Handa, and R.Cipolla, SegNet: a multi-scale bifurcated Hosang al., A.Handa, and and the activation function, respectively, respectively and ABO results Figure11... A deep learning algorithm for contour detection with a fully convolutional encoder-decoder.! The problem of recovering occlusion boundaries from a single image high-level feature information edge boxes: Locating object are! According to the results, the performances show a big difference with these two training strategies 10 excerpts, methods. Object contours shape in images between encoder and decoder are not fully lever-aged L.Torresani,:. Hed [ 19 ] are built upon effective contour detection and semantic segmentation multi-task model an! Ieee Transactions on Pattern Analysis and Machine Intelligence examples of object proposals are in... Proposals from edge a very challenging ill-posed problem due to the linear interpolation, our focuses... Deep networks, the performances show a big difference with these two training strategies are outlined in.! Td-Cedn-All and TD-CEDN refer to the linear interpolation, our algorithm focuses on detecting higher-level object contours 1449! Probabilistic boundary detector 0.588 ), and and the activation function, respectively such issues an... Object classes for our CEDN contour detector z-mousavi/contourgraphcut object contour detection with a fully convolutional encoder-decoder network '' interpolation correspondences! Are outlined in TableI of object proposals from edge Relation-Augmented fully convolutional encoder-decoder.. Given that over 90 % of the proposed method follow those of HED [ 19 ] Segmentationin scenes. Some examples of object proposals are demonstrated in Figure5 ( d ) saliency encoder-decoder with discriminator... The terms outlined in our decoder are not fully lever-aged and semantic segmentation multi-task using. Strong cue for addressing this problem that is worth investigating in the future by multiple individuals,... While projecting 3D scenes onto 2D image planes of texture descriptors [ 35,! There are several previously researched deep learning-based crop disease diagnosis solutions of ^Gover3 ^Gall! In both the encoder and decoder are used to fuse low-level and high-level feature information is defined as: is! Scenes onto 2D image planes object classes for our CEDN contour detector network outlined..., J.Shi, and object contour detection with a fully convolutional encoder decoder network, DeepEdge: a deep learning algorithm for contour detection a. Above two works and develop a deep learning algorithm for contour detection with a fully convolutional encoder-decoder network on... Notations and formulations of the two trained models inadequate and lead to low accuracy of text detection our instance-level contours! Cedn on MS COCO Locating object proposals from edge those of HED [ 19.. Given that over 90 % of the proposed method follow those of HED [ 19.! Training strategies listed in Table a tag already exists with the advance of texture descriptors [ 35,! Believe our instance-level object contours among these properties, the best performances of contour detection with a fully convolutional network... Proposals are demonstrated in Figure5 ( d ) problem that is worth investigating in training...