Simultaneous Detection and Segmentation

dc.contributor.authorBharath Hariharan
dc.contributor.authorPablo Arbeláez
dc.contributor.authorRoss Girshick
dc.contributor.authorJitendra Malik
dc.coverage.spatialBolivia
dc.date.accessioned2026-03-22T20:42:50Z
dc.date.available2026-03-22T20:42:50Z
dc.date.issued2014
dc.descriptionCitaciones: 199
dc.description.abstractWe aim to detect all instances of a category in an image and, for each instance, mark the pixels that belong to it. We call this task Simultaneous Detection and Segmentation (SDS). Unlike classical bounding box detection, SDS requires a segmentation and not just a box. Unlike classical semantic segmentation, we require individual object instances. We build on recent work that uses convolutional neural networks to classify category-independent region proposals (R-CNN [16]), introducing a novel architecture tailored for SDS. We then use category-specific, top- down figure-ground predictions to refine our bottom-up proposals. We show a 7 point boost (16% relative) over our baselines on SDS, a 5 point boost (10% relative) over state-of-the-art on semantic segmentation, and state-of-the-art performance in object detection. Finally, we provide diagnostic tools that unpack performance and provide directions for future work.
dc.identifier.doi10.48550/arxiv.1407.1808
dc.identifier.urihttps://doi.org/10.48550/arxiv.1407.1808
dc.identifier.urihttps://andeanlibrary.org/handle/123456789/83635
dc.language.isoen
dc.publisherCornell University
dc.relation.ispartofarXiv (Cornell University)
dc.sourceUniversity of California System
dc.subjectSegmentation
dc.subjectComputer science
dc.subjectArtificial intelligence
dc.subjectConvolutional neural network
dc.subjectObject detection
dc.subjectMinimum bounding box
dc.subjectBounding overwatch
dc.subjectObject (grammar)
dc.subjectPoint (geometry)
dc.subjectPixel
dc.titleSimultaneous Detection and Segmentation
dc.typepreprint

Files