tf.raw_ops.GenerateBoundingBoxProposals
Stay organized with collections
Save and categorize content based on your preferences.
This op produces Region of Interests from given bounding boxes(bbox_deltas) encoded wrt anchors according to eq.2 in arXiv:1506.01497
tf.raw_ops.GenerateBoundingBoxProposals(
scores,
bbox_deltas,
image_info,
anchors,
nms_threshold,
pre_nms_topn,
min_size,
post_nms_topn=300,
name=None
)
The op selects top `pre_nms_topn` scoring boxes, decodes them with respect to anchors,
applies non-maximal suppression on overlapping boxes with higher than
`nms_threshold` intersection-over-union (iou) value, discarding boxes where shorter
side is less than `min_size`.
Inputs:
`scores`: A 4D tensor of shape [Batch, Height, Width, Num Anchors] containing the scores per anchor at given position
`bbox_deltas`: is a tensor of shape [Batch, Height, Width, 4 x Num Anchors] boxes encoded to each anchor
`anchors`: A 1D tensor of shape [4 x Num Anchors], representing the anchors.
Outputs:
`rois`: output RoIs, a 3D tensor of shape [Batch, post_nms_topn, 4], padded by 0 if less than post_nms_topn candidates found.
`roi_probabilities`: probability scores of each roi in 'rois', a 2D tensor of shape [Batch,post_nms_topn], padded with 0 if needed, sorted by scores.
Args |
scores
|
A Tensor of type float32 .
A 4-D float tensor of shape [num_images, height, width, num_achors] containing scores of the boxes for given anchors, can be unsorted.
|
bbox_deltas
|
A Tensor of type float32 .
A 4-D float tensor of shape [num_images, height, width, 4 x num_anchors] . encoding boxes with respec to each anchor.
Coordinates are given in the form [dy, dx, dh, dw].
|
image_info
|
A Tensor of type float32 .
A 2-D float tensor of shape [num_images, 5] containing image information Height, Width, Scale.
|
anchors
|
A Tensor of type float32 .
A 2-D float tensor of shape [num_anchors, 4] describing the anchor boxes. Boxes are formatted in the form [y1, x1, y2, x2].
|
nms_threshold
|
A Tensor of type float32 .
A scalar float tensor for non-maximal-suppression threshold.
|
pre_nms_topn
|
A Tensor of type int32 .
A scalar int tensor for the number of top scoring boxes to be used as input.
|
min_size
|
A Tensor of type float32 .
A scalar float tensor. Any box that has a smaller size than min_size will be discarded.
|
post_nms_topn
|
An optional int . Defaults to 300 .
An integer. Maximum number of rois in the output.
|
name
|
A name for the operation (optional).
|
Returns |
A tuple of Tensor objects (rois, roi_probabilities).
|
rois
|
A Tensor of type float32 .
|
roi_probabilities
|
A Tensor of type float32 .
|
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates. Some content is licensed under the numpy license.
Last updated 2022-10-27 UTC.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2022-10-27 UTC."],[],[]]