draw_bounding_boxes¶

torchvision.utils.draw_bounding_boxes(image: Tensor, boxes: Tensor, labels: Optional[list[str]] = None, colors: Optional[Union[list[Union[str, tuple[int, int, int]]], str, tuple[int, int, int]]] = None, fill: Optional[bool] = False, width: int = 1, font: Optional[str] = None, font_size: Optional[int] = None, label_colors: Optional[Union[list[Union[str, tuple[int, int, int]]], str, tuple[int, int, int]]] = None, label_background_colors: Optional[Union[list[Union[str, tuple[int, int, int]]], str, tuple[int, int, int]]] = None, fill_labels: bool = False) → Tensor[源代码]¶

在给定的 RGB 图像上绘制边界框。图像值应为 uint8（范围 [0, 255]）或 float（范围 [0, 1]）。如果 fill 为 True，则最终张量应保存为 PNG 图像。

参数:

image (Tensor) – 形状为 (C, H, W) 且 dtype 为 uint8 或 float 的张量。
boxes (Tensor) – 形状为 (N, 4) 或 (N, 8) 的张量，包含边界框。对于 (N, 4)，格式为 (xmin, ymin, xmax, ymax)，边界框是相对于图像的绝对坐标。换句话说：0 <= xmin < xmax < W 且 0 <= ymin < ymax < H。对于 (N, 8)，格式为 (x1, y1, x2, y2, x3, y3, x4, y4)，边界框是相对于底层对象的绝对坐标，因此无需验证后两个不等式。
labels (List[str]) – 包含边界框标签的列表。
colors (颜色或颜色列表, 可选) – 包含边界框颜色的列表，或所有边界框的单一颜色。颜色可以表示为 PIL 字符串，例如“red”或“#FF00FF”，或表示为 RGB 元组，例如 (240, 10, 157)。默认情况下，会为边界框生成随机颜色。
fill (bool) – 如果为 True，则用指定的颜色填充边界框。
width (int) – 边界框的宽度。
font (str) – 包含 TrueType 字体的文件名。如果在此文件名中找不到文件，加载器也可能在其他目录中搜索，例如 Windows 上的 fonts/ 目录，或 macOS 上的 /Library/Fonts/、/System/Library/Fonts/ 和 ~/Library/Fonts/。
font_size (int) – 所需的字体大小（以磅为单位）。
label_colors (颜色或颜色列表, 可选) – 标签文本的颜色。有关详细信息，请参阅 colors 参数的说明。默认情况下，使用与边界框相同的颜色，如果 fill_labels 为 True，则为黑色。
label_background_colors (颜色或颜色列表, 可选) – 标签文本框填充的颜色。默认情况下，使用与边界框相同的颜色。当 fill_labels 为 False 时忽略。
fill_labels (bool) – 如果为 True，则用指定的颜色（来自 label_background_colors 参数，或来自 colors 参数，如果未指定）填充标签背景。默认值：False。

返回:

绘制了边界框的图像张量，dtype 为 uint8。

返回类型:

img (Tensor[C, H, W])

使用 draw_bounding_boxes 的示例

将掩码重新用作边界框

可视化工具

draw_bounding_boxes¶

文档

教程

资源