"PolarStream技术下基于极柱流式激光雷达目标检测与分割的研究"

版权申诉

PDF文件

目标检测

人工智能

计算机视觉

1.81MB | 更新于2024-02-24 | 144 浏览量 | 举报收藏

限时特惠：#14.90

基于极柱的PolarStream流式激光雷达目标检测与分割_PolarStream Streaming Lidar Object Detection and Segmentation with Polar Pillars.pdf是一篇由Qi Chen、Sourabh Vora和Oscar Beijbom撰写的研究论文。该论文在Johns Hopkins University和Motional公司完成，旨在探讨基于极柱的PolarStream流式激光雷达目标检测与分割的方法。这篇论文首先介绍了近期的研究认识到激光雷达作为一种实时数据源，并展示了通过操作楔形点云部分而不是整个点云可以显著减少激光雷达感知模型的端到端延迟。论文着重介绍了利用极柱进行流式激光雷达目标检测与分割的方法，以及该方法与传统方法的比较。论文的方法部分详细描述了PolarStream的架构和工作原理。PolarStream利用极坐标系将激光雷达点云数据转换为极柱表示，将整个点云分割成多个极柱区块，然后对每个区块进行目标检测和分割。论文还介绍了PolarStream的流程优化和实时性处理，以及如何利用极柱表示来提高目标检测和分割的效率和准确性。论文的实验部分通过大量的实验和对比分析，验证了基于极柱的PolarStream方法相对于传统方法在目标检测和分割任务上的优势。实验结果表明，PolarStream在降低端到端延迟和提高目标检测和分割性能方面取得了显著的成效。此外，论文还探讨了PolarStream方法的应用场景和未来的研究方向。作者们指出，基于极柱的PolarStream方法可以广泛应用于自动驾驶、环境感知和机器人领域，未来他们还计划进一步优化PolarStream的算法和拓展其在复杂场景下的适用性。综上所述，基于极柱的PolarStream流式激光雷达目标检测与分割的方法在实时性和性能上具有明显优势，对于提高自动驾驶和环境感知系统的效率和安全性具有重要意义。这篇论文为相关领域的研究和应用提供了有价值的参考和借鉴，具有一定的学术和实践意义。

𝚹

separate

kernels

polar grid

points

Interpolated

cartesian grid

points

Range Stratified Convolution&Normalization

Feature Undistortion

Figure 2: Simultaneous LiDAR object detection and segmentation network with polar pillars. We

adopt the same backbone as in PointPillars, and add a semantic segmentation head in parallel with

the detection heads. The input wedge-shape pillars are unfolded into a rectangular feature map for

convolution. The object (green box) is distorted because one end near the sensor looks bigger and the

other end far from the sensor looks smaller. Feature Undistortion is applied to classiﬁcation head to

mimic bilinear sampling and interpolate cartesian pillar features from polar pillar features. Range

Stratiﬁed Convolution& Normalization is applied to center offset regression head.

only one pillar along the height dimension. Following MVF [

], we adopt dynamic voxelization to

sample all points within each pillar, instead of randomly sampling a ﬁxed number of points per pillar.

3.3 Simultaneous Detection and Segmentation

We design PolarStream: a simultaneous object detection and segmentation network by extending

PointPillars [

], one of the most widely used 3D object detectors balancing accuracy and speed. As

shown in Fig.2, PolarStream consists of a Pillar Feature Encoder, followed by a 2D CNN backbone

and a U-Net[24] like structure. On top are the detection and segmentation heads.

Detection Heads

We adopt CenterPoint [

] heads with modiﬁcations to make it compatible with

polar pillars. To assign targets to the 10-class heatmap to indicate the objects, the gaussian radius of

the object center is computed using the span of range and azimuth of the object bounding box, instead

of using length and width of the box. Following CenterPoint, we also regress the center offset as

, d

, the bounding box size

l, w, h

log l, log w, log h

, and predict the bounding box height

. We

regress the relative bounding box orientation

cos φ, sin φ

and relative velocity as

, v

similar

to [

]. Unlike most methods, which use multi-group detection heads that partition object classes to

several groups according to their size, we use single-group detection heads to balance accuracy and

speed. A comparison against multi-group detection heads is shown in Supplementary. For streaming

data with n > 1, we apply stateful-NMS proposed in Han et al.[13].

Segmentation Head

To extend PointPillars for segmentation, we add a semantic segmentation

head in parallel with the detection heads. The segmentation head is made of a single 1x1 convolution

layer. The input for the segmentation head is concatenation of the outputs from pillar feature encoder

and bilinearly upsampled features from the 2D backbone.

Panoptic Fusion

Similar to Panoptic-PolarNet [

], for each point belonging to things, we predict

the instance id as the box id whose category is the same and center is the nearest. For streaming

data with

n > 1

, the panoptic segmentation task is not well deﬁned. For example, the points in the

sector may belong to the box in the

(i + 1)

sector if the majority of the box is in the

(i + 1)

sector. However, when we are doing panoptic fusion for

sector, we do not have information from

the

(i + 1)

sector. Therefore we choose global panoptic fusion for streaming point clouds, i.e., we

assign instance ids according to the boxes from all sectors of the same sweep.

Multi-Task Learning

We adopt Focal Loss [

] for classiﬁcation and L1 loss for bounding box

regression, orientation and velocity estimation. For segmentation, we use the weighted cross-entropy

loss and lovasz-softmax loss [2]. The total loss is the weighted sum of losses for each component.

Feature Undistortion

As mentioned in Sec.1, objects have distorted appearances with polar pillars,

we propose Feature Undistortion to undistort the features. As shown on the top right of Fig.2, the

idea of undistortion is to interpolate features at cartesian pillar locations from the original polar pillar

locations so that the translation-invariant property of convolution applies. We ﬁnd the connection

剩余16页未读，继续阅读

易小侠

粉丝: 6670

"PolarStream技术下基于极柱流式激光雷达目标检测与分割的研究"

激光雷达数据处理及显示（目标检测+跟踪）

电子功用-塑壳电池极柱与盖板间绝缘检测装置及其检测工装

西门子V16全开源锂电池极柱检测设备程序案例：集成多种功能块，PLC与触摸屏完美融合,西门子1200锂电设备三轴电芯极柱拍照检测程序案例：集成多种功能块与视觉检测，全开源与清晰逻辑，配备西门子触摸屏与

西门子1200锂电设备三轴电芯极柱拍照检测：基于V90伺服PN总线控制的自动化解决方案

电子政务-动力电池水冷极柱气密性的检测方法.zip

西门子S7-1200 PLC在锂电设备三轴极柱检测中的V90伺服PN总线控制与模块化编程实践

西门子1200锂电设备三轴电芯极柱拿照检测设备程序案例 设备采用V90伺服PN总线控制方式，程序采用自编FB块轴控方式，调用控制很方便 程序功能非常齐全，有视觉CCD检测程序 丶扫码枪扫码上传程序

实操-单体测试与极柱压降测试.pptx

电子政务-电池导电极柱及具有该导电极柱的电池壳体.zip

电子政务-电池盖板与极柱的密封结构.zip

最新资源

西门子1200锂电设备三轴电芯极柱拿照检测设备程序案例设备采用V90伺服PN总线控制方式，程序采用自编FB块轴控方式，调用控制很方便程序功能非常齐全，有视觉CCD检测程序丶扫码枪扫码上传程序