By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: How 3D Object Detection Helps AI Avoid Obstacles | HackerNoon
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > Computing > How 3D Object Detection Helps AI Avoid Obstacles | HackerNoon
Computing

How 3D Object Detection Helps AI Avoid Obstacles | HackerNoon

News Room
Last updated: 2025/03/03 at 12:50 AM
News Room Published 3 March 2025
Share
SHARE

Table of links

ABSTRACT

1 INTRODUCTION

2 BACKGROUND: OMNIDIRECTIONAL 3D OBJECT DETECTION

3 PRELIMINARY EXPERIMENT

3.1 Experiment Setup

3.2 Observations

3.3 Summary and Challenges

4 OVERVIEW OF PANOPTICUS

5 MULTI-BRANCH OMNIDIRECTIONAL 3D OBJECT DETECTION

5.1 Model Design

6 SPATIAL-ADAPTIVE EXECUTION

6.1 Performance Prediction

5.2 Model Adaptation

6.2 Execution Scheduling

7 IMPLEMENTATION

8 EVALUATION

8.1 Testbed and Dataset

8.2 Experiment Setup

8.3 Performance

8.4 Robustness

8.5 Component Analysis

8.6 Overhead

9 RELATED WORK

10 DISCUSSION AND FUTURE WORK

11 CONCLUSION AND REFERENCES

2 BACKGROUND: OMNIDIRECTIONAL 3D OBJECT DETECTION

3D object detection aims to identify objects in space and predict their properties such as 3D location, size, and velocity. The predicted object information is utilized by application functionalities such as obstacle avoidance for robot navigation. Safe navigation cannot be solely ensured by Simultaneous Localization and Mapping (SLAM), lacking the ability to model the object sizes or movements in real-time. A robot must plan its navigation path based on obstacles’ location and size, or even their predictive trajectory, to prevent collisions beforehand. Moreover, in complex outdoor environments where objects can approach from multiple directions, the ability to detect surrounding objects becomes essential.

Existing methods for omnidirectional 3D object detection utilize LiDAR sensors or multiple cameras providing a 360° perception range. While LiDAR sensors offer accurate object localization based on depth measurements, the camera-based solutions have recently drawn attention due to their costeffectiveness. Recent camera-based detectors aggregate information from multiple camera images into a bird’s-eye-view (BEV) space, providing a top-down representation of the surrounding 3D space. Early work [38] proposed an end-toend trainable method to extract BEV features directly from multi-view images. Building upon [38], BEVDet [24] enables the detection of surrounding 3D objects using the extracted BEV features. Due to its simplified and scalable architecture, many of the latest BEV-based 3D detectors [23, 29, 30, 52] followed BEVDet’s inference pipeline, as shown in Figure 2. Such detectors have overcome the monocular ambiguity of camera-based approaches by introducing enhanced methods for each stage of the baseline BEVDet pipeline, achieving accuracy comparable to the LiDAR counterpart [54]. Table 1 lists these methods, which are described in the following. The first stage involves extracting 2D feature maps from multi-view images using backbone neural networks, such as ResNet [21], widely used in vision tasks. It is well-known that increasing the backbone capacity, e.g., the number of layers, or input image resolution leads to accuracy improvements. For example, combining a 152-layer ResNet with a 720×1,280 (height×width) resolution yields a more detailed feature map than a 34-layer ResNet with 256×448 resolution thereby enhancing the detection of small and distant objects. The second stage of BEV-based detection is to transform the extracted image features into 3D space using predicted depth. For this stage, prior work [30] employed a depth estimation

neural network, i.e., DepthNet, supervised by dense depth data generated from 3D point clouds. Accurate metric depth predictions (in meters) from images allow the detector to better distinguish objects from the backgrounds. The third stage involves projecting features scattered in each 3D camera coordinate into a unified BEV grid using camera parameters, generating a BEV feature map. Recent works [23, 31] have proposed techniques that fuse the BEV feature map from a previous frame with the feature map of the current frame. By exploiting temporal cues, the technique improves perception robustness, enabling the detection of temporarily occluded objects and the accurate prediction of object velocities. Lastly, the neural networks in the BEV head generate 3D bounding boxes and their properties, e.g., location and velocity, using the BEV features.

This paper is available on arxiv under CC by 4.0 Deed (Attribution 4.0 International) license.

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Ethical Considerations in Student Loans Management: Balancing Financial Needs and Moral Responsibility
Next Article Oscars host Conan O’Brien’s gift and handwritten note to every attendee revealed
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

Ad Hoc Meeting Essentials: 7 Key Steps for Success |
Computing
More concise chatbot responses tied to increase in hallucinations, study finds
News
2023 TechNode Content Team Annual Insights: AI&EV to be the spotlight in 2024 · TechNode
Computing
Ecovacs GOAT A3000 robotic lawn mower review: Smart, thorough, and pricey
News

You Might also Like

Computing

Ad Hoc Meeting Essentials: 7 Key Steps for Success |

28 Min Read
Computing

2023 TechNode Content Team Annual Insights: AI&EV to be the spotlight in 2024 · TechNode

5 Min Read
Computing

BOE overtakes Samsung in foldable display market in Q4 2023 · TechNode

1 Min Read
Computing

China unveils first batch of imported game licenses in 2024 · TechNode

1 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?