Neuro Radiance Fields (NeRFs) - Lecture Notes

Jul 3, 2024

Neuro Radiance Fields (NeRFs) - Lecture Notes

Introduction

Speaker: Torsten
Topic: Introduction to Neuro Radiance Fields (NeRFs)
Background: Computer Vision, 3D reconstructions from images

Key Concepts

Explicit Scene Representations

Point Clouds & 3D Meshes: Traditionally used in multi-view stereo methods
Issues: Memory intensive, assume Lambertian surfaces (color consistency from various viewpoints)

Implicit Scene Representations

Signed Distance Functions (SDF): Used for 3D reconstructions, represent surfaces with zero-crossing isocontours
Voxel Grids: Use SDF stored in a voxelized manner, memory intensive

NeRFs Introduction

Input: Set of images with known camera calibrations and poses
Output: Faithful 3D reconstruction with implicit representation
Advantages: Models complex lighting effects, compact representation (~1MB for a room)

Volume Rendering

Concept: Compute color for each pixel by shooting a ray into the scene and sampling along it
Steps:
1. Evaluate volume at sampled points
2. Accumulate colors and volume densities
Result: Linear combination of colors based on volume density and visibility

NeRFs Detailed

Neural Network Input: Five parameters (3D position, 2 viewing angles)
Output: Color (RGB) and volume density
Training:
- Images and known poses
- Volume rendering to match predicted color to actual image color
Sample Generation: Randomized sampling within intervals to approximate surfaces
Position Encoding: Higher dimensional representation of coordinates improves fine detail recovery

Results

Synthetic Data: High detail and view-dependent effects
Real Scenes: Effective in handling complex real-world scenes

Issues and Solutions

Speed: Original NeRFs are slow (days for training, minutes for rendering)
PlenOctrees: Spherical harmonics for fast training and rendering
Multi-resolution Hash Encoding: Efficient training with hash tables for feature storage

Geometric Extraction from NeRF

Direct Extraction Issues: Noise in reconstructed geometry
Signed Distance Functions: Better definition of surface geometry
WSDF: Volume rendering using signed distance function for improved geometry

Challenges and Alternatives

Sparse Data Training: Difficult with limited viewpoint coverage
Monocular Cues: Using depth and normal predictions to aid training
Transients and Appearance Vectors: Training NeRF with varying illumination and transient objects

Scalability

Large Scenes: Training multiple NeRFs for large scenes (e.g., downtown San Francisco)
Run-Time and Memory: Trade-offs in run-time speed and memory usage

Tools and Software

Nvidia's Instant NGP: Efficient implementation of NeRFs
Nerf Studio: Tool for training and visualizing NeRFs

Conclusion

Current State: NeRFs are promising but still in infancy
Future Directions: Reduce training times, improve scalability, potential for real-time mobile applications
Resources: Blog posts by Frank Dard, survey papers

Full transcript