📜

Stochastic Optimal Control - LQG and Kalman Filters

Jul 16, 2024

Stochastic Optimal Control - LQG and Kalman Filters

Key Concepts

LQG (Linear Quadratic Gaussian) Control

  • Extends Linear Quadratic Regulator (LQR) to cases with stochastic (noisy) dynamics and partial state observations.
  • Two primary types of noise: dynamics noise (process noise) and sensor noise.
  • Observations are noisy and typically do not provide full state information.
  • Key Results: Separation Principle and Certainty Equivalence.
    • Even though these do not hold in the general nonlinear case, they are fundamental in LQG and widely used.

Control and Reinforcement Learning

  • Discussed reinforcement learning methods that do not rely on the separation principle, using a history of observations instead of a single state vector.
  • Recent work by Ben Recht (Berkeley) and Sarah Dean (Cornell) showed model-based RL is more data-efficient than model-free RL in the LQG setting.

Kalman Filter

  • Optimal Estimation: Using the best estimate of the current state for control using methods like LQR.
  • Filter Structure: Recursive, linear, MMSE (Minimum Mean Squared Error) estimator.
  • Estimation Steps: Prediction and Measurement Update.
    • Prediction: Uses the system's dynamics to predict the next state and state covariance.
    • Measurement Update: Correct the prediction using new measurements by calculating the Kalman gain and updating the state and state covariance.
  • Kalman Gain: Crucial for adjusting the estimate based on the measurement prediction error.
  • Innovation & Innovation Covariance: Determine the 'surprise' of measurements and their variability.
  • Joseph Form: Numerically stable way to update the state covariance to ensure it remains positive semi-definite.

Duality between Estimation and Control

  • Nonlinear MMSE estimation can be posed as an optimal control problem using a cost function that includes measurement errors and process noise.
  • Insights: Similarities with LQR in terms of solving the control problem to achieve optimal estimation.
  • Allows generalization to more complex, nonlinear systems using approaches like the Extended Kalman Filter (EKF), Unscented Kalman Filter (UKF), and particle filters.

Practical Considerations

  • Numerical Stability: Use Joseph Form and consider implementing square-root filters for better numerical properties.
  • Outlier Rejection: Use innovation and its covariance to discard unlikely measurements.
  • Nonlinear Extensions: Example approaches include EKF, UKF, iterated EKF, and particle filters for more complex, nonlinear systems.

Applications and Examples

  • Practical demonstrations using a simple double integrator system and LQR control.
  • Highlighted the importance of tuning and how Kalman filters can remain consistent despite noisy measurements.

Suggested Reading and Resources

  • The original Kalman paper (1960s) and works by Rudy Kalman and Dirk Luenberger.
  • Books: Linear Estimation by Kailath, Sayed, and Hassibi.
  • Blogs: UncommonLab.com by Tucker McClure for practical tips on implementing Kalman filters.