Long Short Term Memory (LSTM) Overview

Jul 10, 2024

Long Short Term Memory (LSTM) Overview

Concept Introduction

  • Murder Mystery Analogy: Initially described to explain the necessity of remembering details selectively — predicting outcomes better when remembering key information and forgetting irrelevant details.
  • Two Memory Problems:
    • Short-term memory issue: Difficulty in making sense of the story with limited memory.
    • Over-extended memory issue: Overwhelmed by too much information.

What is LSTM?

  • Long Short Term Memory (LSTM): A type of neural network that retains important information for long periods and forgets irrelevant details to maintain context.
  • Comparison with Standard Memory: LSTM combines remembering relevant details and forgetting the unnecessary ones, addressing both short-term and long-term memory issues.

How LSTM Works

  • Sequence Prediction Example: Predicting the next letter in a string requires context from previous letters (e.g.,