N8N Workflows: Context-Based Chunking in RAG Geek

Introduction

Topic: Enhancing retrieval accuracy in RAG systems using context-based chunking.
Objective: Explore how context-based chunking improves retrieval accuracy.

Definition: RAG is a technique that enhances the accuracy of generative AI by using relevant data sources.
Process:
- Provide an LLM with a specific data source.
- Retrieve and use relevant information before generating a response.
- Outcome: More precise and contextually accurate output.

Described by: Anthropic Contextual Retrieval.
Method: Attach each chunk to a broader context from the entire document.
Benefits:
- Maintain coherence.
- Improve retrieval accuracy.
- Help the model understand the bigger picture.

Retrieving the Source Document: From Google Drive.
Extracting Text Data: From the document.
Adding Boundary Lines:
- Mark different sections in the document.
- Split text into meaningful chunks.
Using Code Node:
- Write script to divide document into structured chunks.
Loop Through Chunks: Using loop node in N8n.
Agent Node:
- Generate contextual info for each chunk by referencing the entire document.
- Use OpenAI's GPT 4.0 Mini from OpenRouter as LLM.
Prepending Context:
- Enriched chunk sent to next node.
Creating Embeddings:
- Use Google's Gemini Text-Embedding Oak 04 model.
- Store in Pinecone vector database.
Recursive Text Splitter:
- Effect minimal due to large chunk size.

Outcome: Enhanced retrieval accuracy and efficiency.
Setup Components:
- Document retrieval and extraction.
- Context generation and embedding creation.
- Storage in vector database.
Result: More accurate and context-aware RAG system.