📋

IICS Administration and Metadata Management

Jul 30, 2024

Lecture Notes on IICS Administration and Metadata Management

Overview

  • Focus: Administration and Metadata Administration in IICS.

Hierarchical Structure of IICS

  1. Infrastructure Layer

    • Managed by Informatica Administrator
    • Equivalent to on-premise EDC/IDQ administrator console.
    • Responsible for:
      • Infrastructure management
      • User management
      • Connection management
  2. Metadata Command Center

    • Manages metadata harvesting for the Enterprise.
    • Supports:
      • Governance assets
      • Custom fields (similar to Axon)
      • Configuration of workflows
    • Typically used by a small group of privileged users.
  3. Data Governance and Catalog Application

    • Consumption layer similar to EDC application.
    • Utilizes the capabilities of the Metadata Command Center.

Informatica Administrator

  • Setting up SAML configurations for authentication.
  • User management:
    • Create/synchronize user objects.
    • Assign roles and permissions.
  • Supports native user roles and group management.

Runtime Environment

  • Critical for:
    • Metadata extraction
    • Data profiling
  • Can be used for multiple applications such as:
    • Cloud Data Integration (CDI)
    • Cloud Data Quality
    • Cloud MDM (Master Data Management)

Connection Management

  • Common connections can be defined and reused across applications (CDI, CM, CDGC).
  • Important for infrastructure administration.

Metadata Command Center Features

Catalog Sources

  • Ability to create custom catalog sources.
  • JDBC support for metadata extraction.
  • Custom catalog sources allow extending existing models.

Data Classification

  • Lookup tables help classify data (e.g., product codes).
  • Classifying data involves defining expressions and rules.
    • Example: Classifying a column as a US state.

Advanced Features

  • Automatic Association:
    • Intelligent glossary association using AI/ML.
  • Relationship Discovery:
    • Helps with schema matching and recommendations (currently in CDI).

Filters and Scheduling

  • Initial Metadata Collection:
    • Start with smaller datasets to manage costs.
  • Can run jobs ad hoc or on a schedule.
  • Flexible execution for classification tasks without re-accessing data sources.

Monitoring and Reporting

  • Job statistics tracking for all running jobs and their results.
  • Ability to download logs and perform offline analysis.

Workflows and Customization

  • Out-of-the-box workflows for asset management (currently limited, custom workflows in roadmap).
  • Capability to customize metadata models and add additional fields.
  • Tools for building and managing workflows are available.

Future Developments

  • Upcoming features for Cloud Data Marketplace and Cloud Data Prep.
  • Public APIs for custom application development.

Conclusion

  • IICS provides a structured approach to metadata management involving multiple administration layers, ensuring effective data governance.