So Lets start with the need of InfoSphere Data Architect (IDA) in an ETL domain? ETL tools primarily load the data to a warehouse. So IDA is a visual tool used for data modelling (how the data is represented in the destination).
From a top-down approach, we can use InfoSphere Data Architect to design a logical model and automatically generate a physical data model from the logical source. Data definition language (DDL) scripts can be generated from the data model to create a database schema based on the design of the data model.
Alternatively, InfoSphere Data Architect can connect to the RDBMS and instantiate the database schema directly from the InfoSphere Data Architect physical data model. This generation facility works both ways in that you also can reverse engineer an existing database into an InfoSphere Data Architect data model for modification, reuse, versioning, and so on.
Here are some of the features of InfoSphere Data Architect:
- Supports multi-dimensional modeling in both logical and physical data models
- Provide automatic discovery and creation of data warehouse schemas
- Provide integration with IBM InfoSphere Warehouse by linking logical design with the warehouse OLAP engine
- Enable data lineage across data warehouse models. This helps to provide an understanding of impact for business and IT users in case of changes in data model.
- It integrates with IBM Cognos BI to ensure the delivery of consistent information to business users
- IDA provides a thorough understanding of the data sources by visually representing the relationships in within it. IBM ILOG technology helps to provide flexible and high performance visualization of data models. Diagrams that you create in InfoSphere Data Architect are themselves highly flexible and allow users to create multiple views of the same model without affecting the underlying metadata.
- Provides business user queries
- Improve productivity with comprehensive comparison filter facilities. It has ability to efficiently identify and filter detailed differences between elements such as models and database objects.
Businesses today own vast amount of data and decisions are made based on data. The challenge for IT is to provide understanding of the data, improve data quality and consistency, and keep data design aligned with business intent and requirements. IBM InfoSphere Data Architect is a collaborative data design solution that helps us discover, model, relate, and standardize diverse and distributed data assets.