In my previous post, I mentioned that need of Data Masking. A challenge that is faced is if the data is hidden, or contained within larger field. So do we understand our data? Someone once said, “You cannot govern what you do not first understand.” Few applications stand alone today. Rather, they are made up of systems, and “systems of systems,” with applications and databases strewn all over the enterprise, yet integrated, or at least interrelated. The relational database model actually makes matters worse by fragmenting business entities for storage. But how is everything related? The Data Governance team needs to discover the critical data relationships across the enterprise.
Data discovery may include simple and hard-to-find relationships, as well as the locations of sensitive data within the enterprise’s IT systems. We may choose to do a manual analysis of each data field or we may use a discovery tool which can enable the organization to identify all instances of confidential data across the environment—whether clearly visible or obscured from view.
A discovery tool (like IBM InfoSphere Discovery) delivers automated capabilities that offer greater accuracy and reliability than manual analysis. It examines data values across multiple sources to determine the complex rules and transformations that may hide sensitive content. It locates confidential data items that are contained within larger fields, or that are separated across multiple columns.
InfoSphere Discovery not only discovers hidden sensitive data, it also provides a full range of data analysis capabilities to discover hidden relationships and bring them clearly into view. By leveraging the combination of InfoSphere Discovery and the InfoSphere Optim Data Masking solution, all relationships will be uncovered and replacement values will be masked consistently and accurately across multiple data sources.