InfoSphere DataStage – XII (Handling Unstructured Data)

9.1 release of Information server has Native Excel Access – A New stage that provides direct Excel read capabilities on all platforms with a rich set of features to handle ranges, multiple worksheets, unstructured text and more.

UnStructuredAccess

Prior to Information Server 9.1 release, it was difficult to access an Excel work sheet directly from a DataStage job. One would have to access Excel file via ODBC – this approach requires creating an ODBC connection to the Excel file on a Datastage server machine and use an ODBC stage in Datastage. The main disadvantage is that it was impossible to do this on an Unix machine.

But the New Unstructured Stage provides….

  • 3-Step Configuration – enter file name, select data range and setup column name mapping
  • Native Excel File Support – reads data from any supported OS (Windows, Linux, Unix)
  • XLS Version Variants – includes both Excel 97-2003 OLE2 (.xls) and 2007 OOXML (.xlsx) file format
  • Simplified Access – no need to define ODBC DSN for each excel file
  • Password Protected Files – read data from password encrypted files
  • Multiple File Support – ability to simultaneously process multiple files in different nodes
  • Multiple Sheet Support – extract data from multiple sheets at a time
  • Runtime Column Propagation – define column metadata
  • Excel Field Extraction – ability to extract row numbers, comments, hyperlinks, formulas, document Properties, etc…
Here is the Unstructured Stage in action:
Excel Access

4 thoughts on “InfoSphere DataStage – XII (Handling Unstructured Data)

  1. what if I have to identify the start of the column by finding the column “order no” (Data driven). Like say one of file the order no starts at line 5 another it starts on 2 and another it starts at 1.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s