Book description
Describes the main tasks that you can perform in SAS Data Integration Studio, including: data access; data integration; metadata management; data cleansing and enrichment; extract, transform, and load (ETL); extract, load, and transform (ELT); and service-oriented architecture (SOA) and message queue integration.
Table of contents
- Contents (1/2)
- Contents (2/2)
-
What's New in SAS Data Integration Studio 4.9
- Overview
- New Source Designer Wizard for Cloudera Impala
- High Performance Analytics Transformations Are Now Production
- Enterprise Decision Management Transformation Is Now Production
- Support Added for SAS Lineage
- Code Generation Supports Credential Lookup for DataFlux Authentication Servers
- SAS Package Wizards Have Better Support for DataFlux Data Management Server
-
Introduction
-
Overview of SAS Data Integration
- About SAS Data Integration
- Advantages of SAS Data Integration
- A Basic Data Integration Environment
- How to Get Help for SAS Data Integration Studio
- Installing SAS Data Integration Studio
- Administrative Documentation for SAS Data Integration Studio
- Accessibility Features in SAS Data Integration Studio
-
Overview of SAS Data Integration
-
General User Tasks
-
Getting Started
- Security for SAS Data Integration Studio
- Main Tasks for Creating Process Flows
- Starting SAS Data Integration Studio
- Connecting to a SAS Metadata Server
- Working with the Folders Tree
- Selecting a Default SAS Application Server
- Registering SAS Libraries
- Working with User-Defined Formats
- Registering Tables and Cubes
- Overview of Transformations (1/3)
- Overview of Transformations (2/3)
-
Overview of Transformations (3/3)
- Introduction to Transformations
- Overview of the Transformations Tree
- Access Folder
- Analysis Folder
- Archived Folder
- Change Data Capture Folder
- Control Folder
- Data Folder
- Data Quality Folder
- Hadoop Folder
- High-Performance Analytics Folder
- Output Folder
- Publish Folder
- SPD Server Dynamic Cluster Folder
- SQL Folder
- Ungrouped Folder
- Working with Stored Processes
- Working with Web Services
- Specifying Global Options in SAS Data Integration Studio
- Working with Change Management
- Search Metadata
- Add a Note or Document to a Registered Object
- View the Content of Notes or Documents
-
Importing, Exporting, and Copying Metadata
- Metadata Import and Export in SAS Data Integration Studio
- Working with SAS Package Metadata
- Preparing to Import or Export SAS Package Metadata
- Exporting SAS Package Metadata
- Importing SAS Package Metadata
- Copying and Pasting Metadata Objects
- Working with SAS Metadata Bridges
- Usage Notes for Importing or Exporting with a SAS Metadata Bridge
- Preparing to Import or Export with a SAS Metadata Bridge
- Importing New Metadata with a SAS Metadata Bridge
- Importing Updated Metadata with a SAS Metadata Bridge
- Exporting Metadata with a SAS Metadata Bridge
-
Working with Tables
- About Tables
- Registering Existing Tables with the Register Tables Wizard
- Registering New Tables with the New Table Wizard
- Viewing or Updating Table Metadata
- Using a Physical Table to Update Table Metadata
- Specifying Options for Tables
- Supporting Case and Special Characters in Table and Column Names
- Maintaining Column Metadata (1/2)
- Maintaining Column Metadata (2/2)
- Standardizing Columns (1/2)
- Standardizing Columns (2/2)
- Maintaining Keys
- Maintaining Indexes
- Browsing Table Data
- Editing SAS Table Data
- Using the View Data Window to Create a SAS Table
- Specifying Browse and Edit Options for Tables and External Files
-
Working with External Files
- About External Files
- Registering a Delimited External File
- Registering a Fixed-Width External File
- Registering an External File with User-Written Code
- Viewing or Updating External File Metadata
- Overriding the Code Generated by the External File Wizards
- Specifying NLS Support for External Files
- Accessing an External File with an FTP Server or an HTTP Server
- Viewing Data in External Files
- Registering a COBOL Data File That Uses a COBOL Copybook
- Using an External File in the Process Flow for a Job
- Using a Format File to Register a Fixed-Width External File
-
Creating Jobs
- About Jobs
- Creating an Empty Job
- Creating a Process Flow for a Job
- Creating a Job That Contains Jobs
- Working with Default Temporary Output Tables
- Specifying Options for Jobs
- Documenting Process Flow Diagrams
- Accessing Local and Remote Data
- Viewing or Updating Job Metadata
- Displaying the SAS Code for a Job
- Common Code Generated for a Job
-
Managing Jobs
- About Managing Jobs
- Submitting a Job for Immediate Execution
- Meeting Prerequisites for Collecting Job Statistics
- Reviewing a Successful Job
- Diagnosing and Correcting an Unsuccessful Job
- Adding a Transformation to an Existing Job
- Understanding the Job Has Changed Warning
- Understanding the Crossed Versions in a Job Warning
- Displaying Run-Time Statistics in SAS Job Monitor
- Displaying Run-Time Statistics in SAS Web Report Studio or the SAS Stored Process Server
- Maintaining Column Mappings
- Managing the Scope of Column Changes in Jobs
- Managing Connections in Job Editor Windows
- Viewing the Code for a Transformation
- Specifying Options for Transformations
- Redirecting Temporary Output Tables
- Pushing ELT Job Code Down to a Database
- Using a Web Client to Orchestrate Jobs
- Restarting Jobs From Checkpoints
-
Managing the Status of Jobs and Transformations
- About Status Handling for Jobs and Transformations
- Default Conditions, Actions, and Conditional Action Sets
- Prerequisites for Actions
- Perform Actions Based on the Status of a Job
- Perform Actions Based on the Status of a Transformation
- Macro Variables for Status Handling (1/2)
- Macro Variables for Status Handling (2/2)
-
Deploying Jobs
- About Deploying Jobs
- About Deploying Jobs for Scheduling
- Prerequisites for Deploying a Job for Scheduling
- Deploying Jobs for Scheduling
- Using a Command Line to Deploy Jobs
- Redeploying Jobs for Scheduling
- Using Scheduling to Handle Complex Process Flows
- Using Deploy for Scheduling to Execute Jobs on a Remote Host
- About Deploying Jobs as Stored Processes
- Prerequisites for Deploying a Job as a Stored Process
- Deploying Jobs as Stored Processes
- Redeploying Jobs to Stored Processes
- Viewing or Updating Stored Process Metadata
- About Deploying Jobs as Web Services
- Prerequisites for Web Service Jobs
- Requirements for Web Service Jobs
- Creating a Web Service Job
- Deploying a Web Service Job as a Stored Process
- Deploying a Stored Process as a Web Service
- Working with Versions
- Working with Generated Code
-
Working with User-Written Code
- About User-Written Code
- Adding User-Written Code to the Precode and Postcode Tab
- Adding a User Written Code Transformation to a Job
- Creating and Using a Generated Transformation (1/2)
- Creating and Using a Generated Transformation (2/2)
- Updating a Generated Transformation
- Editing the Generated Code for a Job or Transformation
- Replacing the Generated Code for a Job or Transformation
- Converting a SAS Code File to a Job
- Optimizing Process Flows
- Working with Impact Analysis and Data Lineage
- Working with Reports
-
Working with Data Management Platform
- Integration with DataFlux Data Management Platform
- General Prerequisites for Data Quality Transformations
- Prerequisites for Running a DataFlux Job or Profile in a SAS Data Integration Studio Job
- Analyzing the Quality of Data Sources
- Standardizing Values with a Standardization Scheme
- Standardizing Values with a Definition
- Using Match Codes to Improve Record Matching
- Using a DataFlux Data Service in a SAS Data Integration Studio Job
- Using a DataFlux Job or Profile in a SAS Data Integration Studio Job
-
Getting Started
-
Working with Transformations
-
Working with Analysis Transformations
- About Analysis Transformations
- Creating a Correlation Analysis
- Creating a Distribution Analysis (1/2)
- Creating a Distribution Analysis (2/2)
- Generating Forecasts (1/2)
- Generating Forecasts (2/2)
- Frequency of Eye Color By Hair Color Crosstabulation (1/3)
- Frequency of Eye Color By Hair Color Crosstabulation (2/3)
- Frequency of Eye Color By Hair Color Crosstabulation (3/3)
- One-Way Frequency of Eye Color By Region (1/2)
- One-Way Frequency of Eye Color By Region (2/2)
- Creating Summary Statistics for a Table (1/2)
- Creating Summary Statistics for a Table (2/2)
- Creating a Summary Tables Report from Table Data (1/2)
- Creating a Summary Tables Report from Table Data (2/2)
-
Working with Loader Transformations
- About Loader Transformations
- About the SPD Server Table Loader Transformation
- Teradata Table Loader Transformation
- About the Table Loader Transformation
- About the Oracle Bulk Table Loader Transformation
- About the DB2 Bulk Table Loader
- Setting Table Loader Transformation Options
- Selecting a Load Technique in the Table Loader
- Removing Non-Essential Indexes and Constraints during a Load
- Considering a Bulk Load
- Working with SAS Sort Transformations
-
Working with SQL Join Transformations
- About Join Transformations
- Using the Designer Window
- Reviewing and Modifying Clauses, Joins, and Tables in an SQL Query
- Understanding Automatic Joins
- Selecting the Join Type
- Adding User-Written SQL Code
- Debugging an SQL Query
- Adding a Column to the Target Table
- Adding a Join to an SQL Query on the Designer Tab
- Creating a Simple SQL Query
- Configuring a SELECT Clause
- Adding a CASE Expression
- Creating or Configuring a WHERE Clause
- Adding a GROUP BY Clause and a HAVING Clause
- Adding an ORDER BY Clause
- Adding Subqueries
- Validating or Submitting an SQL Query
- Joining a Table to Itself
- Using Parameters with an SQL Join
- Constructing a SAS Scalable Performance Data Server Star Join
- Optimizing SQL Processing Performance
- Performing General Data Optimization
- Influencing the Join Algorithm
- Setting the Implicit Property for a Join
- Enabling Explicit Pass-Through Processing for SQL Join Transformations
- Using Properties Window Options to Optimize SQL Processing Performance
- Working with Other SQL Transformations
- Working with Iterative Jobs and Parallel Processing
-
Working with Slowly Changing Dimensions
- About Slowly Changing Dimensions
- About Dimension Tables
- About Fact Tables
- Usage Notes for Slowly Changing Dimensions
- Loading a Dimension Table with Type 1 Updates
- Loading a Dimension Table with Type 1 and 2 Updates
- Comparing Tables (1/2)
- Comparing Tables (2/2)
- Loading a Fact Table Using Dimension Table Lookup (1/2)
- Loading a Fact Table Using Dimension Table Lookup (2/2)
- Loading a Table and Adding a Surrogate Primary Key
- Tracking Changes in Source Datetime Values
- Closing Out Rows in Datetime Change Tracking
- Working with Change Data Capture
- Working with Message Queues
- Working with SPD Server Cluster Tables
-
Working with Hadoop and SAS LASR Analytic Server
- Overview of the Hadoop Transformations
- Prerequisites for the Hadoop Transformations
- Creating a Pig Job
- Creating a Hive Job
- Creating a Hadoop Container Job (1/2)
- Creating a Hadoop Container Job (2/2)
- Monitoring Hadoop Clusters
- Overview of the High-Performance Analytics Transformations
- Prerequisites for the High-Performance Analytics Transformations
- Loading a Table on the SAS LASR Analytic Server
- Usage Notes for HPA Software and Hadoop
-
Working with Analysis Transformations
-
Appendixes
- Main Windows and Wizards
-
Usage Notes
- General Usage Notes (1/2)
-
General Usage Notes (2/2)
- Avoid Double Quotation Marks in DBMS Table and Column Names
- Concurrent Queries to Teradata Tables Can Fail
- Create as View Option Works Only When It Is Possible to Create a View
- Data Transfer Does Not Work for DBMS Tables with Special Characters in Table Names
- DBMS-Specific Functions Work Only with Explicit Pass-Through
- Do Not Include the << and >> Signs in the Column Descriptions of a Table That Could Be Used in an Information Map
- Do Not Use MLE Library Tables as Targets in SAS Data Integration Studio Jobs
- Enhanced Validation for Generated Code Can Be Turned Off
- External File Wizard: Cannot Sort Displayed Data by Column
- Jobs with Implicit Data Transfers between Different Hosts Fail
- Limit Line Lengths in the Expression Builder to 128 Characters
- Maximum Integer Value for a Field in SAS Data Integration Studio
- Microsoft Queue Writer Transformation Does Not Transform Missing Date Values
- Migrating from SAS/Warehouse Administrator to SAS Data Integration Studio
- Out-Of-Date Generated Transformations Are Updated Automatically When Included in Jobs That Are Deployed or Redeployed
- SAS Data Integration Studio Jobs Cannot Output HP Neoview Tables That Have Keys
- SAS Metadata Bridges
- Saving Metadata Changes in the Corresponding Physical Target
- Saving Temporary Output Tables to a Permanent Location
- Sign-on Scripts for SAS/CONNECT Servers
- SQL Join Transformations: Adding Multiple Sources with Primary Keys to an SQL Join Transformation Job
- SQL Merge Transformations: Input and Output Issues
- SQL Merge Transformations: Use the SQLNCLI10 Driver to Access SQL Server 2008
- SQL Update and SQL Merge Transformations: Manual Updates Are Required If Subqueries Are Used
- Synchpoint Option on WebSphere Queue Reader Does Not Prevent All Data Commits to Target When an Error Occurs
- Transformations That Use PROC SQL Rename SAS Reserved Words Unless Case Sensitive Names and Special Characters Are Recognized
- Unrestricted Users Cannot Perform Tasks That Require Logins from the Metadata Server
- Update Table Metadata Cannot Be Used for Some Tables
- Update Table Metadata on z/OS Platforms
- Updates Might Be Needed to the Names of Primary Keys for Oracle Tables
- Very Large Tables or Files in WebSphere Message Queues Can Cause Memory Overruns
-
Usage Notes for Register Tables Wizards and the New Table Wizard
- Access to Data on z/OS Platforms
- Access to Tables Using ODBC DB2 z/OS Pass-through
- Case and Special Characters in DBMS Names
- Case and Special Characters in SAS Names
- Control Whether SAS Formats and Informats Are Automatically Applied to Table Columns
- Generic Register Tables Wizard: When to Use
- Importing Foreign Keys for DBMS Tables
- Importing Foreign Keys for SAS Tables
- Importing Keys and Indexes from SAS/SHARE Libraries
- LIBNAME Options Required for Support of Case and Special Characters in the Names for Keys and Indexes
- In a Register Tables Wizard, Limit Excel Connections to a Single User Name
- Limitations of Register Tables Wizards for MySQL and Informix
- Metadata for a Library and Its Tables Must Be Stored in the Same Metadata Repository
- Microsoft Windows Administrative Rights Required to Connect to OLE DB Data Sources
- ODBC Informix Library: Preserving Case in Table Names
- Registering SAS/SHARE Tables
- Registering a Sybase Table with a Clustered Index
- Separate Logon Credentials for Each Authentication Domain for Database Servers
- Setting Table Options in the New Table Wizard
- Table Names over 32 Characters Are Truncated in the New Table Wizard
- Teradata Register Tables Wizard Hangs Unless a User ID and Password Can Be Supplied
- Unrestricted Users Cannot Perform Tasks That Require Logon Credentials from the Metadata Server
-
Usage Notes for the View Data Window
- About Usage Notes for the View Data Window
- Cannot Display Tables in an HDFS Library or a LASR Library
- Cannot View Cubes in SAS Data Integration Studio
- Cannot View Tables in Libraries That Are Not Defined in a Current Repository
- Close the View Data Window to Unlock a SAS Table
- Default Parameter Values Are Used When Parameterized Tables Are Viewed
- Error When Viewing SAP R/3 Tables
- Libraries for Any User-Defined Formats Must Be Available
- Permanent Data Set Formats Are Unaffected by the Apply Metadata Formats Option in the View Data Window
- Setting Multiple Column Heading Label Options
- Tables Might Not Be Editable Due to a Referential Integrity Issue
- Table Options Will Be Ignored When You Create a Table with View Data
- Using Columns with Leading or Trailing Spaces
- View Data Queries Construct SELECT Statements
- Viewing DBMS Tables Immediately After a Job Executes
- Viewing Formatted Data in Fixed-Width External Files
- Viewing Tables in a SAS WORK Library
- Viewing Teradata Tables
-
Usage Notes for Iterative Jobs
- About Usage Notes for Iterative Jobs
- Iterative Jobs and Parameterized Jobs Behave as Completely Separate Jobs
- Iterative Processing Places Limits on the Number of Nesting Levels and Jobs
- Parameter Values That Include Special Characters Require Special Handling
- Control Table Jobs Display an Out of Order Warning
-
Prerequisites for Running a Job When a DataFlux Server Is Used for Authentication
- Overview
- Verify How Users Are Authenticated on the DataFlux Data Management Server
- Deploy the Job, Service, or Profile
- Register a User on the DataFlux Authentication Server
- Grant Privileges on the DataFlux Data Management Server
- Update or Add a Data Management Server Definition in SAS Management Console
- Update User Accounts in SAS Management Console
- How the Connections Work
- Next Tasks
- Usage Notes for Loaders
-
Miscellaneous Transformations
- Using a Business Rule Flow in a Job
- Creating a Table That Appends Two or More Source Tables
- Creating a Publish to Archive Report from Table Data (1/2)
- Creating a Publish to Archive Report from Table Data (2/2)
- Validating Product Data
- Creating a Publish to Email Report from Table Data (1/2)
- Creating a Publish to Email Report from Table Data (2/2)
- Integrating a SAS Enterprise Miner Model with Existing SAS Data (1/2)
- Integrating a SAS Enterprise Miner Model with Existing SAS Data (2/2)
- Creating a Publish to Queue Report from Table Data (1/2)
- Creating a Publish to Queue Report from Table Data (2/2)
- Extracting Data from a Source Table
- Creating Reports from Table Data (1/2)
- Creating Reports from Table Data (2/2)
- Create a Table That Ranks the Contents of a Source
- Create Two Tables That Are Subsets of a Source
- Moving Data Directly from One Machine to Another Machine
- Creating Standardized Statistics from Table Data
- Creating Transposed Data from Table Data
- Converting a SAS or DBMS Table to an XML Table
- Using ODS to Specify Output from the XML Writer
- Using SOAP to Access a Third-Party Web Service
- Using REST to Access a Third-Party Web Service
- Generating Enterprise Decision Management Output (1/2)
- Generating Enterprise Decision Management Output (2/2)
- Running Conditional Processes (1/2)
- Running Conditional Processes (2/2)
- Java Code and Methods for Report Plug-ins
- Glossary (1/2)
- Glossary (2/2)
- Index (1/3)
- Index (2/3)
- Index (3/3)
Product information
- Title: SAS Data Integration Studio 4.9
- Author(s):
- Release date: August 2014
- Publisher(s): SAS Institute
- ISBN: 9781629593104
You might also like
book
Learn Data Science Using SAS Studio: A Quick-Start Guide
Do you want to create data analysis reports without writing a line of code? This book …
book
A Gentle Introduction to Statistics Using SAS Studio in the Cloud
Point and click your way to performing statistics! Many people are intimidated by learning statistics, but …
book
SAS Encoding
Understanding the basic concepts of character encoding is necessary for creating, manipulating, and rendering any type …
book
Smart Data Discovery Using SAS Viya
Whether you are an executive, departmental decision maker, or analyst, the need to leverage data and …