Data File Comparison
A Self-Study or Classroom Session
  Table of Contents  v-14.03.28 - datcom01.htm 
  Introduction
  Objective of Session
  Requirements
  Materials Provided
  Criteria for Success
  What to do and How to do it
  Step-01, Data File Compare, Generate Source Code
  Step-02, Create HTML Documentation for Copy File
  Step-03, Data File Compare, View Source Code
  Step-04, Data File Compare, Compile Generated Source Code
  Step-05, Execute Data File Compare
  Additional Information
  Numeric Fields, BINARY or COMP Format
  Numeric Fields, Packed-Decimal or COMP-3 Format
  Numeric Fields, Zone Decimal Format
  Summary
  Software Agreement and Disclaimer
  Downloads and Links
  Current Server or Internet Access
  Internet Access Required
  Glossary of Terms
  Comments or Feedback
  Company Overview

Table of Contents Previous Section Next Section Introduction

The session will describe how to compare two data files that reside on a Windows (or UNIX) platform using Micro Focus COBOL and Micro Focus file formats. An exercise in the form of a sample Proof-of-Concept (POC) is provided that will use SimoTime technologies to generate the COBOL source code that may be compiled and executed using Micro Focus COBOL. Since the generated COBOL source code is COBOL/2 compliant it may be compiled and executed on a mainframe (either z/OS or VSE) to compare VSAM or QSAM files. For this exercise only the Windows operating system with Micro Focus COBOL will be used.

To perform this exercise requires a basic knowledge of the Windows environment and Micro Focus Net Express. Additional help is provided for individuals that may not be proficient in the use of Net Express or have forgotten how to perform a particular function.


We have made a significant effort to ensure the documents and software technologies are correct and accurate. We reserve the right to make changes without notice at any time. The function delivered in this version is based upon the enhancement requests from a specific group of users. The intent is to provide changes as the need arises and in a timeframe that is dependent upon the availability of resources.

Copyright © 1987-2019
SimoTime Technologies and Services
All Rights Reserved

Table of Contents Previous Section Next Section Objective of Session

The objective of this session is to define a practical process for comparing two data files on a Windows platform using Micro Focus Net Express. The following is a list of the minimum tasks to be completed for this Session.

Item Description
1 Do a record level compare of a sequential file that has been downloaded from a mainframe and converted from an EBCDIC-encoded format to an ASCII encoded format with a Micro Focus ASCII-encoded indexed file.
2 Explain what happens when a difference occurs.
3 Must be able to run the comparison process on a Mainframe, Windows or UNIX platform.
4 Document the Hardware and Software that is required to perform the Data File comparison process on a Windows platform.
5 Explain what happens with deleted or inserted records.
  Objectives of Session

Table of Contents Previous Section Next Section Requirements

If this session is being performed through self-study then it will be necessary to install the following software on a Windows system. Windows/XP or Windows Server is preferred.

Item Description
1 Either Micro Focus Net Express or Micro Focus Mainframe Express must be installed. Refer to the Micro Focus documentation for more information about these technologies. Both of these technologies are usually pre-installed for the classroom environment.
2 The SimoZAPS with SimoREC1 software that is available from SimoTime must be installed and configured. Both of these technologies are usually pre-installed for the classroom environment.
3 The DataMig1 directory that is available from SimoTime must be installed with the minimum data file and COBOL copy file. This directory is usually pre-installed for the classroom environment.
  Requirements

Table of Contents Previous Section Next Section Materials Provided

For the classroom environment two directories (c:\DataMig1 and c:\SimoLIBR) are provided that contain the following information.

Item Description
1 The c:\DataMig1\Expect\ItemFTP1.DAT is a test copy of an ASCII encoded Item Master File that includes PACKED and BINARY data. This file was transferred from the mainframe as a flat sequential file.
2 The c:\DataMig1\Actual\ItemASC1.DAT is a test copy of an ASCII encoded Item Master File that includes PACKED and BINARY data. This file was processed using Micro Focus COBOL on a Windows platform and is a Micro Focus indexed file.
3 The c:\DataMig1\STR\ItemMast.STR is a structure file used by the Micro Focus Data File Editor to display records in a formatted output based on a copy file.
4 The c:\DataMig1\CobCpy1\ItemCB01.CPY is the copy file for the item master file. This directory also contains CUSTCB01.CPY that is the copy file for the customer master file. The copy files are used to create the HTML documentation. They are not used to do the data file compare.
5 The c:\SimoLIBR directory contains the SimoTime utility programs and additional copy files used to generate the conversion programs.
  Materials Provided

Table of Contents Previous Section Next Section Criteria for Success

The following items must be completed to meet the minimum requirements for successfully completing the proof of concept.

Item Description
1 Provide a data file comparison process that may be executed on an IBM Mainframe, a Windows platform or a UNIX platform.
2 A copy of the output describing the differences in the two files.
3 A demonstration on a Windows platform that executes the data file comparison process and shows the correct data structure for the ASCII encoded Item Master File.
4 Provide HTML documentation for the record layout in order to relate the "position in record" of a difference with an actual field name based on a COBOL copy file.
  Criteria for Success

Table of Contents Previous Section Next Section What to do and How to do it

The objective of this project is to experience first-hand the issues that will be encountered and the solutions required when doing data file comparisons on a Windows platform. Because most mainframe files contain a mixture of alpha-numeric fields (i.e. "USAGE IS DISPLAY" in COBOL terminology) and numeric fields that may be packed (COMP-3) or binary (COMP) the data file comparison process will need to handle this format and properly display differences in a format that is easily understood by human beings.

For this project two directories will be used. The first directory (c:\SimoLIBR) contains the utility programs and copy files needed to generate the COBOL data file comparison program. The c:\SimoLIBR directory will need to be included in the PATH environment variable. The second directory (c:\DataMig1) contains the modules and data files specific to this learning session.

The following sections describe a series of tasks that must be accomplished to successfully complete this learning session.

Table of Contents Previous Section Next Section Step-01, Data File Compare, Generate Source Code

Generate the COBOL source member for a Data File Comparison program.

Refer to the document for Data File Compare, Generate Source Code for additional help.

Note:  If this procedure does not run correctly the problem is usually with the c:\SimoLIBR directory. Either the SimoZAPS or SimoREC1 utility programs were not installed correctly or the c:\SimoLIBR directory is not in the PATH.

Table of Contents Previous Section Next Section Step-02, Create HTML Documentation for Copy File

Create HTML Documentation based on a COBOL copy. This documentation will be used as reference material when a difference occurs during the compare process. This documentation is generated based on the information (or record layout) as defined in a COBOL copy file.

Refer to the Creating HTML Documentation for additional information.

Table of Contents Previous Section Next Section Step-03, Data File Compare, View Source Code

Verify that a COBOL source member was created (ItCompC1.CBL) and placed in the c:\DataMig1\COBOL directory.

Verify that an HTML document was created (itemcb01.htm) and placed in the c:\DataMig1\HTML directory.

Refer to the document for Data File Compare, View Source Code for additional help.

Note:  If step 1 is successful then the COBOL source member should be automatically displayed with NotePAD. You may view the source code and then close the NotePAD windows. Do not change the source code.

Table of Contents Previous Section Next Section Step-04, Data File Compare, Compile Generated Source Code

Compile the generated COBOL source member as a DLL using the command line interface for Micro Focus Net Express.

Refer to the Net Express Command Line Interface document for additional information about compiling a COBOL source member and creating a Load member.

Table of Contents Previous Section Next Section Step-05, Execute Data File Compare

Execute the data file compare program and examine the differences.

For help in executing the program refer to the Execute the Data File Compare Program and Examine the Differences document for additional information. This document also describes how to map data file names.

Table of Contents Previous Section Next Section Additional Information

This section provides additional information or insight into some of the tasks performed by this exercise. There are numerous things to consider when comparing data files. The comparison task is easy, in today's environment with possible ASCII or EBCDIC encoded files along with packed or binary numeric fields within a record it becomes a challenge as to how to properly display and interpret this information when a difference occurs.

Table of Contents Previous Section Next Section Numeric Fields, BINARY or COMP Format

This link provides additional information about Binary or COMP (i.e. COMPUTATIONAL) fields

Link to Internet   Link to Server   Explore The Binary or COMP format for numeric data strings. This numeric structure is supported by COBOL and may be explicitly defined with the "USAGE IS COMP" or "USAGE IS BINARY" clause.

Table of Contents Previous Section Next Section Numeric Fields, Packed-Decimal or COMP-3 Format

This link provides additional information about Packed Decimal or COMP-3 (i.e. COMPUTATIONAL-3) fields.

Link to Internet   Link to Server   Explore The Packed-Decimal or COMP-3 format for numeric data strings. This numeric structure is supported by COBOL and may be explicitly defined with the "USAGE IS COMP-3" clause.

Table of Contents Previous Section Next Section Numeric Fields, Zone Decimal Format

This link provides additional information about Zone Decimal fields. The unsigned, zone decimal fields are text fields and are assumed to be a simple, no risk conversion. However, it is best to treat unsigned fields as signed since it is possible for a signed zone value to end up in an unsigned field and as long as it is a positive value the mainframe will process the field.

Signed fields that are in a zone decimal format need special handling and should not be handled as text data.

Link to Internet   Link to Server   Explore The Zoned-Decimal format for numeric data strings. This numeric structure is the default numeric for COBOL and may be explicitly defined with the "USAGE IS DISPLAY" clause.

Table of Contents Previous Section Next Section Summary

The purpose of this session is to describe the techniques and effort involved to compare two data file on a Windows platform using Micro Focus COBOL and Micro Focus formatted files. This session is available as an on site class or an on line, self-study course.

Table of Contents Previous Section Next Section Software Agreement and Disclaimer

Permission to use, copy, modify and distribute this software, documentation or training material for any purpose requires a fee to be paid to SimoTime Technologies. Once the fee is received by SimoTime the latest version of the software, documentation or training material will be delivered and a license will be granted for use within an enterprise, provided the SimoTime copyright notice appear on all copies of the software. The SimoTime name or Logo may not be used in any advertising or publicity pertaining to the use of the software without the written permission of SimoTime Technologies.

SimoTime Technologies makes no warranty or representations about the suitability of the software, documentation or learning material for any purpose. It is provided "AS IS" without any expressed or implied warranty, including the implied warranties of merchantability, fitness for a particular purpose and non-infringement. SimoTime Technologies shall not be liable for any direct, indirect, special or consequential damages resulting from the loss of use, data or projects, whether in an action of contract or tort, arising out of or in connection with the use or performance of this software, documentation or training material.

Table of Contents Previous Section Next Section Downloads and Links

This section includes links to documents with additional information that are beyond the scope and purpose of this document. The first group of documents may be available from a local system or via an internet connection, the second group of documents will require an internet connection.

Note: A SimoTime License is required for the items to be made available on a local system or server.

Table of Contents Previous Section Next Section Current Server or Internet Access

The following links may be to the current server or to the Internet.

Note: The latest versions of the SimoTime Documents and Program Suites are available on the Internet and may be accessed using the Link to Internet icon. If a user has a SimoTime Enterprise License the Documents and Program Suites may be available on a local server and accessed using the Link to Server icon.

Link to Internet   Link to Server   Explore The SIMOPATH Series starting at the home page to view information about training sessions and learning materials available from SimoTime Enterprises.

Link to Internet   Link to Server   Explore the Principles of Data File Validation. This link includes guidelines for defining requirements and determining the scope of effort for a data validation effort. This effort includes a data file compare process, a technique for accumulating summary totals with a record count and a technique for reading a VSAM, KSDS and producing a hex-dump output based on a list of user-defined keys.

Link to Internet   Link to Server   Explore How to Generate a Data File Compare, Validate or Hex-Dump Program using simple specification statements in a Process Control File (PCF). This link to the User Guide includes the information necessary to create a Process Control File and generate the COBOL programs that will do a data file compare, accumulate summary totals with a record count or produce a Hex-Dump of records in a VSAM, KSDS based on a list of user-defined keys. The User Guide contains a list of the PCF statements that are used for the data file compare, validate or dump process.

Link to Internet   Link to Server   Explore the capabilities of the SimoZAPS Utility Program. This includes generating a COBOL program that will do the conversion of sequential and VSAM (KSDS) files between EBCDIC and ASCII while maintaining mainframe (or COBOL) numeric formats and integrity. SimoZAPS can also read a sequential file in EBCDIC format and create an ASCII/CRLF file or VSAM KSDS file in ASCII format. The conversion tables may be viewed or modified to meet unique requirements. The Hexcess/2 function provides the capability of viewing, finding or patching the contents of a file in hexadecimal.

Link to Internet   Link to Server   Explore How to Document Record Layouts or other Data Structures using a COBOL copy file. For reference or documentation purposes SimoREC1 creates an ASCII/Text file and an HTML document from a COBOL copy file. The utility provides the length of a field and the position of the start of the field within a record. Also, the actual physical length for group items, packed fields, binary fields and tables will be calculated.

Link to Internet   Link to Server   Explore The ASCII and EBCDIC Translation Tables. These tables are provided for individuals that need to better understand the bit structures and differences of the encoding formats.

Link to Internet   Link to Server   Explore The File Status Return Codes to interpret the results of accessing VSAM data sets and/or QSAM files.

Link to Internet   Link to Server   Explore The Binary or COMP format for numeric data strings. This numeric structure is supported by COBOL and may be explicitly defined with the "USAGE IS COMP" or "USAGE IS BINARY" clause.

Link to Internet   Link to Server   Explore The Edited for Display format for numeric data strings. This numeric structure is supported by COBOL and may be used with an edit-mask to prepare the presentation for readability by human beings.

Link to Internet   Link to Server   Explore The Packed-Decimal or COMP-3 format for numeric data strings. This numeric structure is supported by COBOL and may be explicitly defined with the "USAGE IS COMP-3" clause.

Link to Internet   Link to Server   Explore The Zoned-Decimal format for numeric data strings. This numeric structure is the default numeric for COBOL and may be explicitly defined with the "USAGE IS DISPLAY" clause.

Link to Internet   Link to Server   Explore commonly used formats and processing techniques for managing various numeric formats available on the mainframe.

Link to Internet   Link to Server   Explore the Numbers Connection for additional information about the structure and processing of numeric data items (or numeric fields).

Link to Internet   Link to Server   Explore the Hexadecimal Dump capabilites using COBOL to dump the content of a data string. This suite of programs includes a sample program that calls the SIMODUMP program to do the actual formatting for the hexadecimal dump information.

Table of Contents Previous Section Next Section Internet Access Required

The following links will require an internet connect.

A good place to start is The SimoTime Home Page for access to white papers, program examples and product information. This link requires an Internet Connection

Explore The Micro Focus Web Site for more information about products and services available from Micro Focus. This link requires an Internet Connection.

Table of Contents Previous Section Next Section Glossary of Terms

Link to Internet   Link to Server   Explore the Glossary of Terms for a list of terms and definitions used in this suite of documents and white papers.

Table of Contents Previous Section Next Section Comments or Feedback

This document was created and is maintained by SimoTime Technologies. If you have any questions, suggestions, comments or feedback please use the following contact information.

1. Send an e-mail to our helpdesk.
1.1. helpdesk@simotime.com.
2. Our telephone numbers are as follows.
2.1. 1 415 763-9430 office-helpdesk
2.2. 1 415 827-7045 mobile

We appreciate hearing from you.

Table of Contents Previous Section Next Section Company Overview

SimoTime Technologies was founded in 1987 and is a privately owned company. We specialize in the creation and deployment of business applications using new or existing technologies and services. We have a team of individuals that understand the broad range of technologies being used in today's environments. Our customers include small businesses using Internet technologies to corporations using very large mainframe systems.

Quite often, to reach larger markets or provide a higher level of service to existing customers it requires the newer Internet technologies to work in a complementary manner with existing corporate mainframe systems. We specialize in preparing applications and the associated data that are currently residing on a single platform to be distributed across a variety of platforms.

Preparing the application programs will require the transfer of source members that will be compiled and deployed on the target platform. The data will need to be transferred between the systems and may need to be converted and validated at various stages within the process. SimoTime has the technology, services and experience to assist in the application and data management tasks involved with doing business in a multi-system environment.

Whether you want to use the Internet to expand into new market segments or as a delivery vehicle for existing business functions simply give us a call or check the web site at http://www.simotime.com


Return-to-Top
Data Migration, A Data File Comparison Exercise
Copyright © 1987-2019
SimoTime Technologies and Services
All Rights Reserved
When technology complements business
http://www.simotime.com