MVSFORUMS.com Forum Index MVSFORUMS.com
A Community of and for MVS Professionals
 
 FAQFAQ   SearchSearch   Quick Manuals   RegisterRegister 
 ProfileProfile   Log in to check your private messagesLog in to check your private messages   Log inLog in 

Why SUPERCE does not show correct results - Pls help.

 
Post new topic   Reply to topic   printer-friendly view    MVSFORUMS.com Forum Index -> Utilities
View previous topic :: View next topic  
Author Message
mf_user
Intermediate


Joined: 01 Jun 2003
Posts: 372
Topics: 105

PostPosted: Fri Sep 12, 2008 10:11 am    Post subject: Why SUPERCE does not show correct results - Pls help. Reply with quote

Hi,

I have an input dataset with 23769 records in it. The length of this file is 10. All 10 characters becomes key in 1st file. I have another dataset with 9711 records in it with 10 bytes key field starting in column 73. I have compared these two using JOINKEYS feature of SORT as shown below:

Code:

//SYSIN    DD  *                                         
 JOINKEYS FILE=F1,FIELDS=(1,10,A)                         
 JOINKEYS FILE=F2,FIELDS=(73,10,A)                       
 REFORMAT FIELDS=(F2:73,10)                               
 SORT     FIELDS=COPY                   
/*


The job report shown that 474 matches were found which was correct. Now, to do a cross verification, I ran SUPERCE (3.13) on these two datasets with the column compare facility as shown below:

Code:

   Enter or change Process Statements in the EDIT window below:
****** ***************************** Top of Data **************
000001 CMPCOLMN 1:10                                           
000002 CMPCOLMO 73:82                                         
****** **************************** Bottom of Data ************


Surprisingly, it shown me only 405 matches between these two files Exclamation Exclamation Exclamation Even if I consider "FALSE MATCHE(S) CORRECTED" into consideration also it will come to 477.

The final statistics were as given below:

Code:

                    LINE COMPARE SUMMARY AND STATISTICS                 
                                                                       
  405 NUMBER OF LINE MATCHES           30381  TOTAL CHANGES (PAIRED+NONPAIRED CHNG)
    0 REFORMATTED LINES                 2289  PAIRED CHANGES (REFM+PAIRED INS/DEL)
23364 NEW FILE LINE INSERTIONS         21075  NON-PAIRED INSERTS       
 9306 OLD FILE LINE DELETIONS           7017  NON-PAIRED DELETES       
23769 NEW FILE LINES PROCESSED                                         
 9711 OLD FILE LINES PROCESSED                                         
   72 FALSE MATCH(S) CORRECTED                                         


I would like to know whether Syncsort was wrong or SUPERCE did not work properly Question Confused Shocked Mad Which one to believe? Do I need to run more tests on this data?

Please help me in this regard. Also, I beg you to show me the other ways of comparing. I mean using FILE-AID or Syncsort in a different way.

Thanks for your time.
_________________
MF
==
Any training that does not include the emotions, mind and body is incomplete; knowledge fades without feeling.
==
Back to top
View user's profile Send private message Send e-mail
hchinnam
Beginner


Joined: 04 Feb 2005
Posts: 31
Topics: 10

PostPosted: Fri Sep 12, 2008 2:46 pm    Post subject: Reply with quote

You can use file-aid option 3.10
Back to top
View user's profile Send private message
Dibakar
Advanced


Joined: 02 Dec 2002
Posts: 700
Topics: 63
Location: USA

PostPosted: Fri Sep 12, 2008 7:53 pm    Post subject: Reply with quote

I have never used JOINKEYS but after going over old posts it looks like JOINKEYS is supposed to give caretesian products.

If there are 2 matches in these two files and one of them is repeated twice in second file then I think the sort will show three joins whereas superce will show two matches.

Refer to example in this post -
http://www.mvsforums.com/helpboards/viewtopic.php?t=4690&highlight=joinkeys
Back to top
View user's profile Send private message Send e-mail
mf_user
Intermediate


Joined: 01 Jun 2003
Posts: 372
Topics: 105

PostPosted: Sat Sep 13, 2008 2:59 am    Post subject: Not that way Reply with quote

Dibakar, The second file has a single entry only. It does not duplicates at all.

Thanks.
_________________
MF
==
Any training that does not include the emotions, mind and body is incomplete; knowledge fades without feeling.
==
Back to top
View user's profile Send private message Send e-mail
Dibakar
Advanced


Joined: 02 Dec 2002
Posts: 700
Topics: 63
Location: USA

PostPosted: Sat Sep 13, 2008 8:20 pm    Post subject: Reply with quote

what about the first one?
Back to top
View user's profile Send private message Send e-mail
mf_user
Intermediate


Joined: 01 Jun 2003
Posts: 372
Topics: 105

PostPosted: Sun Sep 14, 2008 6:41 am    Post subject: single entry Reply with quote

First file has also got single entry.

Thanks.
_________________
MF
==
Any training that does not include the emotions, mind and body is incomplete; knowledge fades without feeling.
==
Back to top
View user's profile Send private message Send e-mail
Dibakar
Advanced


Joined: 02 Dec 2002
Posts: 700
Topics: 63
Location: USA

PostPosted: Sun Sep 14, 2008 11:21 am    Post subject: Reply with quote

To me it looks duplicate problem, or maybe I am missng something.

Try
Code:

  SORT     FIELDS=(1,10,CH,A)
  SUM      FIELDS=NONE


instead of
Code:

  SORT     FIELDS=COPY                 
 


Maybe you will get a different number.
Back to top
View user's profile Send private message Send e-mail
mf_user
Intermediate


Joined: 01 Jun 2003
Posts: 372
Topics: 105

PostPosted: Mon Sep 15, 2008 4:14 am    Post subject: look good idea Reply with quote

Ok Dibakar......that looks like a better idea......

Thanks.
_________________
MF
==
Any training that does not include the emotions, mind and body is incomplete; knowledge fades without feeling.
==
Back to top
View user's profile Send private message Send e-mail
mf_user
Intermediate


Joined: 01 Jun 2003
Posts: 372
Topics: 105

PostPosted: Mon Sep 15, 2008 10:27 am    Post subject: This is how I have done it. Reply with quote

Hi,

Per Dibakar's idea, this is how I've achieved it without using JOINKEYS feature.....

Code:

//SORT01  EXEC PGM=SORT               
//SORTIN  DD *                       
1111                                 
2222                                 
3333                                 
4444                                 
5555                                 
//SYSIN   DD *                       
 SORT FIELDS=COPY                     
 OUTREC FIELDS=(1,4,C'11')           
/*                                   
//SYSOUT  DD SYSOUT=*                 
//SORTOUT DD DSN=&&TEMP1,             
//           DISP=(NEW,PASS,DELETE), 
//           SPACE=(TRK,(1,1),RLSE)   
//SORT02  EXEC PGM=SORT             
//SORTIN  DD *                       
ABCD1111                             
BCDE4444                             
//SYSIN   DD *                       
 SORT FIELDS=COPY                   
 OUTREC FIELDS=(1:5,4,C'22')         
/*                                   
//SYSOUT  DD SYSOUT=*               
//SORTOUT DD DSN=&&TEMP2,           
//           DISP=(NEW,PASS,DELETE),
//           SPACE=(TRK,(1,1),RLSE) 
//SORT03  EXEC PGM=SORT               
//SORTIN  DD DSN=&&TEMP1,             
//           DISP=(OLD,DELETE,DELETE)
//        DD DSN=&&TEMP2,             
//           DISP=(OLD,DELETE,DELETE)
//SYSIN   DD *                       
 SORT FIELDS=(1,4,CH,A),EQUALS       
 INCLUDE COND=(5,2,CH,EQ,C'22')       
 OUTREC FIELDS=(1,4)                 
/*                                   
//SYSOUT  DD SYSOUT=*                 
//SORTOUT DD SYSOUT=*                 

Any other suggestions are most welcome Exclamation Smile

Thanks.
_________________
MF
==
Any training that does not include the emotions, mind and body is incomplete; knowledge fades without feeling.
==
Back to top
View user's profile Send private message Send e-mail
Dibakar
Advanced


Joined: 02 Dec 2002
Posts: 700
Topics: 63
Location: USA

PostPosted: Fri Sep 19, 2008 1:22 pm    Post subject: Reply with quote

I meant to use following sort instead of the original one to see if the numbers change. That would confirm if duplicates are causing any problem or not.

Code:
//SYSIN    DD  *                                         
 JOINKEYS FILE=F1,FIELDS=(1,10,A)                         
 JOINKEYS FILE=F2,FIELDS=(73,10,A)                       
 REFORMAT FIELDS=(F2:73,10)                               
 SORT     FIELDS=(1,10,CH,A)
 SUM      FIELDS=NONE
/*
Back to top
View user's profile Send private message Send e-mail
Display posts from previous:   
Post new topic   Reply to topic   printer-friendly view    MVSFORUMS.com Forum Index -> Utilities All times are GMT - 5 Hours
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum


MVSFORUMS
Powered by phpBB © 2001, 2005 phpBB Group