MVSFORUMS.com

bidpar

Hi All
I have a dataset having some duplicate records in them. Now I need to
1) sort the file
2) move part of the second duplicate record to first duplicate record
3) remove the second duplicate record from the file and put in another file.

I have to sort on SORT FIELDS=(1,20,A,CH,389,20,A,CH) .
In the second duplicate record the field having position (389,20) will contains some value but the first duplicate record will contain space. I want that value to be copied to the first duplicate record and then the second duplicate to be moved from this file and written in another file.

Trying doing some R&D with ICETOOL but not successfull so far.

Thanks

kolusu · Posted: Tue Dec 02, 2003 3:25 pm Post subject:

Bidpar,

Can you clarify the following.

1. what is the LRECL, RECFM of the file?
2. Can the file contain more than 1 duplicate? .i.e take a look at the following data.

Frank Yaeger · Posted: Tue Dec 02, 2003 3:40 pm Post subject:

Assuming you only have two dups per key, or you want to splice the first and last records for mulitple dups per key, here's a DFSORT/ICETOOL job that will do what you asked for. You'll need DFSORT R14 PTF UQ90053 (Feb, 2003) to use SPLICE:

bidpar · Posted: Tue Dec 02, 2003 3:47 pm Post subject:

Kolusu/Frank
Thanks for your answer and sorry for the confusion.

No the file cannot contain more than 1 duplicate and one of the duplicate will always contain blank and the other will have some data in the specified record area (389 - 408) .

LRECL = 1000 , RECFM = FB

Frank - Yes I want all the non-dup records in OUT1.

Let me know if you want more info.

Thanks

Frank Yaeger · Posted: Tue Dec 02, 2003 4:24 pm Post subject:

Bidpar,

You can use the DFSORT/ICETOOL job with KEEPNODUPS, that is:

bidpar · Posted: Tue Dec 02, 2003 4:26 pm Post subject:

Frank

Works great.

Thanks everybody.

Regards
Bidpar

kolusu · Posted: Wed Dec 03, 2003 9:50 am Post subject:

Frank,

I have a question regarding SELECT with using parm.Can I have something like this?

bidpar · Posted: Wed Dec 03, 2003 9:53 am Post subject:

There is a slight change of plan.

I want to sort the file on Sort will be on (1,20,CH,A,389,20,CH,A)
Then I want all the unique records as well as the second duplicate of the duplicate records to be in one file and the first duplicate of the dupilcate records to be in another file.
As told earlier , the file cannot contain more than 1 duplicate.

How can I do this.

Regards

Frank Yaeger · Posted: Wed Dec 03, 2003 11:19 am Post subject:

Kolusu,

For SELECT with USING, you can only use the INCLUDE, OMIT, OUTFIL and OPTION statements, not the others. DFSORT's ICETOOL generates statements to pass to DFSORT and specifying those other statements to override the generated statements can mess things up (unless you know EXACTLY what you're doing).
_________________
Frank Yaeger - DFSORT Development Team (IBM)
Specialties: JOINKEYS, FINDREP, WHEN=GROUP, ICETOOL, Symbols, Migration
DFSORT is on the Web at:
www.ibm.com/storage/dfsort

Frank Yaeger · Posted: Wed Dec 03, 2003 11:36 am Post subject:

Bidpar,

For this new variation, it's not clear to me what you want in each output file. Do you still want to join the dup1 and dup2 fields? Please show me an example of what the input records look like and what you want the output files to look like.
_________________
Frank Yaeger - DFSORT Development Team (IBM)
Specialties: JOINKEYS, FINDREP, WHEN=GROUP, ICETOOL, Symbols, Migration
DFSORT is on the Web at:
www.ibm.com/storage/dfsort

bidpar · Posted: Wed Dec 03, 2003 11:53 am Post subject:

Frank
No I dont need to join these 2 fileds anymore. This requirement is much simpler.

Let me give some more clarification to make things simpler.

The key for this file is first 20 bytes. Since my input file is unsorted , I am first sorting by this key.

Now this file can contain duplicates. and it will occure maximum of 1 time in a set. That means I will get either an unique record or a pair of duplicates for any keyvalue.

The only field which will be alway different between these 2 duplicates is (389,20). One of the records from the duplicate pair will always have a blank on this field and the other one will always have some value.
That's why I am sorting it on this filed so that I can get the record with blank on this field first.

Now I want all the unique records and the second record from the pair of duplicates in one file.

If I sort the field (389,20) on descending , then I will need the first record from the pair along with all the unique records.

Then I need rest of the duplicate records in another file.

Let me know if you need more clarification . I will give some example.

Thanks

kolusu · Posted: Wed Dec 03, 2003 12:24 pm Post subject:

Bidpar,
The following DFSORT/ICETOOL JCl will give you the desired results.

bidpar · Posted: Wed Dec 03, 2003 1:01 pm Post subject:

Kolusu
I guess I am not clear yet.

If I sort with the combination of keys (1,20 and 389,20) and select based on them , then all of my records will be unique. Right ?

Duplicate will occur only if I sort on (1,20) . Then from the pair of duplicates I have to select the duplicate for which the field at (389,20) will contain some value (no space). Add them to all the unique records and put them in one single file.

Put rest of the duplicates on another file.

Ex:-
Input file -
1----------20 389 408
1111111111
2222222222 rrrrrrrrrrrrrrr
1111111111 ggggggggggg
4444444444
3333333333 qqqqqqqqqqq
3333333333
7777777777
2222222222
9999999999 eeeeeeeeeeee

Out1 -

1111111111 ggggggggggg
2222222222 rrrrrrrrrrrrrrr
3333333333 qqqqqqqqqqq
4444444444
7777777777
9999999999 eeeeeeeeeeee

Out2 -
1111111111
2222222222
3333333333

All the records those will be in OUT2 file will have spaces in the field 389,20.

Regards

Frank Yaeger · Posted: Wed Dec 03, 2003 3:56 pm Post subject:

Bidpar,

This DFSORT/ICETOOL job will do what you asked for:

bidpar · Posted: Wed Dec 03, 2003 5:07 pm Post subject:

This is what I was looking for.

Thanks Frank and Kolusu

Regards
Bidpar