MVSFORUMS.com

Martin · Beginner Joined: 20 Mar 2006 Posts: 133 Topics: 58

Hi All,

I have 2 files with an LRECL of 3709. I need to compare these two files and remove the duplicates .

My question is NOT on how to remove the duplicates. I would like to know if SORT can handle file with LRECL 3709.

P.S : The sort job keeps throwing the below error message :

"ON" LENGTH IS NOT 1 TO 1500

Any pointers is much appreciated.

Thanks,
Martin

kolusu

Martin,

The limitation of 1500 is for a single ON parm. You can have multiple ON parms, but the total key length for comparison should not exceed 4088 bytes. Try this DFSORT/ICETOOL job

Martin · Beginner Joined: 20 Mar 2006 Posts: 133 Topics: 58

Thanks Kolusu !!

I will try this out...

I have another question here:

what if the LRECL exceeds 4088 bytes, How does SORT handle this?

kolusu · Posted: Fri Feb 12, 2010 11:54 am Post subject:

Martin · Beginner Joined: 20 Mar 2006 Posts: 133 Topics: 58

kolusu · Posted: Fri Feb 12, 2010 1:33 pm Post subject:

Martin · Beginner Joined: 20 Mar 2006 Posts: 133 Topics: 58

kolusu · Posted: Fri Feb 12, 2010 1:42 pm Post subject:

Martin,

how do you plan to compare duplicates? lets say file 1 has 4 duplicates and file 2 has 12 duplicates , what do you do in this case?
_________________
Kolusu
www.linkedin.com/in/kolusu

Martin · Beginner Joined: 20 Mar 2006 Posts: 133 Topics: 58

Hi Kolusu,

In this case I want all the duplicates records from file1 to be removed from the output file.

Ex:

File1
aaaaaaaaa11111
aaaaaaaaa11111
aaaaaaaaa11111
aaaaaaaaa11111

File 2:
aaaaaaaaa11111
aaaaaaaaa11111
aaaaaaaaa11111
aaaaaaaaa11111
aaaaaaaaa11111
aaaaaaaaa11111
aaaaaaaaa11111
aaaaaaaaa11111
aaaaaaaaa11111
aaaaaaaaa11111
aaaaaaaaa11111
aaaaaaaaa11111
BBBBBBBB22222
CCCCCCC11111

O/P file :

aaaaaaaaa11111
aaaaaaaaa11111
aaaaaaaaa11111
aaaaaaaaa11111
aaaaaaaaa11111
aaaaaaaaa11111
aaaaaaaaa11111
aaaaaaaaa11111
BBBBBBBB22222
CCCCCCC11111

Note : File 1 is always a subset of file 2. As mentioned above the first 4 records from File1 will also be present in file2. In addition File 2 can have 8 more such records which should NOT be removed .

kolusu · Posted: Fri Feb 12, 2010 2:26 pm Post subject:

Martin,

As is the matching with longer keys is complicated and you threw in monkey wrench into it now with duplicates. I will try if I can come up with an elegant solution.
_________________
Kolusu
www.linkedin.com/in/kolusu

Martin · Beginner Joined: 20 Mar 2006 Posts: 133 Topics: 58

kolusu · Posted: Fri Feb 12, 2010 5:27 pm Post subject:

Sqlcode · Intermediate Joined: 15 Dec 2006 Posts: 157 Topics: 38

Throwing out something which may or may not be possible...

Steps
1) Break your input record (23200 bytes) into 4000 bytes each. This will create 6 records for each input record. Make sure your LRECL is 4000 bytes.You may need multiple pass for each input file. Also create record-id for each record which will be used later to merge them back.

Here is what I tested for 30 bytes record and breaking it into 23 bytes (15 bytes data + 8 bytes record-id). Once again I dont know if its correct or not.

Here in the IFTHEN condition use any valid condition to populate record-id. Can we use entire records greater than spaces?? Don't know...