MVSFORUMS.com

geetha001 · Beginner Joined: 22 Jun 2005 Posts: 41 Topics: 14

I have a batch cobol program which performs slowly, current information about the program is as follows:

Load would be a good option, but since the random number has to be generated in the cobol program, load option is difficult. Also get diagnostics is performed if the row insert fails due to duplicate key, random number is regenerated a limited number of times and the insert is retried.

Can anybody help me with any suggestions on how to improve the performance of this insert job.

papadi · Supermod Joined: 20 Oct 2009 Posts: 594 Topics: 1

geetha001 · Beginner Joined: 22 Jun 2005 Posts: 41 Topics: 14

In general the batch program takes a long time to run. Average run time is over an 1 hr. I am looking to optimize this run time as we may have a much higher volume going forward.

kolusu · Posted: Sun May 23, 2010 8:50 pm Post subject:

geetha001,

Did you run any kind of analysis as to where the program is spending time? Is it in the insert phase or data preparation phase? Did you run an explain on the SQL queries? How is the random number generated ? Do you have the option of changing the DB2 table to have an identity column for the random numbered key? Or How about adding a timestamp field to the DB2 which would contain the time the key is inserted.

check this link for running explain

http://www.mvsforums.com/helpboards/viewtopic.php?t=215&highlight=explain

Kolusu

papadi · Supermod Joined: 20 Oct 2009 Posts: 594 Topics: 1

If you run the code as-is, with the INSERTs commented (so none are actually done) how long does the program run?
_________________
All the best,

di

geetha001 · Beginner Joined: 22 Jun 2005 Posts: 41 Topics: 14

Kolusu,

I have the results of the explain plan from production. I personally did not run any explains in test region yet. From my understanding about the explain plan table, I could not see anything alarming. Most of the plan table columns do not give much information for the insert. I do see that the lock mode used is IX though during the insert.

However I would like to know if there is anything specific that I might have to check in the plan table for the inserts.

The 10 digit random number is generated from the current timestamp value.

Can you please clarify/expand the below

papadi · Supermod Joined: 20 Oct 2009 Posts: 594 Topics: 1

jsharon1248 · Posted: Tue May 25, 2010 7:58 am Post subject:

geetha001 · Beginner Joined: 22 Jun 2005 Posts: 41 Topics: 14

After a lot of research, found that there is a issue with the way the random number is generated. The current module that generates the random number is based on current timestamp and since the ID number is defined as an integer (leading zero suppression), the random number can range anywhere between 1 to 10 (depending on the number of zeroes in the random number).

As mentioned by jsharon1248, there are lot of duplicates generated and retried (I guess that is obvious by now from my above para). Usually it is 10%, but some days it is 200% duplicate retry.

However, now that the issue has been focussed on the random number generator, I am looking for help on generating a unique atleast a 15 digit random number for every row that is inserted. Although there are some common modules already available to do it, I was thinking if any one of you had a good idea of generating such a number quickly (in a unique way) keeping in mind that there will be a huge number of records to be inserted.

I will continue my research... in the meantime if any of you have tips let me know.

Thanks

Dibakar · Posted: Mon Jun 14, 2010 6:50 pm Post subject:

Random numbers are generated for some specific purpose, why do you need one.

All you needd is a unique key, that can be achived by adding 1 to the max of existing keys.
_________________
Regards,
Diba

dbzTHEdinosauer · Posted: Mon Jun 14, 2010 6:54 pm Post subject:

the suggestion for adding a timestamp
in an attempt to measure INSERT speed,
will not work
because the timestamp generated for 100 rows inserted during a mass insert will be the same.

as far as speeding up the process:

why the random number?
you are forcing your INSERTs all over the place,
the effect on index build is what is taking a lot of time,
and unless you are on an old vsn of db2 (<6)
(or old DASD - can't remember the reason)
you don't need to space out your INSERTs:
contiguous inserts are considered more efficient.

also, requires continuous reorg

question:

within a 100-row insert, are there a combination of parent and related child
or do you insert all 100 first and then the corresponding 100 to the related table?

as an experiment,
before you change any code,
run a test - create a benchmark
turn off the RI on the tables and run a test
that will tell you what the RI is costing you.

Here is a link to a db2 guru, that explains more about analyzing RI, especially supporting indexes
_________________
Dick Brenholtz
American living in Varel, Germany

geetha001 · Beginner Joined: 22 Jun 2005 Posts: 41 Topics: 14

papadi · Supermod Joined: 20 Oct 2009 Posts: 594 Topics: 1

There is a difference between a unique number and a random number. . .

Might you consider a unique number?
_________________
All the best,

di

dbzTHEdinosauer · Posted: Tue Jun 15, 2010 8:41 am Post subject:

the basic problem of a random number generator that must also generate a unique number
is that the two processes
(generating a random number & generating a unique number)
are not in sync or together.

the random number is created by your little routine and the unique attribute is proved or disproved by the db2 INSERT.

unfortunately, there is no easy min(unused number) function,
so I suggest you determine which unique numbers are not already in use,
(dump all the key numbers, pass thru and
create another table/file of these numbers not in use.

Then use the unused numbers for your inserts.

Until you give up this old/out-dated methodology of skipping around to do inserts
(this philosophy was dreamed up due to many tasks making inserts near simulataneously -
you only have one task making inserts - there is no need to skip around)

you will continually have this problem.

use the file/table containing unused numbers,
build a load file and use db2 load to INSERT the rows.

NASCAR9 · Posted: Tue Jun 15, 2010 6:31 pm Post subject:

Could this be incorporated in the query? Question

SELECT RAND()
FROM SYSIBM.SYSDUMMY1;
_________________
Thanks,
NASCAR9