kmrwrapper.py − combines scripts of file split and job-script generation
kmrwrapper.py [-n num] [-e nodes] [-s pattern] [-p prefix] [-o outfile] [-d dir] [-O outdir] [-t rsctime] [-S sched] [-w file] [-f] -m mapper -r reducer file
kmrwrapper.py splits a file and generates a job-script. It wraps programs of "kmrfsplit.py" and "kmrgenscript.py".
The following options are supported:
-n num
Specifies the number of split. Default is 1.
-e nodes
Specifies the number of processes to execute the job. Default is 1.
-s pattern
Specifies a pattern string to split an input file. It accepts a regular expression. Default is "\n".
-p prefix
Specifies a prefix of input file names. Default is "part".
-o outfile
Specifies a prefix of output file names. Default is "output".
-d indir
Specifies an input directory. Default is "./work".
-O outdir
Specifies an output directory. It specifies the directory to hold the job results. Default is "./outdir".
-t rsctime
Specifies time limit in the job resources. This is given by "00:00:00" format. Default is "00:10:00".
-S sched
Specifies a scheduler. Only "K" is supported.
-w scrfile
Specifies a script output file. Default is STDOUT.
-f
Forces to create an output directory if it does not exist.