Skip to content

Development: Batching

Tony E Lewis edited this page Sep 12, 2017 · 2 revisions

Batching

A key issue is the number of batches into which the work is divided. When the compute cluster is busy, many small jobs can waste a lot of time in queuing.

Commands to get these sorts of numbers:

grep -c 'submit' 3.40.50.620.progress.log
grep -Po '\d+\s+submitted' 3.40.50.620.progress.log | awk '{print $1}' | sort -g | uniq -c | awk '{print $1, "*", $2}' | tr '\n' '+' | sed 's/+$//g' | xargs -I VAR perl -e 'print "".( VAR )."\n"; '