Monitors an obiwan production run using qdo
Classes
QdoList (outdir[, que_name, skip_succeed, …]) |
Queries the qdo db and maps log files to tasks and task status |
RunStatus (tasks, logs) |
Tallys which QDO_RESULTS actually finished, what errors occured, etc. |
Functions
get_checkpoint_fn (outdir, brick, rowstart) |
|
get_deldirs (outdir, brick, rowstart[, …]) |
If slurm timeout or failed, logfile will exist in final dir but other outputs will be in interm dir. |
get_final_dir (outdir, brick, rowstart[, …]) |
Returns paths like outdir/replaceme/bri/brick/rs0 |
get_interm_dir (outdir, brick, rowstart[, …]) |
Returns paths like outdir/bri/brick/rs0 |
get_logdir (outdir, brick, rowstart[, …]) |
|
get_logfile (outdir, brick, rowstart[, …]) |
|
get_slurm_files (outdir) |
obiwan.runmanager.status.
QdoList
(outdir, que_name='obiwan', skip_succeed=False, rand_num=None, firstN=None)[source]¶Queries the qdo db and maps log files to tasks and task status
Parameters: |
|
---|
change_task_state
(task_ids, to=None, modify=False, rm_files=False)[source]¶change qdo tasks state, for tasks with task_ids, to pending,failed, etc
Parameters: |
|
---|
obiwan.runmanager.status.
RunStatus
(tasks, logs)[source]¶Tallys which QDO_RESULTS actually finished, what errors occured, etc.
Args: tasks: dict, each key is list of qdo tasks logs: dict, each key is list of log files for each task
Defaults: regex_errs: list of regular expressions matching possible log file errors
obiwan.runmanager.status.
get_deldirs
(outdir, brick, rowstart, do_skipids='no', do_more='no')[source]¶If slurm timeout or failed, logfile will exist in final dir but other outputs will be in interm dir. Return list of dirst to all of these