TaskContainer
- class PlasmaCalcs.tools.multiprocessing.TaskContainer(tasks, *args_super, assign_task_idx=True, printable_process_name=None, errors_ok=False, result_missing=RESULT_MISSING, **kw_super)
Bases:
Containera container for multiple tasks; each Task is a function, args, & kwargs.
Calling self will perform all tasks, returning the result (and updating self.result as well).- tasks: iterable of Task objects
- Tasks to perform. Expected type depends on subclass, see e.g. TaskList or TaskArray.
- assign_task_idx: bool
- whether to assign task.i for each task, based on its position in self.
- printable_process_name: None or str
- include in progress update printouts. If None, use a reasonable default.
- errors_ok: bool, Exception type, or tuple of Exception types
- whether it is okay for some tasks to produce certain errors.False –> crash if any task crashes. Equivalent to errors_ok=().True –> except Exception (not BaseException, though). Equivalent to errors_ok=Exception.Exception type or tuple –> except this type (or these types, if tuple).
- result_missing: any object
- result to record for tasks which crash (if errors_ok!=False). Default RESULT_MISSING.
Methods
__call__(*[, kw, idx, reset, skip_done, ...])'perform all tasks in self, returning the results.
assign task.i for tasks in self, based on their positions in self.
coarsen([ncoarse, idx])return a TaskPartition containing TaskGroups of size ncoarse.
enumerate([idx])iterate through i in idx, yielding (i, self[i]) pairs.
errors_ok_tuple([value])returns tuple of okay errors.
set self.result = container with similar shape as self, filled with RESULT_MISSING.
new_empty([fill])return a new container of the same shape as self, filled with the value fill.
size([idx])return the number of objects in the container, or in idx if provided.
iterate through all objs in self, yielding (i, self[i]) pairs.
return the number of objects in the container.
Attributes
tuple of Exception types which are okay for tasks to raise.
return the name to be used for progress updates, if any.
alias to data
- __call__(*, kw={}, idx=None, reset=False, skip_done=False, ncpu=None, timeout=None, ncoarse=1, print_freq=None, errors_ok=UNSET, result_missing=UNSET)
‘perform all tasks in self, returning the results.
OPTIONS (AFFECTS ALL TASKS)
- kw: dict
- kwargs for task will be task.kw, but updated with kw.E.g. if task.kw = {‘x’: 1}, and kw = {‘y’: 2}, –> task called with x=1, y=2.
OPTIONS (AFFECTS WHICH TASKS ARE PERFORMED)- idx: None or iterable of indices
- None –> perform all tasks in self.iterable of indices –> perform only these tasks.
- reset: bool
- whether to reset self.result to all RESULT_MISSING, before starting this operation.
- skip_done: bool
- whether to skip tasks that already have a result (i.e. self.result[idx] != RESULT_MISSING).
OPTIONS (AFFECTS MULTIPROCESSING STRATEGY)- ncpu: None or int
- max number of cpus to use for multiprocessing.None –> use multiprocessing.cpu_count()int –> use this value. if 0 or 1, do not use multiprocessing here.Note: will actually use min(ncpu, number of calls to be made);e.g. if ncpu=4 but len(arg_kw_tuples)=2, will only use 2 cpus.
- timeout: None or int
- max duration, in seconds. Must be None or integer (due to limitations of signal.alarm method)None –> no time limit.Note: if time_limit is reached, will raise a TimeoutError and save the result so far.(in this case, any not-yet-calculated values will each be RESULT_MISSING.)
- ncoarse: int
- if >1, group tasks into groups of size ncoarse before performing them.
OPTIONS (MISC)- print_freq: None, or number (possibly negative or 0)
- >0 –> Minimum number of seconds between progress updates.=0 –> print every progress update.<0 –> never print progress updates.None –> use DEFAULTS.PROGRESS_UPDATES_PRINT_FREQ
- errors_ok: UNSET or bool, Exception type, or tuple of Exception types
- whether it is okay for some tasks to produce certain errors.False –> crash if any task crashes. Equivalent to errors_ok=().True –> except Exception (not BaseException, though). Equivalent to errors_ok=Exception.Exception type or tuple –> except this type (or these types, if tuple).UNSET –> use self.errors_ok.
- result_missing: UNSET or any object
- result to record for tasks which crash (if errors_ok!=False). Default RESULT_MISSING.UNSET –> use self.result_missing.
- _enumerate_all()
iterate through all objs in self, yielding (i, self[i]) pairs.
Equivalent to self.enumerate(idx=None).The implementation will depend on the container type; subclass should implement.
- _size_all()
return the number of objects in the container.
The implementation will depend on the container type; subclass should implement.
- assign_task_idx()
assign task.i for tasks in self, based on their positions in self.
- coarsen(ncoarse=5, *, idx=None)
return a TaskPartition containing TaskGroups of size ncoarse.
Useful for coarsening a TaskContainer for more efficient multiprocessing;grouping tasks together can reduce the overhead of multiprocessing,while still allowing for parallel processing as the groups are run in parallel.if idx is provided, only group the tasks with those indices.
- enumerate(idx=None)
iterate through i in idx, yielding (i, self[i]) pairs.
If idx is None, iterate through all objs in self (see self._enumerate_all).
- property errors_ok
tuple of Exception types which are okay for tasks to raise.
setting self.errors_ok = False –> use empty tuple, i.e. no errors are okay.setting self.errors_ok = errtype –> use errors_ok = (errtype,).setting errors_ok will crash if it includes any parent class of KeyboardInterrupt,e.g. errors_ok=BaseException will crash, but errors_ok=Exception will be fine.See also: self.errors_ok_tuple
- errors_ok_tuple(value=UNSET)
returns tuple of okay errors. UNSET –> self.errors_ok.
False –> (). errtype –> (errtype,).if result includes any parent class of KeyboardInterrupt, raises InputError.e.g. errors_ok_tuple(BaseException) will crash, but errors_ok_tuple(Exception) will be fine.
- init_result()
set self.result = container with similar shape as self, filled with RESULT_MISSING.
Then, return self.result.The idea is that self.result[idx] will correspond to the result of self[idx].
- new_empty(fill=UNSET)
return a new container of the same shape as self, filled with the value fill.
The implementation will depend on the container type; subclass should implement.
- property printable_process_name
return the name to be used for progress updates, if any.
If None, use the default: “[type(self)].__call__”.
- size(idx=None)
return the number of objects in the container, or in idx if provided.
- property tasks
alias to data