Title: | A Distributed Worker Launcher Framework |
---|---|
Description: | In computationally demanding analysis projects, statisticians and data scientists asynchronously deploy long-running tasks to distributed systems, ranging from traditional clusters to cloud services. The 'NNG'-powered 'mirai' R package by Gao (2023) <doi:10.5281/zenodo.7912722> is a sleek and sophisticated scheduler that efficiently processes these intense workloads. The 'crew' package extends 'mirai' with a unifying interface for third-party worker launchers. Inspiration also comes from packages. 'future' by Bengtsson (2021) <doi:10.32614/RJ-2021-048>, 'rrq' by FitzJohn and Ashton (2023) <https://github.com/mrc-ide/rrq>, 'clustermq' by Schubert (2019) <doi:10.1093/bioinformatics/btz284>), and 'batchtools' by Lang, Bischel, and Surmann (2017) <doi:10.21105/joss.00135>. |
Authors: | William Michael Landau [aut, cre] , Daniel Woodie [ctb], Eli Lilly and Company [cph, fnd] |
Maintainer: | William Michael Landau <[email protected]> |
License: | MIT + file LICENSE |
Version: | 0.10.2.9001 |
Built: | 2024-11-21 20:29:39 UTC |
Source: | https://github.com/wlandau/crew |
In computationally demanding analysis projects,
statisticians and data scientists asynchronously deploy
long-running tasks to distributed systems, ranging from
traditional clusters to cloud services.
The NNG-powered
mirai
R package is a sleek and sophisticated scheduler
that efficiently processes these intense workloads.
The crew
package extends
mirai
with a unifying interface
for third-party worker launchers.
Inspiration also comes from packages
future
,
rrq
,
clustermq
,
and batchtools
.
Assert that a condition is true.
crew_assert(value = NULL, ..., message = NULL, envir = parent.frame())
crew_assert(value = NULL, ..., message = NULL, envir = parent.frame())
value |
An object or condition. |
... |
Conditions that use the |
message |
Optional message to print on error. |
envir |
Environment to evaluate the condition. |
NULL
(invisibly). Throws an error if the condition is not true.
Other utility:
crew_clean()
,
crew_deprecate()
,
crew_eval()
,
crew_random_name()
,
crew_retry()
,
crew_terminate_process()
,
crew_terminate_signal()
,
crew_worker()
crew_assert(1 < 2) crew_assert("object", !anyNA(.), nzchar(.)) tryCatch( crew_assert(2 < 1), crew_error = function(condition) message("false") )
crew_assert(1 < 2) crew_assert("object", !anyNA(.), nzchar(.)) tryCatch( crew_assert(2 < 1), crew_error = function(condition) message("false") )
Create an R6
object to manage local asynchronous quick
tasks with error detection.
crew_async(workers = NULL)
crew_async(workers = NULL)
workers |
Number of local |
crew_async()
objects are created inside launchers to allow
launcher plugins to run local tasks asynchronously, such as
calls to cloud APIs to launch serious remote workers.
An R6
async client object.
Other async:
crew_class_async
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { x <- crew_async() x$start() out <- x$eval(1 + 1) mirai::call_mirai_(out) out$data # 2 x$terminate() }
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { x <- crew_async() x$start() out <- x$eval(1 + 1) mirai::call_mirai_(out) out$data # 2 x$terminate() }
R6
async class.R6
class for async configuration.
See crew_async()
.
workers
See crew_async()
.
instance
Name of the current instance.
new()
TLS configuration constructor.
crew_class_async$new(workers = NULL)
workers
Argument passed from crew_async()
.
An R6
object with TLS configuration.
validate()
Validate the object.
crew_class_async$validate()
NULL
(invisibly).
start()
Start the local workers and error handling socket.
crew_class_async$start()
Does not create workers or an error handling socket
if workers
is NULL
or the object is already started.
NULL
(invisibly).
terminate()
Start the local workers and error handling socket.
crew_class_async$terminate()
Waits for existing tasks to complete first.
NULL
(invisibly).
started()
Show whether the object is started.
crew_class_async$started()
Logical of length 1, whether the object is started.
asynchronous()
Show whether the object is asynchronous (has real workers).
crew_class_async$asynchronous()
Logical of length 1, whether the object is asynchronous.
eval()
Run a local asynchronous task using a local compute profile.
crew_class_async$eval( command, substitute = TRUE, data = list(), packages = character(0L), library = NULL )
command
R code to run.
substitute
Logical of length 1, whether to substitute command
.
If FALSE
, then command
must be an expression object
or language object.
data
Named list of data objects required to run command
.
packages
Character vector of packages to load.
library
Character vector of library paths to load the packages from.
Used for launcher plugins with asynchronous launches and
terminations. If processes
is NULL
, the task will run locally.
Otherwise, the task will run on a local process in the local
mirai
compute profile.
If the processes
field is NULL
, a list with an object named
data
containing the result of evaluating expr
synchronously.
Otherwise, the task is evaluated asynchronously, and the result
is a mirai
task object. Either way, the data
element
of the return value will contain the result of the task.
Other async:
crew_async()
R6
client class.R6
class for mirai
clients.
See crew_client()
.
name
See crew_client()
.
workers
See crew_client()
.
host
See crew_client()
.
port
See crew_client()
.
tls
See crew_client()
.
seconds_interval
See crew_client()
.
seconds_timeout
See crew_client()
.
retry_tasks
See crew_client()
relay
Relay object for event-driven programming on a downstream condition variable.
started
Whether the client is started.
client
Process ID of the local process running the client.
dispatcher
Process ID of the mirai
dispatcher
new()
mirai
client constructor.
crew_class_client$new( name = NULL, workers = NULL, host = NULL, port = NULL, tls = NULL, seconds_interval = NULL, seconds_timeout = NULL, retry_tasks = NULL, relay = NULL )
name
Argument passed from crew_client()
.
workers
Argument passed from crew_client()
.
host
Argument passed from crew_client()
.
port
Argument passed from crew_client()
.
tls
Argument passed from crew_client()
.
seconds_interval
Argument passed from crew_client()
.
seconds_timeout
Argument passed from crew_client()
.
retry_tasks
Argument passed from crew_client()
.
relay
Argument passed from crew_client()
.
An R6
object with the client.
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() client$start() client$log() client$terminate() }
validate()
Validate the client.
crew_class_client$validate()
NULL
(invisibly).
start()
Start listening for workers on the available sockets.
crew_class_client$start()
NULL
(invisibly).
terminate()
Stop the mirai client and disconnect from the worker websockets.
crew_class_client$terminate()
NULL
(invisibly).
condition()
Get the nanonext
condition variable which tasks signal
on resolution.
crew_class_client$condition()
The nanonext
condition variable which tasks signal
on resolution. The return value is NULL
if the client
is not running.
resolved()
Get the true value of the nanonext
condition variable.
crew_class_client$resolved()
The value of the nanonext
condition variable.
summary()
Show an informative worker log.
crew_class_client$summary()
A tibble
with information on the workers, or NULL
if the client is not started. The tibble
has 1 row
per worker and the following columns:
worker
: integer index of the worker.
online
: TRUE
if the worker is online and connected to the
websocket URL, FALSE
otherwise.
instances
: integer, number of instances of mirai
daemons
(crew
workers) that have connected to the websocket URL
during the life cycle of the listener.
assigned
: number of tasks assigned to the current websocket URL.
complete
: number of tasks completed at the current websocket URL.
socket
: websocket URL. crew
changes the token at the end of the
URL path periodically as a safeguard while managing workers.
pids()
Get the process IDs of the local process and the
mirai
dispatcher (if started).
crew_class_client$pids()
An integer vector of process IDs of the local process and the
mirai
dispatcher (if started).
Other client:
crew_client()
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() client$start() client$log() client$terminate() } ## ------------------------------------------------ ## Method `crew_class_client$new` ## ------------------------------------------------ if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() client$start() client$log() client$terminate() }
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() client$start() client$log() client$terminate() } ## ------------------------------------------------ ## Method `crew_class_client$new` ## ------------------------------------------------ if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() client$start() client$log() client$terminate() }
R6
class for controllers.
See crew_controller()
.
client
Router object.
launcher
Launcher object.
tasks
A list of mirai::mirai()
task objects.
pushed
Number of tasks pushed since the controller was started.
popped
Number of tasks popped since the controller was started.
error
Tibble of task results (with one result per row)
from the last call to map(error = "stop)
.
backlog
Character vector of explicitly backlogged tasks.
autoscaling
TRUE
or FALSE
, whether async later
-based
auto-scaling is currently running
new()
mirai
controller constructor.
crew_class_controller$new(client = NULL, launcher = NULL)
client
Router object. See crew_controller()
.
launcher
Launcher object. See crew_controller()
.
An R6
controller object.
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() launcher <- crew_launcher_local() controller <- crew_controller(client = client, launcher = launcher) controller$start() controller$push(name = "task", command = sqrt(4)) controller$wait() controller$pop() controller$terminate() }
validate()
Validate the client.
crew_class_controller$validate()
NULL
(invisibly).
empty()
Check if the controller is empty.
crew_class_controller$empty(controllers = NULL)
controllers
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
A controller is empty if it has no running tasks
or completed tasks waiting to be retrieved with push()
.
TRUE
if the controller is empty, FALSE
otherwise.
nonempty()
Check if the controller is nonempty.
crew_class_controller$nonempty(controllers = NULL)
controllers
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
A controller is empty if it has no running tasks
or completed tasks waiting to be retrieved with push()
.
TRUE
if the controller is empty, FALSE
otherwise.
resolved()
Number of resolved mirai()
tasks.
crew_class_controller$resolved(controllers = NULL)
controllers
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
resolved()
is cumulative: it counts all the resolved
tasks over the entire lifetime of the controller session.
Non-negative integer of length 1,
number of resolved mirai()
tasks.
The return value is 0 if the condition variable does not exist
(i.e. if the client is not running).
unresolved()
Number of unresolved mirai()
tasks.
crew_class_controller$unresolved(controllers = NULL)
controllers
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
Non-negative integer of length 1,
number of unresolved mirai()
tasks.
unpopped()
Number of resolved mirai()
tasks available via pop()
.
crew_class_controller$unpopped(controllers = NULL)
controllers
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
Non-negative integer of length 1,
number of resolved mirai()
tasks available via pop()
.
saturated()
Check if the controller is saturated.
crew_class_controller$saturated( collect = NULL, throttle = NULL, controller = NULL )
collect
Deprecated in version 0.5.0.9003 (2023-10-02). Not used.
throttle
Deprecated in version 0.5.0.9003 (2023-10-02). Not used.
controller
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
A controller is saturated if the number of unresolved tasks
is greater than or equal to the maximum number of workers.
In other words, in a saturated controller, every available worker
has a task.
You can still push tasks to a saturated controller, but
tools that use crew
such as targets
may choose not to.
TRUE
if the controller is saturated, FALSE
otherwise.
start()
Start the controller if it is not already started.
crew_class_controller$start(controllers = NULL)
controllers
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
Register the mirai client and register worker websockets with the launcher.
NULL
(invisibly).
started()
Check whether the controller is started.
crew_class_controller$started(controllers = NULL)
controllers
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
Actually checks whether the client is started.
TRUE
if the controller is started, FALSE
otherwise.
launch()
Launch one or more workers.
crew_class_controller$launch(n = 1L, controllers = NULL)
n
Number of workers to try to launch. The actual
number launched is capped so that no more than "workers
"
workers running at a given time, where "workers
"
is an argument of crew_controller()
. The
actual cap is the "workers
" argument minus the number of connected
workers minus the number of starting workers. A "connected"
worker has an active websocket connection to the mirai
client,
and "starting" means that the worker was launched at most
seconds_start
seconds ago, where seconds_start
is
also an argument of crew_controller()
.
controllers
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
NULL
(invisibly).
scale()
Auto-scale workers out to meet the demand of tasks.
crew_class_controller$scale(throttle = TRUE, controllers = NULL)
throttle
TRUE
to skip auto-scaling if it already happened
within the last seconds_interval
seconds. FALSE
to auto-scale
every time scale()
is called. Throttling avoids
overburdening the mirai
dispatcher and other resources.
controllers
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
The scale()
method re-launches all inactive backlogged
workers, then any additional inactive workers needed to
accommodate the demand of unresolved tasks. A worker is
"backlogged" if it was assigned more tasks than it has completed
so far.
Methods push()
, pop()
, and wait()
already invoke
scale()
if the scale
argument is TRUE
.
For finer control of the number of workers launched,
call launch()
on the controller with the exact desired
number of workers.
NULL
(invisibly).
autoscale()
Run worker auto-scaling in a private later
loop
every controller$client$seconds_interval
seconds.
crew_class_controller$autoscale(controllers = NULL)
controllers
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
Call controller$descale()
to terminate the
auto-scaling loop.
NULL
(invisibly).
descale()
Terminate the auto-scaling loop started by
controller$autoscale()
.
crew_class_controller$descale(controllers = NULL)
controllers
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
NULL
(invisibly).
push()
Push a task to the head of the task list.
crew_class_controller$push( command, data = list(), globals = list(), substitute = TRUE, seed = NULL, algorithm = NULL, packages = character(0), library = NULL, seconds_timeout = NULL, scale = TRUE, throttle = TRUE, name = NA_character_, save_command = FALSE, controller = NULL )
command
Language object with R code to run.
data
Named list of local data objects in the evaluation environment.
globals
Named list of objects to temporarily assign to the
global environment for the task.
This list should
include any functions you previously defined in the global
environment which are required to run tasks.
See the reset_globals
argument
of crew_controller_local()
.
substitute
Logical of length 1, whether to call
base::substitute()
on the supplied value of the
command
argument. If TRUE
(default) then command
is quoted
literally as you write it, e.g.
push(command = your_function_call())
. If FALSE
, then crew
assumes command
is a language object and you are passing its
value, e.g. push(command = quote(your_function_call()))
.
substitute = TRUE
is appropriate for interactive use,
whereas substitute = FALSE
is meant for automated R programs
that invoke crew
controllers.
seed
Integer of length 1 with the pseudo-random number generator
seed to set for the evaluation of the task. Passed to the
seed
argument of set.seed()
if not NULL
.
If algorithm
and seed
are both NULL
,
then the random number generator defaults to the
widely spaced worker-specific
L'Ecuyer streams as supported by mirai::nextstream()
.
See vignette("parallel", package = "parallel")
for details.
algorithm
Integer of length 1 with the pseudo-random number
generator algorithm to set for the evaluation of the task.
Passed to the kind
argument of RNGkind()
if not NULL
.
If algorithm
and seed
are both NULL
,
then the random number generator defaults to the
recommended widely spaced worker-specific
L'Ecuyer streams as supported by mirai::nextstream()
.
See vignette("parallel", package = "parallel")
for details.
packages
Character vector of packages to load for the task.
library
Library path to load the packages. See the lib.loc
argument of require()
.
seconds_timeout
Optional task timeout passed to the .timeout
argument of mirai::mirai()
(after converting to milliseconds).
scale
Logical, whether to automatically call scale()
to auto-scale workers to meet the demand of the task load. Also
see the throttle
argument.
throttle
TRUE
to skip auto-scaling if it already happened
within the last seconds_interval
seconds. FALSE
to auto-scale
every time scale()
is called. Throttling avoids
overburdening the mirai
dispatcher and other resources.
name
Optional name of the task. Must be a character string
or NA
.
save_command
Logical of length 1. If TRUE
, the controller
deparses the command and returns it with the output on pop()
.
If FALSE
(default), the controller skips this step to
increase speed.
controller
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
Invisibly return the mirai
object of the pushed task.
This allows you to interact with the task directly, e.g.
to create a promise object with promises::as.promise()
.
walk()
Apply a single command to multiple inputs, and return control to the user without waiting for any task to complete.
crew_class_controller$walk( command, iterate, data = list(), globals = list(), substitute = TRUE, seed = NULL, algorithm = NULL, packages = character(0), library = NULL, seconds_timeout = NULL, names = NULL, save_command = FALSE, scale = TRUE, throttle = TRUE, controller = NULL )
command
Language object with R code to run.
iterate
Named list of vectors or lists to iterate over.
For example, to run function calls
f(x = 1, y = "a")
and f(x = 2, y = "b")
,
set command
to f(x, y)
, and set iterate
to
list(x = c(1, 2), y = c("a", "b"))
. The individual
function calls are evaluated as
f(x = iterate$x[[1]], y = iterate$y[[1]])
and
f(x = iterate$x[[2]], y = iterate$y[[2]])
.
All the elements of iterate
must have the same length.
If there are any name conflicts between iterate
and data
,
iterate
takes precedence.
data
Named list of constant local data objects in the evaluation environment. Objects in this list are treated as single values and are held constant for each iteration of the map.
globals
Named list of constant objects to temporarily
assign to the global environment for each task. This list should
include any functions you previously defined in the global
environment which are required to run tasks.
See the reset_globals
argument of crew_controller_local()
.
Objects in this list are treated as single
values and are held constant for each iteration of the map.
substitute
Logical of length 1, whether to call
base::substitute()
on the supplied value of the
command
argument. If TRUE
(default) then command
is quoted
literally as you write it, e.g.
push(command = your_function_call())
. If FALSE
, then crew
assumes command
is a language object and you are passing its
value, e.g. push(command = quote(your_function_call()))
.
substitute = TRUE
is appropriate for interactive use,
whereas substitute = FALSE
is meant for automated R programs
that invoke crew
controllers.
seed
Integer of length 1 with the pseudo-random number generator
seed to set for the evaluation of the task. Passed to the
seed
argument of set.seed()
if not NULL
.
If algorithm
and seed
are both NULL
,
then the random number generator defaults to the
recommended widely spaced worker-specific
L'Ecuyer streams as supported by mirai::nextstream()
.
See vignette("parallel", package = "parallel")
for details.
algorithm
Integer of length 1 with the pseudo-random number
generator algorithm to set for the evaluation of the task.
Passed to the kind
argument of RNGkind()
if not NULL
.
If algorithm
and seed
are both NULL
,
then the random number generator defaults to the
recommended widely spaced worker-specific
L'Ecuyer streams as supported by mirai::nextstream()
.
See vignette("parallel", package = "parallel")
for details.
packages
Character vector of packages to load for the task.
library
Library path to load the packages. See the lib.loc
argument of require()
.
seconds_timeout
Optional task timeout passed to the .timeout
argument of mirai::mirai()
(after converting to milliseconds).
names
Optional character of length 1, name of the element of
iterate
with names for the tasks. If names
is supplied,
then iterate[[names]]
must be a character vector.
save_command
Logical of length 1, whether to store a text string version of the R command in the output.
scale
Logical, whether to automatically scale workers to meet
demand. See also the throttle
argument.
throttle
TRUE
to skip auto-scaling if it already happened
within the last seconds_interval
seconds. FALSE
to auto-scale
every time scale()
is called. Throttling avoids
overburdening the mirai
dispatcher and other resources.
controller
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
In contrast to walk()
, map()
blocks the local R session
and waits for all tasks to complete.
Invisibly returns a list of mirai
task objects for the
newly created tasks. The order of tasks in the list matches the
order of data in the iterate
argument.
map()
Apply a single command to multiple inputs, wait for all tasks to complete, and return the results of all tasks.
crew_class_controller$map( command, iterate, data = list(), globals = list(), substitute = TRUE, seed = NULL, algorithm = NULL, packages = character(0), library = NULL, seconds_interval = 0.5, seconds_timeout = NULL, names = NULL, save_command = FALSE, error = "stop", warnings = TRUE, verbose = interactive(), scale = TRUE, throttle = TRUE, controller = NULL )
command
Language object with R code to run.
iterate
Named list of vectors or lists to iterate over.
For example, to run function calls
f(x = 1, y = "a")
and f(x = 2, y = "b")
,
set command
to f(x, y)
, and set iterate
to
list(x = c(1, 2), y = c("a", "b"))
. The individual
function calls are evaluated as
f(x = iterate$x[[1]], y = iterate$y[[1]])
and
f(x = iterate$x[[2]], y = iterate$y[[2]])
.
All the elements of iterate
must have the same length.
If there are any name conflicts between iterate
and data
,
iterate
takes precedence.
data
Named list of constant local data objects in the evaluation environment. Objects in this list are treated as single values and are held constant for each iteration of the map.
globals
Named list of constant objects to temporarily
assign to the global environment for each task. This list should
include any functions you previously defined in the global
environment which are required to run tasks.
See the reset_globals
argument of crew_controller_local()
.
Objects in this list are treated as single
values and are held constant for each iteration of the map.
substitute
Logical of length 1, whether to call
base::substitute()
on the supplied value of the
command
argument. If TRUE
(default) then command
is quoted
literally as you write it, e.g.
push(command = your_function_call())
. If FALSE
, then crew
assumes command
is a language object and you are passing its
value, e.g. push(command = quote(your_function_call()))
.
substitute = TRUE
is appropriate for interactive use,
whereas substitute = FALSE
is meant for automated R programs
that invoke crew
controllers.
seed
Integer of length 1 with the pseudo-random number generator
seed to set for the evaluation of the task. Passed to the
seed
argument of set.seed()
if not NULL
.
If algorithm
and seed
are both NULL
,
then the random number generator defaults to the
recommended widely spaced worker-specific
L'Ecuyer streams as supported by mirai::nextstream()
.
See vignette("parallel", package = "parallel")
for details.
algorithm
Integer of length 1 with the pseudo-random number
generator algorithm to set for the evaluation of the task.
Passed to the kind
argument of RNGkind()
if not NULL
.
If algorithm
and seed
are both NULL
,
then the random number generator defaults to the
recommended widely spaced worker-specific
L'Ecuyer streams as supported by mirai::nextstream()
.
See vignette("parallel", package = "parallel")
for details.
packages
Character vector of packages to load for the task.
library
Library path to load the packages. See the lib.loc
argument of require()
.
seconds_interval
Number of seconds to wait between auto-scaling operations while waiting for tasks to complete.
seconds_timeout
Optional task timeout passed to the .timeout
argument of mirai::mirai()
(after converting to milliseconds).
names
Optional character of length 1, name of the element of
iterate
with names for the tasks. If names
is supplied,
then iterate[[names]]
must be a character vector.
save_command
Logical of length 1, whether to store a text string version of the R command in the output.
error
Character of length 1, choice of action if a task has an error. Possible values:
"stop"
: throw an error in the main R session instead of returning
a value. In case of an error, the results from the last errored
map()
are in the error
field
of the controller, e.g. controller_object$error
. To reduce
memory consumption, set controller_object$error <- NULL
after
you are finished troubleshooting.
"warn"
: throw a warning. This allows the return value with
all the error messages and tracebacks to be generated.
"silent"
: do nothing special.
warnings
Logical of length 1, whether to throw a warning in the interactive session if at least one task encounters an error.
verbose
Logical of length 1, whether to print progress messages.
scale
Logical, whether to automatically scale workers to meet
demand. See also the throttle
argument.
throttle
TRUE
to skip auto-scaling if it already happened
within the last seconds_interval
seconds. FALSE
to auto-scale
every time scale()
is called. Throttling avoids
overburdening the mirai
dispatcher and other resources.
controller
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
map()
cannot be used unless all prior tasks are
completed and popped. You may need to wait and then pop them
manually. Alternatively, you can start over: either call
terminate()
on the current controller object to reset it, or
create a new controller object entirely.
A tibble
of results and metadata: one row per task
and columns corresponding to the output of pop()
.
pop()
Pop a completed task from the results data frame.
crew_class_controller$pop( scale = TRUE, collect = NULL, throttle = TRUE, error = NULL, controllers = NULL )
scale
Logical of length 1,
whether to automatically call scale()
to auto-scale workers to meet the demand of the task load.
Scaling up on pop()
may be important
for transient or nearly transient workers that tend to drop off
quickly after doing little work.
See also the throttle
argument.
collect
Deprecated in version 0.5.0.9003 (2023-10-02).
throttle
TRUE
to skip auto-scaling if it already happened
within the last seconds_interval
seconds. FALSE
to auto-scale
every time scale()
is called. Throttling avoids
overburdening the mirai
dispatcher and other resources.
error
NULL
or character of length 1, choice of action if
the popped task threw an error. Possible values:
"stop"
: throw an error in the main R session instead of returning
a value.
"warn"
: throw a warning.
NULL
or "silent"
: do not react to errors.
controllers
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
If not task is currently completed, pop()
will attempt to auto-scale workers as needed.
If there is no task to collect, return NULL
. Otherwise,
return a one-row tibble
with the following columns.
name
: the task name if given.
command
: a character string with the R command if save_command
was set to TRUE
in push()
.
result
: a list containing the return value of the R command.
seconds
: number of seconds that the task ran.
seed
: the single integer originally supplied to push()
,
NA
otherwise. The pseudo-random number generator state
just prior to the task can be restored using
set.seed(seed = seed, kind = algorithm)
, where seed
and
algorithm
are part of this output.
algorithm
: name of the pseudo-random number generator algorithm
originally supplied to push()
,
NA
otherwise. The pseudo-random number generator state
just prior to the task can be restored using
set.seed(seed = seed, kind = algorithm)
, where seed
and
algorithm
are part of this output.
status
: a character string. "success"
if the task did not
throw an error, "cancel"
if the task was canceled with
the cancel()
controller method, or "error"
if the task
threw an error.
code
: an integer code denoting the specific exit status:
0
for successful tasks, 1
for tasks with an error in the R
command of the task, and another positive integer with an NNG
status code if there is an error at the NNG/nanonext
level.
error
: the first 2048 characters of the error message if
the task threw an error, NA
otherwise.
trace
: the first 2048 characters of the text of the traceback
if the task threw an error, NA
otherwise.
warnings
: the first 2048 characters. of the text of
warning messages that the task may have generated, NA
otherwise.
launcher
: name of the crew
launcher where the task ran.
collect()
Pop all available task results and return them in a tidy
tibble
.
crew_class_controller$collect( scale = TRUE, throttle = TRUE, error = NULL, controllers = NULL )
scale
Logical of length 1,
whether to automatically call scale()
to auto-scale workers to meet the demand of the task load.
throttle
TRUE
to skip auto-scaling if it already happened
within the last seconds_interval
seconds. FALSE
to auto-scale
every time scale()
is called. Throttling avoids
overburdening the mirai
dispatcher and other resources.
error
NULL
or character of length 1, choice of action if
the popped task threw an error. Possible values:
"stop"
: throw an error in the main R session instead of returning
a value.
"warn"
: throw a warning.
NULL
or "silent"
: do not react to errors.
controllers
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
A tibble
of results and metadata of all resolved tasks,
with one row per task. Returns NULL
if there are no tasks
to collect.
promise()
Create a promises::promise()
object to asynchronously
pop or collect one or more tasks.
crew_class_controller$promise( mode = "one", seconds_interval = 0.1, scale = NULL, throttle = NULL, controllers = NULL )
mode
Character of length 1, what kind of promise to create.
mode
must be "one"
or "all"
. Details:
If mode
is "one"
, then the promise is fulfilled (or rejected)
when at least one task is resolved and available to pop()
.
When that happens, pop()
runs asynchronously, pops a result off
the task list, and returns a value.
If the task succeeded, then the promise
is fulfilled and its value is the result of pop()
(a one-row
tibble
with the result and metadata). If the task threw an error,
the error message of the task is forwarded to any error callbacks
registered with the promise.
If mode
is "all"
, then the promise is fulfilled (or rejected)
when there are no unresolved tasks left in the controller.
(Be careful: this condition is trivially met in the moment
if the controller is empty and you have not submitted any tasks,
so it is best to create this kind of promise only after you
submit tasks.)
When there are no unresolved tasks left,
collect()
runs asynchronously, pops all available results
off the task list, and returns a value.
If the task succeeded, then the promise
is fulfilled and its value is the result of collect()
(a tibble
with one row per task result). If any of the tasks
threw an error, then the first error message detected is forwarded
to any error callbacks registered with the promise.
seconds_interval
Positive numeric of length 1, delay in the
later::later()
polling interval to asynchronously check if
the promise can be resolved.
scale
Deprecated on 2024-04-10 (version 0.9.1.9003)
and no longer used. Now, promise()
always turns on auto-scaling
in a private later
loop (if not already activated).
throttle
Deprecated on 2024-04-10 (version 0.9.1.9003)
and no longer used. Now, promise()
always turns on auto-scaling
in a private later
loop (if not already activated).
controllers
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
Please be aware that pop()
or collect()
will happen
asynchronously at a some unpredictable time after the promise object
is created, even if your local R process appears to be doing
something completely different. This behavior is highly desirable
in a Shiny reactive context, but please be careful as it may be
surprising in other situations.
A promises::promise()
object whose eventual value will
be a tibble
with results from one or more popped tasks.
If mode = "one"
, only one task is popped and returned (one row).
If mode = "all"
, then all the tasks are returned in a tibble
with one row per task (or NULL
is returned if there are no
tasks to pop).
wait()
Wait for tasks.
crew_class_controller$wait( mode = "all", seconds_interval = 0.5, seconds_timeout = Inf, scale = TRUE, throttle = TRUE, controllers = NULL )
mode
Character of length 1: "all"
to wait for all tasks to
complete, "one"
to wait for a single task to complete.
seconds_interval
Number of seconds to interrupt the wait in order to scale up workers as needed.
seconds_timeout
Timeout length in seconds waiting for tasks.
scale
Logical, whether to automatically call scale()
to auto-scale workers to meet the demand of the task load.
See also the throttle
argument.
throttle
TRUE
to skip auto-scaling if it already happened
within the last seconds_interval
seconds. FALSE
to auto-scale
every time scale()
is called. Throttling avoids
overburdening the mirai
dispatcher and other resources.
controllers
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
The wait()
method blocks the calling R session and
repeatedly auto-scales workers for tasks that need them.
The function runs until it either times out or the condition
in mode
is met.
A logical of length 1, invisibly. TRUE
if the condition
in mode
was met, FALSE
otherwise.
push_backlog()
Push the name of a task to the backlog.
crew_class_controller$push_backlog(name, controller = NULL)
name
Character of length 1 with the task name to push to the backlog.
controller
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
pop_backlog()
pops the tasks that can be pushed
without saturating the controller.
NULL
(invisibly).
pop_backlog()
Pop the task names from the head of the backlog which can be pushed without saturating the controller.
crew_class_controller$pop_backlog(controllers = NULL)
controllers
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
Character vector of task names which can be pushed to the
controller without saturating it. If the controller is saturated,
character(0L)
is returned.
summary()
Summarize the workers and tasks of the controller.
crew_class_controller$summary(controllers = NULL)
controllers
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
A data frame of summary statistics on the workers and tasks. It has one row per worker websocket and the following columns:
controller
: name of the controller.
. * worker
: integer index of the worker.
tasks
: number of tasks which were completed by
a worker at the websocket and then returned by calling
pop()
on the controller object.
seconds
: total number of runtime and seconds of
all the tasks that ran on a worker connected to this websocket
and then were retrieved by calling pop()
on the controller
object.
errors
: total number of tasks which ran on a worker
at the website, encountered an error in R, and then retrieved
with pop()
.
warnings
: total number of tasks which ran on a worker
at the website, encountered one or more warnings in R,
and then retrieved with pop()
. Note: warnings
is actually the number of tasks, not the number of warnings.
(A task could throw more than one warning.
cancel()
Cancel one or more tasks.
crew_class_controller$cancel(names = character(0L), all = FALSE)
names
Character vector of names of tasks to cancel.
Those names must have been manually supplied by push()
.
all
TRUE
to cancel all tasks, FALSE
otherwise.
all = TRUE
supersedes the names
argument.
pids()
Get the process IDs of the local process and the
mirai
dispatcher (if started).
crew_class_controller$pids(controllers = NULL)
controllers
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
An integer vector of process IDs of the local process and the
mirai
dispatcher (if started).
terminate()
Terminate the workers and the mirai
client.
crew_class_controller$terminate(controllers = NULL)
controllers
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
NULL
(invisibly).
Other controller:
crew_controller()
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() launcher <- crew_launcher_local() controller <- crew_controller(client = client, launcher = launcher) controller$start() controller$push(name = "task", command = sqrt(4)) controller$wait() controller$pop() controller$terminate() } ## ------------------------------------------------ ## Method `crew_class_controller$new` ## ------------------------------------------------ if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() launcher <- crew_launcher_local() controller <- crew_controller(client = client, launcher = launcher) controller$start() controller$push(name = "task", command = sqrt(4)) controller$wait() controller$pop() controller$terminate() }
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() launcher <- crew_launcher_local() controller <- crew_controller(client = client, launcher = launcher) controller$start() controller$push(name = "task", command = sqrt(4)) controller$wait() controller$pop() controller$terminate() } ## ------------------------------------------------ ## Method `crew_class_controller$new` ## ------------------------------------------------ if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() launcher <- crew_launcher_local() controller <- crew_controller(client = client, launcher = launcher) controller$start() controller$push(name = "task", command = sqrt(4)) controller$wait() controller$pop() controller$terminate() }
R6
class for controller groups.
controllers
List of R6
controller objects.
relay
Relay object for event-driven programming on a downstream condition variable.
new()
Multi-controller constructor.
crew_class_controller_group$new(controllers = NULL, relay = NULL)
controllers
List of R6
controller objects.
relay
Relay object for event-driven programming on a downstream condition variable.
An R6
object with the controller group object.
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { persistent <- crew_controller_local(name = "persistent") transient <- crew_controller_local( name = "transient", tasks_max = 1L ) group <- crew_controller_group(persistent, transient) group$start() group$push(name = "task", command = sqrt(4), controller = "transient") group$wait() group$pop() group$terminate() }
validate()
Validate the client.
crew_class_controller_group$validate()
NULL
(invisibly).
empty()
See if the controllers are empty.
crew_class_controller_group$empty(controllers = NULL)
controllers
Character vector of controller names.
Set to NULL
to select all controllers.
A controller is empty if it has no running tasks
or completed tasks waiting to be retrieved with push()
.
TRUE
if all the selected controllers are empty,
FALSE
otherwise.
nonempty()
Check if the controller group is nonempty.
crew_class_controller_group$nonempty(controllers = NULL)
controllers
Character vector of controller names.
Set to NULL
to select all controllers.
A controller is empty if it has no running tasks
or completed tasks waiting to be retrieved with push()
.
TRUE
if the controller is empty, FALSE
otherwise.
resolved()
Number of resolved mirai()
tasks.
crew_class_controller_group$resolved(controllers = NULL)
controllers
Character vector of controller names.
Set to NULL
to select all controllers.
resolved()
is cumulative: it counts all the resolved
tasks over the entire lifetime of the controller session.
Non-negative integer of length 1,
number of resolved mirai()
tasks.
The return value is 0 if the condition variable does not exist
(i.e. if the client is not running).
unresolved()
Number of unresolved mirai()
tasks.
crew_class_controller_group$unresolved(controllers = NULL)
controllers
Character vector of controller names.
Set to NULL
to select all controllers.
Non-negative integer of length 1,
number of unresolved mirai()
tasks.
unpopped()
Number of resolved mirai()
tasks available via pop()
.
crew_class_controller_group$unpopped(controllers = NULL)
controllers
Character vector of controller names.
Set to NULL
to select all controllers.
Non-negative integer of length 1,
number of resolved mirai()
tasks available via pop()
.
saturated()
Check if a controller is saturated.
crew_class_controller_group$saturated( collect = NULL, throttle = NULL, controller = NULL )
collect
Deprecated in version 0.5.0.9003 (2023-10-02). Not used.
throttle
Deprecated in version 0.5.0.9003 (2023-10-02). Not used.
controller
Character vector of length 1 with the controller name.
Set to NULL
to select the default controller that push()
would choose.
A controller is saturated if the number of unresolved tasks
is greater than or equal to the maximum number of workers.
In other words, in a saturated controller, every available worker
has a task.
You can still push tasks to a saturated controller, but
tools that use crew
such as targets
may choose not to.
TRUE
if all the selected controllers are saturated,
FALSE
otherwise.
start()
Start one or more controllers.
crew_class_controller_group$start(controllers = NULL)
controllers
Character vector of controller names.
Set to NULL
to select all controllers.
NULL
(invisibly).
started()
Check whether all the given controllers are started.
crew_class_controller_group$started(controllers = NULL)
controllers
Character vector of controller names.
Set to NULL
to select all controllers.
Actually checks whether all the given clients are started.
TRUE
if the controllers are started, FALSE
if any are not.
launch()
Launch one or more workers on one or more controllers.
crew_class_controller_group$launch(n = 1L, controllers = NULL)
n
Number of workers to launch in each controller selected.
controllers
Character vector of controller names.
Set to NULL
to select all controllers.
NULL
(invisibly).
scale()
Automatically scale up the number of workers if needed in one or more controller objects.
crew_class_controller_group$scale(throttle = TRUE, controllers = NULL)
throttle
TRUE
to skip auto-scaling if it already happened
within the last seconds_interval
seconds. FALSE
to auto-scale
every time scale()
is called. Throttling avoids
overburdening the mirai
dispatcher and other resources.
controllers
Character vector of controller names.
Set to NULL
to select all controllers.
See the scale()
method in individual controller classes.
NULL
(invisibly).
autoscale()
Run worker auto-scaling in a private later
loop
every controller$client$seconds_interval
seconds.
crew_class_controller_group$autoscale(controllers = NULL)
controllers
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
NULL
(invisibly).
descale()
Terminate the auto-scaling loop started by
controller$autoscale()
.
crew_class_controller_group$descale(controllers = NULL)
controllers
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
NULL
(invisibly).
push()
Push a task to the head of the task list.
crew_class_controller_group$push( command, data = list(), globals = list(), substitute = TRUE, seed = NULL, algorithm = NULL, packages = character(0), library = NULL, seconds_timeout = NULL, scale = TRUE, throttle = TRUE, name = NA_character_, save_command = FALSE, controller = NULL )
command
Language object with R code to run.
data
Named list of local data objects in the evaluation environment.
globals
Named list of objects to temporarily assign to the
global environment for the task. See the reset_globals
argument of crew_controller_local()
.
substitute
Logical of length 1, whether to call
base::substitute()
on the supplied value of the
command
argument. If TRUE
(default) then command
is quoted
literally as you write it, e.g.
push(command = your_function_call())
. If FALSE
, then crew
assumes command
is a language object and you are passing its
value, e.g. push(command = quote(your_function_call()))
.
substitute = TRUE
is appropriate for interactive use,
whereas substitute = FALSE
is meant for automated R programs
that invoke crew
controllers.
seed
Integer of length 1 with the pseudo-random number generator
seed to set for the evaluation of the task. Passed to the
seed
argument of set.seed()
if not NULL
.
If algorithm
and seed
are both NULL
,
then the random number generator defaults to the
widely spaced worker-specific
L'Ecuyer streams as supported by mirai::nextstream()
.
See vignette("parallel", package = "parallel")
for details.
algorithm
Integer of length 1 with the pseudo-random number
generator algorithm to set for the evaluation of the task.
Passed to the kind
argument of RNGkind()
if not NULL
.
If algorithm
and seed
are both NULL
,
then the random number generator defaults to the
recommended widely spaced worker-specific
L'Ecuyer streams as supported by mirai::nextstream()
.
See vignette("parallel", package = "parallel")
for details.
packages
Character vector of packages to load for the task.
library
Library path to load the packages. See the lib.loc
argument of require()
.
seconds_timeout
Optional task timeout passed to the .timeout
argument of mirai::mirai()
(after converting to milliseconds).
scale
Logical, whether to automatically scale workers to meet
demand. See the scale
argument of the push()
method of
ordinary single controllers.
throttle
TRUE
to skip auto-scaling if it already happened
within the last seconds_interval
seconds. FALSE
to auto-scale
every time scale()
is called. Throttling avoids
overburdening the mirai
dispatcher and other resources.
name
Optional name of the task. Replaced with a random name
if NULL
or in conflict with an existing name in the task list.
save_command
Logical of length 1. If TRUE
, the controller
deparses the command and returns it with the output on pop()
.
If FALSE
(default), the controller skips this step to
increase speed.
controller
Character of length 1,
name of the controller to submit the task.
If NULL
, the controller defaults to the
first controller in the list.
Invisibly return the mirai
object of the pushed task.
This allows you to interact with the task directly, e.g.
to create a promise object with promises::as.promise()
.
walk()
Apply a single command to multiple inputs, and return control to the user without waiting for any task to complete.
crew_class_controller_group$walk( command, iterate, data = list(), globals = list(), substitute = TRUE, seed = NULL, algorithm = NULL, packages = character(0), library = NULL, seconds_timeout = NULL, names = NULL, save_command = FALSE, scale = TRUE, throttle = TRUE, controller = NULL )
command
Language object with R code to run.
iterate
Named list of vectors or lists to iterate over.
For example, to run function calls
f(x = 1, y = "a")
and f(x = 2, y = "b")
,
set command
to f(x, y)
, and set iterate
to
list(x = c(1, 2), y = c("a", "b"))
. The individual
function calls are evaluated as
f(x = iterate$x[[1]], y = iterate$y[[1]])
and
f(x = iterate$x[[2]], y = iterate$y[[2]])
.
All the elements of iterate
must have the same length.
If there are any name conflicts between iterate
and data
,
iterate
takes precedence.
data
Named list of constant local data objects in the evaluation environment. Objects in this list are treated as single values and are held constant for each iteration of the map.
globals
Named list of constant objects to temporarily
assign to the global environment for each task. This list should
include any functions you previously defined in the global
environment which are required to run tasks.
See the reset_globals
argument of crew_controller_local()
.
Objects in this list are treated as single
values and are held constant for each iteration of the map.
substitute
Logical of length 1, whether to call
base::substitute()
on the supplied value of the
command
argument. If TRUE
(default) then command
is quoted
literally as you write it, e.g.
push(command = your_function_call())
. If FALSE
, then crew
assumes command
is a language object and you are passing its
value, e.g. push(command = quote(your_function_call()))
.
substitute = TRUE
is appropriate for interactive use,
whereas substitute = FALSE
is meant for automated R programs
that invoke crew
controllers.
seed
Integer of length 1 with the pseudo-random number generator
seed to set for the evaluation of the task. Passed to the
seed
argument of set.seed()
if not NULL
.
If algorithm
and seed
are both NULL
,
then the random number generator defaults to the
recommended widely spaced worker-specific
L'Ecuyer streams as supported by mirai::nextstream()
.
See vignette("parallel", package = "parallel")
for details.
algorithm
Integer of length 1 with the pseudo-random number
generator algorithm to set for the evaluation of the task.
Passed to the kind
argument of RNGkind()
if not NULL
.
If algorithm
and seed
are both NULL
,
then the random number generator defaults to the
recommended widely spaced worker-specific
L'Ecuyer streams as supported by mirai::nextstream()
.
See vignette("parallel", package = "parallel")
for details.
packages
Character vector of packages to load for the task.
library
Library path to load the packages. See the lib.loc
argument of require()
.
seconds_timeout
Optional task timeout passed to the .timeout
argument of mirai::mirai()
(after converting to milliseconds).
names
Optional character of length 1, name of the element of
iterate
with names for the tasks. If names
is supplied,
then iterate[[names]]
must be a character vector.
save_command
Logical of length 1, whether to store a text string version of the R command in the output.
scale
Logical, whether to automatically scale workers to meet
demand. See also the throttle
argument.
throttle
TRUE
to skip auto-scaling if it already happened
within the last seconds_interval
seconds. FALSE
to auto-scale
every time scale()
is called. Throttling avoids
overburdening the mirai
dispatcher and other resources.
controller
Character of length 1,
name of the controller to submit the tasks.
If NULL
, the controller defaults to the
first controller in the list.
In contrast to walk()
, map()
blocks the local R session
and waits for all tasks to complete.
Invisibly returns a list of mirai
task objects for the
newly created tasks. The order of tasks in the list matches the
order of data in the iterate
argument.
map()
Apply a single command to multiple inputs.
crew_class_controller_group$map( command, iterate, data = list(), globals = list(), substitute = TRUE, seed = NULL, algorithm = NULL, packages = character(0), library = NULL, seconds_interval = 0.5, seconds_timeout = NULL, names = NULL, save_command = FALSE, error = "stop", warnings = TRUE, verbose = interactive(), scale = TRUE, throttle = TRUE, controller = NULL )
command
Language object with R code to run.
iterate
Named list of vectors or lists to iterate over.
For example, to run function calls
f(x = 1, y = "a")
and f(x = 2, y = "b")
,
set command
to f(x, y)
, and set iterate
to
list(x = c(1, 2), y = c("a", "b"))
. The individual
function calls are evaluated as
f(x = iterate$x[[1]], y = iterate$y[[1]])
and
f(x = iterate$x[[2]], y = iterate$y[[2]])
.
All the elements of iterate
must have the same length.
If there are any name conflicts between iterate
and data
,
iterate
takes precedence.
data
Named list of constant local data objects in the evaluation environment. Objects in this list are treated as single values and are held constant for each iteration of the map.
globals
Named list of constant objects to temporarily
assign to the global environment for each task. This list should
include any functions you previously defined in the global
environment which are required to run tasks.
See the reset_globals
argument of crew_controller_local()
.
Objects in this list are treated as single
values and are held constant for each iteration of the map.
substitute
Logical of length 1, whether to call
base::substitute()
on the supplied value of the
command
argument. If TRUE
(default) then command
is quoted
literally as you write it, e.g.
push(command = your_function_call())
. If FALSE
, then crew
assumes command
is a language object and you are passing its
value, e.g. push(command = quote(your_function_call()))
.
substitute = TRUE
is appropriate for interactive use,
whereas substitute = FALSE
is meant for automated R programs
that invoke crew
controllers.
seed
Integer of length 1 with the pseudo-random number generator
seed to set for the evaluation of the task. Passed to the
seed
argument of set.seed()
if not NULL
.
If algorithm
and seed
are both NULL
,
then the random number generator defaults to the
recommended widely spaced worker-specific
L'Ecuyer streams as supported by mirai::nextstream()
.
See vignette("parallel", package = "parallel")
for details.
algorithm
Integer of length 1 with the pseudo-random number
generator algorithm to set for the evaluation of the task.
Passed to the kind
argument of RNGkind()
if not NULL
.
If algorithm
and seed
are both NULL
,
then the random number generator defaults to the
recommended widely spaced worker-specific
L'Ecuyer streams as supported by mirai::nextstream()
.
See vignette("parallel", package = "parallel")
for details.
packages
Character vector of packages to load for the task.
library
Library path to load the packages. See the lib.loc
argument of require()
.
seconds_interval
Number of seconds to wait between auto-scaling operations while waiting for tasks to complete.
seconds_timeout
Optional task timeout passed to the .timeout
argument of mirai::mirai()
(after converting to milliseconds).
names
Optional character of length 1, name of the element of
iterate
with names for the tasks. If names
is supplied,
then iterate[[names]]
must be a character vector.
save_command
Logical of length 1, whether to store a text string version of the R command in the output.
error
Character vector of length 1, choice of action if a task has an error. Possible values:
"stop"
: throw an error in the main R session instead of returning
a value. In case of an error, the results from the last errored
map()
are in the error
field
of the controller, e.g. controller_object$error
. To reduce
memory consumption, set controller_object$error <- NULL
after
you are finished troubleshooting.
"warn"
: throw a warning. This allows the return value with
all the error messages and tracebacks to be generated.
"silent"
: do nothing special.
warnings
Logical of length 1, whether to throw a warning in the interactive session if at least one task encounters an error.
verbose
Logical of length 1, whether to print progress messages.
scale
Logical, whether to automatically scale workers to meet
demand. See also the throttle
argument.
throttle
TRUE
to skip auto-scaling if it already happened
within the last seconds_interval
seconds. FALSE
to auto-scale
every time scale()
is called. Throttling avoids
overburdening the mirai
dispatcher and other resources.
controller
Character of length 1,
name of the controller to submit the tasks.
If NULL
, the controller defaults to the
first controller in the list.
The idea comes from functional programming: for example,
the map()
function from the purrr
package.
A tibble
of results and metadata: one row per task and
columns corresponding to the output of pop()
.
pop()
Pop a completed task from the results data frame.
crew_class_controller_group$pop( scale = TRUE, collect = NULL, throttle = TRUE, error = NULL, controllers = NULL )
scale
Logical, whether to automatically scale workers to meet
demand. See the scale
argument of the pop()
method of
ordinary single controllers.
collect
Deprecated in version 0.5.0.9003 (2023-10-02). Not used.
throttle
TRUE
to skip auto-scaling if it already happened
within the last seconds_interval
seconds. FALSE
to auto-scale
every time scale()
is called. Throttling avoids
overburdening the mirai
dispatcher and other resources.
error
NULL
or character of length 1, choice of action if
the popped task threw an error. Possible values:
"stop"
: throw an error in the main R session instead of returning
a value.
"warn"
: throw a warning.
NULL
or "silent"
: do not react to errors.
controllers
Character vector of controller names.
Set to NULL
to select all controllers.
If there is no task to collect, return NULL
. Otherwise,
return a one-row tibble
with the same columns as pop()
for ordinary controllers.
collect()
Pop all available task results and return them in a tidy
tibble
.
crew_class_controller_group$collect( scale = TRUE, throttle = TRUE, error = NULL, controllers = NULL )
scale
Logical of length 1,
whether to automatically call scale()
to auto-scale workers to meet the demand of the task load.
throttle
TRUE
to skip auto-scaling if it already happened
within the last seconds_interval
seconds. FALSE
to auto-scale
every time scale()
is called. Throttling avoids
overburdening the mirai
dispatcher and other resources.
error
NULL
or character of length 1, choice of action if
the popped task threw an error. Possible values:
"stop"
: throw an error in the main R session instead of returning
a value.
"warn"
: throw a warning.
NULL
or "silent"
: do not react to errors.
controllers
Character vector of controller names.
Set to NULL
to select all controllers.
A tibble
of results and metadata of all resolved tasks,
with one row per task. Returns NULL
if there are no available
results.
promise()
Create a promises::promise()
object to asynchronously
pop or collect one or more tasks.
crew_class_controller_group$promise( mode = "one", seconds_interval = 0.1, scale = NULL, throttle = NULL, controllers = NULL )
mode
Character of length 1, what kind of promise to create.
mode
must be "one"
or "all"
. Details:
If mode
is "one"
, then the promise is fulfilled (or rejected)
when at least one task is resolved and available to pop()
.
When that happens, pop()
runs asynchronously, pops a result off
the task list, and returns a value.
If the task succeeded, then the promise
is fulfilled and its value is the result of pop()
(a one-row
tibble
with the result and metadata). If the task threw an error,
the error message of the task is forwarded to any error callbacks
registered with the promise.
If mode
is "all"
, then the promise is fulfilled (or rejected)
when there are no unresolved tasks left in the controller.
(Be careful: this condition is trivially met in the moment
if the controller is empty and you have not submitted any tasks,
so it is best to create this kind of promise only after you
submit tasks.)
When there are no unresolved tasks left,
collect()
runs asynchronously, pops all available results
off the task list, and returns a value.
If the task succeeded, then the promise
is fulfilled and its value is the result of collect()
(a tibble
with one row per task result). If any of the tasks
threw an error, then the first error message detected is forwarded
to any error callbacks registered with the promise.
seconds_interval
Positive numeric of length 1, delay in the
later::later()
polling interval to asynchronously check if
the promise can be resolved.
scale
Deprecated on 2024-04-10 (version 0.9.1.9003)
and no longer used. Now, promise()
always turns on auto-scaling
in a private later
loop (if not already activated).
throttle
Deprecated on 2024-04-10 (version 0.9.1.9003)
and no longer used. Now, promise()
always turns on auto-scaling
in a private later
loop (if not already activated).
controllers
Not used. Included to ensure the signature is compatible with the analogous method of controller groups.
Please be aware that pop()
or collect()
will happen
asynchronously at a some unpredictable time after the promise object
is created, even if your local R process appears to be doing
something completely different. This behavior is highly desirable
in a Shiny reactive context, but please be careful as it may be
surprising in other situations.
A promises::promise()
object whose eventual value will
be a tibble
with results from one or more popped tasks.
If mode = "one"
, only one task is popped and returned (one row).
If mode = "all"
, then all the tasks are returned in a tibble
with one row per task (or NULL
is returned if there are no
tasks to pop).
wait()
Wait for tasks.
crew_class_controller_group$wait( mode = "all", seconds_interval = 0.5, seconds_timeout = Inf, scale = TRUE, throttle = TRUE, controllers = NULL )
mode
Character of length 1: "all"
to wait for
all tasks in all controllers to complete, "one"
to wait for
a single task in a single controller to complete. In this scheme,
the timeout limit is applied to each controller sequentially,
and a timeout is treated the same as a completed controller.
seconds_interval
Number of seconds to interrupt the wait in order to scale up workers as needed.
seconds_timeout
Timeout length in seconds waiting for results to become available.
scale
Logical of length 1, whether to call scale_later()
on each selected controller to schedule auto-scaling.
See the scale
argument of the wait()
method of
ordinary single controllers.
throttle
TRUE
to skip auto-scaling if it already happened
within the last seconds_interval
seconds. FALSE
to auto-scale
every time scale()
is called. Throttling avoids
overburdening the mirai
dispatcher and other resources.
controllers
Character vector of controller names.
Set to NULL
to select all controllers.
The wait()
method blocks the calling R session and
repeatedly auto-scales workers for tasks that need them.
The function runs until it either times out or the condition
in mode
is met.
A logical of length 1, invisibly. TRUE
if the condition
in mode
was met, FALSE
otherwise.
push_backlog()
Push the name of a task to the backlog.
crew_class_controller_group$push_backlog(name, controller = NULL)
name
Character of length 1 with the task name to push to the backlog.
controller
Character vector of length 1 with the controller name.
Set to NULL
to select the default controller that push_backlog()
would choose.
pop_backlog()
pops the tasks that can be pushed
without saturating the controller.
NULL
(invisibly).
pop_backlog()
Pop the task names from the head of the backlog which can be pushed without saturating the controller.
crew_class_controller_group$pop_backlog(controllers = NULL)
controllers
Character vector of controller names.
Set to NULL
to select all controllers.
Character vector of task names which can be pushed to the
controller without saturating it. If the controller is saturated,
character(0L)
is returned.
summary()
Summarize the workers of one or more controllers.
crew_class_controller_group$summary(controllers = NULL)
controllers
Character vector of controller names.
Set to NULL
to select all controllers.
A data frame of aggregated worker summary statistics
of all the selected controllers. It has one row per worker,
and the rows are grouped by controller.
See the documentation of the summary()
method of the controller
class for specific information about the columns in the output.
pids()
Get the process IDs of the local process and the
mirai
dispatchers (if started).
crew_class_controller_group$pids(controllers = NULL)
controllers
Character vector of controller names.
Set to NULL
to select all controllers.
An integer vector of process IDs of the local process and the
mirai
dispatcher (if started).
terminate()
Terminate the workers and disconnect the client for one or more controllers.
crew_class_controller_group$terminate(controllers = NULL)
controllers
Character vector of controller names.
Set to NULL
to select all controllers.
NULL
(invisibly).
Other controller_group:
crew_controller_group()
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { persistent <- crew_controller_local(name = "persistent") transient <- crew_controller_local( name = "transient", tasks_max = 1L ) group <- crew_controller_group(persistent, transient) group$start() group$push(name = "task", command = sqrt(4), controller = "transient") group$wait() group$pop() group$terminate() } ## ------------------------------------------------ ## Method `crew_class_controller_group$new` ## ------------------------------------------------ if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { persistent <- crew_controller_local(name = "persistent") transient <- crew_controller_local( name = "transient", tasks_max = 1L ) group <- crew_controller_group(persistent, transient) group$start() group$push(name = "task", command = sqrt(4), controller = "transient") group$wait() group$pop() group$terminate() }
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { persistent <- crew_controller_local(name = "persistent") transient <- crew_controller_local( name = "transient", tasks_max = 1L ) group <- crew_controller_group(persistent, transient) group$start() group$push(name = "task", command = sqrt(4), controller = "transient") group$wait() group$pop() group$terminate() } ## ------------------------------------------------ ## Method `crew_class_controller_group$new` ## ------------------------------------------------ if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { persistent <- crew_controller_local(name = "persistent") transient <- crew_controller_local( name = "transient", tasks_max = 1L ) group <- crew_controller_group(persistent, transient) group$start() group$push(name = "task", command = sqrt(4), controller = "transient") group$wait() group$pop() group$terminate() }
R6
abstract class to build other subclasses
which launch and manage workers.
workers
Data frame of worker information.
name
Name of the launcher.
seconds_interval
See crew_launcher()
.
seconds_timeout
See crew_launcher()
.
seconds_launch
See crew_launcher()
.
seconds_idle
See crew_launcher()
.
seconds_wall
See crew_launcher()
.
tasks_max
See crew_launcher()
.
tasks_timers
See crew_launcher()
.
reset_globals
See crew_launcher()
.
reset_packages
See crew_launcher()
.
reset_options
See crew_launcher()
.
garbage_collection
See crew_launcher()
.
crashes_error
See crew_launcher()
.
tls
See crew_launcher()
.
processes
See crew_launcher()
.
asynchronously.
r_arguments
See crew_launcher()
.
options_metrics
See crew_launcher()
.
async
A crew_async()
object to run low-level launcher tasks
asynchronously.
throttle
A crew_throttle()
object to throttle scaling.
new()
Launcher constructor.
crew_class_launcher$new( name = NULL, seconds_interval = NULL, seconds_timeout = NULL, seconds_launch = NULL, seconds_idle = NULL, seconds_wall = NULL, seconds_exit = NULL, tasks_max = NULL, tasks_timers = NULL, reset_globals = NULL, reset_packages = NULL, reset_options = NULL, garbage_collection = NULL, crashes_error = NULL, launch_max = NULL, tls = NULL, processes = NULL, r_arguments = NULL, options_metrics = NULL )
name
See crew_launcher()
.
seconds_interval
See crew_launcher()
.
seconds_timeout
See crew_launcher()
.
seconds_launch
See crew_launcher()
.
seconds_idle
See crew_launcher()
.
seconds_wall
See crew_launcher()
.
seconds_exit
See crew_launcher()
.
tasks_max
See crew_launcher()
.
tasks_timers
See crew_launcher()
.
reset_globals
See crew_launcher()
.
reset_packages
See crew_launcher()
.
reset_options
See crew_launcher()
.
garbage_collection
See crew_launcher()
.
crashes_error
See crew_launcher()
.
launch_max
Deprecated.
tls
See crew_launcher()
.
processes
See crew_launcher()
.
r_arguments
See crew_launcher()
.
options_metrics
See crew_launcher()
.
An R6
object with the launcher.
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() client$start() launcher <- crew_launcher_local(name = client$name) launcher$start(workers = client$workers) launcher$launch(index = 1L) m <- mirai::mirai("result", .compute = client$name) Sys.sleep(0.25) m$data client$terminate() }
validate()
Validate the launcher.
crew_class_launcher$validate()
NULL
(invisibly).
set_name()
Set the name of the launcher.
crew_class_launcher$set_name(name)
name
Character of length 1, name to set for the launcher.
NULL
(invisibly).
settings()
List of arguments for mirai::daemon()
.
crew_class_launcher$settings(socket)
socket
Character of length 1, websocket address of the worker to launch.
List of arguments for mirai::daemon()
.
call()
Create a call to crew_worker()
to
help create custom launchers.
crew_class_launcher$call(socket, launcher, worker, instance)
socket
Socket where the worker will receive tasks.
launcher
Character of length 1, name of the launcher.
worker
Positive integer of length 1, index of the worker. This worker index remains the same even when the current instance of the worker exits and a new instance launches.
instance
Character of length 1 to uniquely identify the instance of the worker.
Character of length 1 with a call to crew_worker()
.
launcher <- crew_launcher_local() launcher$call( socket = "ws://127.0.0.1:5000/3/cba033e58", launcher = "launcher_a", worker = 3L, instance = "cba033e58" )
start()
Start the launcher.
crew_class_launcher$start(sockets = NULL)
sockets
For testing purposes only.
Creates the workers data frame. Meant to be called once at the beginning of the launcher life cycle, after the client has started.
NULL
(invisibly).
terminate()
Terminate the whole launcher, including all workers.
crew_class_launcher$terminate()
NULL
(invisibly).
summary()
Summarize the workers.
crew_class_launcher$summary()
NULL
if the launcher is not started. Otherwise, a tibble
with one row per crew
worker and the following columns:
worker
: integer index of the worker.
launches
: number of times the worker was launched. Each launch
occurs at a different websocket because the token at the end of the
URL is rotated before each new launch.
online
: logical vector, whether the current instance of each
worker was actively connected to its NNG socket during the time of
the last call to tally()
.
discovered
: logical vector, whether the current instance of each
worker had connected to its NNG socket at some point
(and then possibly disconnected) during the time of
the last call to tally()
.
assigned
: cumulative number of tasks assigned, reported by
mirai::daemons()
and summed over all
completed instances of the worker. Does not reflect the activity
of the currently running instance of the worker.
complete
: cumulative number of tasks completed, reported by
mirai::daemons()
and summed over all
completed instances of the worker. Does not reflect the activity
of the currently running instance of the worker.
crashes
: number of consecutive times a worker
launched without completing all its assigned tasks.
tally()
Update the daemons
-related columns of the internal
workers
data frame.
crew_class_launcher$tally(daemons = NULL)
daemons
mirai
daemons matrix. For testing only. Users
should not set this.
NULL
(invisibly).
unlaunched()
Get indexes of unlaunched workers.
crew_class_launcher$unlaunched(n = Inf)
n
Maximum number of worker indexes to return.
A worker is "unlaunched" if it has never connected
to the current instance of its websocket. Once a worker
launches with the launch()
method, it is considered "launched"
until it disconnects and its websocket is rotated with rotate()
.
Integer index of workers available for launch. The backlogged workers are listed first. A worker is backlogged if it is assigned more tasks than it completed.
booting()
Get workers that may still be booting up.
crew_class_launcher$booting()
A worker is "booting" if its launch time is within the last
seconds_launch
seconds. seconds_launch
is a configurable grace
period when crew
allows a worker to start up and connect to the
mirai
dispatcher. The booting()
function does not know about the
actual worker connection status, it just knows about launch times,
so it may return TRUE
for workers that have already connected
and started doing tasks.
active()
Get active workers.
crew_class_launcher$active()
A worker is "active" if its current instance is online and
connected, or if it is within its booting time window
and has never connected.
In other words, "active" means online | (!discovered & booting)
.
Logical vector with TRUE
for active workers and FALSE
for
inactive ones.
done()
Get done workers.
crew_class_launcher$done()
A worker is "done" if it is launched and inactive.
A worker is "launched" if launch()
was called
and the worker websocket has not been rotated since.
Integer index of inactive workers.
rotate()
crew_class_launcher$rotate()
Rotate websockets at all unlaunched workers and
throw an error if a worker launched at least crashes_error
times in a row without completing all its assigned tasks.
NULL
(invisibly).
launch()
Launch a worker.
crew_class_launcher$launch(index)
index
Positive integer of length 1, index of the worker to launch.
NULL
(invisibly).
forward()
Forward an asynchronous launch/termination error condition of a worker.
crew_class_launcher$forward(index, condition = "error")
index
Integer of length 1, index of the worker to inspect.
condition
Character of length 1 indicating what to do
with an error if found. "error"
to throw an error,
"warning"
to throw a warning,
"message"
to print a message,
and "character"
to return a character vector of specific
task-level error messages.
The return value is NULL
if no error is found.
Throw an error, throw a warning, or return a character string,
depending on the condition
argument.
errors()
Collect and return the most recent error messages from asynchronous worker launching and termination.
crew_class_launcher$errors()
Character vector of all the most recent error messages
from asynchronous worker launching and termination. NULL
if there are no errors.
wait()
Wait for any local asynchronous launch or termination tasks to complete.
crew_class_launcher$wait()
Only relevant if processes
is a positive integer.
NULL
(invisibly).
scale()
Auto-scale workers out to meet the demand of tasks.
crew_class_launcher$scale(demand, throttle = TRUE)
demand
Number of unresolved tasks.
throttle
TRUE
to skip auto-scaling if it already happened
within the last seconds_interval
seconds. FALSE
to auto-scale
every time scale()
is called. Throttling avoids
overburdening the mirai
dispatcher and other resources.
NULL
(invisibly)
launch_worker()
Abstract worker launch method.
crew_class_launcher$launch_worker(call, name, launcher, worker, instance)
call
Character of length 1 with a namespaced call to
crew_worker()
which will run in the worker and accept tasks.
name
Character of length 1 with an informative worker name.
launcher
Character of length 1, name of the launcher.
worker
Positive integer of length 1, index of the worker. This worker index remains the same even when the current instance of the worker exits and a new instance launches. It is always between 1 and the maximum number of concurrent workers.
instance
Character of length 1 to uniquely identify the current instance of the worker a the index in the launcher.
Launcher plugins will overwrite this method.
A handle to mock the worker launch.
crashes()
Return the number of consecutive times a worker launched without completing all its assigned tasks.
crew_class_launcher$crashes(index)
index
Non-negative integer, index of the worker pointing
to a row of the data frame output of the summary()
method
of the launcher.
Non-negative integer, number of consecutive times a worker launched without completing all its assigned tasks.
terminate_worker()
Abstract worker termination method.
crew_class_launcher$terminate_worker(handle)
handle
A handle object previously
returned by launch_worker()
which allows the termination
of the worker.
Launcher plugins will overwrite this method.
A handle to mock worker termination.
terminate_workers()
Terminate one or more workers.
crew_class_launcher$terminate_workers(index = NULL)
index
Integer vector of the indexes of the workers
to terminate. If NULL
, all current workers are terminated.
NULL
(invisibly).
Other launcher:
crew_launcher()
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() client$start() launcher <- crew_launcher_local(name = client$name) launcher$start(workers = client$workers) launcher$launch(index = 1L) m <- mirai::mirai("result", .compute = client$name) Sys.sleep(0.25) m$data client$terminate() } ## ------------------------------------------------ ## Method `crew_class_launcher$new` ## ------------------------------------------------ if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() client$start() launcher <- crew_launcher_local(name = client$name) launcher$start(workers = client$workers) launcher$launch(index = 1L) m <- mirai::mirai("result", .compute = client$name) Sys.sleep(0.25) m$data client$terminate() } ## ------------------------------------------------ ## Method `crew_class_launcher$call` ## ------------------------------------------------ launcher <- crew_launcher_local() launcher$call( socket = "ws://127.0.0.1:5000/3/cba033e58", launcher = "launcher_a", worker = 3L, instance = "cba033e58" )
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() client$start() launcher <- crew_launcher_local(name = client$name) launcher$start(workers = client$workers) launcher$launch(index = 1L) m <- mirai::mirai("result", .compute = client$name) Sys.sleep(0.25) m$data client$terminate() } ## ------------------------------------------------ ## Method `crew_class_launcher$new` ## ------------------------------------------------ if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() client$start() launcher <- crew_launcher_local(name = client$name) launcher$start(workers = client$workers) launcher$launch(index = 1L) m <- mirai::mirai("result", .compute = client$name) Sys.sleep(0.25) m$data client$terminate() } ## ------------------------------------------------ ## Method `crew_class_launcher$call` ## ------------------------------------------------ launcher <- crew_launcher_local() launcher$call( socket = "ws://127.0.0.1:5000/3/cba033e58", launcher = "launcher_a", worker = 3L, instance = "cba033e58" )
R6
class to launch and manage local process workers.
crew::crew_class_launcher
-> crew_class_launcher_local
options_local
crew::crew_class_launcher$active()
crew::crew_class_launcher$booting()
crew::crew_class_launcher$call()
crew::crew_class_launcher$crashes()
crew::crew_class_launcher$done()
crew::crew_class_launcher$errors()
crew::crew_class_launcher$forward()
crew::crew_class_launcher$launch()
crew::crew_class_launcher$rotate()
crew::crew_class_launcher$scale()
crew::crew_class_launcher$set_name()
crew::crew_class_launcher$settings()
crew::crew_class_launcher$start()
crew::crew_class_launcher$summary()
crew::crew_class_launcher$tally()
crew::crew_class_launcher$terminate()
crew::crew_class_launcher$terminate_workers()
crew::crew_class_launcher$unlaunched()
crew::crew_class_launcher$wait()
new()
Local launcher constructor.
crew_class_launcher_local$new( name = NULL, seconds_interval = NULL, seconds_timeout = NULL, seconds_launch = NULL, seconds_idle = NULL, seconds_wall = NULL, seconds_exit = NULL, tasks_max = NULL, tasks_timers = NULL, reset_globals = NULL, reset_packages = NULL, reset_options = NULL, garbage_collection = NULL, crashes_error = NULL, tls = NULL, processes = NULL, r_arguments = NULL, options_metrics = NULL, options_local = NULL )
name
See crew_launcher()
.
seconds_interval
See crew_launcher()
.
seconds_timeout
See crew_launcher()
.
seconds_launch
See crew_launcher()
.
seconds_idle
See crew_launcher()
.
seconds_wall
See crew_launcher()
.
seconds_exit
See crew_launcher()
.
tasks_max
See crew_launcher()
.
tasks_timers
See crew_launcher()
.
reset_globals
See crew_launcher()
.
reset_packages
See crew_launcher()
.
reset_options
See crew_launcher()
.
garbage_collection
See crew_launcher()
.
crashes_error
See crew_launcher()
.
tls
See crew_launcher()
.
processes
See crew_launcher()
.
r_arguments
See crew_launcher()
.
options_metrics
options_local
An R6
object with the local launcher.
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() client$start() launcher <- crew_launcher_local(name = client$name) launcher$start(sockets = client$summary()$socket) launcher$launch(index = 1L) task <- mirai::mirai("result", .compute = client$name) mirai::call_mirai_(task) task$data client$terminate() }
validate()
Validate the local launcher.
crew_class_launcher_local$validate()
NULL
(invisibly).
launch_worker()
Launch a local process worker which will dial into a socket.
crew_class_launcher_local$launch_worker(call, name, launcher, worker, instance)
call
Character of length 1 with a namespaced call to
crew_worker()
which will run in the worker and accept tasks.
name
Character of length 1 with a long informative worker name
which contains the launcher
, worker
, and instance
arguments
described below.
launcher
Character of length 1, name of the launcher.
worker
Positive integer of length 1, index of the worker. This worker index remains the same even when the current instance of the worker exits and a new instance launches. It is always between 1 and the maximum number of concurrent workers.
instance
Character of length 1 to uniquely identify the current instance of the worker a the index in the launcher.
The call
argument is R code that will run to
initiate the worker. Together, the launcher
, worker
,
and instance
arguments are useful for
constructing informative job names.
A handle object to allow the termination of the worker later on.
terminate_worker()
Terminate a local process worker.
crew_class_launcher_local$terminate_worker(handle)
handle
A process handle object previously
returned by launch_worker()
.
A list with the process ID of the worker.
Other plugin_local:
crew_controller_local()
,
crew_launcher_local()
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() client$start() launcher <- crew_launcher_local(name = client$name) launcher$start(sockets = client$summary()$socket) launcher$launch(index = 1L) task <- mirai::mirai("result", .compute = client$name) mirai::call_mirai_(task) task$data client$terminate() } ## ------------------------------------------------ ## Method `crew_class_launcher_local$new` ## ------------------------------------------------ if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() client$start() launcher <- crew_launcher_local(name = client$name) launcher$start(sockets = client$summary()$socket) launcher$launch(index = 1L) task <- mirai::mirai("result", .compute = client$name) mirai::call_mirai_(task) task$data client$terminate() }
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() client$start() launcher <- crew_launcher_local(name = client$name) launcher$start(sockets = client$summary()$socket) launcher$launch(index = 1L) task <- mirai::mirai("result", .compute = client$name) mirai::call_mirai_(task) task$data client$terminate() } ## ------------------------------------------------ ## Method `crew_class_launcher_local$new` ## ------------------------------------------------ if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() client$start() launcher <- crew_launcher_local(name = client$name) launcher$start(sockets = client$summary()$socket) launcher$launch(index = 1L) task <- mirai::mirai("result", .compute = client$name) mirai::call_mirai_(task) task$data client$terminate() }
Local monitor R6
class
See crew_monitor_local()
.
dispatchers()
List the process IDs of the running mirai
dispatcher
processes.
crew_class_monitor_local$dispatchers(user = ps::ps_username())
user
Character of length 1, user ID to filter on. NULL
to list processes of all users (not recommended).
Integer vector of process IDs of the running mirai
dispatcher
processes.
daemons()
List the process IDs of the locally running mirai
daemon
processes which are not crew
workers. The crew_async()
object can launch such processes: for example, when a positive
integer is supplied to the processes
argument of e.g.
crew.aws.batch::crew_controller_aws_batch()
.
crew_class_monitor_local$daemons(user = ps::ps_username())
user
Character of length 1, user ID to filter on. NULL
to list processes of all users (not recommended).
Integer vector of process IDs of the locally running
mirai
daemon processes which are not crew
workers.
workers()
List the process IDs of locally running crew
workers
launched by the local controller (crew_controller_local()
).
crew_class_monitor_local$workers(user = ps::ps_username())
user
Character of length 1, user ID to filter on. NULL
to list processes of all users (not recommended).
Only the workers running on your local computer are listed.
Workers that are not listed include jobs on job schedulers like
SLURM or jobs on cloud services like AWS Batch. To monitor
those worker processes, please consult the monitor objects in
the relevant third-party launcher plugins such as crew.cluster
and crew.aws.batch
.
Integer vector of process IDs of locally running crew
workers
launched by the local controller (crew_controller_local()
).
terminate()
Terminate the given process IDs.
crew_class_monitor_local$terminate(pids)
pids
Integer vector of process IDs of local processes to terminate.
Termination happens with the operating system signal
given by crew_terminate_signal()
.
NULL
(invisibly).
Other monitor:
crew_monitor_local()
R6
relay class.R6
class for relay configuration.
See crew_relay()
.
condition
Main condition variable.
from
Condition variable to relay from.
to
Condition variable to relay to.
validate()
Validate the object.
crew_class_relay$validate()
NULL
(invisibly).
start()
Start the relay object.
crew_class_relay$start()
NULL
(invisibly).
terminate()
Terminate the relay object.
crew_class_relay$terminate()
NULL
(invisibly).
set_from()
Set the condition variable to relay from.
crew_class_relay$set_from(from)
from
Condition variable to relay from.
NULL
(invisibly).
set_to()
Set the condition variable to relay to.
crew_class_relay$set_to(to)
to
Condition variable to relay to.
NULL
(invisibly).
wait()
Wait until an unobserved task resolves or the timeout is reached.
crew_class_relay$wait(seconds_timeout = 1000)
seconds_timeout
Positive numeric of length 1, Number of seconds to wait before timing out.
NULL
(invisibly).
Other relay:
crew_relay()
crew_relay()
crew_relay()
R6
throttle class.R6
class for throttle configuration.
See crew_throttle()
.
seconds_interval
Positive numeric of length 1, throttling interval in seconds.
polled
Positive numeric of length 1,
millisecond timestamp of the last time poll()
returned TRUE
.
NULL
if poll()
was never called on the current object.
new()
Throttle constructor.
crew_class_throttle$new(seconds_interval = NULL)
seconds_interval
Throttling interval in seconds.
An R6
object with throttle configuration.
throttle <- crew_throttle(seconds_interval = 0.5) throttle$poll() throttle$poll()
validate()
Validate the object.
crew_class_throttle$validate()
NULL
(invisibly).
poll()
Poll the throttler.
crew_class_throttle$poll()
TRUE
if poll()
did not return TRUE
in the last
seconds_interval
seconds, FALSE
otherwise.
reset()
Reset the throttle object so the next poll()
returns
TRUE
.
crew_class_throttle$reset()
NULL
(invisibly).
Other throttle:
crew_throttle()
throttle <- crew_throttle(seconds_interval = 0.5) throttle$poll() throttle$poll() ## ------------------------------------------------ ## Method `crew_class_throttle$new` ## ------------------------------------------------ throttle <- crew_throttle(seconds_interval = 0.5) throttle$poll() throttle$poll()
throttle <- crew_throttle(seconds_interval = 0.5) throttle$poll() throttle$poll() ## ------------------------------------------------ ## Method `crew_class_throttle$new` ## ------------------------------------------------ throttle <- crew_throttle(seconds_interval = 0.5) throttle$poll() throttle$poll()
R6
TLS class.R6
class for TLS configuration.
See crew_tls()
.
mode
See crew_tls()
.
key
See crew_tls()
.
password
See crew_tls()
.
certificates
See crew_tls()
.
new()
TLS configuration constructor.
crew_class_tls$new( mode = NULL, key = NULL, password = NULL, certificates = NULL )
mode
Argument passed from crew_tls()
.
key
Argument passed from crew_tls()
.
password
Argument passed from crew_tls()
.
certificates
Argument passed from crew_tls()
.
An R6
object with TLS configuration.
crew_tls(mode = "automatic")
validate()
Validate the object.
crew_class_tls$validate(test = TRUE)
test
Logical of length 1, whether to test the TLS configuration
with nanonext::tls_config()
.
NULL
(invisibly).
client()
TLS credentials for the crew
client.
crew_class_tls$client()
NULL
or character vector, depending on the mode.
worker()
TLS credentials for crew
workers.
crew_class_tls$worker(name)
name
Character of length 1 with the mirai
compute profile.
NULL
or character vector, depending on the mode.
Other tls:
crew_tls()
crew_tls(mode = "automatic") ## ------------------------------------------------ ## Method `crew_class_tls$new` ## ------------------------------------------------ crew_tls(mode = "automatic")
crew_tls(mode = "automatic") ## ------------------------------------------------ ## Method `crew_class_tls$new` ## ------------------------------------------------ crew_tls(mode = "automatic")
Terminate mirai
dispatchers and/or crew
workers
which may be lingering from previous workloads.
crew_clean( dispatchers = TRUE, workers = TRUE, user = ps::ps_username(), seconds_interval = 0.5, seconds_timeout = 60, verbose = TRUE )
crew_clean( dispatchers = TRUE, workers = TRUE, user = ps::ps_username(), seconds_interval = 0.5, seconds_timeout = 60, verbose = TRUE )
dispatchers |
Logical of length 1, whether to terminate dispatchers. |
workers |
Logical of length 1, whether to terminate workers. |
user |
Character of length 1. Terminate dispatchers and/or workers associated with this user name. |
seconds_interval |
Seconds to between polling intervals waiting for a process to exit. |
seconds_timeout |
Seconds to wait for a process to exit. |
verbose |
Logical of length 1, whether to print an informative message every time a process is terminated. |
Behind the scenes, mirai
uses an external R process
called a "dispatcher" to send tasks to crew
workers.
This dispatcher usually shuts down when you terminate the controller
or quit your R session, but sometimes it lingers. Likewise,
sometimes crew
workers do not shut down on their own.
The crew_clean()
function searches the process table on your
local machine and manually terminates any mirai
dispatchers
and crew
workers associated with your user name (or the
user name you select in the user
argument.
Unfortunately, it cannot reach remote workers such as those
launched by a crew.cluster
controller.
NULL
(invisibly). If verbose
is TRUE
, it does
print out a message for every terminated process.
Other utility:
crew_assert()
,
crew_deprecate()
,
crew_eval()
,
crew_random_name()
,
crew_retry()
,
crew_terminate_process()
,
crew_terminate_signal()
,
crew_worker()
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { crew_clean() }
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { crew_clean() }
Create an R6
wrapper object to manage the mirai
client.
crew_client( name = NULL, workers = 1L, host = NULL, port = NULL, tls = crew::crew_tls(), tls_enable = NULL, tls_config = NULL, seconds_interval = 0.5, seconds_timeout = 5, retry_tasks = TRUE )
crew_client( name = NULL, workers = 1L, host = NULL, port = NULL, tls = crew::crew_tls(), tls_enable = NULL, tls_config = NULL, seconds_interval = 0.5, seconds_timeout = 5, retry_tasks = TRUE )
name |
Name of the client object. If |
workers |
Integer, maximum number of parallel workers to run. |
host |
IP address of the |
port |
TCP port to listen for the workers. If |
tls |
A TLS configuration object from |
tls_enable |
Deprecated on 2023-09-15 in version 0.4.1.
Use argument |
tls_config |
Deprecated on 2023-09-15 in version 0.4.1.
Use argument |
seconds_interval |
Number of seconds between
polling intervals waiting for certain internal
synchronous operations to complete,
such as checking |
seconds_timeout |
Number of seconds until timing
out while waiting for certain synchronous operations to complete,
such as checking |
retry_tasks |
|
Other client:
crew_class_client
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() client$start() client$summary() client$terminate() }
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() client$start() client$summary() client$terminate() }
This function is for developers of crew
launcher plugins.
Users should use a specific controller helper such as
crew_controller_local()
.
crew_controller(client, launcher, auto_scale = NULL)
crew_controller(client, launcher, auto_scale = NULL)
client |
An |
launcher |
An |
auto_scale |
Deprecated. Use the |
Other controller:
crew_class_controller
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() launcher <- crew_launcher_local() controller <- crew_controller(client = client, launcher = launcher) controller$start() controller$push(name = "task", command = sqrt(4)) controller$wait() controller$pop() controller$terminate() }
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() launcher <- crew_launcher_local() controller <- crew_controller(client = client, launcher = launcher) controller$start() controller$push(name = "task", command = sqrt(4)) controller$wait() controller$pop() controller$terminate() }
Create an R6
object to submit tasks and launch workers
through multiple crew
controllers.
crew_controller_group(...)
crew_controller_group(...)
... |
|
Other controller_group:
crew_class_controller_group
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { persistent <- crew_controller_local(name = "persistent") transient <- crew_controller_local( name = "transient", tasks_max = 1L ) group <- crew_controller_group(persistent, transient) group$start() group$push(name = "task", command = sqrt(4), controller = "transient") group$wait() group$pop() group$terminate() }
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { persistent <- crew_controller_local(name = "persistent") transient <- crew_controller_local( name = "transient", tasks_max = 1L ) group <- crew_controller_group(persistent, transient) group$start() group$push(name = "task", command = sqrt(4), controller = "transient") group$wait() group$pop() group$terminate() }
Create an R6
object to submit tasks and
launch workers on local processes.
crew_controller_local( name = NULL, workers = 1L, host = "127.0.0.1", port = NULL, tls = crew::crew_tls(), tls_enable = NULL, tls_config = NULL, seconds_interval = 0.5, seconds_timeout = 60, seconds_launch = 30, seconds_idle = 300, seconds_wall = Inf, seconds_exit = NULL, retry_tasks = TRUE, tasks_max = Inf, tasks_timers = 0L, reset_globals = TRUE, reset_packages = FALSE, reset_options = FALSE, garbage_collection = FALSE, crashes_error = 5L, launch_max = NULL, r_arguments = c("--no-save", "--no-restore"), options_metrics = crew::crew_options_metrics(), options_local = crew::crew_options_local(), local_log_directory = NULL, local_log_join = NULL )
crew_controller_local( name = NULL, workers = 1L, host = "127.0.0.1", port = NULL, tls = crew::crew_tls(), tls_enable = NULL, tls_config = NULL, seconds_interval = 0.5, seconds_timeout = 60, seconds_launch = 30, seconds_idle = 300, seconds_wall = Inf, seconds_exit = NULL, retry_tasks = TRUE, tasks_max = Inf, tasks_timers = 0L, reset_globals = TRUE, reset_packages = FALSE, reset_options = FALSE, garbage_collection = FALSE, crashes_error = 5L, launch_max = NULL, r_arguments = c("--no-save", "--no-restore"), options_metrics = crew::crew_options_metrics(), options_local = crew::crew_options_local(), local_log_directory = NULL, local_log_join = NULL )
name |
Name of the client object. If |
workers |
Integer, maximum number of parallel workers to run. |
host |
IP address of the |
port |
TCP port to listen for the workers. If |
tls |
A TLS configuration object from |
tls_enable |
Deprecated on 2023-09-15 in version 0.4.1.
Use argument |
tls_config |
Deprecated on 2023-09-15 in version 0.4.1.
Use argument |
seconds_interval |
Number of seconds between
polling intervals waiting for certain internal
synchronous operations to complete,
such as checking |
seconds_timeout |
Number of seconds until timing
out while waiting for certain synchronous operations to complete,
such as checking |
seconds_launch |
Seconds of startup time to allow.
A worker is unconditionally assumed to be alive
from the moment of its launch until |
seconds_idle |
Maximum number of seconds that a worker can idle
since the completion of the last task. If exceeded, the worker exits.
But the timer does not launch until |
seconds_wall |
Soft wall time in seconds.
The timer does not launch until |
seconds_exit |
Deprecated on 2023-09-21 in version 0.5.0.9002. No longer necessary. |
retry_tasks |
|
tasks_max |
Maximum number of tasks that a worker will do before
exiting. See the |
tasks_timers |
Number of tasks to do before activating
the timers for |
reset_globals |
|
reset_packages |
|
reset_options |
|
garbage_collection |
|
crashes_error |
Positive integer scalar. If a worker exits
|
launch_max |
Deprecated on 2024-11-04 (version 0.10.1.9000).
Use |
r_arguments |
Optional character vector of command line arguments
to pass to |
options_metrics |
Either |
options_local |
An object generated by |
local_log_directory |
Deprecated on 2024-10-08. Use
|
local_log_join |
Deprecated on 2024-10-08. Use
|
Other plugin_local:
crew_class_launcher_local
,
crew_launcher_local()
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { controller <- crew_controller_local() controller$start() controller$push(name = "task", command = sqrt(4)) controller$wait() controller$pop() controller$terminate() }
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { controller <- crew_controller_local() controller$start() controller$push(name = "task", command = sqrt(4)) controller$wait() controller$pop() controller$terminate() }
crew
feature.Show an informative warning when a crew
feature is
deprecated.
crew_deprecate( name, date, version, alternative, condition = "warning", value = "x", skip_cran = FALSE, frequency = "always" )
crew_deprecate( name, date, version, alternative, condition = "warning", value = "x", skip_cran = FALSE, frequency = "always" )
name |
Name of the feature (function or argument) to deprecate. |
date |
Date of deprecation. |
version |
Package version when deprecation was instated. |
alternative |
Message about an alternative. |
condition |
Either "warning" or "message" to indicate the type of condition thrown on deprecation. |
value |
Value of the object. Deprecation is skipped
if |
skip_cran |
Logical of length 1, whether to skip the deprecation warning or message on CRAN. |
frequency |
Character of length 1, passed to the |
NULL
(invisibly). Throws a warning if a feature is deprecated.
Other utility:
crew_assert()
,
crew_clean()
,
crew_eval()
,
crew_random_name()
,
crew_retry()
,
crew_terminate_process()
,
crew_terminate_signal()
,
crew_worker()
suppressWarnings( crew_deprecate( name = "auto_scale", date = "2023-05-18", version = "0.2.0", alternative = "use the scale argument of push(), pop(), and wait()." ) )
suppressWarnings( crew_deprecate( name = "auto_scale", date = "2023-05-18", version = "0.2.0", alternative = "use the scale argument of push(), pop(), and wait()." ) )
Not a user-side function. Do not call directly.
crew_eval( command, name = NA_character_, string = NA_character_, data = list(), globals = list(), seed = NULL, algorithm = NULL, packages = character(0), library = NULL )
crew_eval( command, name = NA_character_, string = NA_character_, data = list(), globals = list(), seed = NULL, algorithm = NULL, packages = character(0), library = NULL )
command |
Language object with R code to run. |
name |
Character of length 1, name of the task. |
string |
Character of length 1, string representation of the command. |
data |
Named list of local data objects in the evaluation environment. |
globals |
Named list of objects to temporarily assign to the global environment for the task. |
seed |
Integer of length 1 with the pseudo-random number generator
seed to set for the evaluation of the task. Passed to the
|
algorithm |
Integer of length 1 with the pseudo-random number
generator algorithm to set for the evaluation of the task.
Passed to the |
packages |
Character vector of packages to load for the task. |
library |
Library path to load the packages. See the |
The crew_eval()
function evaluates an R expression
in an encapsulated environment and returns a monad with the results,
including warnings and error messages if applicable.
The random number generator seed, globals
, and global options
are restored to their original values on exit.
A monad object with results and metadata.
Other utility:
crew_assert()
,
crew_clean()
,
crew_deprecate()
,
crew_random_name()
,
crew_retry()
,
crew_terminate_process()
,
crew_terminate_signal()
,
crew_worker()
crew_eval(quote(1 + 1))
crew_eval(quote(1 + 1))
This function is useful for inheriting argument documentation
in functions that create custom third-party launchers. See
@inheritParams crew::crew_launcher
in the source code file of
crew_launcher_local()
.
crew_launcher( name = NULL, seconds_interval = 0.5, seconds_timeout = 60, seconds_launch = 30, seconds_idle = 300, seconds_wall = Inf, seconds_exit = NULL, tasks_max = Inf, tasks_timers = 0L, reset_globals = TRUE, reset_packages = FALSE, reset_options = FALSE, garbage_collection = FALSE, crashes_error = 5L, launch_max = NULL, tls = crew::crew_tls(), processes = NULL, r_arguments = c("--no-save", "--no-restore"), options_metrics = crew::crew_options_metrics() )
crew_launcher( name = NULL, seconds_interval = 0.5, seconds_timeout = 60, seconds_launch = 30, seconds_idle = 300, seconds_wall = Inf, seconds_exit = NULL, tasks_max = Inf, tasks_timers = 0L, reset_globals = TRUE, reset_packages = FALSE, reset_options = FALSE, garbage_collection = FALSE, crashes_error = 5L, launch_max = NULL, tls = crew::crew_tls(), processes = NULL, r_arguments = c("--no-save", "--no-restore"), options_metrics = crew::crew_options_metrics() )
name |
Name of the launcher. |
seconds_interval |
Number of seconds between
polling intervals waiting for certain internal
synchronous operations to complete,
such as checking |
seconds_timeout |
Number of seconds until timing
out while waiting for certain synchronous operations to complete,
such as checking |
seconds_launch |
Seconds of startup time to allow.
A worker is unconditionally assumed to be alive
from the moment of its launch until |
seconds_idle |
Maximum number of seconds that a worker can idle
since the completion of the last task. If exceeded, the worker exits.
But the timer does not launch until |
seconds_wall |
Soft wall time in seconds.
The timer does not launch until |
seconds_exit |
Deprecated on 2023-09-21 in version 0.5.0.9002. No longer necessary. |
tasks_max |
Maximum number of tasks that a worker will do before
exiting. See the |
tasks_timers |
Number of tasks to do before activating
the timers for |
reset_globals |
|
reset_packages |
|
reset_options |
|
garbage_collection |
|
crashes_error |
Positive integer scalar. If a worker exits
|
launch_max |
Deprecated on 2024-11-04 (version 0.10.1.9000).
Use |
tls |
A TLS configuration object from |
processes |
|
r_arguments |
Optional character vector of command line arguments
to pass to |
options_metrics |
Either |
Other launcher:
crew_class_launcher
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() client$start() launcher <- crew_launcher_local(name = client$name) launcher$start(sockets = client$summary()$socket) launcher$launch(index = 1L) task <- mirai::mirai("result", .compute = client$name) mirai::call_mirai_(task) task$data client$terminate() }
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() client$start() launcher <- crew_launcher_local(name = client$name) launcher$start(sockets = client$summary()$socket) launcher$launch(index = 1L) task <- mirai::mirai("result", .compute = client$name) mirai::call_mirai_(task) task$data client$terminate() }
Create an R6
object to launch and maintain
local process workers.
crew_launcher_local( name = NULL, seconds_interval = 0.5, seconds_timeout = 60, seconds_launch = 30, seconds_idle = Inf, seconds_wall = Inf, seconds_exit = NULL, tasks_max = Inf, tasks_timers = 0L, reset_globals = TRUE, reset_packages = FALSE, reset_options = FALSE, garbage_collection = FALSE, crashes_error = 5L, launch_max = NULL, tls = crew::crew_tls(), r_arguments = c("--no-save", "--no-restore"), options_metrics = crew::crew_options_metrics(), options_local = crew::crew_options_local(), local_log_directory = NULL, local_log_join = NULL )
crew_launcher_local( name = NULL, seconds_interval = 0.5, seconds_timeout = 60, seconds_launch = 30, seconds_idle = Inf, seconds_wall = Inf, seconds_exit = NULL, tasks_max = Inf, tasks_timers = 0L, reset_globals = TRUE, reset_packages = FALSE, reset_options = FALSE, garbage_collection = FALSE, crashes_error = 5L, launch_max = NULL, tls = crew::crew_tls(), r_arguments = c("--no-save", "--no-restore"), options_metrics = crew::crew_options_metrics(), options_local = crew::crew_options_local(), local_log_directory = NULL, local_log_join = NULL )
name |
Name of the launcher. |
seconds_interval |
Number of seconds between
polling intervals waiting for certain internal
synchronous operations to complete,
such as checking |
seconds_timeout |
Number of seconds until timing
out while waiting for certain synchronous operations to complete,
such as checking |
seconds_launch |
Seconds of startup time to allow.
A worker is unconditionally assumed to be alive
from the moment of its launch until |
seconds_idle |
Maximum number of seconds that a worker can idle
since the completion of the last task. If exceeded, the worker exits.
But the timer does not launch until |
seconds_wall |
Soft wall time in seconds.
The timer does not launch until |
seconds_exit |
Deprecated on 2023-09-21 in version 0.5.0.9002. No longer necessary. |
tasks_max |
Maximum number of tasks that a worker will do before
exiting. See the |
tasks_timers |
Number of tasks to do before activating
the timers for |
reset_globals |
|
reset_packages |
|
reset_options |
|
garbage_collection |
|
crashes_error |
Positive integer scalar. If a worker exits
|
launch_max |
Deprecated on 2024-11-04 (version 0.10.1.9000).
Use |
tls |
A TLS configuration object from |
r_arguments |
Optional character vector of command line arguments
to pass to |
options_metrics |
Either |
options_local |
An object generated by |
local_log_directory |
Deprecated on 2024-10-08. Use
|
local_log_join |
Deprecated on 2024-10-08. Use
|
Other plugin_local:
crew_class_launcher_local
,
crew_controller_local()
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() client$start() launcher <- crew_launcher_local(name = client$name) launcher$start(sockets = client$summary()$socket) launcher$launch(index = 1L) task <- mirai::mirai("result", .compute = client$name) mirai::call_mirai_(task) task$data client$terminate() }
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { client <- crew_client() client$start() launcher <- crew_launcher_local(name = client$name) launcher$start(sockets = client$summary()$socket) launcher$launch(index = 1L) task <- mirai::mirai("result", .compute = client$name) mirai::call_mirai_(task) task$data client$terminate() }
Create an R6
object to monitor local processes created by
crew
and mirai
.
crew_monitor_local()
crew_monitor_local()
Other monitor:
crew_class_monitor_local
crew
launcher options.Options for the local crew
launcher.
crew_options_local(log_directory = NULL, log_join = TRUE)
crew_options_local(log_directory = NULL, log_join = TRUE)
log_directory |
Either |
log_join |
Logical of length 1. If |
A classed list of options for the local launcher.
Other options:
crew_options_metrics()
crew_options_local()
crew_options_local()
crew_options_metrics()
configures the
crew
to record resource usage metrics (such as CPU and memory usage)
for each running worker.
To be activate resource usage logging,
the autometric
R package version 0.1.0 or higher
must be installed.
Logging happens in the background (through a detached POSIX)
so as not to disrupt
the R session. On Unix-like systems, crew_options_metrics()
can specify /dev/stdout
or /dev/stderr
as the log files, which will
redirect output to existing logs you are already using.
autometric::log_read()
and autometric::log_plot()
can read and
visualize resource usage data from multiple log files, even
if those files are mixed with other messages.
crew_options_metrics(path = NULL, seconds_interval = 5)
crew_options_metrics(path = NULL, seconds_interval = 5)
path |
Where to write resource metric log entries for workers.
After running enough tasks in |
seconds_interval |
Positive number, seconds between resource metric
log entries written to |
A classed list of options for logging resource usage metrics.
Other options:
crew_options_local()
crew_options_metrics()
crew_options_metrics()
Generate a random string that can be used as a name for a worker or task.
crew_random_name(n = 12L)
crew_random_name(n = 12L)
n |
Number of bytes of information in the random string
hashed to generate the name. Larger |
The randomness is not reproducible and cannot be set with
e.g. set.seed()
in R.
A random character string.
Other utility:
crew_assert()
,
crew_clean()
,
crew_deprecate()
,
crew_eval()
,
crew_retry()
,
crew_terminate_process()
,
crew_terminate_signal()
,
crew_worker()
crew_random_name()
crew_random_name()
crew
relay object.Create an R6
crew
relay object.
crew_relay()
crew_relay()
A crew
relay object keeps the signaling relationships
among condition variables.
An R6
crew
relay object.
Other relay:
crew_class_relay
crew_relay()
crew_relay()
Repeatedly retry a function while it keeps returning FALSE
and exit the loop when it returns TRUE
crew_retry( fun, args = list(), seconds_interval = 1, seconds_timeout = 60, max_tries = Inf, error = TRUE, message = character(0), envir = parent.frame() )
crew_retry( fun, args = list(), seconds_interval = 1, seconds_timeout = 60, max_tries = Inf, error = TRUE, message = character(0), envir = parent.frame() )
fun |
Function that returns |
args |
A named list of arguments to |
seconds_interval |
Nonnegative numeric of length 1,
number of seconds to wait between calls to |
seconds_timeout |
Nonnegative numeric of length 1, number of seconds to loop before timing out. |
max_tries |
Maximum number of calls to |
error |
Whether to throw an error on a timeout or max tries. |
message |
Character of length 1, optional error message if the wait times out. |
envir |
Environment to evaluate |
NULL
(invisibly).
Other utility:
crew_assert()
,
crew_clean()
,
crew_deprecate()
,
crew_eval()
,
crew_random_name()
,
crew_terminate_process()
,
crew_terminate_signal()
,
crew_worker()
crew_retry(fun = function() TRUE)
crew_retry(fun = function() TRUE)
Manually terminate a local process.
crew_terminate_process(pid)
crew_terminate_process(pid)
pid |
Integer of length 1, process ID to terminate. |
NULL
(invisibly).
Other utility:
crew_assert()
,
crew_clean()
,
crew_deprecate()
,
crew_eval()
,
crew_random_name()
,
crew_retry()
,
crew_terminate_signal()
,
crew_worker()
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { process <- processx::process$new("sleep", "60") process$is_alive() crew_terminate_process(pid = process$get_pid()) process$is_alive() }
if (identical(Sys.getenv("CREW_EXAMPLES"), "true")) { process <- processx::process$new("sleep", "60") process$is_alive() crew_terminate_process(pid = process$get_pid()) process$is_alive() }
Get a supported operating system signal for terminating a local process.
crew_terminate_signal()
crew_terminate_signal()
An integer of length 1: tools::SIGTERM
if your platform
supports SIGTERM
. If not, then crew_crew_terminate_signal()()
checks
SIGQUIT
, then SIGINT
, then SIGKILL
, and then returns the first
signal it finds that your operating system can use.
Other utility:
crew_assert()
,
crew_clean()
,
crew_deprecate()
,
crew_eval()
,
crew_random_name()
,
crew_retry()
,
crew_terminate_process()
,
crew_worker()
crew_terminate_signal()
crew_terminate_signal()
Create an R6
object for throttling.
crew_throttle(seconds_interval = 0.5)
crew_throttle(seconds_interval = 0.5)
seconds_interval |
Positive numeric of length 1, throttling interval.
The |
Throttling is a technique that limits how often a function is
called in a given period of time. crew_throttle()
objects support
the throttle
argument of controller methods, which ensures auto-scaling
only happen every seconds_interval
seconds. This helps avoid
overburdening the mirai
dispatcher and other resources.
An R6
object with throttle configuration settings and methods.
Other throttle:
crew_class_throttle
throttle <- crew_throttle(seconds_interval = 0.5) throttle$poll() throttle$poll()
throttle <- crew_throttle(seconds_interval = 0.5) throttle$poll() throttle$poll()
Create an R6
object with transport layer security (TLS)
configuration for crew
.
crew_tls( mode = "none", key = NULL, password = NULL, certificates = NULL, validate = TRUE )
crew_tls( mode = "none", key = NULL, password = NULL, certificates = NULL, validate = TRUE )
mode |
Character of length 1. Must be one of the following:
|
key |
If |
password |
If |
certificates |
If |
validate |
Logical of length 1, whether to validate the configuration
object on creation. If |
crew_tls()
objects are input to the tls
argument of
crew_client()
, crew_controller_local()
, etc.
See https://wlandau.github.io/crew/articles/risks.html for details.
An R6
object with TLS configuration settings and methods.
Other tls:
crew_class_tls
crew_tls(mode = "automatic")
crew_tls(mode = "automatic")
Launches a crew
worker which runs a mirai
daemon.
Not a user-side function. Users should not call crew_worker()
directly. See launcher plugins like crew_launcher_local()
for examples.
crew_worker( settings, launcher, worker, instance, options_metrics = crew::crew_options_metrics() )
crew_worker( settings, launcher, worker, instance, options_metrics = crew::crew_options_metrics() )
settings |
Named list of arguments to |
launcher |
Character of length 1, name of the launcher. |
worker |
Positive integer of length 1, index of the worker. This worker index remains the same even when the current instance of the worker exits and a new instance launches. |
instance |
Character of length 1 to uniquely identify the current instance of the worker. |
options_metrics |
Either |
NULL
(invisibly)
Other utility:
crew_assert()
,
crew_clean()
,
crew_deprecate()
,
crew_eval()
,
crew_random_name()
,
crew_retry()
,
crew_terminate_process()
,
crew_terminate_signal()