Annex Creator Function

Creates an object of class c("annex", "data.frame") required to calculate the statistics.

Usage

annex(formula, data, tz, duplicate.action = NULL, meta = NULL, verbose = FALSE)

# S3 method for class 'annex'
head(x, ...)

# S3 method for class 'annex'
tail(x, ...)

# S3 method for class 'annex'
subset(x, ...)

# S3 method for class 'annex_stats'
head(x, ...)

# S3 method for class 'annex_stats'
tail(x, ...)

# S3 method for class 'annex_stats'
subset(x, ...)

Arguments

formula: the formula to specify how the data set is set up. See 'Details' for more information.
data: data.frame containing the obervations/data.
tz: character, time zone definition (e.g., "Europe/Berlin" or "UTC"); required. OlsonNames() returns a list of possible time zones. The correct time zone is important to properly calculate month and time of day.
duplicate.action: NULL or a function which returns a single numeric value. Used to handle possible duplicates, see 'Details'.
meta: NULL (default) or a list with information about study, home, and room (see section 'Duplicates').
verbose: logical, defaults to FALSE. Can be set to TRUE to increase verbosity.
x: object of class annex.
...: arguments to be passed to or from other methods.

Details

In case the data set provided on data does only contain data of one study, home, and room, the fomula has two parts, looking e.g., as follows:

T + RH ~ datetime

The left hand side of the formula (left of ~) specifies the names of the variables of the observations to be processed, the right hand side is the name of the variable containing the time information (must be of class POSIXt). In this case, the meta argument is required to provide information about the study, home, and room.

If the grouping information is already in the data set, the analysis can be performed depending on the group information, typically:

T + RH ~ datetime | study + home + room

The latter allows to process observations from different studies, homes, and/or rooms all in one go.

Duplicates

Duplicated records can distort the statistics and should be handled properly. For each unique study, home, room only one observation (row) for a specific date and time should exist.

As there is no general way to deal with such duplicates, the function annex (as well as annex_prepare) by default throws a warning for the user if such duplicates exist (duplicate.action = NULL; default argument).

However, the package allows the user to provide a custom duplicate.action function, e.g., mean, min, max, ... This function must return one single numeric value (or an NA) when applied to a vector. If a function is provided, the annex function does the following:

Checks if there are any duplicates. If not, no changes are made. Else ...
Checking if the function is valid (returns single numeric or NA). If not, an error will be thrown.
Takes the measurements of each duplicate; if all values are missing, an NA will be returned. Else the users duplicate.action is applied to all remaining non-missing values. I.e., if duplicate.action = mean the average of all non-missing values will be used.

Author

Reto Stauffer

Examples