--- title: "Get started with mantis" output: rmarkdown::html_vignette vignette: > %\VignetteIndexEntry{Get started with mantis} %\VignetteEngine{knitr::rmarkdown} %\VignetteEncoding{UTF-8} --- ```{r, include = FALSE} knitr::opts_chunk$set( collapse = TRUE, comment = "#>" ) ``` This vignette demonstrates how to use this package to generate different types of html reports that allow you to visually review multiple related time series quickly and easily. ## Data format The data frame containing the time series should be in long format, with the following columns (though the actual column names can be different): * one "timepoint" (date/posixt) column which will be used for the x-axes. Values should follow a regular pattern, e.g. daily or monthly, but do not have to be consecutive. * one or more "item" (character) columns containing categorical values identifying distinct time series. * one "value" (numeric) column containing the time series values which will be used for the y-axes. **Example:** The `example_prescription_numbers` dataset is provided with this package, and contains (synthetic) examples of aggregate numbers of antibiotic prescriptions given in a hospital over a period of a year. It contains 5 columns: * PrescriptionDate - The date the prescriptions were written * Antibiotic - The name of the antibiotic prescribed * Spectrum - The spectrum of activity of the antibiotic. This value is always the same for a particular antibiotic * NumberOfPrescriptions - The number of prescriptions written for this antibiotic on this day * Location - The hospital site where the prescription was written ```{r data} # first, attach the package if you haven't already library(mantis) # this example data frame contains numbers of antibiotic prescriptions # in long format data("example_prescription_numbers") head(example_prescription_numbers) ``` ### Specifying the data columns You must specify which column(s) in the supplied data frame should be used to identify the individual time series (items), and which columns contain the timepoint (x-axis) and value (y-axis) for the time series. These are set using the `item_cols`, `timepoint_col` and `value_col` parameters of `inputspec()` respectively. Any other columns in the data frame are ignored. Optionally, if there are multiple columns specified in `item_cols`, one of them can be used to group the time series into different tabs on the report, by using the `tab_col` parameter. Members of `item_cols` should be specified in the order that you want them to appear in the output. **Example:** For the `example_prescription_numbers` dataset above, the combination of "Antibiotic" and "Location" columns uniquely identify a time series in the data frame, and so both columns must be included in `item_cols`. The "Spectrum" column can also be added to `item_cols` as well if desired, in which case it will appear in the output as an additional column/label. Otherwise it will be ignored. Here are some options for specifying the data columns, depending on how you want the report to look: ```{r} # create a flat report, and include the "Location" and "Antibiotic" fields # in the content inspec_flat <- inputspec( timepoint_col = "PrescriptionDate", item_cols = c("Location", "Antibiotic"), value_col = "NumberOfPrescriptions", timepoint_unit = "day" ) # create a flat report, and include the "Location", "Spectrum", and "Antibiotic" # fields in the content inspec_flat2 <- inputspec( timepoint_col = "PrescriptionDate", item_cols = c("Location", "Spectrum", "Antibiotic"), value_col = "NumberOfPrescriptions", timepoint_unit = "day" ) # create a tabbed report, with a separate tab for each unique value of # "Location", and include just the "Antibiotic" field in the content of each tab inspec_tabbed <- inputspec( timepoint_col = "PrescriptionDate", item_cols = c("Antibiotic", "Location"), value_col = "NumberOfPrescriptions", tab_col = "Location", timepoint_unit = "day" ) # create a tabbed report, with a separate tab for each unique value of # "Location", and include the "Antibiotic" and "Spectrum" fields in the content # of each tab inspec_tabbed2 <- inputspec( timepoint_col = "PrescriptionDate", item_cols = c("Antibiotic", "Spectrum", "Location"), value_col = "NumberOfPrescriptions", tab_col = "Location", timepoint_unit = "day" ) # create a tabbed report, with a separate tab for each unique value of # "Antibiotic", and include just the "Location" field in the content of each tab inspec_tabbed3 <- inputspec( timepoint_col = "PrescriptionDate", item_cols = c("Antibiotic", "Location"), value_col = "NumberOfPrescriptions", tab_col = "Antibiotic", timepoint_unit = "day" ) ``` ### Specifying the timepoint unit In this example dataset, the time series contain daily values, which is the default pattern of timepoints expected by the package. If the time series were to contain e.g. monthly or hourly values, this should be specified using the `timepoint_unit` parameter of `inputspec()`. ## Generating a report The simplest way to create a report is to use the `mantis_report()` function. We need to decide where to save the report to. The file name can only contain alphanumeric, `-` and `_` characters, and should include the extension ".html". If a path is supplied, the directory should already exist. We can also optionally specify a short description for the dataset, which will appear on the report. There are 3 different options for visualising the time series. This is set using the `outputspec` parameter. ### Interactive plots This is the default visualisation, and produces a table with one time series in each row, with orderable and filterable columns showing the previously-specified `item_cols` for identifying the time series. Also in the table are the maximum value of each time series, and bar plots with adjustable axes and tooltips showing the individual dates and values. There are some options for adjusting the output, such as changing column labels, plot type, and whether or not to use the same y-axis scale across the table. This can be done using the `outputspec_interactive()` function when supplying the `outputspec` parameter. **Example:** For the `example_prescription_numbers` dataset above, we will save a report in the working directory, with separate tabs for each hospital site. ```{r interactive-test, include=FALSE} # quietly test can create the report from an rmd but show the code in the next # chunk in the vignette # temporarily set working directory to tempdir withr::with_dir( tempdir(), filename <- mantis_report( df = example_prescription_numbers, file = "example_prescription_numbers_interactive.html", inputspec = inspec_tabbed2, outputspec = outputspec_interactive(), report_title = "mantis report", dataset_description = "Antibiotic prescriptions by site", show_progress = FALSE ) ) # clean up file.remove(filename) ``` ```{r interactive-display, eval=FALSE} mantis_report( df = example_prescription_numbers, file = "example_prescription_numbers_interactive.html", inputspec = inspec_tabbed2, outputspec = outputspec_interactive(), report_title = "mantis report", dataset_description = "Antibiotic prescriptions by site", show_progress = TRUE ) ``` ### Static plots Alternatively, the report can output static plots. This can be useful if interactivity is not needed, or if file sizes need to be kept small. There are currently two types of static visualisations: heatmap or multipanel, and are selected by using the `outputspec_static_heatmap()` or `outputspec_static_multipanel()` function when supplying the `outputspec` parameter. #### Heatmap ```{r heatmap-test, include=FALSE} # quietly test can create the report from an rmd but show the code in the next # chunk in the vignette # temporarily set working directory to tempdir withr::with_dir( tempdir(), filename <- mantis_report( df = example_prescription_numbers, file = "example_prescription_numbers_heatmap.html", inputspec = inspec_tabbed, outputspec = outputspec_static_heatmap(), report_title = "mantis report", dataset_description = "Antibiotic prescriptions by site", show_progress = FALSE ) ) # clean up file.remove(filename) ``` ```{r heatmap-display, eval=FALSE} mantis_report( df = example_prescription_numbers, file = "example_prescription_numbers_heatmap.html", inputspec = inspec_tabbed, outputspec = outputspec_static_heatmap(), report_title = "mantis report", dataset_description = "Antibiotic prescriptions by site", show_progress = TRUE ) ``` #### Multipanel ```{r multipanel-test, include=FALSE} # quietly test can create the report from an rmd but show the code in the next # chunk in the vignette # temporarily set working directory to tempdir withr::with_dir( tempdir(), filename <- mantis_report( df = example_prescription_numbers, file = "example_prescription_numbers_multipanel.html", inputspec = inspec_tabbed, outputspec = outputspec_static_multipanel(), report_title = "mantis report", dataset_description = "Antibiotic prescriptions by site", show_progress = FALSE ) ) # clean up file.remove(filename) ``` ```{r multipanel-display, eval=FALSE} mantis_report( df = example_prescription_numbers, file = "example_prescription_numbers_multipanel.html", inputspec = inspec_tabbed, outputspec = outputspec_static_multipanel(), report_title = "mantis report", dataset_description = "Antibiotic prescriptions by site", show_progress = TRUE ) ```