[This article was first published on R-posts.com, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don’t.
In health research, a flowchart is the best way to show the flow of participants in a study when reporting results. But drawing flowcharts can be tedious to prepare and can get on your nerves.
Fortunately, there are several packages in R for drawing flowcharts using different approaches. The problem is that the programming is generally quite complex, and the numbers have to be entered manually or parameterized beforehand. These flowcharts can have reproducible problems because if data changes, we have to manually change the parameters again.
To make our lives easier, there’s a new {flowchart} package that uses the tidyverse workflow, which allows to create many different types of flowcharts in just a few steps.
The package provides a set of functions that are thought to be combined with a tidyverse pipe operator (%>%or|>) to create different flowchart designs directly from the study database. These functions are highly customizable and allow the user to create reproducible flowcharts in an easier and tidier way. Now we don’t need to manually set the flowchart parameters such as the box coordinates or the numbers to display, because it automatically adapts to the data we have.
For example, we can create a flowchart of the entire participant study flow with this simple tidy workflow:
Here, we will describe these steps that are involved in creating a flowchart in this example. We will use the built-insafodataset, that comes with the package, which is a randomly generated dataset from theSAFO clinical trial. For more information and other examples, you can visit thevignette of the package.
Installing and loading the package
As of March of 2024, the package is available on CRAN:
install.packages("flowchart")
You can always install the development version fromGithub:
remotes::install_github("bruigtp/flowchart")
Initialize the flowchart
The first step is the initialisation of the flowchart with the functionas_fc():
library(flowchart)
x <- safo |>
as_fc(label = "Patients assessed for eligibility")
This will create an object of classfc, the class created for this package. Objects of this class consist of a list containing the dataset together with the information related to the flowchart being generated. Let’s see it for our example:
List of 2
$ data: tibble [925 × 21] (S3: tbl_df/tbl/data.frame)
$ fc : tibble [1 × 17] (S3: tbl_df/tbl/data.frame)
- attr(*, "class")= chr "fc"
Thedatatibble belongs to the entire SAFO dataset as we haven’t done any further operations:
# A tibble: 925 × 21
id inclusion_crit exclusion_crit chronic_heart_failure expected_death_24h
1 1 Yes No No No
2 2 No No No No
3 3 No No No No
4 4 No Yes No No
5 5 No No No No
6 6 No Yes No No
7 7 No No No No
8 8 No Yes No Yes
9 9 No No No No
10 10 No No No No
# ℹ 915 more rows
# ℹ 16 more variables: polymicrobial_bacteremia ,
# conditions_affect_adhrence , susp_prosthetic_valve_endocard ,
# severe_liver_cirrhosis , acute_sars_cov2 ,
# blactam_fosfomycin_hypersens , other_clinical_trial ,
# pregnancy_or_breastfeeding , previous_participation ,
# myasthenia_gravis , decline_part , group , itt , …
Thefctibble represents the information on the generated flowchart, which only contains a first initial box indicating the total number of patients assessed for eligibility in the SAFO trial:
# A tibble: 1 × 17
id x y n N perc text type group just text_color text_fs
1 1 0.5 0.5 925 925 100 "Pat… init NA cent… black 8
# ℹ 5 more variables: text_fface , text_ffamily , text_padding ,
# bg_fill , border_color
Drawing the flowchart
We can always use thefc_draw()function to draw the associated flowchart from afcobject:
Building the flowchart
To build the entire flowchart, we would need to combine the initializedfcobject with the desired functions until we obtain the final flowchart.
The second box showing the patients excluded from randomization can be obtained using thefc_filter()function:
withshow_exc = TRUEto show the excluded subject box as well. Now$datacontains the database filtered only for the randomized subjects while$fccontains the information for these new boxes.
Now, we can split the flowchart by the study group, using thefc_split()function:
The idea is to combine these basic functions, fc_filter()andfc_split(), in any way we want to create the desired flowchart. The resulting flowchart can be further customized and enhanced using thefc_modify()function, or combined with other flowcharts either horizontally or vertically using thefc_merge()andfc_stack()functions, respectively. Finally, once the final flowchart is drawn, it can be exported to the desired image format using the fc_export() function.