vermeulen provides the Biomarker data set by Vermeulen et al. (2009) in tidy format.

This data set is for a real-time quantitative PCR experiment that comprises:

  • The raw fluorescence data of 24,576 amplification curves.
  • 64 targets: 59 genes of interest and 5 reference genes.
  • 366 neuroblastoma cDNA samples and 18 dilution series samples.


Install vermeulen from CRAN:

# Install from CRAN

You can instead install the development version of vermeulen from GitHub:

# install.packages("remotes")


Because of CRAN size limits the data is not provided at installation time. The data can be retrieved from this GitHub repository after installation with the function get_biomarker_dataset().


# Takes a few seconds (downloading from GitHub...)
biomarker <- as_tibble(get_biomarker_dataset())
#> # A tibble: 1,226,880 × 11
#>    plate well  dye   target target_t…¹ sample sampl…² copies dilut…³ cycle fluor
#>    <fct> <fct> <fct> <fct>  <fct>      <chr>  <fct>    <int>   <dbl> <int> <dbl>
#>  1 AHCY  A1    SYBR  AHCY   toi        1495   unk         NA      NA     1  1.10
#>  2 AHCY  A1    SYBR  AHCY   toi        1495   unk         NA      NA     2  1.45
#>  3 AHCY  A1    SYBR  AHCY   toi        1495   unk         NA      NA     3  1.46
#>  4 AHCY  A1    SYBR  AHCY   toi        1495   unk         NA      NA     4  1.47
#>  5 AHCY  A1    SYBR  AHCY   toi        1495   unk         NA      NA     5  1.47
#>  6 AHCY  A1    SYBR  AHCY   toi        1495   unk         NA      NA     6  1.45
#>  7 AHCY  A1    SYBR  AHCY   toi        1495   unk         NA      NA     7  1.48
#>  8 AHCY  A1    SYBR  AHCY   toi        1495   unk         NA      NA     8  1.46
#>  9 AHCY  A1    SYBR  AHCY   toi        1495   unk         NA      NA     9  1.47
#> 10 AHCY  A1    SYBR  AHCY   toi        1495   unk         NA      NA    10  1.46
#> # … with 1,226,870 more rows, and abbreviated variable names ¹​target_type,
#> #   ²​sample_type, ³​dilution
#> # ℹ Use `print(n = ...)` to see more rows

Types of samples:

  distinct(biomarker, plate, well, sample_type, copies, dilution),
#> # A tibble: 7 × 4
#>   sample_type copies dilution     n
#>   <fct>        <int>    <dbl> <int>
#> 1 ntc              0      Inf   192
#> 2 std             15    10000   192
#> 3 std            150     1000   192
#> 4 std           1500      100   192
#> 5 std          15000       10   192
#> 6 std         150000        1   192
#> 7 unk             NA       NA 23424

