Skip to contents

This function retrieves the Biomarker data set, a data set containing raw fluorescence amplification data: 24,576 amplification curves, of 50 cycles each.

Usage

get_biomarker_dataset()

Value

A data frame with 24,576 amplification curves, of 50 cycles each:

plate

Plate identifier. Because one plate was used per gene, the name of the plate is the same as the values in target.

well

Well identifier.

dye

In all reactions the SYBR Green I master mix (Roche) was used, so the value is always "SYBR".

target

Target identifier, in almost all cases the name of a gene.

target_type

Target type: either target of interest ("toi") or reference target ("ref").

sample

Sample identifier.

sample_type

Sample type.

copies

Standard copy number.

dilution

Dilution factor. Higher number means greater dilution.

cycle

PCR cycle.

fluor

Raw fluorescence values.

Details

Data was gathered from Ruijter et al. (2013), doi:10.1016/j.ymeth.2012.08.011 but original source is by Vermeulen et al. (2009), doi:10.1016/S1470-2045(09)70154-8 . The tidy version of the data is kept at the repository of the source of {vermeulen} package. This function fetches such data and thus requires internet connection. It takes a few seconds to run.

The Biomarker data set comprises a set of 59 targets previously identified as a 59-mRNA gene expression signature, that has been developed and validated for improved outcome prediction of children with neuroblastoma. In short, 59 biomarkers and 5 reference genes were measured in 8 µl reactions in a 384-well plate using the LightCycler480 SYBR Green Master (Roche) in a sample maximization experiment design. The 59 genes were carefully selected as being previously reported as prognostic genes in neuroblastoma in at least two independent studies. Each plate contained 366 cDNA samples (n = 1) from primary tumor biopsies, a 5-point 10-fold serial dilution series based on an external oligonucleotide standard (n = 3, from 150,000 to 15 copies), and a no template control (NTC, n = 3). Raw (baseline uncorrected) fluorescent data were exported from the LightCycler480 instrument software.

Examples

# \donttest{
# Takes ~ 10-30 sec
head(get_biomarker_dataset())
#>   plate well  dye target target_type sample sample_type copies dilution cycle
#> 1  AHCY   A1 SYBR   AHCY         toi   1495         unk     NA       NA     1
#> 2  AHCY   A1 SYBR   AHCY         toi   1495         unk     NA       NA     2
#> 3  AHCY   A1 SYBR   AHCY         toi   1495         unk     NA       NA     3
#> 4  AHCY   A1 SYBR   AHCY         toi   1495         unk     NA       NA     4
#> 5  AHCY   A1 SYBR   AHCY         toi   1495         unk     NA       NA     5
#> 6  AHCY   A1 SYBR   AHCY         toi   1495         unk     NA       NA     6
#>      fluor
#> 1 1.102538
#> 2 1.453846
#> 3 1.463030
#> 4 1.471323
#> 5 1.473840
#> 6 1.453012
# }