Skip to contents

The studies object consists of eight slots, each a table (tibble), that combined form a relational database of a subset of GWAS Catalog studies. Each study is an observation (row) in the studies table --- main table. All tables have the column study_id as primary key.

Slots

studies
study_id

GWAS Catalog study accession identifier, e.g., "GCST002735".

reported_trait

Phenotypic trait as reported by the authors of the study, e.g. "Breast cancer".

initial_sample_size

Free text description of the initial cohort sample size.

replication_sample_size

Free text description of the replication cohort sample size.

gxe

Whether the study investigates a gene-environment interaction.

gxg

Whether the study investigates a gene-gene interaction.

snp_count

Number of variants passing quality control.

qualifier

Qualifier of number of variants passing quality control.

imputed

Whether variants were imputed.

pooled

Whether samples were pooled.

study_design_comment

Any other relevant study design information.

full_pvalue_set

Whether full summary statistics are available for this study.

user_requested

Whether the addition of this study to the GWAS Catalog was requested by a user.

genotyping_techs

A tibble listing genotyping technologies employed in each study. Columns:

study_id

GWAS Catalog study accession identifier.

genotyping_technology

Genotyping technology employed, e.g. "Exome genotyping array", "Exome-wide sequencing", "Genome-wide genotyping array", "Genome-wide sequencing", or "Targeted genotyping array".

platforms

A tibble listing platforms used per study.

study_id

GWAS Catalog study accession identifier.

manufacturer

Platform manufacturer, e.g., "Affymetrix", "Illumina", or "Perlegen".

ancestries

A tibble listing ancestry of samples used in each study.

study_id

GWAS Catalog study accession identifier.

ancestry_id

Ancestry identifier.

type

Stage of the ancestry sample: either 'initial' or 'replication'.

number_of_individuals

Number of individuals comprising this ancestry sample.

ancestral_groups

A tibble listing ancestral groups used in each ancestry.

study_id

GWAS Catalog study accession identifier.

ancestry_id

Ancestry identifier.

ancestral_group

Genetic ancestry groups present in the sample.

countries_of_origin

A tibble listing countries of origin of samples.

study_id

GWAS Catalog study accession identifier.

ancestry_id

Ancestry identifier.

country_name

Country name, according to The United Nations M49 Standard of Geographic Regions.

major_area

Region name, according to The United Nations M49 Standard of Geographic Regions.

region

Sub-region name, according to The United Nations M49 Standard of Geographic Regions.

countries_of_recruitment

A tibble listing countries of recruitment of samples.

study_id

GWAS Catalog study accession identifier.

ancestry_id

Ancestry identifier.

country_name

Country name, according to The United Nations M49 Standard of Geographic Regions.

major_area

Region name, according to The United Nations M49 Standard of Geographic Regions.

region

Sub-region name, according to The United Nations M49 Standard of Geographic Regions.

publications

A tibble listing publications associated with each study.

study_id

GWAS Catalog study accession identifier.

pubmed_id

PubMed identifier.

publication_date

Publication date (online date if available) formatted as ymd.

publication

Abbreviated journal name.

title

Publication title.

author_fullname

Last name and initials of first author.

author_orcid

Author's ORCID iD (Open Researcher and Contributor ID).