Suitability of Some Data Quality Controls Thresholds for Genetic Association Studies of Admixed Population

NCT ID: NCT02770001

Last Updated: 2019-12-17

Study Results

Results pending

The study team has not published outcome measurements, participant flow, or safety data for this trial yet. Check back later for updates.

Basic Information

Get a concise snapshot of the trial, including recruitment status, study phase, enrollment targets, and key timeline milestones.

Recruitment Status

WITHDRAWN

Study Classification

OBSERVATIONAL

Study Start Date

2016-05-05

Study Completion Date

2017-05-15

Brief Summary

Review the sponsor-provided synopsis that highlights what the study is about and why it is being conducted.

Background: In genetic studies, the quality of DNA samples is tested first. Samples that are low-quality are not used. Some studies involve minority ethnic groups. And example is admixed African American. These studies often have small sample sizes. It is important to make sure samples are not discarded unnecessarily. This may happen by using quality control (QC) thresholds for homogenous groups. These may not be appropriate for an admixed group. Researchers want to study samples that failed certain QC tests. They want to see if this has to do with the ancestry of the outliers or the quality of the samples.

Objectives:

To study samples that fail heterozygosity and sample genotype call rate QC. To see if the failing rates have to do with the ancestry composition of the outliers or the quality of the samples.

Eligibility:

No new participants. Researchers will review data that has already been collected.

Design:

Researchers will study DNA samples in a lab.

The samples will not include data that can identify the person the sample came from.

Detailed Description

Dive into the extended narrative that explains the scientific background, objectives, and procedures in greater depth.

The purpose of this IRB proposal is to gain access to genetic data generated from participants of publically-funded genomic studies and deposited into dbGaP. It is our intention to use dbGaP data to conduct secondary analysis of the influence of admixture on the outcome of data quality control (QC) in genetic association studies to inform future studies of the optimal QC metric for the genetic association analysis of admixed population. The data we intend to use is deposited in dbGaP and is from the Michigan University Health and Retirement Study (HRS). This protocol is being sent to you because of a dbGaP requirement for this specific de-identified dataset to be reviewed by an IRB as that was not in the protocol.

This work will require the use of statistical analyses tools to estimate the genetic ancestry make-up of each sample, from the genotype data, and determine how that ancestry relates to QC outcomes (i.e. whether or not the sample might be excluded from an analysis due to its ancestral genetic composition rather that the sample genetic quality).

Objectives and specific aims: This work aims to investigate samples failing heterozygosity and sample genotype call rate quality control (QC) to determine whether or not the samples call rate and heterozygosity rate have to do with the ancestry composition of the outliers rather than the quality of the samples and inform future studies of potential loss if general QC is applied to genetic data of admixed sample sets.

Rationale and Background: In genetic association studies DNA sample quality can vary largely across study participants and such variation has an impact on genotype call rate and genotype accuracy; samples of low DNA quality tend to have lower genotype call rate and genotype accuracy. Heterozygosity rate (proportion of heterozygous loci per individual) and genotype failure rate (proportion of missing genotypes per individual) are jointly and routinely used to identify samples with low DNA quality at the data quality control (QC) stage of genetic association studies. Excessive heterozygosity rate may indicate sample contamination whilst a reduced heterozygosity rate could indicate inbreeding \[1\]. Samples with 3-7% \[2,3\] genotype call-rate and heterozygosity \> 2-3 standard deviations from the mean heterozygosity are usually excluded from genetic case-control studies.

Generally, the sample size of genetic association studies involving minority ethnic groups such as admixed African American tends to be small. It is hence important to ensure samples are not discarded unnecessarily, resulting into reduced statistical power, by using QC thresholds applied to homogenous groups which might not be appropriate for an admixed sample set. The aim of this analysis is to investigate samples failing heterozygosity and sample genotype call rate QC to determine whether or not the samples call rate and heterozygosity rate have to do with the ancestry composition of the outliers rather than the quality of the samples. The motivation is to inform future studies of potential loss if general

Conditions

See the medical conditions and disease areas that this research is targeting or investigating.

African American Genetics

Keywords

Explore important study keywords that can help with search, categorization, and topic discovery.

Admixture African American African American Genetics

Study Design

Understand how the trial is structured, including allocation methods, masking strategies, primary purpose, and other design elements.

Observational Model Type

OTHER

Study Time Perspective

OTHER

Eligibility Criteria

Check the participation requirements, including inclusion and exclusion rules, age limits, and whether healthy volunteers are accepted.

Inclusion Criteria

* All the genotype data will be used and no individual will be excluded based on any phenotype.
Minimum Eligible Age

50 Years

Eligible Sex

ALL

Accepts Healthy Volunteers

No

Sponsors

Meet the organizations funding or collaborating on the study and learn about their roles.

National Human Genome Research Institute (NHGRI)

NIH

Sponsor Role lead

Responsible Party

Identify the individual or organization who holds primary responsibility for the study information submitted to regulators.

Responsibility Role SPONSOR

Principal Investigators

Learn about the lead researchers overseeing the trial and their institutional affiliations.

Sharon K Davis

Role: PRINCIPAL_INVESTIGATOR

National Human Genome Research Institute (NHGRI)

Countries

Review the countries where the study has at least one active or historical site.

United States

Other Identifiers

Review additional registry numbers or institutional identifiers associated with this trial.

16-HG-N110

Identifier Type: -

Identifier Source: secondary_id

999916110

Identifier Type: -

Identifier Source: org_study_id