Concept Part 1: Genome-Wide Association Study (GWAS) and the Basics of Genetics

A brief look at Genome-Wide Association Study (GWAS) and the Basics of Genetics

1. Introduction to Genome-wide Association Study

Genome-Wide Association Study (GWAS) is a scientific method used to establish an association between a condition, usually a disease, and the genetic variants in the human genome. The entire human genome reaches 3 billion nucleotides, which are composed of the letters A, C, T, G. To put it into perspective, that is enough information to fill roughly 100 200-page books!

Conventionally, researchers do not look at all of the 3 billion letters in the human genome since it is quite rare or expensive to use statistical software that can handle such computational intensity. For that reason, in GWAS, researchers prototype and study the genome sequence on specific parts of the body. We are typically working with data that come from a type of technology called a genotype array which only measures/records a subset of those nucleotides. Another type of technology that’s becoming more common (as it gets cheaper) is known as whole genome sequencing which actually measures/records all 3 billion nucleotides.

2. The basics of genetics

Before diving into our method, let’s define some key genetic concepts in this topic. They are:

In the picture above, we have 4 individuals with most of their base pairs are the same, except from the ones highlighted in orange. Those orange-highlighted parts of the chromosomes that are different from one another are SNPs!

References