Top Banner
Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics Dr. Yen-Yi Ho ([email protected]) Sep 09, 2015 1/25
25

Statistics for Human Genetics and Molecular Biology Lecture 1

Sep 12, 2021

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Statistics for Human Genetics and Molecular Biology Lecture 1

Statistics for Human Genetics and Molecular BiologyLecture 1: Review Basic Terminology of Genetics

Dr. Yen-Yi Ho ([email protected])

Sep 09, 2015

1/25

Page 2: Statistics for Human Genetics and Molecular Biology Lecture 1

Logistics

Lectures M W F& Labs: 1:25 to 2:15Office Hours : Yen-Yi MW 2:30-3:30

Cavan MW 2:30-3:30Zhiyuan (Jason) Xu Tue 3-4p in Mayo A446

Textbook: Foulkes (2009): Applied Statistical Genetics with RHahne, Huber, Gentleman, and Falcon (2008):Bioconductor Case StudiesJohn Verzani’s SimpleR notes

Website: http://www.biostat.umn.edu/∼cavanr/pubh7445.html

2/25

Page 3: Statistics for Human Genetics and Molecular Biology Lecture 1

Goals for the Course

• Basic knowledge of R

• Basics of statistics for human genetics

• Basics of genetic data analyses using R/Bioconductor

• Interpreting results and simple diagnoses

3/25

Page 4: Statistics for Human Genetics and Molecular Biology Lecture 1

Objectives of Lecture 1

I Review basic terminology of geneticsI Central dogma of molecular biologyI Chromosomes, genes, DNA, RNA, and proteinsI Gene expressionI Genetic variationI Mutations

I Technologies for Genome Analysis

4/25

Page 5: Statistics for Human Genetics and Molecular Biology Lecture 1

Mendelian Genetics (1866)

Segregation of alleles in the production of sex cells1. the principle of segregation2. the principle of independent assortment

5/25

Page 6: Statistics for Human Genetics and Molecular Biology Lecture 1

Mendelian Genetics Translates to Modern Genetics

I A parent contributes only a single chromosome within a pairto the offspring.

I A fixed location on a chromosome pair is called a locus, andonly those loci coding (for proteins or functional RNA) aretypically called genes.

I An allele is the state or type of genetic info at a locus on asingle chromosome. Thus there are two alleles at each locusin an individual (for autosomes, and for sex chromosomes infemales).

6/25

Page 7: Statistics for Human Genetics and Molecular Biology Lecture 1

I Example: A particular disease locus has two possible alleletypes in the population: d (the disease allele) and D (normal).

I Genotype: the joint (unordered) state of the two alleles.Could be dd, DD (called homozygous genotypes), or Dd (heterozygous genotype).

I Alleles that are common in the population are often calledwild type while disease alleles are called mutant.

I Phenotype: an observed trait we care about, such as diseasestatus, etc.

7/25

Page 8: Statistics for Human Genetics and Molecular Biology Lecture 1

Mendelian Genetics Translates to Modern Genetics

Adapted from NHGRI Talking Glossary

8/25

Page 9: Statistics for Human Genetics and Molecular Biology Lecture 1

Central Dogma of Biology: Classic View

9/25

Page 10: Statistics for Human Genetics and Molecular Biology Lecture 1

10/25

Page 11: Statistics for Human Genetics and Molecular Biology Lecture 1

Base Pairs

Humans have ≈ 3 × 109 base pairs intheir nuclear genome.

IUPAC code Base

a adeninec cytosineg guanine

t (or u) thymine (or uracil)r a/gy c/ts g/cw a/tk g/tm a/cb c/g/td a/g/th a/c/tv a/c/gn any base./ - gap

11/25

Page 12: Statistics for Human Genetics and Molecular Biology Lecture 1

GeneGene: a functional and inheritable element in the genome, usuallycodes for a protein; human genome ≈20,000 genes.The gene consists of three major structures:

• Regulatory segment

• Exons

• Introns

souce: http://www.nobelprize.org/educational/medicine/dna/a/replication/gene.html

12/25

Page 13: Statistics for Human Genetics and Molecular Biology Lecture 1

Transcription

Transcription is the process of making RNA from DNA.

13/25

Page 14: Statistics for Human Genetics and Molecular Biology Lecture 1

Translation

Translation is the process of translating the sequence of nucleotidebases in DNA/RNA into a sequence of amino acids in a protein.

14/25

Page 15: Statistics for Human Genetics and Molecular Biology Lecture 1

15/25

Page 16: Statistics for Human Genetics and Molecular Biology Lecture 1

Gene Expression

Gene expression is a highly specific process. Only a small fractionof the genes are expressed, or turned ”on,” in any particular typeof cell.

gene expression in different tissues gene expression in the same tissue,

but different points in time

16/25

Page 17: Statistics for Human Genetics and Molecular Biology Lecture 1

Putting it all together

source:

http://www.nobelprize.org/educational/medicine/dna/index.html

I DNA:Info on chromosome isstatic, and essentially thesame across cells withinthe individual

I mRNA:Not as relevant as protein,but easier to quantify

I Protein:Difficult to quantifyglobally, though veryrelevant

17/25

Page 18: Statistics for Human Genetics and Molecular Biology Lecture 1

Source of Variation

18/25

Page 19: Statistics for Human Genetics and Molecular Biology Lecture 1

Environment Vs. Gene

Any two individuals are 99.9% identical in their DNA

19/25

Page 20: Statistics for Human Genetics and Molecular Biology Lecture 1

Genetic Variations (Polymorphisms)

That 0.1 % is very important in defining our differences

• single nucleotide polymorphisms(SNPs, every 300 nucleotide onaverage)

• small-scale mutation, insertions,deletions

• copy number variations(AAGAAGAAGAAG)

source: http://ghr.nlm.nih.gov/handbook/genomicresearch/snp

20/25

Page 21: Statistics for Human Genetics and Molecular Biology Lecture 1

Mutations

21/25

Page 22: Statistics for Human Genetics and Molecular Biology Lecture 1

Genome Analysis Technologies

1. DNA

• Microarrays:SNP, Copy numbervariation (CNV),Methylation

• DNA sequencing:SNP, Insertion,Deletion, Mutation,CNV, Methylation

2. mRNA

• Microarrays• RNA sequencing

3. Protein

• 2-D electrophoresis• Maldi-Tof mass spec

22/25

Page 23: Statistics for Human Genetics and Molecular Biology Lecture 1

General Steps in Obtaining Gene Expression Data

23/25

Page 24: Statistics for Human Genetics and Molecular Biology Lecture 1

General Steps in Next-Generation Sequencing

24/25

Page 25: Statistics for Human Genetics and Molecular Biology Lecture 1

Next Lecture

I Review basic terminology of population geneticsI Crossing OverI DNA RecombinationI Genetic MarkersI Genetic Association Analysis

I Structures of Genetic Data

25/25