Top Banner
Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics Dr. Yen-Yi Ho ([email protected]) Sep 09, 2015 1/25
25

Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

Mar 30, 2018

Download

Documents

vancong
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

Statistics for Human Genetics and Molecular BiologyLecture 1: Review Basic Terminology of Genetics

Dr. Yen-Yi Ho ([email protected])

Sep 09, 2015

1/25

Page 2: Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

Logistics

Lectures M W F& Labs: 1:25 to 2:15Office Hours : Yen-Yi MW 2:30-3:30

Cavan MW 2:30-3:30Zhiyuan (Jason) Xu Tue 3-4p in Mayo A446

Textbook: Foulkes (2009): Applied Statistical Genetics with RHahne, Huber, Gentleman, and Falcon (2008):Bioconductor Case StudiesJohn Verzani’s SimpleR notes

Website: http://www.biostat.umn.edu/∼cavanr/pubh7445.html

2/25

Page 3: Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

Goals for the Course

• Basic knowledge of R

• Basics of statistics for human genetics

• Basics of genetic data analyses using R/Bioconductor

• Interpreting results and simple diagnoses

3/25

Page 4: Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

Objectives of Lecture 1

I Review basic terminology of geneticsI Central dogma of molecular biologyI Chromosomes, genes, DNA, RNA, and proteinsI Gene expressionI Genetic variationI Mutations

I Technologies for Genome Analysis

4/25

Page 5: Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

Mendelian Genetics (1866)

Segregation of alleles in the production of sex cells1. the principle of segregation2. the principle of independent assortment

5/25

Page 6: Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

Mendelian Genetics Translates to Modern Genetics

I A parent contributes only a single chromosome within a pairto the offspring.

I A fixed location on a chromosome pair is called a locus, andonly those loci coding (for proteins or functional RNA) aretypically called genes.

I An allele is the state or type of genetic info at a locus on asingle chromosome. Thus there are two alleles at each locusin an individual (for autosomes, and for sex chromosomes infemales).

6/25

Page 7: Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

I Example: A particular disease locus has two possible alleletypes in the population: d (the disease allele) and D (normal).

I Genotype: the joint (unordered) state of the two alleles.Could be dd, DD (called homozygous genotypes), or Dd (heterozygous genotype).

I Alleles that are common in the population are often calledwild type while disease alleles are called mutant.

I Phenotype: an observed trait we care about, such as diseasestatus, etc.

7/25

Page 8: Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

Mendelian Genetics Translates to Modern Genetics

Adapted from NHGRI Talking Glossary

8/25

Page 9: Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

Central Dogma of Biology: Classic View

9/25

Page 10: Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

10/25

Page 11: Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

Base Pairs

Humans have ≈ 3 × 109 base pairs intheir nuclear genome.

IUPAC code Base

a adeninec cytosineg guanine

t (or u) thymine (or uracil)r a/gy c/ts g/cw a/tk g/tm a/cb c/g/td a/g/th a/c/tv a/c/gn any base./ - gap

11/25

Page 12: Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

GeneGene: a functional and inheritable element in the genome, usuallycodes for a protein; human genome ≈20,000 genes.The gene consists of three major structures:

• Regulatory segment

• Exons

• Introns

souce: http://www.nobelprize.org/educational/medicine/dna/a/replication/gene.html

12/25

Page 13: Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

Transcription

Transcription is the process of making RNA from DNA.

13/25

Page 14: Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

Translation

Translation is the process of translating the sequence of nucleotidebases in DNA/RNA into a sequence of amino acids in a protein.

14/25

Page 15: Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

15/25

Page 16: Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

Gene Expression

Gene expression is a highly specific process. Only a small fractionof the genes are expressed, or turned ”on,” in any particular typeof cell.

gene expression in different tissues gene expression in the same tissue,

but different points in time

16/25

Page 17: Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

Putting it all together

source:

http://www.nobelprize.org/educational/medicine/dna/index.html

I DNA:Info on chromosome isstatic, and essentially thesame across cells withinthe individual

I mRNA:Not as relevant as protein,but easier to quantify

I Protein:Difficult to quantifyglobally, though veryrelevant

17/25

Page 18: Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

Source of Variation

18/25

Page 19: Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

Environment Vs. Gene

Any two individuals are 99.9% identical in their DNA

19/25

Page 20: Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

Genetic Variations (Polymorphisms)

That 0.1 % is very important in defining our differences

• single nucleotide polymorphisms(SNPs, every 300 nucleotide onaverage)

• small-scale mutation, insertions,deletions

• copy number variations(AAGAAGAAGAAG)

source: http://ghr.nlm.nih.gov/handbook/genomicresearch/snp

20/25

Page 21: Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

Mutations

21/25

Page 22: Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

Genome Analysis Technologies

1. DNA

• Microarrays:SNP, Copy numbervariation (CNV),Methylation

• DNA sequencing:SNP, Insertion,Deletion, Mutation,CNV, Methylation

2. mRNA

• Microarrays• RNA sequencing

3. Protein

• 2-D electrophoresis• Maldi-Tof mass spec

22/25

Page 23: Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

General Steps in Obtaining Gene Expression Data

23/25

Page 24: Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

General Steps in Next-Generation Sequencing

24/25

Page 25: Statistics for Human Genetics and Molecular Biology ...yho/Pubh7445/Lecture1.pdf · Statistics for Human Genetics and Molecular Biology Lecture 1: Review Basic Terminology of Genetics

Next Lecture

I Review basic terminology of population geneticsI Crossing OverI DNA RecombinationI Genetic MarkersI Genetic Association Analysis

I Structures of Genetic Data

25/25