Top Banner
Genetic diversity, population structure and phylogeography among belugas (Delphinapterus leucas) in Canadian waters: broad to fine-scale approaches to inform conservation and management strategies by Lianne D. Postma A Thesis submitted to the Faculty of Graduate Studies of The University of Manitoba in partial fulfilment of the requirements of the degree of DOCTOR OF PHILOSOPHY Department of Biological Sciences University of Manitoba Winnipeg Copyright © 2017 by Lianne D. Postma
314

Genetic diversity, population structure and phylogeography ...

Mar 26, 2023

Download

Documents

Khang Minh
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Genetic diversity, population structure and phylogeography ...

Genetic diversity, population structure and phylogeography among belugas

(Delphinapterus leucas) in Canadian waters: broad to fine-scale approaches to inform

conservation and management strategies

by

Lianne D. Postma

A Thesis submitted to the Faculty of Graduate Studies of

The University of Manitoba

in partial fulfilment of the requirements of the degree of

DOCTOR OF PHILOSOPHY

Department of Biological Sciences

University of Manitoba

Winnipeg

Copyright © 2017 by Lianne D. Postma

Page 2: Genetic diversity, population structure and phylogeography ...

ii

Abstract

This thesis examines the genetic diversity, population structure and phylogeography of belugas

(Delphinapterus leucas) in Canadian waters that encounter multiple stressors throughout their

seasonal distributions. Data were collected from multiple scales, including: samples covering a

temporal scale of 25 years; broad geographic comparisons to finer-scale, within-group

comparisons; and varying amounts of genetic information from partial mtDNA sequences to

whole mitogenome sequences, and multiple nuclear microsatellite loci. At least nine genetically

distinct summer aggregations, with an additional distinct winter sample collection of unknown

summer distribution, were identified. This information contributed to the identification of

Designatable Units (DUs) of belugas for future assessments by the Committee on the Status of

Endangered Wildlife in Canada (COSEWIC). Phylogenetic analyses of mtDNA sequences

revealed that the most divergent lineages are found at the east, west and southern edges of the

Canadian distribution, with the central area characteristic of a contact zone displaying an

admixture of lineages marked by incomplete lineage sorting. The geographic distribution of

these lineages suggests multiple glacial refugia as sources of ancestral beluga populations that

recolonized Canadian Arctic and sub-Arctic waters. Preliminary tests of selection detected the

presence of purifying selection on all mtDNA protein-coding genes of belugas. However, no

signals of adaptive selection were detected among genetic lineages or geographic groups. Within

nearshore summer aggregations of Beaufort Sea belugas, three distinct maternal lineages were

identified and patterns of genetic relatedness suggest clusters of related females form in the

overall area. However, these clusters of related belugas did not form fine-scale kin structure

corresponding to aggregation/harvesting locations. Thus, disturbances and subsistence harvesting

Page 3: Genetic diversity, population structure and phylogeography ...

iii

in particular areas where belugas are aggregating will not be necessarily putting a discrete

genetic unit of the stock at risk. These results provide a better understanding of the diversity and

spatial differentiation among and within Canadian beluga stocks, inferences about past responses

to climate changes, approaches to investigate fine-scale structure within seasonal aggregations,

and new tools to infer adaptive potential of these whales. This information, and studies of beluga

fossils plus additional samples across global distributions, will improve conservation and

management planning for this culturally important and charismatic species.

Page 4: Genetic diversity, population structure and phylogeography ...

iv

Acknowledgements

It is very difficult to capture the full scope of people who have contributed to this body of

work. It has been shaped by experiences over the course of an entire career that I have been so

fortunate to have at the Freshwater Institute (DFO) since I was a young undergraduate student.

Without the mentoring, wisdom and support of many co-workers, students and colleagues over

the last several decades, I would never have made my way to this achievement. However, to

combine a PhD program with a busy career requires a special group of people and commitment.

My supervisor, Margaret Docker, was a model of patience, encouragement and sound advice for

every step of this process. She was supported by a “dream-team” of a committee composed of

Steve Ferguson, James Hare and Micheline Manseau. I thank my external examiner Dr. David

Coltman (University of Alberta) for his thoroughness and constructive review of this thesis.

Denise Tenkula has been my partner in the lab for many years, and Susie Bajno and Vanessa

Kornelsen also supported the work in this project. Assistance as training and guidance for

Chapter 3 was generously provided by Tim Frasier of St. Mary’s University, Halifax, NS.

Helpful review and comments were also provided by Eline Lorenzen, Natural History Museum

of Denmark and University of Copenhagen.

Funding for this work was provided by Fisheries and Oceans Canada (DFO) as salary to

L. Postma, through the DFO Genomics Research and Development Initiative (GRDI), the DFO

Ecosystem Research initiative (ERI), the Nunavut Implementation Fund (NIF), and from the

Fisheries Joint Management Committee (FJMC) in the Inuvialuit Settlement Region.

However, my family and friends have been my true touchstone during this endeavour.

Their love and support kept me going when the going got tough. I cannot thank you enough.

Page 5: Genetic diversity, population structure and phylogeography ...

v

Dedication

I dedicate this to my mom. She was my very first teacher and the first to inspire my love of

nature and science when she put earthworms in my small hand and explained how important they

were to the garden. While she is not here to see the end of this journey, she told me almost every

day how proud she was of me for everything I have accomplished. That has been the foundation

of all the work, and the strength in my perseverance, that has brought me, finally, to here.

Page 6: Genetic diversity, population structure and phylogeography ...

vi

Table of Contents

Abstract…………………………………………………………………………………… ii

Acknowledgements ……………………………………………………………………… iv

Dedication ……………………………………………………………………………….. v

List of Tables……………………………………………………………………………… ix

List of Figures …………………………………………………………………………….. xi

Chapter 1: Introduction: Conservation goals and challenges for belugas (Delphinapterus

leucas) in the Canadian Arctic and sub-Arctic

1.1 Conservation and belugas…………………………………………………….. 1

1.2 Evolution and adaptive potential of belugas………………………………….. 6

1.3 Thesis outline…………………………………………………………………. 10

1.4 References …………………………………………………………………….. 14

Chapter 2: Mitochondrial DNA sequence diversity, population structure, and

phylogeography of belugas (Delphinapterus leucas) in Canadian waters

Abstract…………………………………………………………………………… 23

2.1 Introduction…………………………………………………………………… 24

2.2 Materials and methods

2.2.1 Data collection………………………………………………………. 31

2.2.2 Genetic variability and population structure of geographic

samples……………………………………………………………………. 34

2.2.3 Phylogenetic analyses……………………………………………….. 36

Page 7: Genetic diversity, population structure and phylogeography ...

vii

2.2.4 Demographic expansions and haplogroups…………………………. 40

2.3 Results

2.3.1 mtDNA sequence diversity and population structure of

belugas in Canadian waters……………………………………………….. 42

2.3.2 Phylogeographic patterns of belugas inferred from gene

trees and haplotype network………………………………………………. 54

2.3.3 Divergence patterns and dating……………………………………... 64

2.4 Discussion

2.4.1 Population structure of belugas in Canadian waters………………… 75

2.4.2 Influence of dispersal, both historical and contemporary,

on extant beluga population genetic structure…………………………….. 83

2.4.3 Predicting changes in movement patterns and population genetic

structure among Canadian beluga populations……………………………. 94

2.5 Acknowledgements…………………………………………………………… 96

2.6 References …………………………………………………………………….. 96

Appendix 2.1. Summary of results from previous research studies of

mtDNA in beluga populations…………………………………………………….. 113

Appendix 2.2. Information for geographic sample collections used for the

analyses of haplotype diversity (Table 2.2) and geographic differentiation

(Table 2.3)………………………………………………………………………… 116

Appendix 2.3. Results of re-analysis of summer and winter beluga sample

collections using only samples collected in the period 2000-2008

(Nsamples = 718)…………………………………………………………………….. 119

Page 8: Genetic diversity, population structure and phylogeography ...

viii

Appendix 2.4 Alignment of polymorphic sites in N=83 haplotypes found

in main geographic beluga sample collections (N=1377 samples)………………… 122

Chapter 3: Fine-scale genetic structure of nearshore beluga (Delphinpaterus leucas)

aggregations in the Eastern Beaufort Sea: are kin groups being impacted by harvesting?

Abstract……………………………………………………………………………. 125

3.1 Introduction……………………………………………………………………. 126

3.2 Materials and methods

3.2.1 Study area and samples……………………………………………… 133

3.2.2 DNA extraction and sex identification……………………………… 136

3.2.3 Microsatellite genotyping…………………………………………… 136

3.2.4 Validity and variability of microsatellite markers…………………... 137

3.2.5 Relatedness within geographic areas of interest…………………….. 139

3.2.6 Network clustering analyses based on pairwise

relatedness of individuals…………………………………………………. 141

3.2.7 Bayesian and multivariate clustering analyses……………………… 143

3.2.8 Mitochondrial DNA control region sequencing…………………….. 145

3.3 Results

3.3.1 Validity and variability of microsatellite markers…………………... 148

3.3.2 Relatedness and network clustering analyses……………………….. 152

3.3.3 Analyses with STRUCTURE……………………………………….. 164

3.3.4 Discriminant Analysis of Principal Components (DAPC)………….. 166

3.3.5 Mitochondrial DNA control region sequencing…………………….. 169

Page 9: Genetic diversity, population structure and phylogeography ...

ix

3.4 Discussion …………………………………………………………………….. 180

3.5 Acknowledgements…………………………………………………………… 188

3.6 References …………………………………………………………………….. 189

Appendix 3.1 Multiplex amplification reaction mixtures,

amplification conditions, and pooling ratios for beluga microsatellite

genotypes in this study……………………………………………………………. 202

Chapter 4: Mitochondrial genome diversity and phylogenetic patterns among Canadian

belugas (Delphinapterus leucas), with comparisons to narwhal (Monodon monoceros)

mitogenomes

Abstract…………………………………………………………………………… 211

4.1 Introduction…………………………………………………………………… 212

4.2 Materials and Methods

4.2.1 Sample selection……………………………………………………. 217

4.2.2 Complete mitochondrial genome sequencing………………………. 222

4.2.3 Complete mitochondrial genome assembly………………………… 224

4.2.4 Mitogenome sequence variability and differentiation among

sample collections……………………………………………………….... 226

4.2.5 Phylogenetic analyses of complete mitogenome sequences………... 227

4.2.6 Mitogenome codon substitution patterns and evidence

of selection………………………………………………………………... 230

Page 10: Genetic diversity, population structure and phylogeography ...

x

4.3 Results

4.3.1 Complete mitogenome sequencing workflow for beluga and

narwhal…………………………………………………………………… 233

4.3.2 Mitogenome sequence data analyses………………………………. 237

4.3.3 Phylogenetics of complete mitogenome sequences of beluga

and narwhal……………………………………………………………….. 246

4.3.4 Mitogenome codon diversity and signatures of selection…………... 256

4.4 Discussion …………………………………………………………………….. 259

4.5 Acknowledgements…………………………………………………………… 269

4.6 References …………………………………………………………………….. 269

Appendix 4.1. Haplotypes for beluga samples based on complete mitochondrial

genome sequences (16,385bp) as compared to control region mtDNA (609bp)

haplotypes…………………………………………………………………………. 280

Chapter 5: Summary, directions for further research and conclusions

5.1 Introduction…………………………………………………………………… 283

5.2 Key findings and future research directions

5.2.1 Conservation and management units for beluga whales in

Canadian waters…………………………………………………………... 284

5.2.2 Phylogeography of Canadian belugas………………………………. 286

5.2.3 Relatedness and fine-scale stock structure for nearshore

aggregations of Beaufort Sea belugas…………………………………….. 287

Page 11: Genetic diversity, population structure and phylogeography ...

xi

5.2.4 Mitogenomics as a tool for investigating beluga population

genetics and molecular ecology…………………………………………... 289

5.3 Conclusion……………………………………………………………………. 291

5.4 References …………………………………………………………………….. 292

Page 12: Genetic diversity, population structure and phylogeography ...

xii

List of Tables

Table 2.1. Distribution of haplotypes (N=111) from beluga samples collected in

Canadian Waters (Figure 2.1)……………………………………………………………... 45

Table 2.2. Summary of haplotype diversity found in only the summer and winter

beluga sample collections (Nsamples=1377)……………………………………………….... 48

Table 2.3. Patterns of differentiation based on genetic distance among summer and

winter beluga sample collections (Nsamples=1377)…………………………………………. 51

Table 2.4. Results of Maximum Likelihood tests for fits of different nucleotide

substitution models for 1377 haplotype sequences (609bp) from summer and

winter beluga sample collections (see Figure 2.1) as calculated in MEGA6……………... 57

Table 2.5. Pairwise divergence estimates based on nucleotide diversities for the

3 Haplogroups of beluga whale haplotypes……………………………………………….. 64

Table 2.6. Results of neutrality tests for significantly differentiated sample

collections (Table 2.3) and for all sample haplotypes combined according

to geographic haplogroup patterns (Figure 2.9)…………………………………………... 68

Table 2.7. Parameters of demographic expansion for geographic lineages

found in Canadian beluga whale samples as estimated by mismatch analysis…………… 74

Table A2.1. Summary of haplotype diversity found in only the summer

and winter beluga sample collections 2000-2008………………………………………… 119

Table A2.2. Patterns of differentiation based on genetic distance among summer

and winter beluga sample collections 2000-2008………………………………………… 120

Table 3.1. Summary of samples analyzed from each location. Note that number

of females and number of males do not always total final number as PCR

amplifications for sex determination failed in some samples…………………………… 150

Table 3.2. Measures of nuclear genetic diversity by sampling locations………………... 151

Table 3.3. Summary of mtDNA haplotype diversity found in beluga sample

collections……………………………………………………………………………….. 170

Page 13: Genetic diversity, population structure and phylogeography ...

xiii

Table 3.4. Patterns of mtDNA differentiation based on genetic distance among

sample locations (Nsamples=1032)………………………………………………………… 173

Table 3.5. Patterns of differentiation based on mtDNA genetic distance

among sample collections (Nsamples=1032) divided by decades 1980s, 1990s,

and 2000s………………………………………………………………………………... 177

Table 3.6. Summary of mtDNA haplotype diversity found in beluga sample

collections grouped by decade 1980s, 1990s, and 2000s……………………………….. 180

Table 4.1. Mitogenome sequencing run summary statistics for beluga and narwhal

samples (averaged across sequencing runs/samples)……………………………………... 235

Table 4.2. Complete mitogenome genetic variability among beluga sample

collections and beluga clades (as identified in Section 4.3.3)…………………………….. 238

Table 4.3. Summary of beluga mitogenome haplotypes found in multiple samples............ 243

Table 4.4. Comparison of mtDNA control region (CR) sequence information to

complete mitochondrial genomes for beluga and narwhal………………………………... 245

Table 4.5. Intra- and interspecific diversity and divergence in mtDNA protein-coding

genes of beluga compared to narwhal…………………………………………………….. 258

Table 4.6. McDonald-Kreitman test to evaluate departures from selective neutrality

in mtDNA protein-coding genes among beluga clades…………………………………… 258

Page 14: Genetic diversity, population structure and phylogeography ...

xiv

List of Figures

Figure 1.1. The global extant range of belugas…………………………………………….. 2

Figure 2.1. Locations of beluga sample sources for all mtDNA haplotypes

(N=2540) analyzed in this study………………………………………………………….. 32

Figure 2.2. Principal Coordinates Analysis (PCoA) using ΦPT genetic distance matrix

for mtDNA haplotypes from summer and winter beluga whale sample collections

(Table 2.3)………………………………………………………………………………… 52

Figure 2.3. Evolutionary relationships of summer and winter beluga whale sample

collections (Nsamples=1377) based on mtDNA haplotypes and inferred using the

Neighbour-Joining method……………………………………………………………….. 53

Figure 2.4. Evolutionary relationships of beluga whale mtDNA control region

haplotype sequences (N=83) inferred from phylogenetic analysis using

Bayesian inference………………………………………………………………………... 58

Figure 2.5. Evolutionary relationships of beluga whale mtDNA control region

haplotype sequences (N=83) inferred from phylogenetic analysis using

Maximum Parsimony……………………………………………………………………... 59

Figure 2.6. Evolutionary relationships of beluga whale mtDNA control region

haplotype sequences (N=83) inferred from phylogenetic analysis by

Maximum Likelihood…………………………………………………………………….. 60

Figure 2.7. Median-joining phylogenetic network for all beluga mtDNA haplotypes

(609bp sequences) identified in this study (Table 2.1) and found in

N>2 individuals (Nhaplotypes= 85, Nsamples= 2506)…………………………………………. 62

Figure 2.8. Distribution of haplogroups (Haplogroup 1A, Haplogroup 1B and

Haplogroup 2) among summer and winter beluga whale sample collections

(Table 2.3)………………………………………………………………………………… 63

Figure 2.9. Observed mismatch distribution (dotted line) for all Canadian beluga

summer and winter samples combined (N=1377) compared to the bell-shaped

curve (solid line) expected for the data if population expansion has occurred

in the past………………………………………………………………………………….. 69

Page 15: Genetic diversity, population structure and phylogeography ...

xv

Figure 2.10. Observed mismatch distribution (dotted line) for Beaufort Sea samples,

characterized by Haplogroup 1B (N=301), compared to the bell-shaped curve

(solid line) expected for the data if population expansion has occurred in the past………. 70

Figure 2.11. Observed mismatch distribution (dotted line) for central Canadian

sample locations i.e. High Arctic, Cumberland Sound, Western Hudson Bay,

and Belcher Islands harvest, characterized by Haplogroup 1A (N=798), compared

to the bell-shaped curve (solid line) expected for the data if population expansion

has occurred in the past……………………………………………………………………. 71

Figure 2.12. Observed mismatch distribution (dotted line) for combined Belcher

Island entrapment samples and Long Island, characterized by Haplogroup 2 (N=95),

compared to the bell-shaped curve (solid line) expected for the data if population

expansion has occurred in the past………………………………………………………… 72

Figure 2.13. Observed mismatch distribution (dotted line) for (A) James Bay (N=28);

(B) Eastern Hudson Bay (N=30); and (C) St. Lawrence Estuary beluga samples

(N=125) compared to the curve (solid line) expected under a model population

growth or decline………………………………………………………………………….. 73

Figure 2.14. Distribution of haplotypes among summer and winter sample

collections that are significantly differentiated based on genetic distance (Table 2.3)…… 76

Figure A2.1 Principal Coordinates Analysis (PCoA) using ΦPT genetic distance

matrix for mtDNA haplotypes from summer and winter beluga whale sample

collections 2000-2008…………………………………………………………………….. 121

Figure 3.1. Sampling locations for Eastern Beaufort Sea stock of beluga whales

used in this study. Symbols indicate the current hunting camps (black triangle)

and main communities (white square) participating in beluga harvest monitoring………. 134

Figure 3.2. Relatedness of beluga samples from Shallow/Niaquunaq Bay harvested

by hunting camps on the same day………………………………………………………... 153

Figure 3.3. Relatedness of beluga samples from Shallow/Niaqunnaq Bay harvested

by hunting camps in the same decade…………………………………………………….. 154

Page 16: Genetic diversity, population structure and phylogeography ...

xvi

Figure 3.4. Relatedness of male beluga samples (N=706) from: Shallow/Niaqunnaq

Bay (NBNB) (note that the duplication of “NB” and other location abbreviations

denotes an estimation of relatedness within the aggregation area rather than between

areas); East Mackenzie Bay (EMEM); Kugmallit Bay (KBKB); Paulatuk (PAPA);

2006 Husky Lakes (HAHA); and the St. Lawrence Estuary population (SLSL)………… 155

Figure 3.5. Relatedness of female beluga samples (N=209) from: Shallow/Niaqunnaq

Bay (NBNB) (note that the duplication of “NB” and other location abbreviations

denotes an estimation of relatedness within the aggregation area rather than

between areas); East Mackenzie Bay (EMEM); Kugmallit Bay (KBKB);

Paulatuk (PAPA); 2006 Husky Lakes (HAHA); and the St. Lawrence

Estuary population (SLSL)……………………………………………………………….. 156

Figure 3.6. Relatedness of all (male and female) beluga samples (N=964) from

Shallow/Niaqunnaq Bay (NBNB) (note that the duplication of “NB” and other

location abbreviations denotes an estimation of relatedness within the aggregation

area rather than between areas); East Mackenzie Bay (EMEM); Kugmallit Bay

(KBKB); Paulatuk (PAPA); 2006 Husky Lakes (HAHA); and the St. Lawrence

Estuary population (SLSL)………………………………………………………………... 157

Figure 3.7. Network of male and female beluga samples from all Beaufort Sea

sampling locations (Shallow/Niaqunnaq Bay, East Mackenzie Bay, Kugmallit

Bay, and Husky Lakes), and the St. Lawrence Estuary, coloured based on

group assignment: (A) to three groups found by the fast greedy method;

(B) to three groups found by the leading eigenvector method; and (C) to six

groups found by the spinglass community method……………………………………….. 160

Figure 3.8. Network of male beluga samples (N=668) from all Beaufort Sea

sampling locations (Shallow/Niaqunnaq Bay, East Mackenzie Bay, Kugmallit

Bay, and Husky Lakes), coloured based on group assignment: (A) to two

groups found by the fast greedy method; (B) to three groups found by the

leading eigenvector method; and (C) to three groups found by the spinglass

community method……………………………………………………………………….. 161

Figure 3.9. Network of female beluga samples (N=168) from all Beaufort Sea

sampling locations (Shallow/Niaqunnaq Bay, East Mackenzie Bay, Kugmallit

Bay, and Husky Lakes), coloured based on group assignment: (A) to four groups

found by the fast greedy method; (B) to three groups found by the leading

eigenvector method; and (C) to four groups found by the spinglass community

method…………………………………………………………………………………….. 162

Page 17: Genetic diversity, population structure and phylogeography ...

xvii

Figure 3.10. Network of all (male and female) beluga samples (N=858) from

all Beaufort Sea sampling locations (Shallow/Niaqunnaq Bay, East Mackenzie

Bay, Kugmallit Bay, and Husky Lakes), coloured based on group assignment:

(A) to three groups found by the fast greedy method; (B) to three groups found

by the leading eigenvector method; and (C) to nine groups found by the spinglass

community method. For ease of visualization, all connections with relatedness

values < 0.35 were truncated to 0…………………………………………………………. 163

Figure 3.11. (A) Bayesian clustering assignment of individual beluga samples

(N=964) into K=2 clusters based on microsatellite data. Each vertical column

represents one individual, with the length of the coloured segments proportional

to the assignment strength of that individual to one of the clusters. Samples are

grouped by sampling locations from left to right: 1) Kugmallit Bay; 2) East

Mackenzie Bay; 3) Shallow/Niaqunnaq Bay; 4) Paulatuk; 5) 2006 Husky Lakes;

6) 1989 Husky Lakes; 7) 1996 Husky Lakes; and 8) St. Lawrence Estuary. Within

each sample location group, samples are ordered by sampling year. (B) Detection

of the number of K groups that best fit the beluga data performed with the Evanno

method and implemented in Structure Harvester…………………………………………. 161

Figure 3.12. Discriminant Analysis of Principal Components (DAPC) clustering

of male (A) and female (B) beluga samples from all Beaufort Sea

sampling locations (Shallow/Niaqunnaq Bay, East Mackenzie Bay, Kugmallit

Bay, Paulatuk and Husky Lakes), coloured based on group assignment…………………. 167

Figure 3.13. Discriminant Analysis of Principal Components (DAPC) clustering

of male and female beluga samples combined from: (A) all Beaufort Sea sampling

locations (Shallow/Niaqunnaq Bay, East Mackenzie Bay, Kugmallit Bay, Paulatuk

and Husky Lakes); and (B) all Beaufort Sea sampling locations and the

St. Lawrence Estuary, coloured based on group assignment……………………………… 168

Figure 3.14. Distribution of mtDNA haplotypes among sampling locations…………….. 172

Figure 3.15. Principal Coordinates Analysis (PCoA) using ФPT genetic distance

matrix for mtDNA sequence data for beluga sample locations…………………………… 174

Figure 3.16. Principal Coordinates Analysis (PCoA) using ФPT genetic distance

matrix for mtDNA sequence data for beluga samples grouped by location and

decade of sampling………………………………………………………………………... 176

Figure 4.1. Sampling locations and sample sizes for beluga and narwhal mitochondrial

genome sequences………………………………………………………………………… 218

Page 18: Genetic diversity, population structure and phylogeography ...

xviii

Figure 4.2. Summer aggregation ranges of putative narwhal stocks in Canada………….. 221

Figure 4.3. Examples of read coverage against reference mitogenome sequence.

(A) High threshold coverage typical of most samples included in this study;

(B) Poor (below) threshold coverage for samples not included in analyses……………… 236

Figure 4.4. Sequence variation observed within the beluga (A) and narwhal

(B) mitochondrial genomes………………………………………………………………. 240

Figure 4.5. Bootstrap (1000 replicates) consensus Neighbour-Joining tree inferred

from complete narwhal mitochondrial genomes…………………………………………. 250

Figure 4.6. Bootstrap (1000 replicates) consensus Neighbour-Joining tree inferred

from complete beluga mitochondrial genomes…………………………………………… 251

Figure 4.7. Maximum Likelihood tree inferred from complete beluga mitochondrial

genomes split into two parts to highlight sample membership in clades…………………. 252

Figure 4.7, Part B. Clade A (A1 and A2) and Clade B…………………………………… 253

Figure 4.8. Median-joining phylogenetic network for Canada-wide geographic beluga

mitochondrial genome haplotypes (Table 4.1)…………………………………………… 254

Figure 4.9. Median-joining phylogenetic network for Eastern Beaufort Sea beluga

mitochondrial genome haplotypes (Table 4.1)…………………………………………… 255

Figure 4.10. Linear map of complete beluga mitochondrial genome…………………….. 257

Page 19: Genetic diversity, population structure and phylogeography ...

1

Chapter 1: Introduction: Conservation goals and challenges for belugas (Delphinapterus

leucas) in the Canadian Arctic and sub-Arctic

1.1 Conservation and belugas

Conservation of wild species is a concept that spans a broad spectrum of goals and

approaches. At one end there is the motivation of how natural biodiversity provides goods,

services and economic benefits for human societies (Cardinale et al. 2012, Reyers et al. 2012,

Kareiva and Marvier 2012). At the other is the desire to conserve species for their own sake, with

an undefinable value derived from their existence (Ehrenfeld 1976, Soulé 1985, Doak et al.

2014). Deciding what, where and how to conserve a species or population is a complicated

formula involving societal values, political intent, and scientific input influenced by limited

human and financial resources (Tear et al. 2005). Priority setting then becomes a critical

exercise. Determining conservation priorities at the species level is usually done using a listing

process that assigns rankings based on levels of threat and likelihood of extinction (Coates and

Atkins 2001, ICUN 2012, COSEWIC 2015). However, actions for conservation efforts often

arise from a mixture of ecological and biological parameters, human values, and practical

management and regulatory considerations (Margules and Usher 1981).

Though Soulé (1985) described conservation biology as a crisis discipline, important

baseline information can be gathered from intact ecosystems and populations that can be used as

references for framing conservation goals and setting targets (Margules and Usher 1981, Caro et

al. 2011). The need for proactive conservation approaches is even more pertinent when dealing

with migratory species, especially if “the migration is seen as a phenomenon of abundance” and

the goal is to protect the abundance rather than preventing the animal’s extinction (Wilcove and

Page 20: Genetic diversity, population structure and phylogeography ...

2

Wikelski 2008). These species and populations need to be protected while they are still able to

fulfil the ecological properties and services associated with that abundance (Wilcove and

Wikelski 2008, Rands et al. 2010).

Belugas (Delphinapterus leucas) are an Arctic species of cetacean that inhabit a

discontinuous polar distribution mainly in the waters of Canada, Alaska, Russia, Norway and

Greenland (Breton and Smith 1990, Smith et al. 1990) (Figure 1.1).

Figure 1.1. The global extant range of belugas (modified from Jefferson et al. 2012). Isolated

southernmost populations are circled.

Page 21: Genetic diversity, population structure and phylogeography ...

3

Because these marine mammals inhabit an environment dominated by ice for much of the

year, they follow movement patterns that often involve annual migrations covering distances of

thousands of kilometres between summering and wintering areas (Laidre et al. 2008). The global

population of belugas is estimated to be more than 150,000 individuals (Jefferson et al. 2012). In

many regions of their distribution, tens of thousands of belugas form abundant aggregations

along coastlines and in estuaries during the summer (e.g. Harwood et al. 1996, Innes et al. 2002a,

Richard 2005). These predictable seasonal movements of belugas into estuaries and their habit of

congregating close to shore creates an opportunity for the whales to become a target species for

Northern subsistence fisheries (Smith et al. 1990). Throughout their range, belugas are harvested

for consumption (Robards and Reeves 2011, Laidre et al. 2015), but beluga hunts also play an

important role for the cultural identity and social relationships among indigenous peoples (e.g.

Tyrrell 2007). In addition to being a source of food, belugas can also provide local communities

economic value from whale-watching (Orams 2000, Dressler et al. 2001) that draws the interest

of the public to “popular charismatic species” (Barua et al. 2011). In fact, the economic

importance of eco-tourism may be of greater value than harvesting in some locations (Stewart et

al. 2005, Stewart and Draper 2007). Thus, the conservation of belugas, from the species, to

populations, and to local subpopulations, becomes important not just for the inherent value of

protecting biodiversity for its own sake (Doak et al. 2014), but also to meet the food, cultural,

and economic benefits for many northern communities (e.g. Richard and Pike 1993).

The conservation risk status for belugas in Canada is assessed by the Committee on the

Status of Endangered Wildlife in Canada (COSEWIC 2015, 2016). This group provides

information and advice that forms the basis of management objectives for conservation and

management units, or stocks, of belugas. Summer aggregations of beluga in estuaries

Page 22: Genetic diversity, population structure and phylogeography ...

4

traditionally have been the basis for defining stocks in the context of whaling (Reeves and

Mitchell 1989, IWC 2000). Stock boundaries, however, are not fixed – they sometimes overlap

spatially and may have a temporal dimension as well. Also, belugas often form large herds of

several hundred to more than a thousand animals, and group structure appears to be fluid, with

few stable associations (O’Corry-Crowe 2009). When possible, stock definition decisions should

be informed by as much information as possible such as TEK (Traditional Ecological

Knowledge), genetic, morphological, behavioural, telemetric, and contaminant evidence (IWC

2000, COSEWIC 2004, COSEWIC 2016). However, the interactions of people and beluga as a

resource could also be reflected in the identification of a stock. Innes et al. (2002b) define beluga

stocks based on the annual migration path (summering areas and migration routes), and the

opportunity hunters have to hunt the whales during the migration. Furthermore, Innes et al.

(2002b) argue that the migration pattern is a cultural trait of the beluga, and it is this trait, not a

biological one, that determines which individual belugas are hunted by which hunters. Stewart

(2008) reviews this, plus several other concepts of ‘stock’, and defines a stock as “a specific part

of a population impacted by human activity (including potential utilization) in a way that affects

population productivity”. Thus, the approaches that are used to refine the identification of beluga

stocks may vary, but all ultimately aim to achieve management of marine resources that

effectively combines the objectives of meeting societal needs and eliminating or mitigating

detrimental anthropogenic impacts (Waples et al. 2008).

Currently, COSEWIC (2016) recognizes eight Designatable Units (DUs) of belugas in

Canada. These DUs, which are units of the taxonomic species, are defined using a combination

of discreteness, which may involve genetic, geographic and/or eco-geographic region

discreteness; and various indicators of evolutionary significance. In this case, significance is

Page 23: Genetic diversity, population structure and phylogeography ...

5

evaluated in terms of the evolutionary legacy of the species unit, and whether or not its loss

would be irreplaceable through natural means (COSEWIC 2012, 2016). Based on previous and

current COSEWIC assessments (COSEWIC 2004, 2014, 2016), belugas DUs in Canada include:

St. Lawrence Estuary (Endangered); Cumberland Sound (Threatened); Eastern Hudson Bay

(Endangered); James Bay (Not Assessed); Eastern High Arctic-Baffin Bay (Special Concern);

Ungava Bay (Endangered); Eastern Beaufort Sea (Not at Risk); and Western Hudson Bay

(Special Concern). In addition to natural mortality (e.g. predation by killer whales and polar

bears, ice entrapments, harmful algal blooms), these belugas are subject to different levels of

known or potential anthropogenic impacts (e.g. harvesting, oil and gas exploration, commercial

fisheries, hydroelectric development, shipping, habitat disturbance, and environmental pollution)

(COSEWIC 2004, Huntington 2009, Jefferson 2012). Furthermore, the effects of climate change

will likely amplify the nature and magnitude of these activities (Huntington 2009, Jefferson et al.

2012) and will impact the distribution and abundance of sea-ice critical to beluga habitat and

ecosystems (Laidre et al. 2008, O’Corry-Crowe 2008). In the winter, beluga depend on areas

where they can have access to air such as in recurring polynyas and open water formed under the

influence of bottom topography and currents (Breton and Smith 1990). Changing climate could

alter the stability of polynyas, both regarding the timing of sea ice changes and increased

competition for resources (Heide-Jørgensen et al. 2013). Changes in sea ice conditions

associated with winter habitats used by belugas may also increase the risks of ice entrapment

mortality (Laidre and Heide-Jørgensen 2005).

Increasing the concerns over climate change are the observations that warming effects are

most prominent in the polar regions and they are happening at an unprecedented rate (e.g. Corell

2006, Anisimov et al. 2007, Moline et al. 2007, Post et al. 2009, Smol 2012). The impacts of

Page 24: Genetic diversity, population structure and phylogeography ...

6

these trends will have a cascading effect throughout ecosystems with both immediate and long-

term consequences (Anisimov et al. 2007, Schindler 2011). In the Arctic, the rate of warming is

almost twice as high compared to the rest of the world and this trend is expected to accelerate in

the future (Corell 2006). The isolated beluga populations at the southern margins of the species’

range (Figure 1.1) may be the ones particularly vulnerable to climate change (O’Corry-Crowe et

al. 2015). Genetic characterization of these whales can provide valuable insights as to how these

populations became established and the potential for their future persistence (Sexton et al. 2009,

O’Corry-Crowe et al. 2015).

1.2 Evolution and adaptive potential of belugas

For billions of years, life on the planet has been responding to changes in climate by

either adapting or going extinct (Schindler 2011, Smol 2012). This phenomenon is often

associated with the phrase ‘survival of the fittest’, from Darwin’s theory of evolution (Darwin

1859), but this term can sometimes cause confusion about the concepts of natural selection,

fitness and adaptation (Orr 2005, 2009). From an evolutionary genetics perspective, genetic

change arises from natural selection acting on variation in fitness among individuals, species,

and populations that leads to the heritable transmission of the variation from parents to offspring

(Ellegren and Sheldon 2008, Stern and Orgogozo 2009). Fitness can have many definitions,

from conceptual to mathematical, but it is essentially the ability of organisms to survive and

reproduce in the environment in which they live (e.g. Orr 2009).

The capacity of species and populations to evolve in response to changes in their

environments has been termed “evolvability”, or adaptive potential (Houle 1992, Willi et al.

2006). Cetaceans (Cetacea or Neoceti) are a good example of this capacity (Marx and Uhen

Page 25: Genetic diversity, population structure and phylogeography ...

7

2010). Their evolution began as a transition from land to an obligate aquatic environment

(Thewissen et al. 2001). Subsequent adaptive radiation among cetaceans has been linked to

periods of ocean restructuring (Steeman et al. 2009) and to changes in oceanic currents and sea

surface temperatures that resulted in increased diatom diversity and productivity (Marx and Uhen

2010). In modern cetaceans, two whale clades are identified by adaptive strategies for feeding:

the mysticetes, or filter-feeding baleen whales, and the odontocetes, the toothed whales that

pursue larger prey using echolocation (Marx and Uhen 2010).

The only two living species of the cetacean family Monodontidae, beluga and narwhal

(Monodon monoceros), are also the only toothed-whale species found year-round in Arctic

waters. The oldest fossil evidence of an extinct Miocene monodontid lineage (Denebola

brachycephala) was found in Baja California, Mexico (Barnes 1984). The dating of these

materials suggests that monodontids originated in warm water environments at temperate

latitudes (Barnes 1984). Extinct whale lineages in fossil deposits found along the North Carolina

and Virginia coastlines of the North Atlantic Ocean were also dated to the Miocene and Pliocene

(approximately 17 to 4.5 million annum (Ma) before present (BP)) (Whitmore 1994). It is from

these same deposits that an additional early Pliocene beluga-like cetacean was identified,

Bohaskaia monodontoides (Vélez-Juarbe and Pyenson 2011). These occurrences of extinct

monodontid lineages at temperate latitudes in both the Pacific and Atlantic Ocean basins provide

some evidence that cold-climate adaptations may have evolved recently in cetacean lineages.

These lineages may have then developed into extant belugas and narwhals; therefore the

expansion of monodontids into the Arctic and sub-Arctic could be a relatively recent

phenomenon on an evolutionary scale (Vélez-Juarbe and Pyenson 2011).

Page 26: Genetic diversity, population structure and phylogeography ...

8

Adaptations may occur by processes that are linked to both short-term ecological and

long-term evolutionary responses (Ungerer et al. 2008). In line with both types of processes,

belugas show adaptations that include physical, physiological, and behavioural features for

Arctic conditions that are common among polar mammals (Blix 2016). Physical adaptations

include a thick, insulating blubber layer that can comprise over 40% of body weight (Sergeant

and Brodie 1969). This barrier may also serve as an energy store during times of limited food

availability, as well as other functions specific to a cold water aquatic environment (Koopman

2007). Belugas have a small head, tail and flippers (O’Corry-Crowe 2009) that are consistent

with Allen’s rule that the lengths of limbs and other extremities tend to decrease from warmer to

colder climates as a strategy to reduce heat loss (Blix 2016, Hagen 2017). The presence of a

dorsal ridge instead of a dorsal fin in belugas is considered to be an adaptation for surfacing in an

ice-covered habitat or limiting heat loss (O’Corry-Crowe et al. 2009). However, the dorsal ridge

may not, in fact, be an adaptation specific to ice-covered waters as it is not unique to whales in

these environments (Werth et al. 2012). Instead, Werth et al. (2012) suggest that fat pads along

the abdomen are more likely to be an adaptation in monodontids for moving and foraging in ice-

covered habitats. These assist in controlling agile movements like inverted swimming, rolling

and whole body turns characteristic of belugas and narwhal. Belugas also display physiological

specializations for a sea-ice environment, such as high myoglobin content at birth that increases

rapidly to adult levels (Noren and Suydam 2016). This biochemistry facilitates breath-holding

over long periods for foraging and moving among breathing holes (Noren and Suydam 2016).

Whale evolution has also involved significant transformations of brain size and brain

morphology, to the point that brain complexity in modern cetaceans, particularly in odontocetes

(the toothed whales), is surpassed only by humans (Marino 2004). Coincident with these big

Page 27: Genetic diversity, population structure and phylogeography ...

9

brain sizes, odontocetes have also evolved sophisticated social systems (Connor et al. 1998,

Marino 2004, Silk 2007a). The evolution of sociality in mammals occurs when the benefits of

living in groups (e.g. protection from predators, foraging success, mating success) outweighs the

costs that can arise from adverse effects such as competition (Silk 2007b). Thus, social behaviour

has the potential to influence individual fitness and may be important for processes leading to

adaptations (Silk 2007b, Vander Wal et al. 2015).

Social learning behaviours that are shared within subsets of a population are common

within cetacean species (Whitehead et al. 2004, Cantor and Whitehead 2013). This type of

learning is thought to promote increased ecological success by avoiding the time and energy

costs associated with individual learning (Franz and Nunn 2009). Social learning may, therefore,

be important to one of the most characteristic features of many whales, migration (e.g. Esteban et

al. 2016, Kavanaugh et al. 2017). The evolution of long-distance migration is also a strategy to

maximize fitness in seasonal environments and is facilitated by both behavioural and

physiological adaptations (Alerstam et al. 2003, Avgar et al. 2013). In belugas, the extended

period calves spend with their mothers during nursing and weaning is thought to establish social

learning behaviours related to communication, foraging and movement patterns all important for

migration strategies (Brodie 1969, Tyack 1986, Colbeck et al. 2013, Krasnova et al. 2014).

Migration patterns vary among beluga stocks and populations across Canada, with associated

costs (e.g. energy use, predation, ice-entrapments, human disturbance) and benefits (e.g. reliable

habitats and resources, energetic benefits, predator avoidance) differing as well. Migration

patterns are thought to have evolutionary flexibility through a dynamic process that can make

rapid changes to movement behaviour (Alerstam et al. 2003). Modern belugas appear to have a

high degree of fidelity to migration patterns but have been shown to shift the timing of

Page 28: Genetic diversity, population structure and phylogeography ...

10

movements in response to changes in sea surface temperatures (Bailleul et al. 2012). Thus, the

potential for further behavioural adaptations for seasonal movements, at least for migration

phenology, is present in belugas.

1.3 Thesis outline

It has been over 20 years since beluga population genetics have been investigated over

the entire Canadian distribution (Brown Gladden et al. 1997, 1999). Since that time, targeted

studies have been completed to address conservation and management issues at smaller, regional

scales (e.g. de March et al. 2002, de March and Postma 2003, Turgeon et al. 2012, Postma et al.

2012, Colbeck et al. 2013). These studies employed incremental increases in sample sizes;

however, gaps in robust sample sizes and locations remained and, except for Colbeck et al.

(2013), genetic markers were only used to examine broad population and stock level differences.

Predicting how belugas, from individuals to populations, will respond to the cumulative effects

of climate change and human activities is difficult when gaps in data may obscure the variability

of potential responses among different sub-populations (Laidre et al. 2015).

This thesis examines patterns of molecular genetic diversity among belugas in Canadian

waters at a finer scale to provide new information and approaches relevant to ongoing

conservation and management efforts. Throughout the thesis, I take advantage of the

accumulation of samples from beluga research and monitoring programs over the last 25 years

that have contributed to increased sample sizes and new seasonal collections. The objectives of

each chapter of the thesis also employ advances in molecular genetics technologies to gather

larger amounts of genetic information and evolving approaches for the analysis of genetics data.

Page 29: Genetic diversity, population structure and phylogeography ...

11

Chapter 1 introduces the importance of belugas as a species and a resource in Canada, as

well as how conservation and management objectives in Canada are defined. Among Arctic

marine mammals, belugas have been assessed as being moderately sensitive to climate effects

(Laidre et al. 2008), but are in the top 10% of cetacean species predicted to be most affected by

climate-driven changes (Alter et al. 2010). To better understand the sensitivity of belugas to

disturbance and changes in their environment, Chapter 1 also provides some background on the

adaptations belugas display for life in an Arctic environment. Detailed knowledge of genetic

variation and patterns may help to understand this type of adaptive evolution across different

timescales and the potential for change in response to climate effects (Brodersen et al. 2014,

Mathiesen et al. 2017). Any improvement in the knowledge of the variability in movements,

habitat use, behavioural plasticity, and genetic traits that contribute to resilience will be

important elements of beluga management plans (Laidre et al. 2015).

Climate change effects are anticipated to be regionally specific, variable at different

locations, vary in time and space, and be compounded by multiple stressors, and impacts will

vary among species and populations within species (Anisimov et al. 2007, Laidre et al. 2015).

Thus, an important part of the process for defining conservation goals for belugas is the

identification of units for conservation and management at the smallest reasonable scale across

the full extent of the historical range (Reeves and Mitchell 1989, IWC 2000, Waples and Naish

2009). In Chapter 2, therefore, I re-define genetic units for belugas in Canadian waters using

increased sample sizes, greater coverage of the Canadian distribution, and a larger portion of

mitochondrial DNA (mtDNA) control region sequence. The use of a larger number of samples

and more mtDNA sequence information is predicted to identify a greater number of haplotypes

that might provide further divisions of existing stock units. These results could directly impact

Page 30: Genetic diversity, population structure and phylogeography ...

12

the future definition of Designatable Units (DUs) of Canadian belugas for COSEWIC (2016),

and will contribute to further considerations of conservation definitions and fisheries

management stocks of belugas. The spatial differentiation of these stocks and degree of overlap

provide a basis for hypotheses about contemporary seasonal habitat use and migration patterns.

Chapter 2 also uses phylogeographic analyses and coalescent approaches to make inferences

about historical patterns of beluga range expansions, timing of haplotype divergence, and post-

glacial colonization routes (e.g. Provan and Bennett 2008, Hickerson et al. 2009). Insight into the

genetic resiliency of belugas during past responses to climate changes in their environment may

help to inform predictions for future impacts.

For belugas, social learning among close kin is considered to be important for the

transmission of seasonal, philopatric patterns of habitat use (Colbeck et al. 2013). As a result of

this behaviour, maternal lineages are highly structured among different areas of summer

aggregations (Brown Gladden et al. 1997, de March and Postma 2003, this thesis Chapter 2). In

Chapter 3, I explore patterns of relatedness within one of the largest beluga stocks in Canada, in

the eastern Beaufort Sea (EBS), to investigate possible fine-scale social structure. Whales present

in the summer form aggregations along nearshore areas over a fairly broad spatial distribution

(Harwood et al. 2014). Previous genetic analyses of this stock suggested microgeographic

structure among locations where whales are hunted for aboriginal subsistence purposes (Brown

Gladden et al. 1997). In Chapter 3, I use microsatellite analysis to evaluate patterns of spatial and

temporal relatedness among samples of harvested belugas, and network and clustering analysis

to test for structure. I hypothesize that fine-scale genetic structure will be detected in the large

EBS stock using comparisons at the individual level. The local geographic structure of mtDNA

haplotypes among harvest samples is also re-assessed using significantly more samples and a

Page 31: Genetic diversity, population structure and phylogeography ...

13

larger portion of control region sequence. From this information, possible impacts of human

disturbance and hunting on potential genetic sub-units are assessed.

The technological power to characterize an increasing amount of genetic variation, both

neutral and adaptive, in natural populations has changed dramatically in the last five decades

(Allendorf 2017). As costs increasingly become more affordable to analyze larger numbers of

samples, genomic data are being used to address a broad range of conservation problems such as

delimiting conservation units, assessing past and present connectivity, and assessing adaptive

potential (Corlett 2017). Currently, population genomic resources for belugas have not been fully

developed. Given the utility of mitochondrial DNA for addressing some conservation and

management goals (see Chapters 2 and 3), I develop protocols to sequence complete beluga

mitogenomes using next generation sequencing for Canadian range-wide comparisons (Chapter

4). The utility of this approach to improve the identification of conservation and management

units and to refine phylogenetic patterns is examined. I hypothesize that whole mitogenome

sequences will offer more power to investigate population diversity, structure, and signals of

selection.

A summary of general results and conclusions are presented in Chapter 5, as well as

additional suggestions for further studies.

Page 32: Genetic diversity, population structure and phylogeography ...

14

1.4 References

Alerstam, T., Hedenström, A. and Åkesson, S. 2003. Long‐distance migration: evolution and

determinants. Oikos 103: 247-260.

Allendorf, F.W. 2017. Genetics and the conservation of natural populations: allozymes to

genomes. Molecular Ecology 26: 420-430.

Alter, S.E., Simmonds, M.P. and Brandon, J.R. 2010. Forecasting the consequences of climate-

driven shifts in human behavior on cetaceans. Marine Policy 34: 943-954.

Anisimov, O.A., D.G. Vaughan, T.V. Callaghan, C. Furgal, H. Marchant, T.D. Prose, H.

Vilhjálmsson and J.E. Walsh. 2007. Polar regions (Arctic and Antarctic). In M.L. Parry, O.F.

Canziani, J.P. Palutikof, P.J. van der Linden and C.E. Hanson (Eds.), Climate Change 2007:

Impacts, Adaptation and Vulnerability. Contribution of Working Group II to the Fourth

Assessment Report of the Intergovernmental Panel on Climate Change, Cambridge University

Press, Cambridge, pp.653-685.

Avgar, T., Street, G. and Fryxell, J.M. 2013. On the adaptive benefits of mammal migration.

Canadian Journal of Zoology 92: 481-490.

Bailleul, F., Lesage, V., Power, M., Doidge, D.W. and Hammill, M.O. 2012. Migration

phenology of beluga whales in a changing Arctic. Climate Research 53: 169-178.

Barua, M., Root-Bernstein, M., Ladle, R.J. and Jepson, P. 2011. Defining flagship uses is critical

for flagship selection: a critique of the IUCN climate change flagship fleet. AMBIO: A Journal

of the Human Environment 40: 431-435.

Blix, A.S. 2016. Adaptations to polar life in mammals and birds. Journal of Experimental

Biology 219: 1093-1105.

Bonin, A., Nicole, F., Pompanon, F., Miaud, C. and Taberlet, P. 2007. Population adaptive

index: a new method to help measure intraspecific genetic diversity and prioritize populations for

conservation. Conservation Biology 21: 697-708.

Breton, M. and Smith, T.G. 1990. Underwater world: the beluga. Fisheries and Oceans Canada.

Brodersen, J. and Seehausen, O. 2014. Why evolutionary biologists should get seriously

involved in ecological monitoring and applied biodiversity assessment programs. Evolutionary

Applications 7: 968-983.

Page 33: Genetic diversity, population structure and phylogeography ...

15

Cantor, M. and Whitehead, H. 2013. The interplay between social networks and culture:

theoretically and among whales and dolphins. Proceedings of the Royal Society of London B:

Biological Sciences 368: 20120340.

Cardinale, B.J., Duffy, J.E., Gonzalez, A., Hooper, D.U., Perrings, C., Venail, P., Narwani, A.,

Mace, G.M., Tilman, D., Wardle, D.A. and Kinzig, A.P. 2012. Biodiversity loss and its impact

on humanity. Nature 486: 59-67.

Caro, T., Darwin, J., Forrester, T., LeDoux-Bloom, C., and Wells, C. 2011. Conservation in the

Anthropocene. Conservation Biology, 26: 185-188.

Coates, D.J. and Atkins, K.A. 2001. Priority setting and the conservation of Western Australia's

diverse and highly endemic flora. Biological Conservation 97: 251-263.

Colbeck, G.J., Duchesne, P., Postma, L.D., Lesage, V., Hammill, M.O. and Turgeon, J. 2013.

Groups of related belugas (Delphinapterus leucas) travel together during their seasonal

migrations in and around Hudson Bay. Proceedings of the Royal Society of London B:

Biological Sciences 280: 20122552.

Corell, R.W. 2006. Challenges of climate change: an Arctic perspective. Ambio 35: 144-152.

Corlett, R.T. 2017. A bigger toolbox: biotechnology in biodiversity conservation. Trends in

Biotechnology 35: 55-65.

COSEWIC. 2004. COSEWIC assessment and update status report on the beluga whale

Delphinapterus leucas in Canada. Committee on the Status of Endangered Wildlife in Canada.

Ottawa. ix + 70 p.

COSEWIC. 2012. Definitions and abbreviations. [Online:

http://www.cosewic.gc.ca/eng/sct2/sct2_6_e.cfm].

COSEWIC 2014. COSEWIC assessment and update status report on the beluga whale (St

Lawrence Estuary population) Delphinapterus leucas in Canada. Committee on the Status of

Endangered Wildlife in Canada. Ottawa

COSEWIC. 2015. Committee on the Status of Endangered Wildlife in Canada

Assessment Process, Categories and Guidelines.

COSEWIC. 2016. Designatable Units for Beluga Whales (Delphinapterus leucas) in Canada.

Committee on the Status of Endangered Wildlife in Canada. Ottawa. 73 pp.

Page 34: Genetic diversity, population structure and phylogeography ...

16

Darwin, C.R. 1859. The Origin of the Species. J. Murray, London.

Doak, D.F., Bakker, V.J., Goldstein, B.E. and Hale, B. 2014. What is the future of conservation?

Trends in Ecology and Evolution 29: 77-81.

Dressler, W.H., Berkes, F. and Mathias, J. 2001. Beluga hunters in a mixed economy: managing

the impacts of nature-based tourism in the Canadian western Arctic. Polar Record 37: 35-48.

Ehrenfeld, D.W. 1976. The conservation of non-resources: conservation cannot rely solely on

economic and ecological justifications. There is a more reliable criterion of the value of species

and communities. American Scientist 64: 648-656.

Ellegren, H. and B.C. Sheldon. 2008. Genetic basis of fitness differences in natural populations.

Nature 452: 169-175.

Esteban, R., Verborgh, P., Gauffier, P., Giménez, J., Martín, V., Pérez-Gil, M., Tejedor, M.,

Almunia, J., Jepson, P.D., García-Tíscar, S. and Barrett-Lennard, L.G. 2016. Using a multi-

disciplinary approach to identify a critically endangered killer whale management unit.

Ecological Indicators 66: 291-300.

Frankham, R., 2003. Genetics and conservation biology. Comptes Rendus Biologies 326: 22-29.

Franz, M. and Nunn, C.L. 2009. Rapid evolution of social learning. Journal of Evolutionary

Biology 22: 1914-1922.

Frost, K.J. and Lowry, L.F. 1990. Distribution, abundance and movements of beluga whales,

Delphinapterus leucas, in coastal waters of western Alaska. Canadian Bulletin of Fisheries and

Aquatic Sciences 224: 39-57.

Gienapp, P., C. Teplitsky, J.S. Alho, J.A. Mills and J. Merila. 2008. Climate change and

evolution: disentangling environmental and genetic responses. Molecular Ecology 17: 167-178.

Gilg, O., K.M. Kovacs, J. Aars, J. Fort, G. Gauthier, D. Grémillet, R.A. Ims, H. Meltofte, J.

Moreau, E. Post, N.M. Schmidt, G. Yannic and L. Bollache. 2012. Climate change and the

ecology and evolution of Arctic vertebrates. Annals of the N.Y. Academy of Sciences 1249: 166-

190.

Hagen, J.B. 2017. Bergmann’s Rule, adaptation, and thermoregulation in Arctic animals:

Conflicting perspectives from physiology, evolutionary biology, and physical anthropology after

World War II. Journal of the History of Biology 50: 235-265.

Page 35: Genetic diversity, population structure and phylogeography ...

17

Hansen, M.M., Olivieri, I., Waller, D.M., Nielsen, E.E., and the GeM Working Group. 2012.

Monitoring adaptive genetic responses to environmental change. Molecular Ecology 21: 1311-

1329.

Harwood, L.A., Innes, S., Norton, P. and Kingsley, M.C. 1996. Distribution and abundance of

beluga whales in the Mackenzie estuary, southeast Beaufort Sea, and west Amundsen Gulf

during late July 1992. Canadian Journal of Fisheries and Aquatic Sciences 53: 2262-2273.

Heide-Jørgensen, M.P., Burt, L.M., Hansen, R.G., Nielsen, N.H., Rasmussen, M., Fossette, S.

and Stern, H. 2013. The significance of the North Water Polynya to Arctic top predators. Ambio

42: 596-610.

Hendry, A.P., Farrugia, T.J., and Kinnison, M.T. 2008. Human influences on rates of phenotypic

change in wild animal populations. Molecular Ecology 17: 20-29.

Hickerson, M.J., Carstens, B.C., Cavender-Bares, J., Crandall, K.A., Graham, C.H., Johnson,

J.B., Rissler, L., Victoriano, P.F. and Yoder, A.D. 2010. Phylogeography’s past, present, and

future: 10 years after Avise 2000. Molecular Phylogenetics and Evolution 54: 291-301.

Hobbs, R.C., Laidre, K.L., Vos, D.J., Mahoney, B.A., and Eagleton, M. 2005. Movements and

area use of belugas, Delphinapterus leucas, in a Subarctic Alaskan estuary. Arctic 58: 331-340.

Houle, D. 1992. Comparing evolvability and variability of quantitative traits. Genetics 130: 195-

204.

Huntington, H.P. 2009. A preliminary assessment of threats to arctic marine mammals and their

conservation in the coming decades. Marine Policy 33: 77-82.

Innes, S., Heide-Jørgensen, M.P., Laake, J.L., Laidre, K.L., Cleator, H.J., Richard, P. and

Stewart, R.E. 2002. Surveys of belugas and narwhals in the Canadian High Arctic in 1996.

NAMMCO Scientific Publications 4: 169-190.

Innes, S., Muir, D.C., Stewart, R.E., Heide-Jørgensen, M.P. and Dietz, R. 2002b. Stock identity

of beluga (Delphinapterus leucas) in Eastern Canada and West Greenland based on

organochlorine contaminants in their blubber. NAMMCO Scientific Publications 4: 51-68.

IUCN 2012. IUCN Red List Categories and Criteria: version 3.1. Gland, Switzerland and

Cambridge, UK: IUCN. iv + 32pp.

Page 36: Genetic diversity, population structure and phylogeography ...

18

IWC. 2000. Status of Monodontid whales: white whale. J. Cetac. Res. Manage. 2 (SUPPL.),

243–250.

Jefferson, T.A., Karkzmarski, L., Laidre, K., O’Corry-Crowe, G., Reeves, R., Rojas-Bracho, L.,

Secchi, E., Slooten, E., Smith, B.D., Wang, J.Y. & Zhou, K. 2012. Delphinapterus leucas. The

IUCN Red List of Threatened Species 2012: e.T6335A17690692.

http://dx.doi.org/10.2305/IUCN.UK.2012.RLTS.T6335A17690692.en

Kareiva, P. and Marvier, M., 2012. What is conservation science? BioScience 62:962-969.

Kavanaugh, A.S, Noad, M.J., Blomberg, S.P., Goldizen, A.W., Kniest, E., Cato, D.H., and

Dunlop, R.A. 2017. Factors driving the variability in diving and movement behavior of

migrating humpback whales (Megaptera novaengliae): implications for anthropogenic

disturbance studies. Marine Mammal Science 33: 413-439.

Laidre, K.L. and Heide-Jørgensen, M.P. 2005. Arctic sea ice trends and narwhal vulnerability.

Biological Conservation 121: 509-517.

Laidre, K.L., I. Stirling, L.F. Lowry, Ø. Wiig, M.P. Heide-Jørgensen and S.F. Ferguson. 2008.

Quantifying the sensitivity of Arctic marine mammals to climate-induced habitat change.

Ecological Applications 18 (Supplement): S97-S125.

Laidre, K.L., Stern, H., Kovacs, K.M., Lowry, L., Moore, S.E., Regehr, E.V., Ferguson, S.H.,

Wiig, Ø., Boveng, P., Angliss, R.P. and Born, E.W. 2015. Arctic marine mammal population

status, sea ice habitat loss, and conservation recommendations for the 21st century. Conservation

Biology 29: 724-737.

Lande, R. 2009. Adaptation to an extraordinary environment by evolution of phenotypic

plasticity and genetic assimilation. Journal of Evolutionary Biology 22: 1435-1446.

Lesage, V. and Kingsley, M.C.S. 1998. Updated status of the St. Lawrence River population of

beluga, Delphinapterus leucas. The Canadian Field-Naturalist 11: 98-114.

MacDonald, G.M. 2010. Global warming and the Arctic: a new world beyond the reach of the

Grinnellian niche? Journal of Experimental Biology 213: 855-861.

Margules, C. and Usher, M.B. 1981. Criteria used in assessing wildlife conservation potential: a

review. Biological Conservation 21: 79-109.

Page 37: Genetic diversity, population structure and phylogeography ...

19

Mathiesen, S.S., Thyrring, J., Hemmer‐Hansen, J., Berge, J., Sukhotin, A., Leopold, P., Bekaert,

M., Sejr, M.K. and Nielsen, E.E. 2017. Genetic diversity and connectivity within Mytilus spp. in

the subarctic and Arctic. Evolutionary Applications 10: 39-55.

Marx, F.G. and Uhen, M.D. 2010. Climate, critters, and cetaceans: Cenozoic drivers of the

evolution of modern whales. Science 327: 993-996.

Moline, M., Karnovsky,N.J., Brown, Z., Divoky, G.J., Frazer,T.K., Jacoby, C.A., Torres, J.J.,

and Fraser, W.R. 2008. High latitude changes in ice dynamics and their impact on polar marine

ecosystems. Annals of the NewYork Academy of Science 1134: 267-319.

Möller, L.M. 2012. Sociogenetic structure, kin associations and bonding in delphinids.

Molecular Ecology 21: 745-764.

Noren, S.R. and Suydam, R. 2016. Navigating under sea ice promotes rapid maturation of diving

physiology and performance in beluga whales. Journal of Experimental Biology 219: 2828-2836.

O’Corry-Crowe, G. 2008. Climate change and the molecular ecology of Arctic marine mammals.

Ecological Adaptations 18 (Supplement): S56-S76.

O’Corry-Crowe, G. 2009. Beluga whale, Delphinapterus leucas. In W.F. Perrin, B. Würsig and

J.G.M. Thewissen (Eds.), Encyclopedia of Marine Mammals, Elsevier Academic Press,

Burlingtom, MA, USA.

O'Corry-Crowe, G., Lucey, W., Archer, F.I. and Mahoney, B. 2015. The genetic ecology and

population origins of the beluga whales, Delphinapterus leucas, of Yakutat Bay. Marine

Fisheries Review 77: 47-58.

Orams, M.B. 2000. Tourists getting close to whales, is it what whale-watching is all about?

Tourism Management 21: 561-569.

Orr, H.A. 2005. The genetic theory of adaptation: a brief history. Nature Reviews Genetics 6:

119-127.

Orr, H. Allen. 2009. Fitness and its role in evolutionary genetics. Nature Reviews Genetics 10:

531-539.

Post, E., Forchhammer, M.C., Bret-Harte, M.S., Callaghan, T.V., Christensen, T.R., Elberling,

B., Fox, A.D., Gilg, O., Hik, D.S., Høye, T.T. and Ims, R.A. 2009. Ecological dynamics across

the Arctic associated with recent climate change. Science 325: 1355-1358.

Page 38: Genetic diversity, population structure and phylogeography ...

20

Postma, L.D., Petersen, S.D., Turgeon, J., Hammill, M.O., Lesage, V., and Doniol-Valcroze, T.

2012. Belugas in James Bay: a separate entity from Eastern Hudson Bay? Canadian Science

Advisory Secretariat Research Document 2012/074.

Provan, J. and Bennett, K.D. 2008. Phylogeographic insights into cryptic glacial refugia. Trends

in Ecology and Evolution, 23: 564-571.

Rands, M.R., Adams, W.M., Bennun, L., Butchart, S.H., Clements, A., Coomes, D., Entwistle,

A., Hodge, I., Kapos, V., Scharlemann, J.P. and Sutherland, W.J. 2010. Biodiversity

conservation: challenges beyond 2010. Science 329: 1298-1303.

Reeves, R.R. and Mitchell, E. 1989. Status of white whales, Delphinapterus leucas, in Ungava

Bay and eastern Hudson Bay. Canadian Field-Naturalist 103: 220-239.

Reyers, B., Polasky, S., Tallis, H., Mooney, H.A. and Larigauderie, A. 2012. Finding common

ground for biodiversity and ecosystem services. BioScience 62:503-507.

Richard, P.R. and Pike, D.G., 1993. Small whale co-management in the eastern Canadian Arctic:

a case history and analysis. Arctic 46: 138-143.

Richard, P.R., 2005. An estimate of the Western Hudson Bay beluga population size in 2004.

Canadian Science Advisory Secretariat Research Document 2005/017.

Robards, M.D. and Reeves, R.R. 2011. The global extent and character of marine mammal

consumption by humans: 1970–2009. Biological Conservation 144: 2770-2786.

Schindler, D. 2011. The boiling point. Outdoor Canada, May 2011: 71-76.

Sexton, J.P., McIntyre, P.J., Angert, A.L. and Rice, K.J. 2009. Evolution and ecology of species

range limits. Annual Review of Ecology, Evolution, and Systematics 40: 415-436.

Silk, J.B. 2007a. Social components of fitness in primate groups. Science 317: 1347-1351.

Silk, J.B. 2007b. The adaptive value of sociality in mammalian groups. Philosophical

Transactions of the Royal Society B: Biological Sciences 362: 539-559.

Smith, T.G., St. Aubin, D.J., and Geraci, J.R. 1990. Research on beluga whales, Delphinapterus

leucas: introduction and overview. Canadian Bulletin of Fisheries and Aquatic Sciences 224: 1-

6.

Page 39: Genetic diversity, population structure and phylogeography ...

21

Smol, J.P. 2012. A planet in flux: how is life on Earth reacting to climate change? Nature 483:

S12-S15.

Soulé, M.E. 1985. What is conservation biology? BioScience 35: 727-734.

Steeman, M.E., Hebsgaard, M.B., Fordyce, R.E., Ho, S.Y., Rabosky, D.L., Nielsen, R., Rahbek,

C., Glenner, H., Sørensen, M.V. and Willerslev, E. 2009. Radiation of extant cetaceans driven by

restructuring of the oceans. Systematic Biology 58: 573-585.

Stern, D.L. and Orgogozo, V. 2009. Is genetic evolution predictable? Science 323: 746-751.

Stewart, R.E. 2008. Redefining walrus stocks in Canada. Arctic 61: 292-308.

Stewart, E.J., Draper, D. and Johnston, M.E. 2005. A review of tourism research in the polar

regions. Arctic: 383-394.

Stewart, E.J. and Draper, D. 2007. A collaborative approach to understanding local stakeholder

perceptions of tourism in Churchill, Manitoba (Canada). Polar Geography 30: 7-35.

Tear, T.H., Kareiva, P., Angermeier, P.L., Comer, P., Czech, B., Kautz, R., Landon, L.,

Mehlman, D., Murphy, K., Ruckelshaus, M. and Scott, J.M. 2005. How much is enough? The

recurrent problem of setting measurable objectives in conservation. BioScience 55: 835-849.

Thewissen, J.G., Williams, E.M., Roe, L.J. and Hussain, S.T. 2001. Skeletons of terrestrial

cetaceans and the relationship of whales to artiodactyls. Nature 413: 277-281.

Tyrrell, M., 2007. Sentient beings and wildlife resources: Inuit, beluga whales and management

regimes in the Canadian Arctic. Human Ecology 35:575-586.

Ungerer, M.C., Johnson, L.C., and Herman, M.A. 2008. Ecological genomics: understanding

gene and genome function in the natural environment. Heredity 100: 178-183.

Vander Wal, E., Festa-Bianchet, M., Réale, D., Coltman, D.W. and Pelletier, F. 2015. Sex‐based

differences in the adaptive value of social behavior contrasted against morphology and

environment. Ecology 96: 631-641.

Vélez-Juarbe, J. and Pyenson, N.D. 2012. Bohaskaia monodontoides, a new monodontid

(Cetacea, Odontoceti, Delphinoidea) from the Pliocene of the western North Atlantic Ocean.

Journal of Vertebrate Paleontology 32: 476-484.

Page 40: Genetic diversity, population structure and phylogeography ...

22

Waples, R.S., Punt, A.E. and Cope, J.M. 2008. Integrating genetic data into management of

marine resources: how can we do it better? Fish and Fisheries 9: 423-449.

Waples, R.S. and Naish, K.A., 2009. Genetic and evolutionary considerations in fishery

management: research needs for the future. In R.J. Beamish and B.J. Rothschild (Eds.), The

Future of Fisheries Science in North America, Springer, Netherlands, pp. 427-451.

Whitehead, H., Rendell, L., Osborne, R.W. and Würsig, B. 2004. Culture and conservation of

non-humans with reference to whales and dolphins: review and new directions. Biological

Conservation 120: 427-437.

Whitmore, F.C. 1994. Neogene climatic change and the emergence of the modern whale fauna of

the North Atlantic Ocean. Proceedings of the San Diego Society of Natural History 29: 223-227.

Wilcove, D.S. and Wikelski, M. 2008. Going, going, gone: is animal migration disappearing.

PLoS Biology 6: p.e188.

Willi, Y., Van Buskirk, J. and Hoffmann, A.A. 2006. Limits to the adaptive potential of small

populations. Annual Review of Ecology, Evolution and Systematics 37: 433-458.

Page 41: Genetic diversity, population structure and phylogeography ...

23

Chapter 2: Mitochondrial DNA sequence diversity, population structure, and

phylogeography of belugas (Delphinapterus leucas) in Canadian waters

Abstract

The phylogeographic diversity among belugas (Delphinapterus leucas) with seasonal

distributions in Canadian waters can offer important evolutionary and ecological insights for

defining conservation goals for these whales. The geographic patterns of beluga migrations and

summer aggregations can be characterized with molecular genetic markers to examine

biodiversity across the Canadian range, define units for conservation and management

assessments, and provide insights about the historical use of climate refugia and recolonizations

of post-glacial habitats. In this study, I used 609bp of control region mitochondrial DNA

(mtDNA) sequence from 2500 Canadian beluga samples to examine geographic genetic diversity

and evolutionary patterns among 83 haplotypes. The fine-scale resolution of haplotypes and the

addition of new and larger sample collections resulted in the identification of at least nine

genetically distinct summer aggregations, with an additional different sample collection of

unknown summer distribution in southern Hudson Bay and/or James Bay. The patterns of

haplotypes unique to particular sample collections indicate a larger number of non-migratory

beluga populations in Canada than previously known. This result further highlights the

importance of polynyas as winter habitat around the Belcher Islands in SE Hudson Bay and

James Bay. The evolutionary relationships among the haplotypes reveal the most divergent

lineages are found at the east, west and southern edges of the Canadian distribution of belugas,

with the central area characteristic of a contact zone displaying an admixture of lineages.

Furthermore, the present geographic distribution of these lineages suggests multiple glacial

Page 42: Genetic diversity, population structure and phylogeography ...

24

refugia as sources of ancestral beluga populations for recolonization of Canadian Arctic and sub-

Arctic waters. Analyses of ancient DNA from historical beluga samples and fossils would

enhance the interpretation of modern day patterns of mtDNA genetic diversity.

2.1 Introduction

Understanding the intraspecific genetic relationships of animals that are highly mobile

and distributed across large ranges is challenging. Over large areas, geographic barriers (such as

rivers, e.g. Eriksson et al. 2004; mountains and valleys, e.g. Zhou et al. 2017; and peninsulas,

e.g. Pelletier et al. 2012) can influence spatial structuring of genetic diversity by restricting gene

flow. Similarly, anthropogenic landscape barriers (such as roads, e.g. Wilson et al. 2015;

fences/barriers, e.g. Bracken et al. 2015; urbanization, e.g. Breyne et al. 2014; and agricultural

development, e.g. Stronen et al. 2012 ) can also shape population genetic substructure,

potentially within a few generations (Wilson et al. 2015).

However, in environments where there are no obvious geographic barriers to dispersal,

population genetic structuring can still occur due to a variety of other factors (Irwin 2002).

Isolation by distance (IBD) (Wright 1943, Malécot 1967, Kimura and Weiss 1964) will cause

patterns of genetic variability due to individual migration distances and mating patterns being

smaller than the species distribution as a whole (e.g. Spice et al. 2012). Thus, with IBD, both

ecological and genetic nuances can shape population structure at different scales (Ishida 2009).

For example, even though seabirds are highly mobile animals with few limitations to dispersal,

genetic differentiation of populations and colonies can be strong and is commonly linked to

selective or behavioural processes (Friesen et al. 2007). Even within the same habitat and

geographic scale, two different seabird species can have sharply contrasting levels of genetic

Page 43: Genetic diversity, population structure and phylogeography ...

25

differentiation among colonies due to behavioural differences (i.e. natal and breeding philopatry)

(Levin and Parker 2012). Evaluating the elements that lead to genetic population differentiation

can be even more difficult when both geographic barriers and behavioural processes are at play.

In one study of a tropical bird species (El Oro parakeet, Pyrrhura orcesi), fine-scale genetic

structure was considered to be influenced by a combination of the loss of suitable habitat at

lower elevations due to climate change, human deforestation practices, and a complex social

breeding system that all contributed to limited dispersal (Klauke et al. 2016).

The dispersal potential for marine mammals in many ways parallels that of birds. These

highly mobile animals are often characterized by broad, continuous distributions with few

physical or geographic barriers to movements and gene flow that may facilitate genetically

homogenous populations. For example, despite extensive spatial and temporal sampling, current

genetic analyses of North East Atlantic minke whales (Balaenoptera acutorostrata

acutorostrata) fail to provide any support for geographic or cryptic population genetic structure

(Quintela et al. 2014). However, many marine mammal species do display a high level of

subpopulation differentiation that is shaped by a variety of factors and at different scales.

Worldwide patterns of genetic structure among sperm whales (Physeter macrocephalus) within

and among oceanic basins are thought to result from recent population expansions, breeding

behaviours, female social groups, and geographic philopatry (Alexander et al. 2016). Atlantic

spotted dolphins (Stenella frontalis) show higher-level genetic differences between oceanic and

shelf morphotypes, but also habitat related genetic structure within the shelf lineage that is linked

to depth and sea-surface temperature (Viricel and Rosel 2014). Coastal and pelagic bottlenose

dolphins (Tursiops truncatus) in the North East Atlantic also have a fine-scale population

structure that is maintained by habitat preference (Louis et al. 2014). In long-finned pilot whales

Page 44: Genetic diversity, population structure and phylogeography ...

26

(Globicephala melas) from the North Atlantic, social structure due to sex-biased dispersal and

female philopatry, along with possible foraging specializations, are thought to contribute to

regional divergence and genetic population structure (Monteiro et al. 2015). Prey specialization

is also considered to shape genetic differentiation among populations of killer whales (Orcinus

orca), along with dispersal and mating strategies, kin associations, and maternal social structure

(e.g. Hoelzel et al. 2007, Pilot et al. 2010, Parsons et al. 2013).

Belugas (Delphinapterus leucas) are primarily an Arctic species of whale with a large,

discontinuous circumpolar distribution (Figure 1.1; Stewart and Stewart 1989) and a variety of

behavioural and ecological characteristics that could shape genetic differentiation among

populations and subpopulations. For example, over the species distribution, the whales display a

variety of movement patterns that are closely linked to sea-ice and sea-surface temperatures (e.g.

Bailleul et al. 2012, Hauser et al. 2014, Hornby et al. 2016, Hauser et al. 2016). Some beluga

populations have annual patterns of summer and winter area site fidelity resulting in migrations

that may cover thousands of kilometres for some beluga populations (Richard et al. 2001, Hauser

et al. 2014). Conversely, annual movements may only involve relatively small shifts away from

land-fast ice for other, generally isolated, populations (Hobbs et al. 2005, Lefebvre et al. 2012).

Although ice has the potential to limit movements of belugas and even cause death due to ice

entrapments (Heide-Jørgensen et al. 2002), this species, especially adult males, has been shown

to move into areas of dense (>90%) ice cover (Richard et al. 2001, Suydam et al. 2001) to pursue

foraging opportunities (Loseto et al. 2006, 2008).

Evidence from reproductive tract metrics indicates that belugas have a promiscuous

mating system (Kelley et al. 2015) and breeding is thought to occur in late winter to early spring

while whales are in or near wintering areas where several subpopulations, or stocks, may overlap

Page 45: Genetic diversity, population structure and phylogeography ...

27

(O’Corry-Crowe 2009, Citta et al. 2017). This behaviour can cause gene flow among stocks and

contribute to a decrease in genetic heterogeneity. Conversely, several other behaviours exhibited

by belugas are thought to promote structuring within populations. Predictable summer

distributions of belugas are generally associated with bays and shallow estuaries (e.g. Richard et

al. 2001, Martin et al. 2001, Gosselin et al. 2002) that have warmer, brackish waters and rocky

bottom substrate thought to facilitate an annual moult (St. Aubin et al. 1990, Boily 1995).

Females with calves and juveniles stay near these shallow coastal areas during the summer,

perhaps for energetic benefits and/or protection from predators such as killer whales (Loseto et

al. 2006). However, males generally disperse offshore to forage in areas of higher productivity

(Suydam et al. 2001, Richard et al. 2001, Loseto et al. 2006). Though Arctic cod (Boreogadus

saida) are a primary prey species for belugas, diet and foraging ecology varies among

populations and stocks with respect to geographic areas and habitats, season, migration patterns

and duration, and range extent (Loseto et al. 2006 , Kelley et al. 2010, Marcoux et al. 2012,

Quakenbush et al. 2015). Similar to other odontocetes, beluga females spend an extended period

(up to 3 years) nursing their calves, with the age of weaning variable among individuals

(Matthews and Ferguson 2015). Maternal investment in nursing and weaning calves is thought to

provide the foundation of beluga social organization, based on learning behaviours related to

communication, foraging and philopatric migration patterns (Brodie 1969, Tyack 1986, Colbeck

et al. 2013, Krasnova et al. 2014).

Other challenges for understanding contemporary genetic structure in populations arise

from the need to interpret the influences of current demographic processes along with

evolutionary and historical signals. Climate fluctuations play a significant role in shaping the

genetic diversity and geographic distribution of both species and genealogical lineages by

Page 46: Genetic diversity, population structure and phylogeography ...

28

influencing fragmentation, range expansions and long range colonizations (Friesen et al. 2007).

Historical patterns of glaciation forced species into refugia that, in turn, shaped genetic

signatures of post-glacial movements (Hewitt 1996, 2000, 2004). Specifically, intraspecific

genetic diversity is expected to be high within the refugial areas, followed by declining diversity

away from the refugia due to successive founder events along recolonization routes as ice

recedes (Hewitt 1996, 2000, Slatkin and Excoffier 2012). However, genetic signatures of glacial

refugia may be difficult to detect (Provan and Bennet 2008). Gavin et al. (2014) assert the need

to use a combination of fossil records, species distribution models, and molecular

phylogeographic approaches to investigate the existence of past refugia and to understand their

influence on the current and future “spatiotemporal trajectories of species and populations”.

However, ice-age marine refugia are particularly difficult to identify because of sea level

changes altering coastlines and a lack of a fossil record for many marine species (Provan and

Bennett 2008, Maggs et al. 2008). In this case, phylogeography, the spatial and temporal

distribution of genetic lineages (Avise et al. 1987, 2009), may be the best approach, especially

for within-species and among-population questions about historical migration patterns and the

link between evolutionary and demographic processes (Gavin et al. 2014).

Information about genetic structure and phylogeographic patterns are important elements

for the identification of conservation and management units for wildlife. The Committee on the

Status of Endangered Wildlife in Canada (COSEWIC) recently re-examined belugas in Canada

under the criteria to define Designatable Units (DUs) that are “discrete and evolutionary

significant units of the taxonomic species, where significant means that the unit is important to

the evolutionary legacy of the species as a whole and if lost would likely not be replaced through

natural dispersion” (COSEWIC 2012, 2016). One of the lines of evidence required to identify

Page 47: Genetic diversity, population structure and phylogeography ...

29

discreteness and one for evolutionary significance both rely on genetic information from neutral,

relatively slow-evolving genetic markers (COSEWIC 2016).

Mitochondrial DNA (mtDNA) sequence has proven to be a useful tool for the study of

many aspects of beluga population structure, phylogeography, dispersal patterns, and social

structure (see references in Appendix 1). The dominant findings of these studies reveal that

migratory belugas are highly philopatric to relatively discrete summering areas, and these groups

of animals are representative of distinctive maternal genetic lineages (Brennin et al. 1997, Brown

Gladden et al. 1997, O’Corry-Crowe et al. 1997, de March and Postma 2003, Turgeon et al.

2012). Furthermore, phylogeographic relationships among the haplotypes identified patterns of

postglacial recolonizations of North American waters from an eastern or Atlantic refugium and a

western or Pacific refugium (Brennin et al. 1997, Brown-Gladden et al. 1997, de March and

Postma 2003). However, at a finer scale, some phylogenetic and population structure patterns,

especially in Canadian waters, remain unclear (de March and Postma 2003, Turgeon et al. 2012).

In all studies, a small number (N=2 or 3) of dominant haplotypes were found in the majority of

samples (>50%) (Appendix 1). This result may be due to the relatively small portion of

mitochondrial control region sequence analyzed (234 base pairs, bp). Louis et al. (2014) found

that longer (682bp vs. 324bp) mtDNA control region sequences were necessary for the resolution

of bottlenose dolphin ecotypes, and recommended that more extended sequences be used for

investigations of fine-scale genetic structure in delphinids. In addition, samples sizes, especially

in certain areas, may not be sufficient to reveal more complex patterns due to “signal:noise ratio

problems” (Waples 1998). In species with high gene flow, such as belugas, the genetic signal

used to indicate stock structure may be relatively weak, and the “noise” due to biases in the

samples may have a greater impact. One approach to mitigate this effect is to increase sample

Page 48: Genetic diversity, population structure and phylogeography ...

30

size and replicate samples over time (Waples 1998). Furthermore, some potentially genetically

distinct aggregations of belugas have not been sampled at all (de March and Postma 2003,

Turgeon et al. 2012). All of these factors have been implicated as a cause for some of the

confusing or inconsistent results for geographic comparisons in these studies, and are likely

contributing to an “insufficient phylogenetic signal” (Palsböll et al. 2002) for mtDNA analyses

of Canadian beluga samples.

In this study, mitochondrial DNA haplotype diversity is assessed across the full

distribution of belugas in Canadian waters, with increased sample sizes at most locations and

samples from previously unsampled areas. The length of the control region analyzed has also

been significantly increased. With this new data, the goals of the analyses are to:

1. Refine our understanding of extant mtDNA genetic variation and population structure of

beluga whales in Canadian waters to be able to update the identification of Designatable

Units (DUs) of belugas in Canada;

2. Infer phylogeographic patterns among beluga haplotypes found in different geographic

summer aggregations and other seasonal groups that define Canadian beluga stocks and

add new information about possible historical patterns underlying present-day genetic

structure;

3. Place this information in the context of potential changes to beluga population structure

in response to changing climate and human activities.

Page 49: Genetic diversity, population structure and phylogeography ...

31

2.2 Materials and Methods

2.2.1 Data collection

Sample collection

Beluga whale samples in Canada have been collected for many years through harvest

monitoring programs, research programs such as satellite tagging studies, and opportunistic

sampling of carcases and sloughed skin. Samples in this study have been accessed from archived

tissues (primarily skin and muscle) dating back to the 1980s and held at the Freshwater Institute,

Winnipeg, Manitoba (DFO Central and Arctic). A broad range of geographic sample locations

was considered (Figure 2.1) to encompass the greatest amount of the beluga distribution in

Canadian waters as possible. The sample collections could be grouped into different categories:

1) animals sampled from community harvests occurring along the migration routes of the

belugas in the spring and fall (mainly in N Hudson Bay, Hudson Strait and Ungava Bay); 2)

whales sampled from harvests of animals while they are in annual summer aggregations,

generally late June to August; 3) samples collected from summer aggregations of whales during

satellite tag attachment for tracking programs, or opportunistically from sloughed skin; 4)

belugas sampled from humane harvests of whales unable to escape from winter ice entrapments,

December to March (considered to be ‘winter samples’); and 5) samples collected from

necropsies of beachcast belugas, mainly from the St. Lawrence River Estuary. (See Appendix 2.2

for detailed information on samples for main geographic sample collection comparisons).

Page 50: Genetic diversity, population structure and phylogeography ...

32

Figure 2.1. Locations of beluga sample sources for all mtDNA haplotypes (N=2540) analyzed in this study. Sample

collections for the overall analyses include three types of samples: 1) from animals in summer aggregations from

Beaufort Sea (BeS), Grise Fiord (EHA, E High Arctic), Cunningham Inlet (CHA, Central High Arctic), Foxe Basin

(FB), Cumberland Sound (Csd), W Hudson Bay (WHB, 4 locations indicated in green), E Hudson Bay (EHB, 8

locations indicated in purple), Belcher Is. harvests (SQH, Sanikiluaq harvest), James Bay (JB), Long Island (LI), and

St. Lawrence Estuary (SLE); 2) Winter ice-entrapment samples from Belcher Is (SQE); 3) Migrating whales sampled

from N Hudson Bay (NHB, 4 locations indicated in red) and Hudson Strait (NQ, 18 locations along the N Quebec

coast, indicated in yellow).

Samples taken in the field were either frozen or preserved in a salt-saturated 20% DMSO

solution (Seutin et al. 1991) and frozen upon arrival at the lab. For all locations except the

Central High Arctic (Cunningham Inlet), tissue samples for genetic analyses were taken from

Page 51: Genetic diversity, population structure and phylogeography ...

33

dead animals, or satellite tagging programs where there was confidence that replicate sampling

of individuals did not occur. Tissue samples are a by-product during tag attachment, and all

whale handling protocols for satellite telemetry studies were approved by Fisheries and Oceans

Canada Animal Care Committee at the FWI. The Cunningham Inlet samples were collected from

sloughed beluga skin washed onto shore by high wave action. These tissues were all previously

analyzed using 15 microsatellite loci (L. Postma, unpublished data) and replicate samples of

individuals identified using GenAlEx ver. 6 (Peakall and Smouse 2006). Only skin samples from

unique individuals from this location were included in analyses for this study.

DNA extraction and sequencing

Total cellular DNA extractions were performed using proteinase K digestions followed

by a variety of separation methods including phenol:chloroform (Amos and Holzel 1992),

Qiagen spin columns (DNeasy Blood and Tissue Kits), and the Biosprint automated platform

(Qiagen Inc, Valencia, CA, USA).

Mitochondrial DNA (mtDNA) sequences (approximately 700bp) were generated using

amplification reactions designed to target a portion of the mtDNA control region. Primers Belmt-

5 (GAT AGA GTT TTT TGA GCC CG) and Belmt-6 (TCA CCA CCA ACA CCC AAA G)

were used in a polymerase chain reaction (PCR) mixture containing a 1x buffer, 25mM MgCl2,

10mM dNTP mix, 20uM of each primer, 0.5units of Taq polymerase and approximately 50-

200ng of template DNA. The PCR reactions were performed in a total volume of 50µL under the

following conditions: 95°C for10 min.; 35 cycles of 95°C for 20s, 55°C for 30s, 72°C for 45s;

extension at 72°C for 10 min. Products were visualized using agarose gel electrophoresis, and

successfully amplified samples were cleaned using QiaQuick PCR clean-up kits (Qiagen Inc.).

Page 52: Genetic diversity, population structure and phylogeography ...

34

DNA sequencing was performed using BigDye ver3.5 (Applied Biosystems) with the Belmt-6

primer as the sequencing primer (2uM). The PCR sequencing temperature profile was: 96°C for

1min.; 32 cycles of 96°C for 10s, 50°C for 30s, and 60°C for 4min; and an extension at 72°C for

7min. Sequencing was performed on an Applied Biosystems 3130xl genetic analyzer (Life

Technologies).

Sequence alignment, editing and identification of haplotypes

DNA sequences were aligned and edited using a combination of approaches (largely due

to the introduction of new software resources over time), either using the Life Technologies

software SeqScape ver. 2.5 for the 3010xl genetic analyzer, or using MEGA (ver. 3 Kumar et al.

2004, ver. 4 Tamura et al. 2007, ver. 5 Tamura et al. 2011). Individual sample sequence

haplotypes, based on 609bp of unambiguous sequence, were assigned using SeqScape ver 2.5 or

using GenAlEx ver. 6 (Peakall and Smouse 2006). All sequences were also visually checked by

eye and haplotypes confirmed. The resulting ‘library’ of unique haplotypes was verified each

time a new haplotype was found using alignment and visualization in the software package DNA

Alignment (www.fluxus-engineering.com). Samples resulting in a new haplotype were also re-

analyzed to verify the sequence. Due to the use of various versions of software and alignment

approaches, after all the data in this study had been collected, N=89 randomly chosen samples

were re-extracted and re-sequenced to estimate the error in haplotype identification.

2.2.2 Genetic variability and population structure of geographic samples

Haplotype frequency distributions and geographic patterns were evaluated for the entire

dataset (all samples from all seasons) and separately for a subset of samples collected from

Page 53: Genetic diversity, population structure and phylogeography ...

35

animals located in summer aggregations and winter ice-entrapments. It has been shown that

genetic structuring within beluga populations is influenced by matrilineal migrations patterns

(Brown Gladden et al. 1997, de March and Postma 2003, Turgeon et al. 2010) and that stock

identification is based on annual returns of animals to particular geographic summering areas

(IWC 2000, Richard 2010). Examination of the haplotypes for all samples will allow for the full

range of haplotype diversity in Canadian belugas to be investigated, while analysis of only

summer and winter samples will provide a basis for testing population structuring and

demographic patterns among discrete geographic groups of samples.

Genetic diversity and differentiation were estimated for the summer and winter

geographic sample collections which included Western Hudson Bay (N=118), Beaufort Sea

(N=301), Central High Arctic (N=54), Eastern High Arctic (N=57), Foxe Basin (N=72), James

Bay (N=28), Long Island (N=39), Eastern Hudson Bay (N=30), St. Lawrence Estuary (N=125),

Cumberland Sound (N=186), Belcher Islands harvest (N=311), and Belcher Islands winter ice

entrapments (N=56). The number of different haplotypes (NH), haplotype diversity (h),

nucleotide diversity (π), the number of segregating sites in each group and the average number of

nucleotide differences (k) within sample collections were determined using DnaSP ver. 5.1

(Librado and Rozas 2009). The presence of population structure was assessed using analysis of

molecular variance (AMOVA) between and within the sample collection groups and pairwise

ΦST comparisons using ARLEQUIN ver. 3.5.1.2 (Excoffier and Lischer 2010). AMOVA was

also performed in GenAlEx ver. 6.4 (Peakall and Smouse 2012) under a slightly different null

hypothesis. Instead of a hypothesis of ‘no difference in genetic variation among populations or

sub-populations’, the Ho is that subpopulations are part of a single randomly mating genetic

population. The FST analogue here, ΦPT, is also tested for significance in a slightly different way

Page 54: Genetic diversity, population structure and phylogeography ...

36

using randomization of the data with the observed value being included as another permutation

(Peakall and Smouse 2010).

The matrix of ΦPT pairwise differences produced in GenAlEx (Peakall and Smouse 2012)

was then used in a Principal Coordinates Analysis (PCoA) as a way to visualize the patterns of

genetic relationships among the summer and winter sample collections based on genetic

distance. Evolutionary relationships among the collections were also inferred from a Neighbour-

Joining (NJ) tree calculated in MEGA ver. 6 (Tamura et al. 2013) using an evolutionary distance

matrix (FST) calculated using DnaSP (Librado and Rozas 2009).

2.2.3 Phylogenetic analyses

Evolutionary histories among beluga sample collections were also investigated using

gene trees based on the mtDNA haplotypes, which allows for the characterization of population

subdivision through the identification of lineages with a common ancestor (Nichols 2001). For

these phylogenetic analyses, only haplotypes that occurred at a frequency of >2 in the overall

dataset were used to exclude rare haplotypes that were possible errors during haplotype

identification (see Results for details of final numbers).

Phylogenetic trees were reconstructed using Maximum Likelihood (ML) in PhyML

(Guindon et al. 2005), Bayesian Inference (BI) in MrBayes ver. 3.2 (Ronquist et al. 2011), and

Maximum Parsimony (MP) in MEGA ver. 6 (Tamura et al. 2013). The selection of the most

appropriate substitution model for the data was determined using methods contained within

PhyML and MEGA software packages, and also using jModelTest ver, 2.1.7 (Posada 2008). In

each case, the optimal model was selected based on the lowest Akaike Information Criterion

(AIC) and Bayesian Information Criterion (BIC) scores and was considered to describe the

Page 55: Genetic diversity, population structure and phylogeography ...

37

substitution pattern in the data best. This model was used for all the subsequent phylogenetic

analyses. Analyses were run without an outgroup and later rooted using an ingroup branch. This

approach was done to ensure that branching patterns within the ingroup (beluga haplotypes) were

not being obscured by the outgroup branch (Holland et al. 2003). This form of rooting the tree is

acceptable as long as the distance between the group/taxon used to root the tree is greater than

any distances within the resulting clades.

Phylogenetic analysis by Maximum Likelihood (an approach that infers an evolutionary

tree that makes the data most likely, Hall 2011) was implemented in the online version of

PhyML using a large, parallel computing platform (Guindon et al. 2010, http://www.atgc-

montpellier.fr/phyml/ ). Unique haplotype sequences for the summer and winter sample

collections (N=83) were submitted and the substitution model HKY85 (Hasegawa-Kishino-Yano

model, Hasegawa et al. 1985) was selected based on previous trial runs of the PhyML program

using the automatic model selection option, the results of MEGA analyses and the results of the

jModelTest analysis. The equilibrium frequencies and the transition/transversion ratio were both

estimated, and the proportion of invariable sites was set to 0.89 (also determined from various

model test results). The number of substitution rate categories was 4, and the gamma shape

parameter was estimated by the program. Tree searches used a BIONJ tree as a starting tree, and

the tree improvement was assessed using SPR (subtree pruning and regrafting topological

moves, Hordijk and Gascuel 2005). This method was recommended for datasets larger than 50

sequences and where a weak phylogenetic signal is expected (PhyML manual http://www.atgc-

montpellier.fr/download/papers/phyml_manual_2012.pdf). Branch support was estimated using

the approximate likelihood ratio test (“aLRT SH-like” option) (Anisimova and Gascuel 2006).

Page 56: Genetic diversity, population structure and phylogeography ...

38

A Bayesian Inference approach was also used to estimate a phylogenetic tree; however,

the tree that is produced in this analysis is the one that is most likely given the data and the

substitution model selected (Hall 2011). The Bayesian analyses were conducted using MrBayes

ver. 3.2.5 (Ronquist et al. 2012) with two independent runs each consisting of four simultaneous

chains (one “heated chain” and three “cold” chains). Given that the data were from a single non-

coding mtDNA region, the data was not partitioned, and the nucleotide model was left at the

default setting of ‘4 by 4’. Based on the previous results of jModelTest (Posada 2008) and

MEGA (Tamura et al. 2013) analyses for the optimal substitution model for the data, the NST

parameter of the likelihood model was set to ‘2’ which encompasses the K2, HKY, T3P and

TN93 models (MrBayes ver. 3.2 manual (http://mrbayes.sourceforge.net/mb3.2_manual.pdf).

Similarly, the rates parameter was set to ‘invgamma’, which indicates that a proportion of the

sites are invariable and that the rate for the remaining sites should be drawn from a gamma

distribution. This distribution is approximated using four categories, and the proportion of

invariable sites was estimated from a uniform distribution. All priors for the phylogenetic model

were left at the default parameters; therefore, no molecular clock was assumed, rate

heterogeneity was modelled by the analyses, and any combination of base frequency was given

equal prior weight. The two concurrent runs of 7,000,000 generations were sampled every 1000

generations to assess for convergence using an evaluation of the standard deviation of the split

frequencies between runs and a Potential Scale Reduction Factor (PRSF) convergence diagnostic

(MrBayes ver. 3.2 manual (http://mrbayes.sourceforge.net/mb3.2_manual.pdf). The effective

sample sizes (ESS) of parameters sampled from the MCMC analysis and the overall trace data

were also evaluated using a 25% burn-in applied by Tracer ver. 1.6 (Rambaut et al. 2014). A

Page 57: Genetic diversity, population structure and phylogeography ...

39

50% majority-rule consensus tree was then constructed to summarize the posterior probabilities

for each identified clade.

Maximum Parsimony (MP) analyses do not use the selection of a particular evolutionary

model during the estimation of trees but instead employ methods that search for trees constructed

from the minimum number of steps (representing the fewest evolutionary changes) that explain

the data (Hall 2011). Beluga haplotype MP trees were produced using MEGA ver. 6 (Tamura et

al. 2013) using the Tree-Bisection-Regrafting (TBR) algorithm for branch-swapping (Nei and

Kumar 2000). Branch lengths were calculated using the average pathway method (Nei and

Kumar 2000) and support for the nodes was determined using bootstrap analysis of 2000

replicates with 10 random-addition sequence replicates (Felsenstein 1985).

Consensus networks offer an alternative to inferring the evolutionary history of species

using gene trees because they are less prone to errors arising from factors such as stochastic

processes, sampling error, and complex biological processes (e.g. hybridization) (Holland et al.

2004). This situation is particularly the case for intraspecific data where phylogenetically

informative characters may have undergone recurrent mutation (Bandelt et al. 2000). Therefore,

an unrooted median-joining phylogenetic network using all beluga haplotypes that occurred at a

frequency of >2 in the overall dataset was constructed using Network ver. 4.6 (www.fluxus-

engineering.com). As the beluga data contained only substitutions that were 99% transitions,

characters were not weighted, and the setting was left at the default value of 10 (Network 4.6

User Guide http://www.fluxus-engineering.com/Network4600_user_guide.pdf). The initial

median-joining network was built using an epsilon (a weighted genetic distance measure) value

of 10 after exploring runs with epsilon = 0, 10 and 20 to determine the clearest result. The

“Greedy FHP” method for distance calculation (Foulds et al. 1979) was selected and so was the

Page 58: Genetic diversity, population structure and phylogeography ...

40

MP (maximum parsimony) option that identifies and eliminates unnecessary median vectors and

links. The resulting network was drawn and modified for clarity using Network Publisher ver.

2.0 (http://www.fluxus-engineering.com/nwpub.htm).

Based on the results of phylogenetic trees and network analyses, haplogroups were

identified that grouped haplotypes based on ancestral origins indicating a common ancestor

(International Society of Genetic Genealogy http://isogg.org/wiki/Genetics_Glossary). For

mitochondrial DNA, founder events and genetic drift influence the development of haplogroups,

which are often restricted to specific geographic areas (Achilli et al. 2004). Branching patterns in

phylogenetic trees and “star-like” clusters in phylogenetic networks are indicative of such

haplogroups (Richards et al. 1998, Troy et al. 2001, Avise 2009). To visualize haplogroup

patterns in the beluga dataset, the summer and winter sample collection haplotypes were

combined into the identified haplogroups, and the frequencies of each haplogroup graphed based

on the geographic origin of the samples. Divergence estimates of total raw divergence (Dxy) and

net divergence (Da) among the haplogroups were calculated using 2000 replicates for the

permutation test in DnaSP ver. 5.1 (Librado and Rozas 2009).

2.2.4 Demographic expansions and haplogroups

Demographic analyses for indications of population expansion for both summer and

winter collection samples and haplogroups were performed using several different statistical

approaches. First, Tajima’s D statistic (Tajima, 1989) and Fu’s FS test statistic (Fu 1997) were

used for assessing departures from an evolutionary model of neutral mutation. These tests are

slightly different as Tajima’s D uses information from the mutation frequency and Fu’s FS uses

information from the haplotype distribution (Ramos-Onsins and Rozas 2002). Both statistics

Page 59: Genetic diversity, population structure and phylogeography ...

41

were determined using Arlequin ver. 3.5.1.2 (Excoffier and Lischer 2010) and significance were

tested using 10,000 coalescent simulations. Significantly negative Tajima’s D tests and Fu’s FS

tests are considered to be indicative of population expansion. The R2 statistic, a third neutrality

test, was also used as it has been found to be more powerful than Tajima’s D and performs better

for small sample sizes than Fu’s FS statistic (Ramos-Onsins and Rozas 2002). This R2 test was

conducted using DnaSP ver. 5.1 (Librado and Rozas 2009) and the significance evaluated with

2000 coalescent simulations. Finally, the mismatch distribution method (Rogers and Harpending

1992) was used to test if the sample collections in the analysis belong to a population of constant

size over time. This process examines the distribution of the number of pairwise differences

between sequences within a set of samples. The pattern of the distribution will indicate either a

model of population expansion (a smooth, unimodal wave) or a model of constant population

size (a “non-wave-like” or multi-modal ragged distribution) (Rogers and Harpending 1992,

Harpending 1994, Schenekar and Weiss 2011). The distribution pattern was assessed by

quantifying a raggedness statistic, R (Harpending 1994), and by a sum of squared differences

statistic (SSD) where both use a bootstrap approach to compare the observed and expected

mismatch distribution based on simulations (Schneider and Excoffier 1999). Both of these

statistics were calculated in Arlequin ver. 3.5.1.2 (Excoffier and Lischer 2010) with 10,000

simulations to test significance. Graphs of the mismatch distribution data were produced in

DnaSP ver. 5.1 (Librado and Rozas 2009) using parameters of Ɵ0 and tau being estimated from

the data assuming a Ɵfinal of infinity, and a Ɵfinal for the graphical simulation set to 1000 (Rogers

and Harpending 1992).

Mismatch distribution can also be used to estimate, or at least compare, the timing of

demographic expansion (Rogers and Harpending 1992, Schenekar and Weiss 2011). This

Page 60: Genetic diversity, population structure and phylogeography ...

42

approach uses the equation t = Ƭ/2u where t is the time since expansion, Ƭ is the mode of the

mismatch distribution and is a moment estimator of mutational time, and u is the cumulative

(across the sequence) nucleotide mutation rate. The calculation of u must take into account the

amount of sequence information in base pairs and the substitution rate for the particular segment

of the sequence used in the mismatch distribution analysis (Rogers and Harpending 1992). A

cetacean (dusky dolphin, Lagenorhynchus obscurus) mtDNA mutation rate (µ) of 7.0 x 10-8 per

base per year (Harlin et al. 2003) was used to determine an aggregate mutation rate for the

beluga haplotype sequences for u = 609µ = 4.26 x 10-5. The time of expansion based on

mismatch distribution analyses for beluga haplogroups was therefore estimated using t

=Ƭ/0.000085.

2.3 Results

2.3.1 mtDNA sequence diversity and population structure of belugas in Canadian waters

The analysis of 2540 beluga whales sampled from widely distributed locations in

Canadian waters (Figure 2.1) resulted in the identification of 111 unique haplotypes based on

609bp of mtDNA control region sequence (Table 2.1). Replicate sequencing analysis (Morin et

al. 2010) and haplotype identification of a randomly selected subgroup of samples (N=83, 6 of

the original 89 samples failed to yield good sequences) had an error rate of 8% (7 out of 83

haplotypes did not match). While this is a high incidence (typical error rate for beluga mtDNA

sequencing is <1% as in Chapter 3, Section 3.3.5), in all cases the samples that did not match

were older samples (1986-1999), and sample degradation (causing poor quality template

resulting in sequencing errors) is suspected to have contributed to the results. Also, the

misidentification was with a haplotype of similar sequence (one mutational difference), and this

Page 61: Genetic diversity, population structure and phylogeography ...

43

type of error is unlikely to affect overall analyses in this study (see below). To test the influence

of older samples on analyzed results, the dataset was re-analyzed using only post-1999 samples.

The results did not change any of the patterns as compared to the full dataset (Appendix 2.3),

except for Foxe Basin where the sample size was significantly reduced (N=72 to N=9). Thus, the

full dataset analyses were retained for all interpretations.

Rare haplotypes in the overall dataset could also be an indication of sequencing errors

and error in identifying haplotypes; therefore, haplotypes that occurred in ≤2 individuals were

removed. In total, 40 individuals corresponding to 28 haplotypes were excluded. The majority of

these samples were old (or from sloughed skin) and of poorer DNA template quality. The

sequences produced from these types of samples are more vulnerable to sequencing errors and

subjective haplotype scoring by different lab technicians. After this data quality screening, a final

dataset of 2500 individuals and 83 unique haplotypes was retained. This almost doubles the

number of samples examined in previous beluga mtDNA studies and more than doubles the

number of haplotypes identified (references in Appendix 2.1).

Increasing the number of base pairs sequenced from 234bp to 609bp increased the

number of variable positions from 19 (Brown Gladden et al. 1997) to 39 in this study. This

resulted in an increased haplotype diversity (Table 2.2) for all locations, except the St. Lawrence

River, as compared to de March and Postma (2003). All of the mutations observed in the 609bp

of mtDNA sequence used to identify haplotypes were due to single nucleotide substitutions (i.e.

rather than insertions or deletions). Of the 39 variable positions, 38 displayed transitions in all

samples, except for position 220 where a transversion (C->G) was observed in three samples

from Grise Fiord.

Page 62: Genetic diversity, population structure and phylogeography ...

44

Overall haplotype diversity (h) was 0.9378, nucleotide diversity (π) was 0.0083, and the

average number of nucleotide differences (k) was 5.03 (Table 2.2). The smallest haplotype

diversities were found in the St. Lawrence Estuary sample collection (0.4493) and James Bay

(0.5212), whereas all other distinct areas had diversities larger than 0.7333.

The increased number of haplotypes in this study resulted in new patterns of unique

haplotypes and new proportions of dominant haplotypes found among different locations (see

Appendix 2.4 for example alignment of polymorphic sites). In previous beluga studies, haplotype

H02 was the dominant haplotype, found in 36% (Brown Gladden et al. 1997), 52% (de March

and Postma 2003) and 38% (Turgeon et al. 2012) of all the samples analyzed. In this study,

haplotype H02 was differentiated into 21 different haplotypes (E02, E03, E04, E05, E06, E07,

E08, E34, E40, E52, E58, E65, E71, E72, E83, E101, E130, E133, E138, E141, and E176) that

collectively represented 49% of the total sample set (Table 2.1). Also, the resolution of H02 into

new haplotypes was different among various sampling collections, and this provided a finer scale

of genetic information that revealed new geographic patterns. Haplotype H02 in Western Hudson

Bay samples broke down into mostly E72 (29%), E02 (25%) and E05 (10%) haplotypes. In the

Sanikiluaq harvest samples, the proportions of haplotypes from H02 were mainly E02 (28%),

E06 (15%), E07 and E08 (together 5%). Eastern Hudson Bay samples did not have a large

number of H02 haplotypes reported in previous studies (15%, e.g. Turgeon et al. 2012), and in

contrast to other Hudson Bay samples, this remained approximately the same in this study with

12% of these samples all E02.

Page 63: Genetic diversity, population structure and phylogeography ...

45

Table 2.1. Distribution of haplotypes (N=111) from beluga samples collected in Canadian waters (Figure 2.1).

Abbreviations: Hap, Haplotype ID; BeS, Beaufort Sea (Hendrickson Is.); CHA, Central High Arctic (Cunningham

Inlet); EHA, Eastern High Arctic (Grise Fiord); FB, Foxe Basin (Igloolik); NHB, Northern Hudson Bay (4

locations); WHB, Western Hudson Bay (4 locations); JB, James Bay; LI, Long Island; EHB, Eastern Hudson Bay (8

locations); SQH, Belchers Is. Harvest (Sanikiluaq); SQE, Belcher Is. Ice entrapments (Sanikiluaq); SLE, St.

Lawrence Estuary; NQ, Northern Quebec along Hudson Strait (18 locations); CSd, Cumberland Sound

(Pangnirtung); Tot1, Total number of samples with each haplotype; Tot2, Total number of samples in each

collection. Blank cells equal N=0.

Hap BeS CHA EHA FB NHB WHB JB LI EHB SQH SQE SLE NQ CSd Tot1

E01 2 2

E02 24 6 1 6 21 44 5 3 20 87 12 238 21 488

E03 1 2 1 23 1 28

E04 1 1

E05 2 18 2 8 1 31

E06 1 3 46 2 5 57

E07 12 2 14

E08 3 3

E09 15 2 19 36

E10 32 3 7 13 55

E11 10 5 8 3 3 19 12 11 20 4 37 132

E13 9 5 19 1 34

E15 1 8 15 24

E16 1 4 2 30 1 7 7 52

E17 1 9 1 11

E18 1 1 40 3 18 63

E19 4 1 5

E22 20 15 19 4 1 9 68

E23 1 1 1 6 2 11

E24 4 1 1 1 1 5 9 22

E25 3 3

E28 4 3 2 9

E29 12 12

E30 2 2 2 6

E31 3 3

E32 1 4 2 7 14

E33 3 3

E34 2 2

E35 1 1 1 3

E36 2 2 1 3 5 13

E37 7 2 18 12 39

E38 3 1 1 1 2 17 25

E40 5 5

E41 1 4 3 1 1 13 23

E42 3 2 2 7

E50 1 6 1 8

E51 7 7

E52 5 1 6

E55 2 2 4

E56 1 2 1 10 14

E57 1 3 1 8 2 15

E58 1 1 2

Page 64: Genetic diversity, population structure and phylogeography ...

46

Hap BeS CHA EHA FB NHB WHB JB LI EHB SQH SQE SLE NQ CSd Tot1

E59 1 2 4 3 10

E61 2 1 1 2 6

E62 1 3 1 5

E63 8 4 4 2 18

E65 1 4 3 2 10

E67 1 3 4

E68 4 2 6 1 13

E69 1 1 1 7 10

E70 1 1 2

E71 1 1 2

E72 4 25 13 39 92 52 6 7 260 63 561

E75 1 1 1 22 25

E76 2 2

E77 1 8 9

E79 1 8 9

E80 4 15 1 1 5 5 31

E81 1 3 3 7

E82 2 1 1 4

E83 2 2

E84 1 1

E86 3 3

E88 1 1

E90 7 7

E92 8 8

E94 1 13 14

E95 1 1 2

E96 1 1

E97 1 1 3 5

E100 4 4

E101 1 1

E103 1 3 4

E110 1 2 3

E112 1 1 4 6

E113 22 1 23

E114 2 2

E115 2 2 3 7

E116 1 7 4 12

E117 1 1 2

E119 1 1

E120 131 1 1 133

E121 4 4

E123 2 2

E124 1 1

E127 1 1

E129 1 1

E130 1 4 5

E131 1 1

E132 1 1

E133 1 1

E138 2 8 10

E140 2 2 4

E141 2 2

Page 65: Genetic diversity, population structure and phylogeography ...

47

Hap BeS CHA EHA FB NHB WHB JB LI EHB SQH SQE SLE NQ CSd Tot1

E142 6 6

E143 1 1

E144 6 6

E146 7 7

E147 30 30

E148 5 5

E149 1 1

E150 3 3

E153 1 4 5

E154 3 3

E155 92 92

E156 3 3

E176 4 4

E177 2 4 6

E178 1 1

E181 3 3

E187 1 1

Tot2 261 54 57 72 210 178 28 39 157 319 54 102 807 202 2540

Page 66: Genetic diversity, population structure and phylogeography ...

48

Table 2.2. Summary of haplotype diversity found in only the main geographic beluga sample collections of interest

(Nsamples=1377). Abbreviations: WHB, Western Hudson Bay (Arviat); BeS, Beaufort Sea (Hendrickson Island); CHA,

Central High Arctic (Cunningham Inlet); EHA, Eastern High Arctic (Grise Fiord); FB, Foxe Basin (Igloolik); JB,

James Bay; LI, Long Island; EHB, Eastern Hudson Bay (Nastapoka River); SLE, St. Lawrence Estuary; CSd,

Cumberland Sound (Pangnirtung); SQH, Belcher Islands harvests (Sanikiluaq); SQE, Belchers Islands ice

entrapments. The order of highest (1) to lowest (12) levels of haplotype diversity and nucleotide diversity are indicated

in parentheses (1-12).

Sample Collection

Number of sequences

Number of haplotypes

Number of segregating sites

Avg. number

nucleotide diff. (k)

Haplotype diversity (h)

Nucleotide diversity (π)

WHB 118 25 23 3.19 0.85 (6) 0.005 (4) BeS 301 27 18 2.21 0.78 (7) 0.004 (10) CHA 54 12 15 2.73 0.76 (8) 0.005 (7) EHA 57 14 12 2.48 0.86 (5) 0.004 (8) FB 72 9 15 1.58 0.63 (10) 0.003 (12) JB 28 6 15 2.05 0.52 (11) 0.003 (11) LI 39 15 21 5.23 0.87 (2) 0.009 (2) EHB 30 7 15 3.74 0.73 (9) 0.006 (3) SLE 125 13 18 2.43 0.45 (12) 0.004 (9) CSd 186 24 17 2.85 0.86 (3) 0.005 (6) SQH 311 37 27 3.12 0.88 (1) 0.005 (5) SQE 56 10 15 5.82 0.86 (4) 0.010 (1) Total data 1377 107 39 5.03 0.94 0.0083

Page 67: Genetic diversity, population structure and phylogeography ...

49

Cumberland Sound samples had haplotypes of E72 (31%), E02 (10%) and E138 (4%) as

the main constituents of the previous H02. Finally, the High Arctic samples, which in previous

studies had a large proportion of H02 (de March et al. 2002), were split into E72 and E02 of

different proportions between the Central High Arctic (Cunningham Inlet) and the Eastern High

Arctic (Grise Fiord) samples.

Haplotype H07 was the second most dominant haplotype in previous beluga studies,

occurring in >15% of the overall sample set (Brown Gladden et al. 1997, de March et al 2002).

This haplotype was especially distinctive in the Beaufort Sea and High Arctic samples. In the

present analysis of a greater amount of mtDNA sequence, H07 was differentiated into seven new

haplotypes (E11, E62, E88, E97, E103, E120, and E153) that once again changed the geographic

patterns of haplotypes found in beluga samples from the Beaufort Sea and High Arctic. Most

notable was the identification of haplotype E120, which was found in over 50% (N=131) of the

Beaufort Sea samples but only occurred in 2 individual samples elsewhere (Table 1). In High

Arctic samples, H07 was resolved into E103 in the Central High Arctic (CHA) and E97 in the

Eastern High Arctic (EHA). These two locations were also differentiated in the present study by

the identification of a unique haplotype in the CHA (E100 found in 7% of the samples), and a

large proportion of haplotype E80 (26%) in the EHA samples.

Unique, or private, haplotypes were found in relatively large proportions in several of the

sample collections. Haplotypes E29 (4% of all samples) and E92 (3%) were found only in

belugas harvested near the Belcher Islands (Table 2.1). Belugas samples from winter ice

entrapments around the Belcher Islands also revealed large numbers of haplotypes E94 (24%),

E69 (13%), and E177 (7%) that were unique to that area. As in previous studies, Eastern Hudson

Bay samples had several haplotypes found in very small numbers elsewhere (E16, E18), but in

Page 68: Genetic diversity, population structure and phylogeography ...

50

this study had a unique haplotype (E51) in 4% of the samples. Beaufort Sea samples had many

more unique haplotypes as compared to previous studies, including E121, E142, E143, E144,

E146, E147, E148 and E150. Collectively, these were found in 24% of the samples. Finally, the

St. Lawrence beluga samples were clearly distinct from all other collections, containing four

unique haplotypes in 99% of the samples (Table 2.1), the majority of which (90%) were E155.

This lack of overlap with any other haplotypes from other locations is another new result as

compared to previous studies (Brown Gladden et al. 1997, de March and Postma 2003).

Overall, analyses of pairwise genetic differences confirmed results of previous studies

(Brown Gladden et al. 1997, de March and Postma 2003, Turgeon et al. 2012), but also identified

new significant differentiation of sample collections from new and previously sampled locations

(Table 2.3). ΦST values ranged from 0.027 to 0.525, with the greatest distances found between

the St. Lawrence Estuary and all other sites, followed by Eastern Hudson Bay and the Beaufort

Sea. All summer and winter sample collections were significantly differentiated, except for

James Bay and Long Island, and in one analysis between Foxe Basin and Western Hudson Bay.

Samples from the Central High Arctic (Cunningham Inlet), the Eastern High Arctic (Grise Fiord)

and the Belcher Islands harvests (Sanikiluaq) were not significantly differentiated in previous

analyses of samples from these areas (Brown Gladden et al. 1997, de March and Postma 2003).

These patterns of spatial relationships among geographic sample collections were echoed by the

results of Principal Coordinates Analysis (PCoA) using genetic distance (Figure 2.2) and the

evolutionary relationships inferred from a Neighbour-Joining tree (Figure 2.3).

Page 69: Genetic diversity, population structure and phylogeography ...

51

Table 2.3. Patterns of differentiation based on genetic distance among main geographic beluga sample collections

(Nsamples= 1377). ФST values, calculated in Arlequin ver. 3.5 (below diagonal), and ФPT values, calculated in GenAlEx

ver. 6.5 (upper diagonal), are both FST analogues based on nucleotide divergence among haplotypes. Minimum

significance levels using Bonferroni correction (Holm 1979) were P=0.006 and P=0.032 respectively. All pairwise

comparisons were significant (P<0.0001 and P<0.008 respectively, data not shown), except for those highlighted in

grey. Abbreviations: WHB, Western Hudson Bay (Arviat); CSd, Cumberland Sound (Pangnirtung); SQH, Belcher

Islands harvests (Sanikiluaq); SQE, Belchers Islands ice entrapments; BeS, Beaufort Sea (Hendrickson Island);

CHA, central High Arctic (Cunningham Inlet); EHA, Eastern High Arctic (Grise Fiord); FB, Foxe Basin (Igloolik);

JB, James Bay; LI, Long Island; EHB, Eastern Hudson Bay (Nastapoka River); SLE, St. Lawrence Estuary.

WHB CSd SQH SQE BeS CHA EHA FB JB LI EHB SLE

WHB * 0.054 0.087 0.384 0.462 0.066 0.188 0.020 0.339 0.296 0.604 0.716

CSd 0.027 * 0.094 0.450 0.429 0.052 0.174 0.095 0.340 0.338 0.660 0.749

SQH 0.088 0.088 * 0.404 0.399 0.109 0.176 0.141 0.276 0.279 0.617 0.709

SQE 0.110 0.113 0.056 * 0.534 0.418 0.453 0.478 0.410 0.234 0.121 0.369

BeS 0.172 0.167 0.141 0.162 * 0.377 0.307 0.561 0.203 0.272 0.698 0.742

CHA 0.031 0.030 0.132 0.162 0.212 * 0.067 0.147 0.248 0.242 0.659 0.757

EHA 0.071 0.058 0.113 0.127 0.181 0.056 * 0.323 0.120 0.185 0.687 0.765

FB 0.038 0.082 0.198 0.239 0.268 0.061 0.153 * 0.540 0.429 0.727 0.793

JB 0.253 0.257 0.189 0.225 0.287 0.287 0.218 0.405 * 0.066 0.668 0.754

LI 0.126 0.128 0.081 0.098 0.170 0.155 0.098 0.257 0.087 * 0.441 0.614

EHB 0.190 0.191 0.170 0.151 0.238 0.242 0.184 0.323 0.309 0.155 * 0.318

SLE 0.349 0.327 0.306 0.366 0.361 0.419 0.373 0.469 0.525 0.386 0.428 *

Page 70: Genetic diversity, population structure and phylogeography ...

52

Figure 2.2. Principal Coordinates Analysis (PCoA) using ΦPT genetic distance matrix for mtDNA haplotypes from

summer and winter beluga whale sample collections (Table 2.3). Symbols are coloured according to previously

described eastern (blue) and western (yellow) haplotype assemblages (Brown Gladden et al. 1997) and new sample

collections (red). Abbreviations: FB, Foxe Basin; WHB, Western Hudson Bay; CSd, Cumberland Sound; SQH,

Belchers Is. (Sanikiluaq) harvests; CHA, Central High Arctic; EHA, Eastern High Arctic; JB, James Bay; BeS,

Beaufort Sea; SQE, Belcher Is. ice entrapments; EHB, Eastern Hudson Bay; SLE, St. Lawrence Estuary.

Page 71: Genetic diversity, population structure and phylogeography ...

53

Figure 2.3. Evolutionary relationships of summer and winter beluga whale sample collections (Nsamples= 1377) based

on mtDNA haplotypes and inferred using the Neighbour-Joining method (Saitou and Nei 1987). The evolutionary

distances (FST) as a pairwise matrix were generated in DnaSP v5 (Librado and Rozas 2009). The tree was rooted using

the SLE/EHB/SQE branch as an outgroup and is drawn to scale, with branch lengths (in distance units) indicated.

New sample collections (as compared to Brown Gladden et al. 1997) are indicated with red diamonds.

Abbreviations: FB, Foxe Basin; WHB, Western Hudson Bay; CSd, Cumberland Sound; SQH, Belchers Is.

(Sanikiluaq) harvests; CHA, Central High Arctic; EHA, Eastern High Arctic; JB, James Bay; BeS, Beaufort Sea; SQE,

Belcher Is. ice entrapments; EHB, Eastern Hudson Bay; SLE, St. Lawrence Estuary.

.

Distance

Page 72: Genetic diversity, population structure and phylogeography ...

54

2.3.2 Phylogeographic patterns of belugas inferred from gene trees and haplotype network

Phylogenetic analyses were performed to estimate gene trees for the 83 main haplotypes

identified in this study. All model tests (PhyML, MEGA and jModelTest) agreed that the best

model of sequence evolution for the beluga data was HKY+G+I (Table 2.4). All phylogenetic

approaches, Bayesian Inference (Figure 2.4), Maximum Parsimony (Figure 2.5), and Maximum

Likelihood (Figure 2.6), identified two well-supported lineages (>80% support). These lineages

correspond to the two “assemblages” of haplotypes identified in previous studies (see references

in Appendix 1) with the haplotypes highlighted in blue in Figures 2.4 - 2.6) corresponding to the

“eastern assemblage”. Branching patterns within this lineage were also consistent and reasonably

well-supported across the different phylogenetic methods. E154 and E155, the two haplotypes

found in almost all (93%) of the St. Lawrence Estuary beluga samples, consistently grouped

together and were the most distantly related haplotypes to all others. Haplotypes E17, E18, and

E19 also formed a consistent group and were found almost exclusively in a large proportion of

the Eastern Hudson Bay samples.

The other lineage (roughly corresponding to the “western assemblage”) was composed of

branching patterns that were less consistent across the different phylogenetic analyses. However,

both the Maximum Parsimony (MP) (Figure 2.5) and Maximum Likelihood (ML) (Figure 2.6)

analyses clustered the same sets of haplotypes in similar patterns and as most distant from other

haplotypes in the lineage. While the MP branch support was very weak, the ML tree had strong

aLRT support for most branches and suggested the division of the overall “western” lineage into

two subgroups. The Bayesian tree did not provide a similar resolution of the polytomy in this

lineage (Figure 2.4). Some haplotypes were grouped in several clusters, but with lower posterior

probabilities at the nodes than the level of support that was found in the ML interpretation of

Page 73: Genetic diversity, population structure and phylogeography ...

55

haplotype relationships. It is unlikely that this is a failure of the BI analysis, as all diagnostics

indicated that convergence was achieved (i.e. all PSRF values = 1.000 – 1.001; all parameter

ESS (effective sample size) values greater than 900, and no obvious trend in the trace plot

suggesting that the MCMC was still converging).

The median-joining (MJ) phylogenetic network (Figure 2.7) provides more clarity for

assessing the different haplotype relationships represented by the different gene tree analyses.

This type of network, and others allows for the inclusion of ancestral nodes, multifurcations

(multiple connections as opposed to strictly bifurcating branches) and reticulations (more

complex structures in the connections) (Posada and Crandall 2001). Because of this, more

phylogenetic information from the dataset can be determined from the interconnections of the

haplotypes depicted by a network. Ancestral haplotypes will be found central to a cluster of

multiple descendant haplotypes in a network analysis, and will also “occupy a branch of zero

length at the basal node of a cluster” in a traditional bifurcating tree (Posada and Crandall 2001).

Haplotypes E120, E11, E02, E72, E13, E41, E16 and E18 all have this characteristic defining

them as ancestral haplotypes in both the ML tree and the MJ network (Figures 2.6 and 2.7). Also,

these older haplotypes will often be present in higher frequencies (Watterson and Guess 1977)

and will be “common” alleles (i.e. found over a wider geographic range) (Watterson 1985). The

network results (Figure 2.7) confirm this pattern. Using this information and further comparison

of the ML phylogenetic tree (Figure 2.6) and the MJ network (Figure 2.7), three haplogroups

(evolutionary closely related haplotypes, e.g. Klüsch et al. 2012, Lindqvist et al. 2016) for the

beluga mtDNA data in this study were identified. These haplogroups appear to have formed from

different ancestral lineages: Haplogroup 1A and Haplogroup 1B corresponding to the “western

Page 74: Genetic diversity, population structure and phylogeography ...

56

assemblage” identified in previous studies (references in Appendix 1) and Haplogroup 2

corresponding to the “eastern assemblage” (Figure 2.6 and 2.7).

The grouping of individual samples into the three haplogroups and comparison of the

haplogroup frequencies among the summer and winter collection locations revealed both north-

south and east-west patterns over the Canadian beluga distribution (Figure 2.8). In the western

Canadian Arctic (Beaufort Sea), the samples were predominantly Haplogroup 1B and in the

eastern edge of the distribution (St. Lawrence Estuary) samples were almost entirely Haplogroup

2. In the middle of the Canadian beluga distribution, there was a latitudinal gradient from south

to north of decreasing frequencies for haplotypes belonging to Haplogroup 2. Conversely,

Haplogroup 1B had a general trend of decreasing frequency from north to south, with the

exception of James Bay and the southeast corner of Hudson Bay (Long Island and Belcher

Islands). Haplogroup 1A was the dominant haplogroup in the central portion of the beluga

distribution, with a slightly decreasing frequency occurring from south to north.

The belugas sampled in the St. Lawrence Estuary represent an isolated population and

have the lowest amount of haplotype diversity (Table 2.2). Though the haplotypes found in this

sample collection belong to Haplogroup 2, they were unique to this population and were

predominantly (90%) a single haplotype. Based on these results, this location was considered

separately for further analyses.

Page 75: Genetic diversity, population structure and phylogeography ...

57

Table 2.4. Results of Maximum Likelihood tests for fits of different nucleotide substitution models for 1377 haplotype sequences (609bp) from summer and

winter beluga sample collections (see Figure 1) as calculated in MEGA6 (Tamura et al. 2013). Model abbreviations: GTR: General Time Reversible; HKY:

Hasegawa-Kishino-Yano; TN93: Tamura-Nei; T92: Tamura 3-parameter; K2: Kimura 2-parameter; JC: Jukes-Cantor; +G: gamma shape parameter estimated;

+I: fraction of invariant sites estimated.

Page 76: Genetic diversity, population structure and phylogeography ...

58

Figure 2.4. Evolutionary relationships of beluga whale mtDNA control region haplotype sequences (N=83) inferred

from phylogenetic analysis using Bayesian inference (MrBayes ver3.2, Ronquist et al. 2011). The tree was rooted

using the group of haplotypes outlined in blue and branch lengths are proportional to the scale (measured as the number

of substitutions per site). Red numbers adjacent to nodes are Bayesian posterior probabilities indicating clade

confidence.

Page 77: Genetic diversity, population structure and phylogeography ...

59

Figure 2.5. Evolutionary relationships of beluga whale mtDNA control region haplotype sequences (N=83) inferred from

phylogenetic analysis using Maximum Parsimony (MEGA ver. 6, Tamura et al. 2013). Tree #1 out of 4 most parsimonious

trees is shown. The tree was rooted using the group of haplotypes outlined in blue. Branch lengths were calculated using

the average pathway method and lengths are proportional to the scale (measured as units of the number of changes over

the whole sequence). Red numbers adjacent

to nodes are the percentage of trees in which

associated haplotype sequences clustered

together during bootstrap analysis.

Page 78: Genetic diversity, population structure and phylogeography ...

60

Figure 2.6. Evolutionary relationships of beluga whale mtDNA control region haplotype sequences (N=83) inferred

from phylogenetic analysis by Maximum Likelihood (PhyML ver3.0, Guindon et al. 2010). The tree was rooted

using the group of haplotypes highlighted in blue, and branch lengths greater than 0.001 are shown (measured as the

number of substitutions per site). Red numbers adjacent to nodes are aLRT (approximate Likelihood Ratio Test)

measures of support for the branches. Additionally, two main haplogroups are identified that generally correspond to

the previously described eastern and western haplotype assemblages (Brown Gladden et al. 1997). However, the

“western” assemblage has been subdivided into two haplotype subgroups based on the new mtDNA control region

sequence (609bp) information included in this study. Circled haplotypes have been identified as ancestral

haplotypes.

Page 79: Genetic diversity, population structure and phylogeography ...

61

Haplogroup 2

(“eastern”

assemblage)

Haplogroup 1B

Haplogroup

1A

Page 80: Genetic diversity, population structure and phylogeography ...

62

Figure 2.7. Median-joining phylogenetic network for all beluga mtDNA haplotypes (609bp sequences) identified in

this study (Table 2.1) and found in N>2 individuals (Nhaplotypes= 83, Nsamples= 2500). Each circle (node) represents an

individual haplotype and the size of the node is proportional to the overall frequency of occurrence of the haplotype

in the complete dataset. Nodes are coloured according to haplogroups and subhaplogroups identified using the

Maximum Likelihood phylogenetic tree (Figure 2.7): Haplogroup 1A, yellow; Haplogroup 1B, orange; Haplogroup

2, blue. Red numbers along branches connecting nodes are sequence positions of nucleotide changes between

haplotypes. Nodes labelled as “mv” are median vectors that refer to missing haplotypes. The network was

reconstructed using Network ver. 4.6 (Fluxus Technology 2012).

Page 81: Genetic diversity, population structure and phylogeography ...

63

Figure 2.8. Distribution of haplogroups (Haplogroup 1A, Haplogroup 1B and Haplogroup 2) among summer and

winter beluga whale sample collections (Table 3). Haplogroups are identified based on the Maximum Likelihood

phylogenetic tree (Figure 2.6) and haplotype network (Figure 2.7).

Page 82: Genetic diversity, population structure and phylogeography ...

64

2.3.3 Divergence patterns and dating

Divergence estimates (Table 2.5) among the haplogroups support a much closer

relationship between Haplogroup 1A and Haplogroup 1B, whereas Haplogroup 2 was more

highly divergent. These results echoed the branching patterns depicting the evolutionary

relationships among the haplotypes in the phylogenetic trees and the haplotype network.

Table 2.5. Pairwise divergence estimates based on nucleotide diversities for the 3 haplogroups of beluga whale

haplotypes.

Group 1 vs. Group 2 Dxy (total raw divergence)

Da (net divergence)

Haplogroup 2 Haplogroup 1B 0.01650 0.01144 Haplogroup 2 Haplogroup 1A 0.01693 0.01168

Haplogroup 1B Haplogroup 1A 0.00893 0.00391

The neutrality tests (Table 2.6) and mismatch distribution analyses (Figure 2.9-2.13)

revealed different signals of spatial and demographic expansion for the seasonal geographic

samples of belugas. Significantly negative D, R2, and Fs values are an indication of population

expansion. Neither the Tajima’s D statistic nor the Ramos-Osins and Rozas R2 were significant

for any of the distinct sample collections or the geographic groupings (all probabilities > 0.05)

(Table 2.6). Also, Fu’s Fs was not significantly negative for the sample collections from SE

Hudson Bay or the St. Lawrence Estuary. These results are not able to reject a null hypothesis of

a constant population size, and for some sample collections (Belcher Islands entrapment samples,

eastern Hudson Bay) positive neutrality test results could also indicate a past bottleneck event

(Zlojutro et al. 2006). It should be noted, though, that neutrality tests such as these have been

Page 83: Genetic diversity, population structure and phylogeography ...

65

shown to have weak power when sampling occurs too early (before substantial population

growth has occurred) or too late (after the population has reached a steady size) (Fu, 1997).

However, Fu’s Fs was significantly negative for five out of nine locations (the High

Arctic samples were combined), all of which had predominantly Haplogroup 1A haplotypes.

Thus, as expected, this combined set of samples representing Haplogroup 1A also had a highly

significant Fu’s Fs (-25.34, P = 0.0003). While the weight of evidence is stronger with congruent

test results, it has been shown that Tajima’s D, Ramos-Osins and Rozas R2, and Fu’s Fs are much

different in their ability to detect departures from population equilibrium (Fu 1997, Ramos-

Onsins and Rozas 2002, Ray et al. 2003). The power of Tajima’s D is greatly influenced by gene

flow and the age of the expansion event, whereas Fu’s Fs is more robust, especially for larger

sample sizes such as in this beluga dataset (Ramos-Onsins and Rozas 2002, Ray et al. 2003).

And since the R2 statistic is based on the mutation frequencies rather than the haplotype

distribution, Fs is more sensitive than R2 to the presence of rare haplotypes expected in a recently

expanded population (Fu 1997, Ramos-Onsins and Rozas 2002).

Based on geographic patterns of haplogroup distributions and the results of the neutrality

tests, pairwise sequence mismatch distributions were examined for groups of samples that

represent potential post-glacial expansions from different refugia. The mismatch distribution

graphs echoed the results of the neutrality tests. The pairwise difference distribution for the

complete summer and winter sample dataset did not have a unimodal pattern (Figure 2.9) and did

not have any significant departures from a model of neutral evolution and constant population

size with any of the neutrality tests (Table 2.6). Instead, there were two modes in the pairwise

distribution. Given the broad geographic range of this large dataset (N=1377), this multimodal

distribution could reflect population substructure (Marjoram and Donnelly 1994, Zlojutro et al.

Page 84: Genetic diversity, population structure and phylogeography ...

66

2006). However, Rogers et al. (1996) asserted that pooling data from subdivided populations has

a limited effect on the results of mismatch distribution analyses. Thus, the larger mode (at ~ 11)

most likely corresponds to the number of differences between the two main beluga lineages and

the smaller mode (at ~ 4) reflects the differences among individuals within lineages (Bernatchez

2001).

For the analyses of individual haplogroups, the mismatch distributions for both the

Beaufort Sea samples (Figure 2.10) and the central Canadian locations (Figure 2.11) displayed a

unimodal distribution (at ~3 pairwise differences) that fit the mode and shape of a curve expected

for a population that had undergone expansion in the recent past. The highly significant Fs tests

for each of these sample collections support this interpretation (Table 2.6), as do the non-

significant P-values for the sum of squares deviation (SSD) and raggedness index (Table 2.7).

Based on the mismatch distribution results, the time since expansion for the Beaufort Sea

samples (Haplogroup 1B) was estimated at 51,765 years ago (ya). For Haplogroup 1A (the

central Canadian locations: High Arctic, Hudson Bay, and Cumberland Sound), the mismatch

distribution results estimate a time since expansion of 14,118 ya.

Mismatch distribution results for both the southeastern Hudson Bay samples (James Bay

and Eastern Hudson Bay) and the St. Lawrence Estuary had different patterns and did not display

a typical unimodal bell-shape curve. However, samples from each of these locations did roughly

conform to the expected mismatch distribution under a model of population expansion (Figure

2.13), though the higher raggedness index (Table 2.7) did indicate a poorer fit of the data to the

model. Still, non-significant P-values for the SSD and raggedness index do not deviate from an

expansion hypothesis (Table 2.7). Thus, based on mismatch distribution analyses, the

Page 85: Genetic diversity, population structure and phylogeography ...

67

approximate time of expansion for the Eastern Hudson Bay sample collection was estimated at

4706 ya.

The sample collections from the Belcher Islands entrapments and Long Island resulted in

a multimodal distribution (Figure 2.12) similar to the overall sample distribution, with modes

also similar. Samples from these locations were characterized by haplotypes belonging to a

proportional mixture of all the haplogroups, and the mismatch distribution indicates that the

samples from these locations are from at least two very diverse lineages. However, the same

pattern remained if the locations were analyzed separately. Taken together, the mix of

haplogroups and the bimodal mismatch distribution for each of these locations most likely

indicates that samples in these collections are from a mixture of populations or stocks (Matias et

al. 2013).

Page 86: Genetic diversity, population structure and phylogeography ...

68

Table 2.6. Results of neutrality tests for significantly differentiated sample collections (Table 2.3) and for all sample

haplotypes combined according to geographic haplogroup patterns (Figure 2.9). Abbreviations: FB, Foxe Basin;

WHB, Western Hudson Bay; CSd, Cumberland Sound; SQH, Belchers Is. (Sanikiluaq) harvests; CHA, Central High

Arctic; EHA, Eastern High Arctic; JB, James Bay; BeS, Beaufort Sea; SQE, Belcher Is. ice entrapments; EHB, eastern

Hudson Bay; SLE, St. Lawrence Estuary.

Collection/

“Haplogroup”

Tajima’s

D

P Fu’s FS P Ramos-

Onsins

and

Rozas R2

P

BeS (N=301) -0.5746 0.3262 -12.2389 0.0035* 0.0613 0.3585

WHB-FB (N=190) -0.9362 0.1833 -14.5826 0.0007** 0.0563 0.2050

CHA (N=54) -0.5172 0.3549 -1.9217 0.2305 0.0870 0.3322

EHA (N=57) -0.1345 0.5068 -3.9303 0.0545 0.1016 0.5098

HA (CHA and EHA

combined N=111)

-0.8225 0.2175 -6.5564 0.0208* 0.0666 0.2440

CSd (N=186) -0.0744 0.5407 -7.6776 0.0214* 0.0834 0.5165

SQH (N=311) -0.7267 0.2606 -17.7801 0.0009** 0.0576 0.2695

SQE (N=56) 2.3556 0.9911 3.0247 0.8749 0.1940 0.9980

JB-LI (N=67) -0.3755 0.4085 -2.0703 0.2445 0.0899 0.4295

EHB (N=30) -0.0369 0.5436 1.6771 0.7916 0.1211 0.5460

SLE (N=125) -0.7532 0.2480 -1.4773 0.3328 0.0672 0.2885

Combined

HA/CSd/WHB/SQH

(N=798)

-0.9608 0.1640 -25.3428 0.0003** 0.0433 0.1978

Combined

SQE/JB/LI/EHB

(N=153)

1.2983 0.9198 -2.1442 0.3138 0.1297 0.9300

All summer and

winter collection

samples (N=1377)

-0.0992 0.5855 -0.3429 0.5665 0.0645 0.2332

Bolded P-values and * indicates significant (P<0.05) or ** highly significant (P<0.001) results supporting

a hypothesis of recent population growth or expansion.

Page 87: Genetic diversity, population structure and phylogeography ...

69

Figure 2.9. Observed mismatch distribution (dotted line) for all Canadian beluga summer and winter samples

combined (N=1377) compared to the bell-shaped curve (solid line) expected for the data if population expansion has

occurred in the past. The y-axis indicates the frequency at which each different category was found in the pairwise

sample comparisons.

Page 88: Genetic diversity, population structure and phylogeography ...

70

Figure 2.10. Observed mismatch distribution (dotted line) for Beaufort Sea samples, characterized by Haplogroup 1B

(N=301), compared to the bell-shaped curve (solid line) expected for the data if population expansion has occurred in

the past. The y-axis indicates the frequency at which each different category was found in the pairwise sample

comparisons.

Page 89: Genetic diversity, population structure and phylogeography ...

71

Figure 2.11. Observed mismatch distribution (dotted line) for central Canadian sample locations i.e. High Arctic,

Cumberland Sound, Western Hudson Bay, and Belcher Islands harvest, characterized by Haplogroup 1A (N=798),

compared to the bell-shaped curve (solid line) expected for the data if population expansion has occurred in the past.

The y-axis indicates the frequency at which each different category was found in the pairwise sample comparisons.

Page 90: Genetic diversity, population structure and phylogeography ...

72

Figure 2.12. Observed mismatch distribution (dotted line) for combined Belcher Island entrapment samples and Long

Island, characterized by Haplogroup 2 (N=95), compared to the bell-shaped curve (solid line) expected for the data if

population expansion has occurred in the past. The y-axis indicates the frequency at which each different category

was found in the pairwise sample comparisons.

Page 91: Genetic diversity, population structure and phylogeography ...

73

Figure 2.13. Observed mismatch distribution (dotted line) for (A) James Bay (N=28); (B) Eastern Hudson Bay

(N=30); and (C) St. Lawrence Estuary beluga samples (N=125) compared to the curve (solid line) expected under a

model population growth or decline. The y-axis indicates the frequency at which each different category was found

in the pairwise sample comparisons.

C

B

A

Page 92: Genetic diversity, population structure and phylogeography ...

74

Table 2.7. Parameters of demographic expansion for geographic lineages found in Canadian beluga whale samples as

estimated by mismatch analysis. Tau (Ƭ) is the age of expansion, ϴ0 is the effective population size before expansion,

and ϴ1 is the effective population size following expansion (all expressed in units of mutational time). SSD is a

goodness of fit test statistic between expected and observed mismatch distributions. P(SSDobs) is the probability of

observing by chance a less good fit between the observed and the mismatch distribution for a demographic history of

the lineage defined by the estimated parameters. P(Ragobs) is the probability of observing by chance a higher value of

the raggedness index than the observed one given a model of population expansion. t is the estimated time since

expansion.

Lineage Ƭ

(95% CI)

ϴ0

(95% CI)

ϴ1

(95% CI)

SSD P

(SSD)

Ragged

-ness

P

(Rag.)

t

(years)

Beaufort Sea

Lineage

(Haplogroup

1B)

4.4

(0.0-8.5)

0.0

(0.0-1.4)

3.3

(1.6-

99999.0)

0.021 0.207 0.049 0.570 51,765

HA-HB-Csd

Lineage

(Haplogroup

1A)

1.2

(1.1-1.6)

0.0

(0.0-0.06)

99999.0

(29.6-

99999.0)

0.003 0.354 0.026 0.472 14,118

SQE-LI 11.1

(1.8-17.0)

0.0

(0.0-4.7)

12.3

(7.3-

99999.0)

0.037 0.052 0.026 0.727 bimodal

James Bay 0.0

(0.0-0.6)

0.0

(0.0-0.003)

99999.0

99869.0-

99999.0)

0.038 0.563 0.303 0.617 -

EHB 0.4

(0.0-3.5)

1.1

(0.0-1.8)

99999.0

(4.2-

99999.0)

0.046 0.303 0.101 0.658 4,706

St. Lawrence

Lineage

(Haplogroup

2)

0.0

(0.0-0.5)

0.0

(0.0-0.01)

99999.0

(99869.0-

99999.0)

0.014 0.599 0.247 0.676 -

Page 93: Genetic diversity, population structure and phylogeography ...

75

2.4 Discussion

2.4.1 Population structure of beluga whales in Canadian waters

This study provides the most extensive analysis to date of mtDNA genetic population

structure for beluga whales in Canadian waters. It demonstrates that the knowledge of matrilineal

genetic structure within and among beluga aggregations continues to improve as more locations

and different seasonal samples are included, and as approaches to data analysis evolve. Ten

genetically distinct groups of belugas were identified based on pairwise comparisons of genetic

distance among haplotypes (Table 2.3) that correspond to different geographic, seasonal samples

(mostly summer) (Figure 2.14). Results in this study, for sampling locations in common with

previous studies, confirmed broad patterns of genetic distinctiveness among philopatric Canadian

summering stocks of belugas (Brown Gladden et al. 1997, de March and Postma 2003). The

significant increase in sample sizes for some areas, such as the Belcher Islands and the St.

Lawrence Estuary, has enhanced previous conclusions about the distinctiveness of populations

and stocks in these areas, providing finer scale resolution than previously possible. Both the

PCoA ordination of genetic distance ΦPT (Figure 2.2) and the neighbour-joining tree based on

FST (Figure 2.3) revealed a clear separation of “western” and “eastern” samples. However, the

inclusion of samples from new collections (James Bay, Long Island and winter Belcher Island

ice entrapments) grouped the ice-entrapment samples with the Eastern Hudson Bay and St.

Lawrence Estuary belugas instead of the James Bay/Long Island samples and the James Bay and

Long Island samples with the Beaufort Sea belugas.

Page 94: Genetic diversity, population structure and phylogeography ...

76

Figure 2.14. Distribution of haplotypes among summer and winter sample collections that are significantly

differentiated based on genetic distance (Table 2.3). Rare haplotypes (represented by <4 individuals in the entire

dataset) were not included and some of the “star-like” clusters of haplotypes in the network were combined when they

were unique to a particular collection. For clarity, only haplotypes highlighted in the Results and Discussion are

labelled in the pie diagrams.

Page 95: Genetic diversity, population structure and phylogeography ...

77

Belugas summering in the Canadian Central High Arctic (Creswell Bay, Elwin Bay,

Cunningham Inlet) have been considered to be part of a single High Arctic stock, also shared

with western Greenland, based on genetic analyses of five samples from Creswell Bay (Brown

Gladden et al. 1997, 1999). However, satellite tracking studies of belugas tagged in these areas

have challenged this information and suggest the possibility of further segregation among, and

even within, summering localities in the High Arctic (Heide-Jørgensen et al. 2003). Of the

whales tracked in this study, only the belugas tagged in Creswell Bay moved to West Greenland.

This result was incongruent with the hypothesis supported by the genetic analyses at that time

which indicated that the Canadian High Arctic and West Greenland whales were mixing as a

single stock. Additional samples used in this study revealed haplotypes unique to Cunningham

Inlet (CHA) that were not found in the other High Arctic (EHA) location of Grise Fiord (Figure

2.1 and 2.14), which also had unique haplotypes revealed (Table 2.1). These haplotypes, along

with differing frequencies of shared haplotypes, resulted in significant differentiation of the two

geographic High Arctic samples (Table 2.3), providing some genetic evidence in support of the

satellite tracking hypothesis of “segregation within the beluga population in Canada” (Heide-

Jørgensen et al. 2003).

Aerial surveys of James Bay have shown that a large number of whales (approximately

15,000) are distributed throughout the Bay in August (Gosselin et al. 2013). Given the lack of

hunting in this area and failures of biopsy efforts, samples were unavailable to earlier genetic

studies of Canadian beluga populations and critical information has therefore been missing (de

March and Postma 2003, Turgeon et al. 2012). A small number of samples were collected by

hunters in 2003-2004, including Cape Jones Island in the northwest corner of James Bay near

Long Island (Figure 2.1) and Cape Hope Island in the southeastern corner of James Bay. Satellite

Page 96: Genetic diversity, population structure and phylogeography ...

78

tags were deployed from Cape Hope Island from 2007 to 2009 along with the collection of tissue

samples from 14 belugas (11 females and 3 males) (Bailleul et al. 2012). All of these samples

were analyzed with the same mtDNA sequencing analyses as this study and included in

geographic comparisons. Of these accumulated samples, 86% were haplotype E11, which is

found in low proportions in other Hudson Bay summer aggregation samples (Postma et al. 2012,

Figure 2.14 this study) and also in the Beaufort Sea and High Arctic samples in low proportions

(Figure 2.14). Haplotype E11 was not found in the Foxe Basin samples, Cumberland Sound

samples, or in the St. Lawrence Estuary samples (Table 2.1).

Adding to the uniqueness of the James Bay samples was evidence from microsatellite

analyses that these samples came from a group of highly related individuals, even though they

were collected over different years (Postma et al. 2012). Also, all 14 of the whales tagged over

three years at Cape Hope Island were haplotype E11 (Postma et al. 2012). The satellite tracking

results revealed that the whales tagged did not leave the southeast portion of James Bay during

the months of August to February, suggesting that these are non-migratory whales belonging to a

distinct population that overwinters in polynyas within James Bay (Bailleul et al. 2012). Results

of all genetic analyses would further suggest that this population is small and isolated given the

similarities of high relatedness and genetic homozygosity of the samples to date with those that

have been characterized for the St. Lawrence Estuary (Table 2.2 and Figure 3.6). It is possible

that these animals belong to a family unit that has been sampled (Palsbøll et al. 2002), but again,

it would be unusual to have a family sampled together over multiple years. Even within years,

kin-groups (except for mothers and calves) are thought to disperse in summering areas (Colbeck

et al. 2012).

Page 97: Genetic diversity, population structure and phylogeography ...

79

This does, however, raise questions about the large groups of belugas that have been

recorded during the summer from aerial and other observational surveys along the northwest

coastline of James Bay (Gosselin et al. 2013) and in several river estuaries along the north coast

of Ontario (Armstrong 2013). The genetic, spatial, and seasonal cohesiveness of the Cape Hope

Island belugas would indicate that they do not overlap with the northwest James Bay or north

Ontario coast groups. Thus, the Ontario coastal whales may form (one or more) additional

distinct summering groups of belugas (Gosselin et al. 2013). These whales are currently

considered to belong to the western Hudson Bay stock based on a continuous pattern of beluga

distribution in major river estuaries along the Hudson Bay coast of Manitoba and Ontario

(Armstrong 2013). Given the patterns of finer genetic subdivision of Hudson Bay belugas

revealed in our analyses, this may not be true.

Polynyas, or areas of open water surrounded by ice, have been found to recur each winter

in the same area and provide necessary habitat for migrating or overwintering species of marine

birds and mammals (Stirling 1997). At least 61 distinct, recurring polynyas have been identified

in the Northern Hemisphere, with 21 identified within or partly within Canadian waters (Barber

and Massom 2007). Some of these polynyas are very well-known (e.g. the North Water polynya

in northern Baffin Bay and southern Smith Sound) while others are too small to be mapped with

remote sensing technologies and have no published information (Barber and Massom 2007). The

non-migratory belugas tagged near Cape Hope Island in James Bay are likely utilizing this type

of small, undocumented polynya (Bailleul et al. 2012). However, around the Belcher Islands, in

southeast Hudson Bay and north of James Bay (Figure 2.1), some documented recurring

polynyas have been observed to provide important winter habitat for marine mammals and birds,

including belugas that commonly overwinter to the southwest of the Islands (Gilchrist and

Page 98: Genetic diversity, population structure and phylogeography ...

80

Robertson 2000). This study has revealed new, never previously sequenced haplotypes in

samples collected from several winter ice entrapments of belugas around the Belcher Islands

(N=60). These private haplotypes were found in high proportion among the samples (E94, 26%;

E69, 12% and E177, 7%) and in each year entrapments were sampled (2004, 2011, 2013). This

suggests the presence of an unrecognized population of belugas that does not migrate out of

Hudson Bay for the winter. If migration was occurring along traditional migration routes, it is

likely that the haplotypes would have been detected in our harvest samples of migratory whales

(‘NQ’ in Table 2.1). This additional Hudson Bay population most likely includes the groups of

belugas found in the summer along the north Ontario coast and/or in the northwestern part of

James Bay. However, this is speculative, and cannot be confirmed without genetic analyses of

samples from these summering concentrations of whales.

The combination of ice-entrapment samples and extensive harvest sampling from the

Belcher Islands (by the community of Sanikiluaq) has substantially increased sample size for this

area (SQH N=311, SQE N=56) compared to previous studies using only harvest samples

(N=100, de March and Postma 2003; N=152, Turgeon et al. 2012). The Sanikiluaq harvest

samples contained four private haplotypes (E08, E29, E92, E176) that were found in 9% of the

overall sample. Six haplotypes were found almost exclusively in Sanikiluaq samples (>80%),

with the few other occurrences in areas along what could potentially be a migratory route

through Hudson Strait (NQ in Table 2.1) or in an offshore distribution range to eastern Hudson

Bay (EHB) and Long Island (LI) near the opening of James Bay (Figure 2.1). Furthermore, the

division of the dominant haplotype, H02, of previous studies (Brown Gladden et al. 1997, de

March et al. 2003, Turgeon et al. 2012) into 21 different haplotypes by sequencing a larger

segment of mtDNA provided clearer partitioning with less overlap of haplotypes among all of

Page 99: Genetic diversity, population structure and phylogeography ...

81

the Hudson Bay stocks examined (Table 2.1), and has more clearly distinguished the Sanikiluaq

harvest samples from western Hudson Bay samples (Figure 2.14).

Current beluga harvest management approaches that use the genetic assignment of

samples from harvested whales taken by Sanikiluaq hunters only consider Western Hudson Bay

and Eastern Hudson Bay stocks as sources for the harvested whales (DFO 2016). This is in large

part due to a lack of knowledge on the distribution, migratory movements, and abundance of a

possible Sanikiluaq stock. Spatial and temporal overlap of this stock is known to occur with the

genetically distinctive Eastern Hudson Bay (EHB) beluga stock in the summer. Evidence of this

comes from the almost continuous distribution of animals observed during aerial surveys

(Gosselin et al. 2013) and the proportion of mtDNA haplotypes characteristic to the EHB stock

found in the Sanikiluaq summer harvest samples (de March and Postma 2003, Turgeon et al.

2012, present study). Overlap of the Sanikiluaq stock with other, as yet uncharacterized, stocks is

also possible. The Belcher Islands may be at the centre of a genetic “melting pot”, both

historically (Petit et al. 2003) and currently, for various beluga populations and stocks in Hudson

Bay and this could indicate the area is of ecological importance. Under this model, an increase in

genetic diversity (as indicated in Table 2.2, where the Belcher Islands harvest samples had the

highest gene diversity) can result from a post-glacial redistribution of the genetic diversity from

multiple source refugia (Petit et al. 2003). Despite the unknowns, genetic evidence is now quite

compelling that beluga harvests around the Belcher Islands area are composed of a more

complex genetic mixture of stocks than the simple WHB-EHB model currently used.

Furthermore, private haplotypes found consistently over time in this harvest suggest the presence

of a local summer Belcher Islands stock.

Page 100: Genetic diversity, population structure and phylogeography ...

82

The analyses in this thesis have supported the identification of at least eight (8)

Designatable Units for belugas in Canada (COSEWIC 2016), including: Eastern Beaufort Sea

(EBS), Eastern High Arctic-Baffin Bay (EHA-BB), Cumberland Sound (CS), Ungava Bay (UB),

Western Hudson Bay (WHB), Eastern Hudson Bay (EHB), St. Lawrence Estuary (STL), and

James Bay (JB).The beluga DU report also recognized the evidence of another DU, not officially

assessed and requiring further sampling, in southeastern Hudson Bay as indicated by the samples

from ice-entrapped whales near the Belcher Islands (COSEWIC 2016). Again, this data and

information resulted from the preliminary results of this thesis. However, the analyses in this

thesis also indicate finer scale genetic structure in the EHA-BB and WHB DUs. The authors of

the report acknowledge that “the EHA-BB DU is under-sampled, especially in the high and

central Arctic regions” (COSEWIC 2016). Given the multiple lines of evidence employed in the

report for DU identification, it is possible that the data and analyses in this thesis do not meet the

threshold required for a separate DU around Cunningham Inlet (Central High Arctic). However,

its genetic distinctiveness should be recognized for further efforts for sampling, genetic analyses

and re-evaluation.

For the Western Hudson Bay Designatable Unit of beluga (WHB-DU), there is some

recognition that further substructure may be present due to estuary specific lineages (de March

and Postma 2003, COSEWIC 2016). The report (COSEWIC 2016) focusses on the Seal,

Churchill and Nelson Rivers as examples, but fails to mention the complex mixture of

haplotypes from whales harvested in the spring and summer around the Belcher Islands. The data

in this study represent a large number of samples (N=310) collected over almost two decades

from seasonal harvesting and results consistently indicate the presence of unique haplotypes and

haplotype mixtures not found in other Hudson Bay sampling collections. Again, perhaps the

Page 101: Genetic diversity, population structure and phylogeography ...

83

multiple lines of evidence are not available to identify the evolutionary significance of this

sample collection, but the genetic distinctiveness is evident, and further research should be

focussed in this area.

2.4.2 Influence of dispersal, both historical and contemporary, on extant beluga population

genetic structure

The geographical pattern of genetic variation arises from a combination of genetic

structure and population history (Templeton et al. 1995). The finer scale resolution of mtDNA

diversity and structure among Canadian beluga stocks revealed by this study provides an

opportunity to re-visit the evolutionary relationships of haplotypes from different geographic

locations, phylogeographic hypotheses about the use of glacial refugia during the last glacial

maximum and the possible routes of post-glacial range expansions by belugas. As in previous

studies (Brown Gladden et al. 1997, de March et al. 2003), at least two divergent clades of

haplotypes were supported by all methods of phylogenetic analyses (gene trees, Figures 2.4 –

2.6, and median-joining network, Figure 2.7). However, the joint interpretation of the maximum

likelihood tree and the network (Figure 2.6 and 2.7) supports three main groups of haplotypes

that cluster together based on ancestral nodes with multiple descendant haplotypes. Furthermore,

the geographical distribution of these haplogroups (Figure 2.8) offers a different model for the

location of possible refugia and the patterns of historical expansion and recolonizations by

Canadian stocks of belugas compared to the previous hypothesis (Brown Gladden et al. 1997, de

March et al. 2003).

Glacial refugia have been defined in different ways, but essentially were areas or habitats

that allowed the survival of species during periods of adverse conditions (Bennett and Provan

2008, Keppel et al. 2012, Gavin et al. 2014). Characteristics of refugia will vary spatially and

Page 102: Genetic diversity, population structure and phylogeography ...

84

temporally, but generally, species will expand away from refugia once conditions become more

favourable (Gavin et al. 2014). These range expansions can “produce genetic patterns that persist

for many hundreds or thousands of generations” (Nichols and Hewitt 1994) and therefore leave

some predictable modern day signatures. First, areas of glacial refugia will have the highest

within-population and between-population genetic diversity due to a longer demographic history

than the postglacial colonies (Taberlet et al. 1998, Provan and Bennett 2008, Maggs et al. 2008).

Second, because the leading edge of range expansions by long-distance dispersers causes a series

of founder events (Hewitt 2000), there will be a steady reduction of genetic diversity with

increasing distance from the refugial, or ancestral population (Slatkin and Excoffier 2014). And

third, contact zones will form where recolonizing populations meet (Bennett and Provan 2008).

These areas are characterized by high levels of genetic diversity but lack unique or “private”

haplotypes more typical in refugial populations that were isolated during the glacial maximum

(Petit et al. 2003, Maggs et al. 2008). Genetic patterns of expansion will also be influenced by

the effective population sizes before and after the expansion (Avise 1984, Rogers and

Harpending 1992).

Glacial refugia for North American belugas were thought have been south of the current

beluga distribution (Brown Gladden et al. 1997). Previous patterns of genetic variation suggested

that most of the summer aggregations of belugas currently distributed in Canadian waters are the

result of historical range expansion and recolonizations from a Pacific Ocean refugium (Brown

Gladden et al. 1997, de March and Postma 2003). The results of our study support a hypothesis

of a more complex series of postglacial expansions by belugas from a more diverse source of

multiple refugia.

Page 103: Genetic diversity, population structure and phylogeography ...

85

Analyses of beluga samples from the Beaufort Sea in our study are consistent with an

expansion of whales from a western Arctic refugial area. Genetic studies of other Pacific Arctic

marine species have proposed the presence of refugia in the eastern Bering Sea and/or near

southeastern Alaska (e.g. Barrie and Conway 1999, O’Corry-Crowe et al. 2008, Canino et al.

2010). Signals of beluga expansion were detected in the mismatch distribution (Figure 2.10,

Table 2.7), a significant Fu’s Fs test (Table 2.6), the moderate level of genetic diversity (Table

2.2), a large number of private haplotypes in the Beaufort Sea samples (Table 2.1), and

significantly different haplotypes from all other Canadian locations (Table 2.3, Figure 2.2, Figure

2.14). The presence of a single haplotype (E120) in greater than 50% of the samples, along with

private haplotypes connected by a single mutation to E120 in a star-like pattern in the network

(Figure 2.7), suggests this is an ancestral haplotype (Posada and Crandall 2001) unique to the

Beaufort Sea area. The time of expansion for this stock, as estimated by mismatch analysis

(Table 2.7), is approximately 53 kilo anni (ka) before present (BP), which suggests beluga

expansion into the western Canadian Arctic occurred before the glacial retreat from the last

glacial maximum approximately 12 ka BP. Approximately 73 ka BP, the largest volcano

eruption of the last two million years occurred at the Toba volcano in northern Sumatra and was

considered in some studies to have contributed to climate cooling and changes in global

environments (Williams et al. 2009). Despite this major climate event, it is unclear what direct

and enduring ecological effects the eruption may have had on global climate and wild

populations (Louys 2007, Williams 2012). The late Pleistocene, in general, was characterized by

environmental fluctuations (Louys 2012), and beluga expansions in the western Canadian Arctic

may have coincided with glacial/interglacial cycles of the late Pleistocene epoch.

Page 104: Genetic diversity, population structure and phylogeography ...

86

However, dominant expansion from the west (Bering Sea) into the central Canadian

Arctic is not supported by our current data. Mismatch distribution in these samples (Figure 2.11)

and significant Fu’s Fs values (Table 2.6) do support a pattern of a recent expansion into this

area. However, it would be expected that samples from locations along pathways of

recolonizations from a single western refugium would be marked by a steady reduction of

genetic diversity and evidence of founder events (Hewitt 2000, Excoffier et al. 2009, Slatkin and

Excoffier 2012). Sample collections for this area (High Arctic locations, Western Hudson Bay,

Belcher Islands, Cumberland Sound) had the highest genetic diversity of all the summering areas

(Table 2.2). Given that these areas were covered with an ice sheet complex at the last glacial

maximum (Dyke 2004), it is unlikely that any of these locations were themselves refugial areas.

The patterns of genetic diversity instead indicate that the central Canadian Arctic is a contact

zone of colonizing belugas emigrating from multiple different refugia which resulted in

admixture of ancestral haplotype lineages (Petit et al. 2003, Maggs et al. 2008).

The present geographic distribution of haplotype E11, found in all sample collections

except Cumberland Sound and the St. Lawrence Estuary, suggests it is one of these ancestral

haplotypes. This distribution, and its position in the network (Figure 2.7) and ML gene tree

(Figure 2.6) clustering with E120, would indicate that there was at least some expansion of

beluga from a western refugium (N Pacific/E Bering Sea) into the contact zone, especially in the

High Arctic locations.

With this interpretation, the presence of haplotype E11 in 100% of whales tagged over

three years at Cape Hope Island in SE James Bay is quite unusual. A small occurrence of a

“western haplotype” was noted by de March and Postma (2003) for the Belcher Islands and

eastern Hudson Bay, and was further suggested to have infiltrated the area via recolonization

Page 105: Genetic diversity, population structure and phylogeography ...

87

routes from the west through Fury and Hecla Strait approximately 5 ka BP. At this time, glacial

retreat and depressed coastlines would have provided connections between the western Arctic

and Hudson Bay. This connection would have disappeared around 3 to 4 ka BP when land levels

rose due to postglacial rebound, which then contributed to a loss of continued gene flow between

these areas (de March and Postma 2003). Studies of the postglacial distribution of radiocarbon

aged bowhead whale (Balaena mysticetus) fossils (Dyke et al. 1996) and the historical and

present-day patterns of gene flow in bowhead whales (Alter et al. 2012) provide evidence in

support of this hypothesis.

Based on the samples available for our analyses, an origin for the SE James Bay beluga

genetic signature could be the result of a leptokurtic founder event at the leading edge of a west-

to-east postglacial recolonization route through Fury and Hecla Strait. Revised maps of North

American deglaciation, with updated timing of ice margin history, indicate this route was open

approximately 7 ka BP (Dyke 2004). Leptokurtic movement is a rare, long-distance dispersal

event where some individuals move far ahead of the distribution of the main population (Hewitt

1996). This type of range expansion can result in the “establishment of pocket populations….that

tend to show a high level of inbreeding within populations and of differentiation between

populations” (Ibrahim et al. 1996). Given that belugas have been shown to travel thousands of

kilometres in short time periods (one whale was shown to travel over 5000 km in just over three

months (Heide-Jørgensen et al. 2003)), this pattern for beluga dispersal is possible. Recent

research has further suggested that leptokurtic dispersal may be linked to individual variation of

behavioural traits for boldness and exploration of habitats (Fraser et al. 2001, Hawkes 2009,

Canestrelli et al. 2016) and that these behavioural polymorphisms can shape “current geographic

patterns of species distributions and genetic diversity” (Canestrelli et al. 2016). In fact,

Page 106: Genetic diversity, population structure and phylogeography ...

88

individual behavioural characteristics such as this (i.e. related to movement patterns) can be

important for the evolution and establishment of novel traits, and it can occur over a relatively

fast evolutionary timescale (fewer than 100 generations) (Zuk et al. 2014). Predator avoidance,

foraging and food preference, and mating and mate preferences are other behaviours that can act

as novel selection pressures (Zuk et al. 2014), and may also have the potential to influence trait

evolution in belugas.

The High Arctic locations, western and northern Hudson Bay and Cumberland Sound all

have large proportions of an ancestral haplotype E72, which is absent or only present in small

proportions in all other sample collections (Figure 2.10). The absence of this haplotype in the

Beaufort Sea samples suggests a non-western refugial source, likely in the eastern Canadian

Arctic, contributed to the expansion of belugas into these areas after ice receded, and contributed

to the observed genetic data patterns consistent with a postglacial contact zone. The important

distinction in this study is that this refugium is not the same as the source of the “eastern

assemblage” of haplotypes hypothesized previously (Brown Gladden et al. 1997, de March and

Postma 2003). The haplotypes connected to E72 (Haplogroup 1A) are significantly differentiated

with high divergence (Table 2.5) from this eastern assemblage (included in this study with

Haplogroup 2).

Samples from Cumberland Sound have genetic signatures of high diversity (h = 0.8635,

Table 2.2), large numbers of private haplotypes (Figure 2.10) and are from an area with a small,

non-migrating population of belugas (Richard et al. 2010). It is likely that these samples have

been collected from a mixture of this resident population and migrating whales from other stocks

that temporarily move into parts of Cumberland Sound; however, the data would suggest that a

refugial population of belugas was located near the coast of Southeast Baffin Island. Coastal,

Page 107: Genetic diversity, population structure and phylogeography ...

89

periglacial refugia have been proposed for the North Atlantic (Maggs et al. 2008) and several

possible offshore refugial areas indicated for marine mammals (O’Corry-Crowe et al. 2008).

Patterns of ice retreat at the end of the last glacial maximum indicate that Cumberland Sound

started to open approximately 10.25 ka BP (Dyke 2004). Thus, this may have been the first area

recolonized by expansion from a nearby refugium.

Genetic patterns of range expansion by belugas from the North Atlantic or Baffin Bay

into Hudson Bay and towards the High Arctic are also consistent with deglaciation patterns. Ice

retreat at the east end of Hudson Strait started to fluctuate around16 ka BP (Dyke 2004). By

approximately 9 ka BP, Hudson Strait was open from the east as was Lancaster Sound and the

Gulf of Boothia in the High Arctic (though movement north along the east coast of Baffin Island

to the entrance of Lancaster Sound was possible much earlier) (Dyke 2004). Movement by

belugas into the central part of Hudson Bay would have been possible around 7.8 – 7.6 ka BP

(Dyke 2004). The estimated time since expansion for this group of samples of 14 ka BP based on

mismatch distribution analyses (Haplogroup 1A, Table 2.7) supports this hypothesis.

All of these estimated times since expansion need to be interpreted in the context of the

phylogenetic analyses. Disagreement in the outcomes of different tree-building methods using

the same dataset, as observed in the analyses of the beluga data, is often due to conflicts arising

from the complex interconnections of within-species data (Morrison 2005, Huson and

Scornavacca 2010). Bifurcating trees with simple hierarchical connections are often not able to

capture the evolutionary impact of population-level processes, nor the relationship complexities

of inter-breeding populations (Posada and Crandall 2001). Furthermore, the beluga phylogenetic

trees for all methods resulted in branching patterns (i.e. short branches) typical of incomplete

lineage sorting that often occurs when there has been few generations since divergence

Page 108: Genetic diversity, population structure and phylogeography ...

90

(Maddison 1997). Both large ancestral effective population sizes (Nichols 2001) and relatively

few generations since divergence (Maddison 1997) result in a higher likelihood that lineages will

fail to completely sort out at the same time as speciation events, or population sub-division

within a species (Maddison and Knowles 2006).

The most cryptic patterns for beluga haplotype distributions and diversity remain with the

“eastern mtDNA assemblage” (Brown Gladden et al. 1997, de March and Postma 2003). In our

study, the beluga haplotype lineages that cluster in Haplogroup 2 were highly divergent from all

other haplotypes (Figures 2.3 – 2.6, Table 2.5), and the number of mutations separating the two

clades (4+ in the haplotype network, Figure 2.7) indicates a long separation between the

ancestral population of this group from all other Canadian beluga stocks (Maggs et al. 2008).

However, the geographic distances between the two SE Hudson Bay sample collections and the

St. Lawrence Estuary population, and the differences in movement patterns and habitat use

among these areas suggest further evolutionary patterns. The very low genetic diversity in the St.

Lawrence samples (Table 2.2), plus the lack of any shared haplotypes with any other location

(Figure 2.14), suggests that colonization of the St. Lawrence Estuary was from a different glacial

refugium as eastern Hudson Bay and the Belcher Islands. This pattern of multiple refugia with

little or no contact has been described for other species (see Maggs et al. 2008), notably for

hermit crabs (Pagurus longicarpus) in areas of the North Atlantic around Maine (US) and Nova

Scotia (Canada) (Young et al. 2002). Beluga may have inhabited a similar refugial area to hermit

crabs. The Champlain Sea was an extension of the Atlantic Ocean that encompassed the St.

Lawrence River around 13 – 9 ka BP (Cronin et al. 2008). Approximately 80% of whale remains

found in the area of the Champlain Sea have been identified as beluga (N= 21 beluga fossils)

(Harington 2006, 2008). A nearly complete beluga skeleton has been radiocarbon dated to 10.7

Page 109: Genetic diversity, population structure and phylogeography ...

91

ka BP (Harington 2006), and several other fossilized beluga remains to 10.4 ka BP (Harington

2008). The prominence of beluga fossils in the Champlain Sea and the ages of these fossils

suggest that this was suitable habitat for belugas as deglaciation of the St. Lawrence River began

and the Champlain Sea formed approximately 11.5 ka BP (Dyke 2004, Cronin et al. 2008).

Ancient DNA analyses of these beluga fossils would add additional data to further molecular

phylogeographic investigations of refugia in this area of the North Atlantic (Gavin et al. 2014).

The samples collected from the Belcher Islands ice entrapments also contained, at

relatively high frequency, haplotypes that were not found in any other collection (Table 2.1,

Figure 2.14). Similarly, the eastern Hudson Bay samples contained the highest proportion of

other haplotypes clustered in Haplogroup 2. This indicates that belugas used different post-

glacial colonization routes into southern Hudson Bay with different timing and/or refugia than

eastern Hudson Bay belugas (Maggs et al. 2008, Keppel et al. 2012). These refugia were both

likely in the Atlantic Ocean, with expansion occurring along routes through Hudson Strait. The

mismatch distribution estimated time of expansion for the eastern Hudson Bay beluga stock is

approximately 4.7 ka BP (Table 2.7). This estimate agrees very closely with radiocarbon dated

beluga fossil material from the Great Whale River, Quebec, which suggested belugas had

recolonized eastern Hudson Bay by at least 4.3 ka BP (Harington 2003). The timing of

deglaciation patterns for Hudson Strait and Hudson Bay indicate this would have been possible

from an expansion from the northeast Atlantic well before that time, approximately 7.6 – 7.2 ka

BP (Dyke 2004).

However, as the Laurentide Ice Sheet began to recede from the south, along its edge, the

Champlain Sea was joined to a series of interconnected glacial lakes approximately 11.5 ka BP

(Cronin et al. 2008), including Lake Agassiz and Lake Ojibway. These “gigantic” lakes

Page 110: Genetic diversity, population structure and phylogeography ...

92

disappeared after northwards drainage into Hudson Bay approximately 7.7 ka BP (Dyke 2004).

Furthermore, ice scour patterns indicate that the initial drainage of these lakes may have been

into western Hudson Bay and that more than one northward flushing may have occurred (Dyke

2004). The presence of beluga fossils, discussed previously, provide evidence of belugas

inhabiting the Champlain Sea (Harington 2006, 2008) and gives support to the possibility of

belugas recolonizing Hudson Bay via an expansion from the south (de March and Postma 2003).

Belugas have been shown to move into warmer, brackish waters and lakes (St. Aubin et al. 1990,

Boily 1995, Kocho-Schellenberg 2010), and warmer sea surface temperatures in the North

Atlantic at the end of last glacial maximum (Hewitt et al. 2001) may have allowed persistence in

these glacial lakes.

An alternate possibility is the existence of cryptic glacial refugia within Hudson Bay

itself. The existence of these refugia, not identified in paleorecords (Keppel et al. 2012), have

been proposed in the Arctic for a diverse range of species, including white spruce (Picea glauca)

(Anderson et al 2006), polar tiger moths (Pararctia subnebulosa) (Bolotov et al. 2015), and

collared lemmings (Dicrostonyx groenlandicus) (Fedorov and Stenseth 2002). Computer

modelling of the Laurentide ice sheet(s) in North America indicates that erratics in ice formation

and ice flow were common around the Belcher Islands and Hudson-James Bay lowland (Prest

1990). Given the ability of modern belugas to navigate areas of shifting and heavy ice cover

(Richard et al. 2001, Suydam et al. 2001), perhaps suitable habitat for belugas was present and

allowed for the persistence of whales in these areas throughout the last glacial maximum.

Drawing conclusions about historical expansion patterns from phylogeographic analyses

of modern genetic diversity must consider a number of confounding influences. Marjoram and

Donnelly (1994) point out that the interpretation of non-unimodal distributions of mtDNA

Page 111: Genetic diversity, population structure and phylogeography ...

93

pairwise differences need to consider not only a hypothesis of constant population size or

geographic subdivision but a variety of other population genetics models such as selection,

alternate mechanisms of mutation and unique population or migration dynamics. Certainly,

migration strategies of belugas vary among regions and within regions, such as SE Hudson Bay

and James Bay. However, present migration patterns may not entirely reflect post-glacial

colonization routes. Migration routes are shaped by numerous factors other than historic and

genetic influences, such as: barriers, mortality costs and/or predation, energy costs, and

navigational cues including search behaviours and memory (Alerstam et al. 2003, Avgar et al.

2013, Shaw and Couzin 2013, Fagan et al. 2013). Evidence of close kin associations during

migration in extant belugas has demonstrated the importance of social learning as a behavioural

driver of migration routes (Colbeck et al. 2012), which can also evolve over ecological

timescales (Zuk et al. 2014).

The impact of historic commercial harvesting and other human activities (such as

hydroelectric development, shipping and industrial pollution) are also likely to have influenced

the extant genetic patterns of Canadian beluga stocks (e.g. Allendorf et al. 2008). Starting in the

1500s, Spanish and French Basque commercial whalers hunted several species of whales nearly

to extinction along the east coast of Canada (Pope 2015). Though bowhead whales and right

whales (Eubaleana glacialis) were the primary targets of commercial whaling (McLeod et al.

2008), belugas were also taken, especially from the 1850s to the early 1900s in Cumberland

Sound, Ungava Bay, eastern Hudson Bay and the St. Lawrence Estuary (Reeves and Mitchell

1989, Reeves and Smith 2003, Lawson et al. 2006). This historical commercial whaling reduced

the numbers of belugas in these areas by approximately 75% (Cumberland Sound, Eastern

Hudson Bay) to 90% (St. Lawrence Estuary and Ungava Bay) (Lawson et al. 2006). The low

Page 112: Genetic diversity, population structure and phylogeography ...

94

level of haplotype diversity in the St. Lawrence Estuary (Table 2.2) likely reflects a demographic

bottleneck as a result of the severe reduction in population size (e.g. Nyström et al. 2006).

Conversely, high levels of haplotype diversity in Cumberland Sound and eastern Hudson Bay

beluga samples could indicate that the source populations for expansion into these areas may

have been large in size and/or high in genetic diversity and were able to retain diversity despite

high levels of historic commercial exploitation. For example, a similar history of commercial

exploitation and near extinction of Atlantic walrus (Odebenus rosmarus rosmarus) in Svalbard

was assessed for genetic impacts using ancient DNA techniques (Lindqvist et al. 2016). Ancient

sample haplotypes were nested in the modern samples, and higher than expected mtDNA

sequence variation was found in the modern Svalbard samples, indicating that intense hunting

may not have resulted in an extensive loss of mtDNA genetic variation. Again, ancient DNA

analyses of pre-commercial whaling beluga specimens and fossils would enhance the

interpretation of genetic patterns of modern beluga samples.

2.4.3 Predicting changes in movement patterns and population genetic structure among

Canadian beluga populations

Genetic information is essential for the development and modification of conservation

and management plans that seek to maintain species biodiversity vulnerable to stressors such as

changing climate and anthropogenic activities (e.g. Waples et al. 2008, Sgrò et al. 2010). The

identification of Canadian beluga stocks based on body size differences, summering aggregation

areas and migration patterns has been used for decades to evaluate the impacts of human

activities such as industrial development (e.g. hydro-electric development, shipping) and hunting

on the species biodiversity, abundance, and population trajectories (Reeves and Mitchell 1989,

Page 113: Genetic diversity, population structure and phylogeography ...

95

COSEWIC 2004). The concerns over the impacts of climate change, both direct and indirect, on

belugas have been increasing as the potential magnitude of threats due to cumulative stressors

are assessed (e.g. Laidre 2008, O’Corry-Crowe et al. 2008, Moore and Huntington 2008,

Huntington 2009, Laidre et al. 2015). Genetic tools can add both evolutionary insights for beluga

population histories and contemporary data for understanding current population diversity and

structure (O’Corry-Crowe et al. 2008).

Belugas are considered to be moderately sensitive to climate change and was the least

sensitive of the Arctic cetaceans because of its overall relative flexibility (Laidre et al. 2008).

The results of this study reveal a genetic diversity and suggest a phylogeographic history across

the Canadian range of belugas that further supports this idea of climate change resilience. In

contrast, narwhals (Monodon monoceros) are considered the most sensitive Arctic cetacean to

climate change as a consequence of close associations with ice and specialized feeding (Laidre et

al. 2008). Despite ecological similarities to belugas in migrations, area usage and large

population sizes, narwhals have very low levels of mtDNA genetic diversity (Palsbøll et al.

1997, de March et al. 2003) The identification of greater haplotype diversity in the Canada-wide

distribution of belugas has yielded more insight about the demographic histories of these whales

in many areas. Southeastern Hudson Bay and James Bay have a more complex mixture of

overlapping stocks than previously defined, and genetic evidence supports observations of

diverse migration and movement patterns and could indicate “uniquely adapted ecologies and

behaviours” in this area (Reeves and Mitchell 1989, O’Corry-Crowe et al. 2008). However, the

effects of a demographic bottleneck and reduction of the number of haplotypes in the St.

Lawrence Estuary appear more severe that previously indicated (Brown Gladden et al. 1997, de

March and Postma 2003) and highlight the impact that human activities can have on isolated

Page 114: Genetic diversity, population structure and phylogeography ...

96

beluga populations. The inferred ability of belugas to respond to historic periods of climate

change through the use of multiple refugia, routes of range expansions and recolonizations routes

reinforce the idea of the flexibility of this species to shift its range in response to changing

climate (Huntington 2008).

2.5 Acknowledgements

Most of the samples for genetic analyses in this study were collected by hunters from

communities in the Inuvialuit Settlement Region (ISR), Nunavut, and Nunavik during harvest

monitoring programs funded by the Fisheries Joint Management Committee (FJMC), the

Nunavut Wildlife Management Board (NWMB), the Nunavik Inuit Land Claims Agreement

(NICLA), and Fisheries and Oceans Canada. Countless scientific studies would not be possible

without these partnerships. The efforts of many dedicated individuals to coordinate these sample

collections and other sampling endeavours, the shipping and sorting of sample kits, archiving

materials in an organized system and maintaining sample information databases are immensely

appreciated. Denise Tenkula (DFO Central and Arctic) provided laboratory assistance, as well as

students and others (Vanessa Kornelsen, Lucy Johnson, Susie Bajno, Sheri Friesen) who helped

out over the duration of this study.

2.6 References

Achilli, A., Rengo, C., Magri, C., Battaglia, V., Olivieri, A., Scozzari, R., Cruciani, F., Zeviani,

M., Briem, E., Carelli, V. and Moral, P. 2004. The molecular dissection of mtDNA haplogroup

H confirms that the Franco-Cantabrian glacial refuge was a major source for the European gene

pool. The American Journal of Human Genetics 75: 910-918.

Alerstam, T., Hedenström, A. and Åkesson, S. 2003. Long‐distance migration: evolution and

determinants. Oikos 103: 247-260.

Page 115: Genetic diversity, population structure and phylogeography ...

97

Alexander, A., Steel, D., Hoekzema, K., Mesnick, S.L., Engelhaupt, D., Kerr, I., Payne, R. and

Baker, C.S. 2016. What influences the worldwide genetic structure of sperm whales (Physeter

macrocephalus)? Molecular Ecology 25: 2754-2772.

Amos, W. and Hoelzel, A.R. 1992. Applications of molecular genetic techniques to the

conservation of small populations. Biological Conservation 61: 133-144.

Anderson, B. G. and Borns Jr, H. W. 1997. The ice age world. An introduction to Quaternary

history and research with emphasis on North America and Europe during the last 2.5 million

years. Scandinavian University Press, Oslo, Norway.

Anderson, L.L., Hu, F.S., Nelson, D.M., Petit, R.J. and Paige, K.N. 2006. Ice-age endurance:

DNA evidence of a white spruce refugium in Alaska. Proceedings of the National Academy of

Sciences USA 103: 12447-12450.

Anisimova, M., and Gascuel, O. 2006. Approximate likelihood-ratio test for branches: a fast,

accurate, and powerful alternative. Systematic Biology 55: 539-552.

Armstrong, T. 2013. Management plan for the beluga (Delphinapterus leucas) in Ontario.

Ontario Management Plan Series. Prepared for the Ontario Ministry of Natural Resources.

Peterborough, Ontario. Vi + 58pp.

Avgar, T., Street, G. and Fryxell, J.M. 2013. On the adaptive benefits of mammal migration.

Canadian Journal of Zoology 92: 481-490.

Avise, J.C., Arnold, J., Ball, R.M., Bermingham, E., Lamb, T., Neigel, J.E., Reeb, C.A. and

Saunders, N.C. 1987. Intraspecific phylogeography: the mitochondrial DNA bridge between

population genetics and systematics. Annual Review of Ecology and Systematics 18: 489-522.

Avise, J.C., 2009. Phylogeography: retrospect and prospect. Journal of Biogeography 36: 3-15.

Bailleul, F., Lesage, V., Power, M., Doidge, D.W. and Hammill, M.O. 2012. Migration

phenology of beluga whales in a changing Arctic. Climate Research 53: 169-178.

Bandelt, H.-J., Forster, P., and Röhl, A. 1999. Median-joining networks for inferring

intraspecific phylogenies. Molecular Biology and Evolution 16: 37-48.

Bandelt, H.-J., Macauley, V., and Richards, M. 2000. Median networks: speedy construction and

greedy reduction, one simulation, and two case studies from human mtDNA. Molecular

Phylogenetics and Evolution 16: 8-28.

Page 116: Genetic diversity, population structure and phylogeography ...

98

Barber, D.G. andMassom, R.A. 2007. The role of sea ice in Arctic and Antarctic polynyas. In

W.O. Smith Jr. and D.G. Barber (Eds.) Polynyas: Windows to the World. Elsevier Oceanography

Series 74, doi 10.1016/S0422-9894(06)74001-6, pp. 1-54.

Barrie, J.V. and Conway, K.W. 1999. Late Quaternary glaciation and postglacial stratigraphy of

the northern Pacific margin of Canada. Quaternary Research 51: 113-123.

Bennett, K.D. and Provan, J. 2008. What do we mean by ‘refugia’?. Quaternary Science Reviews

27: 2449-2455.

Bernatchez, L. 2001. The evolutionary history of brown trout (Salmo trutta L.) inferred from

phylogeographic, nested clade, and mismatch analyses of mitochondrial DNA variation.

Evolution 55: 351-379.

Boily, P., 1995. Theoretical heat flux in water and habitat selection of phocid seals and beluga

whales during the annual molt. Journal of Theoretical Biology 172: 235-244.

Bolotov, I.N., Tatarinov, A.G., Filippov, B.Y., Gofarov, M.Y., Kondakov, A.V., Kulakova, O.I.,

Potapov, G.S., Zubryi, N.A. and Spitsyn, V.M. 2015. The distribution and biology of Pararctia

subnebulosa (Dyar, 1899)(Lepidoptera: Erebidae: Arctiinae), the largest tiger moth species in the

High Arctic. Polar Biology 38: 905-911.

Bracken, F.S., Hoelzel, A.R., Hume, J.B. and Lucas, M.C., 2015. Contrasting population genetic

structure among freshwater‐resident and anadromous lampreys: the role of demographic history,

differential dispersal and anthropogenic barriers to movement. Molecular Ecology 24: pp.1188-

1204.

Brennin, R., Murray, B.W., Friesen, M.K., Maiers (Postma), L.D., Clayton, J.W., and White,

B.N. 1997. Population genetic structure of beluga whales (Delphinapterus leucas): mitochondrial

DNA sequence variation within and among North American populations. Canadian Journal of

Zoology 75: 795-802.

Breyne, P., Mergeay, J. and Casaer, J., 2014. Roe deer population structure in a highly

fragmented landscape. European Journal of Wildlife Research 60: 909-917.

Brodie, P.F. 1969. Duration of lactation in cetacea: an indicator of required learning? American

Midland Naturalist 82: 312-314.

Page 117: Genetic diversity, population structure and phylogeography ...

99

Brown Gladden, J.G., Ferguson, M.M, and Clayton, J.W. 1997. Matriarchal genetic population

structure of North American beluga whales Delphinapterus leucas (Cetacea: Monodontidae).

Molecular Ecology 6: 1033-1046.

Canestrelli, D., Bisconti, R. and Carere, C. 2016. Bolder takes all? The behavioral dimension of

biogeography. Trends in Ecology & Vvolution 31: 35-43.

Canino, M.F., Spies, I.B., Cunningham, K.M., Hauser, L. and Grant, W.S., 2010. Multiple ice‐

age refugia in Pacific cod, Gadus macrocephalus. Molecular Ecology 19: 4339-4351.

Citta, J.J., Richard, P., Lowry, L.F., O'Corry‐Crowe, G., Marcoux, M., Suydam, R., Quakenbush,

L.T., Hobbs, R.C., Litovka, D.I., Frost, K.J. and Gray, T. 2017. Satellite telemetry reveals

population specific winter ranges of beluga whales in the Bering Sea. Marine Mammal Science

33: 236-250.

Colbeck, G.J., Duchesne, P., Postma, L.D., Lesage, V., Hammill, M.O. and Turgeon, J., 2013.

Groups of related belugas (Delphinapterus leucas) travel together during their seasonal

migrations in and around Hudson Bay. Proceedings of the Royal Society of London B:

Biological Sciences 280: 20122552.

COSEWIC 2004. COSEWIC assessment and update status report on the beluga whale

Delphinapterus leucas in Canada. Committee on the Status of Endangered Wildlife in Canada.

Ottawa. Ix + 70 pp. (www.sararegistry.gc.ca/status/status_e.cfm).

COSEWIC. 2012. Definitions and abbreviations. [Online:

http://www.cosewic.gc.ca/eng/sct2/sct2_6_e.cfm].

COSEWIC 2015. Guidelines for recognizing Designatable Units. www.cosewic.gc.ca Status

Reports. Information for Preparing Status Reports.

COSEWIC. 2016. Designatable Units for Beluga Whales (Delphinapterus leucas) in Canada.

Committee on the Status of Endangered Wildlife in Canada. Ottawa. 73 pp.

Cronin, T.M., Manley, P.L., Brachfeld, S., Manley, T.O., Willard, D.A., Guilbault, J.P.,

Rayburn, J.A., Thunell, R. and Berke, M. 2008. Impacts of post-glacial lake drainage events and

revised chronology of the Champlain Sea episode 13–9 ka. Palaeogeography, Palaeoclimatology,

Palaeoecology 262: 46-60.

Page 118: Genetic diversity, population structure and phylogeography ...

100

de March, B.G.E., Maiers (Postma), L.D., and Friesen, M.K. 2002. An overview of genetic

relationships of Canadian and adjacent populations of belugas (Delphinapterus leucas) with

emphasis on Baffin Bay and Canadian eastern Arctic populations. NAMMCO Scientific

Publications 4: 17-35.

de March, B.G.E and Postma, L.D. 2003. Molecular stock discrimination of belugas

(Delphinapterus leucas) hunted in eastern Hudson Bay, northern Quebec, Hudson Strait, and

Sanikiluaq (Belcher Islands), Canada, and comparisons to adjacent populations. Arctic 56: 111-

124.

de March, B.G.E., Tenkula, D.A, and Postma, L.D. 2003. Molecular genetics of narwhal

(Monodon monoceros) from Canada and West Greenland (1982-2001). Canadian Science

Advisory Secretariat Research Document 2003/080, i + 19pp.

DFO. 2016. Updated genetic mixture analysis of Nunavik beluga (Delphinapterus leucas)

harvests. DFO Canadian Science Advisory Secretariat Science Advisory Report 2016/025.

Dyke, A.S., 2004. An outline of North American deglaciation with emphasis on central and

northern Canada. Developments in Quaternary Sciences 2: 373-424.

Dyke, A.S., Hooper, J. and Savelle, J.M., 1996. A history of sea ice in the Canadian Arctic

Archipelago based on postglacial remains of the bowhead whale (Balaena mysticetus). Arctic 49:

235-255.

Eriksson, J., Hohmann, G., Boesch, C. and Vigilant, L., 2004. Rivers influence the population

genetic structure of bonobos (Pan paniscus). Molecular Ecology 13: 3425-3435.

Fagan, W.F., Lewis, M.A., Auger‐Méthé, M., Avgar, T., Benhamou, S., Breed, G., LaDage, L.,

Schlägel, U.E., Tang, W.W., Papastamatiou, Y.P. and Forester, J. 2013. Spatial memory and

animal movement. Ecology Letters 16: 1316-1329.

Fedorov, V.B. and Stenseth, N.C. 2002. Multiple glacial refugia in the North American Arctic:

inference from phylogeography of the collared lemming (Dicrostonyx groenlandicus).

Proceedings of the Royal Society of London B: Biological Sciences 269: 2071-2077.

Felsenstein, J. 1985. Confidence limits on phylogenies: an approach using the bootstrap.

Evolution 39: 783-791.

Foulds, L.R., Hendy, M.D., and Penny, D. 1979. A graph theoretic approach to the development

of minimal phylogenetic trees. Journal of Molecular Evolution 13: 127-149.

Page 119: Genetic diversity, population structure and phylogeography ...

101

Friesen, V.L., Burg, T.M. and McCoy, K.D., 2007. Mechanisms of population differentiation in

seabirds. Molecular Ecology 16: 1765-1785.

Fraser, D.F., Gilliam, J.F., Daley, M.J., Le, A.N., and Skalski, G.T. 2001. Explaining leptokurtic

movement distributions: intrapopulation variation in boldness and exploration. The American

Naturalist 158: 124-135.

Fratini, S., Ragionieri, L., and Cannicci, S. 2010. Stock structure and demographic history of the

Indo-West Pacific mud crab Scylla serrata. Estuarine, Coastal and Shelf Science 86: 51-61.

Fu, Y.-X. 1997. Statistical tests of neutrality against population growth, hitchhiking and

background selection. Genetics 147: 915-925.

Gavin, D.G., Fitzpatrick, M.C., Gugger, P.F., Heath, K.D., Rodríguez‐Sánchez, F., Dobrowski,

S.Z., Hampe, A., Hu, F.S., Ashcroft, M.B., Bartlein, P.J. and Blois, J.L. 2014. Climate refugia:

joint inference from fossil records, species distribution models and phylogeography. New

Phytologist 204: 37-54.

Gilchrist, H.G. and Robertson, G.J. 2000. Observations of marine birds and mammals wintering

at polynyas and ice edges in the Belcher Islands, Nunavut, Canada. Arctic 53: 61-68.

Gosselin, J.-F., Lésage, V., Hammill, M., and Bourdages, H. 2002. Abundance indices of

belugas in James Bay, eastern Hudson Bay and Ungava Bay in summer 2001. Canadian Science

Advisory Secretariat Research Document 2002/042, 27pp.

Guindon, S., Lethiec, F., Duroux, P., and Gascuel, O. 2005. PHYML online – a web server for

fast maximum likelihood-based phylogenetic inference. Nucleic Acids Research 33: W557-

W559.

Hall, B. 2011. Phylogenetic Trees Made Easy. Sinauer Associates Inc. Publishers,

Massachusetts, U.S.A., 282pp.

Harlin, A.D., Markowitz, T., Baker, C.S, Würsig, B., and Honeycutt, R.L. 2003. Genetic

structure, diversity and historical demography of New Zealand’s dusky dolphin

(Lagenorhynchus obscurus). Journal of Mammalogy 84: 702-717.

Harington, C.R., 2003. Quaternary vertebrates of Québec: a summary. Géographie physique et

Quaternaire 57:85-94.

Page 120: Genetic diversity, population structure and phylogeography ...

102

Harington, C.R. 2006. Félix: a late Pleistocene white whale (Delphinapterus leucas) skeleton

from the Champlain Sea deposits at Saint-Félix-de-Valois, Québec. Géographie physique et

Quaternaire 60: 183-198.

Harington, C.R. 2008. The evolution of Arctic marine mammals. Ecological Applications 18

(Supplement): S23-S40.

Harpending, H.C. 1994. Signature of ancient population growth in a low-resolution

mitochondrial DNA mismatch distribution. Human Biology 66: 591-600.

Hasegawa, M., Kishino, K., and Yano, T. 1985. Dating the human-ape splitting by a molecular

clock of mitochondrial DNA. Journal of Molecular Evolution 22: 160-174.

Hauser, D.D., Laidre, K.L., Suydam, R.S. and Richard, P.R. 2014. Population-specific home

ranges and migration timing of Pacific Arctic beluga whales (Delphinapterus leucas). Polar

Biology 37: 1171-1183.

Hauser, D.D., Laidre, K.L., Stafford, K.M., Stern, H.L., Suydam, R.S. and Richard, P.R. 2016.

Decadal shifts in autumn migration timing by Pacific Arctic beluga whales are related to delayed

annual sea ice formation. Global Change Biology, doi: 10.1111/gcb.13564.

Hawkes, C., 2009. Linking movement behaviour, dispersal and population processes: is

individual variation a key? Journal of Animal Ecology 78: 894-906.

Heide-Jørgensen, M.P., Richard, P., Ramsay, M. and Akeeagok, S. 2002. Three recent ice

entrapments of Arctic cetaceans in West Greenland and the eastern Canadian High Arctic.

NAMMCO Scientific Publications 4: 143-148.

Hewitt, C.D., Broccoli, A.J., Mitchell, J.F. and Stouffer, R.J. 2001. A coupled model study of the

last glacial maximum: Was part of the North Atlantic relatively warm? Geophysical Research

Letters 28: 1571-1574.

Hewitt, G.M. 1996. Some genetic consequences of ice ages, and their role in divergence and

speciation. Biological Journal of the Linnean Society 58: 247-276.

Hewitt, G.M. 2000. The genetic legacy of the ice ages. Nature 405: 907-913.

Hewitt, G.M. 2004. Genetic consequences of climatic oscillations in the Quaternary.

Philosophical Transactions of the Royal Society of London B 359: 183-195.

Page 121: Genetic diversity, population structure and phylogeography ...

103

Hoelzel, A.R., Hey, J., Dahlheim, M.E., Nicholson, C., Burkanov, V. and Black, N. 2007.

Evolution of population structure in a highly social top predator, the killer whale. Molecular

Biology and Evolution 24: 1407-1415.

Holland, B.R., Penny, D., and Hendy, M.D. 2003. Outgroup misplacement and phylogenetic

inaccuracy under a molecular clock- a simulation study. Systematic Biology 52: 229-238.

Holland, B.R., Huber, K.T., Moulton, V., and Lockhart, P.J. 2004. Using consensus networks to

visualize contradictory evidence for species phylogeny. Molecular Biology and Evolution 21:

1459-1461.

Holm, S. 1979. A simple sequential rejective method procedure. Scandinavian Journal of

Statistics 6: 65-70.

Hordijk, W., and Gascuel, O. 2005. Improving the efficiency of SPR moves in phylogenetic tree

search methods based on maximum likelihood. Bioinformatics 21: 4338-4347.

Hornby, C.A., Hoover, C., Iacozza, J., Barber, D.G. and Loseto, L.L. 2016. Spring conditions

and habitat use of beluga whales (Delphinapterus leucas) during arrival to the Mackenzie River

Estuary. Polar Biology 39: 2319-2334.

Huson, D.H. and Scornavacca, C. 2011. A survey of combinatorial methods for phylogenetic

networks. Genome Biology and Evolution 3: 23-35.

Ibrahim, K.M., Nichols, R.A., and Hewitt, G.M. 1996. Spatial patterns of genetic variation

generated by different forms of dispersal during range expansion. Heredity 77: 282-291.

Irwin, D.E., 2002. Phylogeographic breaks without geographic barriers to gene flow. Evolution

56: 2383-2394.

IWC 2000. Status of Monodontid whales: white whale. Journal of Cetacean Research and

Management, 2 (SUPPL.), 243-250.

Kelley, T.C., Loseto, L.L., Stewart, R.E.A., Yurkowski, M. and Ferguson, S.H. 2010.

Importance of eating capelin: unique dietary habits of Hudson Bay beluga. In S.H. Ferguson et

al. (Eds.) A Little Less Arctic: Top Predators in the World’s Largest Northern Inland Sea,

Hudson Bay. Springer, Netherlands, pp 53-70, doi: 10.1007/978-90-481-9121-5_3.

Page 122: Genetic diversity, population structure and phylogeography ...

104

Kelley, T.C., Stewart, R.E., Yurkowski, D.J., Ryan, A. and Ferguson, S.H. 2015. Mating ecology

of beluga (Delphinapterus leucas) and narwhal (Monodon monoceros) as estimated by

reproductive tract metrics. Marine Mammal Science 31: 479-500.

Keppel, G., Van Niel, K.P., Wardell‐Johnson, G.W., Yates, C.J., Byrne, M., Mucina, L., Schut,

A.G., Hopper, S.D. and Franklin, S.E. 2012. Refugia: identifying and understanding safe havens

for biodiversity under climate change. Global Ecology and Biogeography 21: 393-404.

Klauke, N., Schaefer, H.M., Bauer, M. and Segelbacher, G., 2016. Limited dispersal and

significant fine-scale genetic structure in a tropical montane parrot species. PLoS One 11:

p.e0169165.

Klütsch, C.F., Manseau, M. and Wilson, P.J. 2012. Phylogeographical analysis of mtDNA data

indicates postglacial expansion from multiple glacial refugia in woodland caribou (Rangifer

tarandus caribou). PLoS One 7, p.e52661.

Knowles, L. L. and Maddison, W.P. 2002. Statistical phylogeography. Molecular Ecology 11:

2623-2635.

Kocho-Schellenberg, J.E. 2010. Understanding the evolution of beluga entrapment co-

management in the Inuvialuit Settlement Region using social network analysis. Master’s Thesis,

Faculty of Environment, Earth and Resources, University of Manitoba, Winnipeg, 138pp.

Krasnova, V.V., Chernetsky, A.D., Zheludkova, A.I. and Bel’kovich, V.M., 2014. Parental

behavior of the beluga whale (Delphinapterus leucas) in natural environment. Biology Bulletin,

41: 349-356.

Kumar, S., Tamura, K., and Nei, M. 2004. MEGA3: Integrated software for Molecular

Evolutionary Genetics Analysis and Sequence Alignment. Briefings in Bioinformatics 5: 150-

163.

Laidre, K.L., Stirling, I., Lowry, L.F., Wiig, Ø., Heide-Jørgensen, M.P. and Ferguson, S.H. 2008.

Quantifying the sensitivity of Arctic marine mammals to climate‐induced habitat change.

Ecological Applications 18 (Supplement): S97-S125.

Lawson, J., Hammill, M., and Stenson, G. 2006. Characteristics for recovery: beluga whale.

DFO Canadian Science Advisory Secretariat Research Document 2006/075. ii + 16pp.

Page 123: Genetic diversity, population structure and phylogeography ...

105

Lefebvre, S.L., Michaud, R., Lesage, V. and Berteaux, D. 2012. Identifying high residency areas

of the threatened St. Lawrence beluga whale from fine-scale movements of individuals and

coarse-scale movements of herds. Marine Ecology Progress Series: 243-257.

Levin, I.I. and Parker, P.G., 2012. Philopatry drives genetic differentiation in an island

archipelago: comparative population genetics of Galapagos Nazca boobies (Sula granti) and

great frigatebirds (Fregata minor). Ecology and Evolution 2: 2775-2787.

Librado, P. and Rozas, J. 2009. DnaSP v5: A software for comprehensive analysis of DNA

polymorphism data. Bioinformatics 25: 1451-1452.

Lindqvist, C., Roy, T., Lydersen, C., Kovacs, K.M., Aars, J., Wiig, Ø. and Bachmann, L. 2016.

Genetic diversity of historical Atlantic walruses (Odobenus rosmarus rosmarus) from Bjørnøya

and Håøya (Tusenøyane), Svalbard, Norway. BMC Research Notes 9: 112.

Loseto, L.L., Richard, P., Stern, G.A., Orr, J. and Ferguson, S.H. 2006. Segregation of Beaufort

Sea beluga whales during the open-water season. Canadian Journal of Zoology 84: 1743-1751.

Loseto, L.L., Stern, G.A. and Ferguson, S.H. 2008. Size and biomagnification: how habitat

selection explains beluga mercury levels. Environmental Science & Technology 42: 3982-3988.

Louis, M., Viricel, A., Lucas, T., Peltier, H., Alfonsi, E., Berrow, S., Brownlow, A., Covelo, P.,

Dabin, W., Deaville, R. and Stephanis, R. 2014. Habitat‐driven population structure of bottlenose

dolphins, Tursiops truncatus, in the North‐East Atlantic. Molecular Ecology 23: 857-874.

Louys, J. 2007. Limited effect of the Quaternary's largest super-eruption (Toba) on land

mammals from Southeast Asia. Quaternary Science Reviews 26: 3108-3117.

Louys, J. 2012. Mammal community structure of Sundanese fossil assemblages from the Late

Pleistocene, and a discussion on the ecological effects of the Toba eruption. Quaternary

International 258: 80-87.

Maddison, W.P., 1997. Gene trees in species trees. Systematic Biology 46: 523-536.

Maddison, W.P. and Knowles, L.L. 2006. Inferring phylogeny despite incomplete lineage

sorting. Systematic Biology 55: 21-30.

Maggs, C.A., Castilho, R., Folta, D., Henzler, C, and Wares, J. 2008. Evaluating signatures of

glacial refugia for North Atlantic benthic marine taxa. Ecology 89 (Supplement): S108-S122.

Page 124: Genetic diversity, population structure and phylogeography ...

106

Marjoram, P. and Donnelly, P. 1994. Pairwise comparisons of mitochondrial DNA sequences in

subdivided populations and implications for early human evolution. Genetics 136: 673-683.

Martin, A.R., Hall, P. and Richard, P.R. 2001. Dive behaviour of belugas (Delphinapterus

leucas) in the shallow waters of western Hudson Bay. Arctic 54: 276-283.

Matias, A.M.A., Anticamara, J.A. and Quilang, J.P., 2013. High gene flow in reef fishes and its

implications for ad-hoc no-take marine reserves. Mitochondrial DNA 24: 584-595.

McLeod, B.A., Brown, M.W., Moore, M.J., Stevens, W., Barkham, S.H., Barkham, M. and

White, B.N. 2008. Bowhead whales, and not right whales, were the primary target of 16th-to

17th-century Basque whalers in the western North Atlantic. Arctic 61:61-75.

Meschersky, I.G., Kholodova, M.V., and Zvychaynaya, E.Y. 2008. Molecular genetic study of

the beluga (Delphinapterus leucas: Cetacea, Monodontidae) summering in the southern Sea of

Okhotsk as compared to North American populations. Russian Journal of Genetics 44: 1105 –

1110.

Monteiro, S.S., Méndez-Fernandez, P., Piertney, S., Moffat, C.F., Ferreira, M., Vingada, J.V.,

López, A., Brownlow, A., Jepson, P., Mikkelsen, B. and Niemeyer, M. 2015. Long-finned pilot

whale population diversity and structure in Atlantic waters assessed through biogeochemical and

genetic markers. Marine Ecology Progress Series 536: 243-257.

Morin, P.A., Martien, K.K., Archer, F.I., Cipriano, F., Steel, D., Jackson, J. and Taylor, B.L.,

2010. Applied conservation genetics and the need for quality control and reporting of genetic

data used in fisheries and wildlife management. Journal of Heredity 101: 1-10.

Morrison, D.A. 2005. Networks in phylogenetic analysis: new tools for population biology.

International Journal for Parasitology 35: 567-582.

Nei, M. and Kumar, S. 2000. Molecular Evolution and Phylogenetics. Oxford University Press,

New York, U.S.A., 333pp.

Nichols, R. 2001. Gene trees and species trees are not the same. Trends in Ecology and

Evolution 16: 358-364.

Nichols, R.A. and Hewitt, G.M., 1994. The genetic consequences of long distance dispersal

during colonization. Heredity 72: 312-317.

Page 125: Genetic diversity, population structure and phylogeography ...

107

Nyström, V., Angerbjörn, A. and Dalén, L. 2006. Genetic consequences of a demographic

bottleneck in the Scandinavian arctic fox. Oikos 114: 84-94.

O’Corry-Crowe, G.M., Suydam, R.S., Rosenberg, A., Frost, K.J., and Dizon, A.E. 1997.

Phylogeography, population structure and dispersal patterns of the beluga whale Delphinapterus

leucas in the western Nearctic revealed by mitochondrial DNA. Molecular Ecology 6: 955-970.

O’Corry-Crowe, G.M. 2008. Climate change and the molecular ecology of Arctic marine

mammals. Ecological Applications 18 (Supplement): S56-S76.

O’Corry-Crowe, G. 2009. Beluga whale, Delphinapterus leucas. In W.F. Perrin, B. Würsig and

J.G.M. Thewissen (Eds.), Encyclopedia of Marine Mammals, Elsevier Academic Press,

Burlingtom, MA, USA.

O’Corry-Crowe, G.M., Lydersen, C., Heide-Jørgensen, M.P., Hansen, L, Mukhametov, L.M.,

Dove, O., and Kovacs, K. 2010. Population genetic structure and evolutionary history of North

Atlantic beluga whales (Delphinapterus leucas) from West Greenland, Svalbard and the White

Sea. Polar Biology 33: 1179-1194.

O’Corry-Crowe, G.M., Lucey, W., Archer, F.I. and Mahoney, B. 2015. The genetic ecology and

population origins of the beluga whales, Delphinapterus leucas, of Yakutat Bay. Marine

Fisheries Review 77: 47-58.

Palsbøll, P., Heide-Jørgensen, M.P. and Dietz, R. 1997. Genetic studies of narwhals, Monodon

monoceros, from West and East Greenland. Heredity 78: 284-292.

Palsböll, P.J., Heide-Jørgensen, M.P., and Bérubé, M. 2002. Analysis of mitochondrial control

region nucleotide sequences from Baffin Bay beluga (Delphinapterus leucas): detecting pods or

sub-populations? NAMMCO Scientific Publications 4: 39-51.

Parsons, K.M., Durban, J.W., Burdin, A.M., Burkanov, V.N., Pitman, R.L., Barlow, J., Barrett-

Lennard, L.G., LeDuc, R.G., Robertson, K.M., Matkin, C.O. and Wade, P.R. 2013. Geographic

patterns of genetic differentiation among killer whales in the northern North Pacific. Journal of

Heredity 104: 737-754.

Peakall, R. and Smouse, P.E. 2006. GENALEX 6: genetic analysis in excel. Population

genetic software for teaching and research. Molecular Ecology Notes 6: 288-295.

Page 126: Genetic diversity, population structure and phylogeography ...

108

Pelletier, A., Obbard, M.E., Mills, K., Howe, E.J., Burrows, F.G., White, B.N., and Kyle, C.J.

2012. Delineating genetic groupings in continuously distributed species across largely

homogeneous landscapes: a study of American black bears (Ursus americanus) in Ontario,

Canada. Canadian Journal of Zoology 90: 999-1014.

Pilot, M., Dahlheim, M.E. and Hoelzel, A.R. 2010. Social cohesion among kin, gene flow

without dispersal and the evolution of population genetic structure in the killer whale (Orcinus

orca). Journal of Evolutionary Biology 23: 20-31.

Posada, D. 2008. jModelTest: Phylogenetic model averaging. Molecular Biology and Evolution

25: 1253-1256.

Posada, D. and Crandall, K.A. 2001. Intraspecific gene genealogies: trees grafting into networks.

Trends in Ecology and Evolution 16: 37-45.

Postma, L.D., Petersen, S.D., Turgeon, J., Hammill, M.O., and Doniol-Valcroze, T. 2012. Beluga

whales in James Bay: a separate entity from eastern Hudson Bay belugas? DFO Canadian

Science Advisory Secretariat Research Document 2012/074. iii + 23pp.

Provan, J., and Bennett, K.D. 2008. Phylogeographic insights into cryptic glacial refugia. Trends

in Ecology and Evolution 23: 564-571.

Quakenbush, L.T., Suydam, R.S., Bryan, A.L., Lowry, L.F., Frost, K.J. and Mahoney, B.A.

2015. Diet of beluga whales (Delphinapterus leucas) in Alaska from stomach contents, March–

November. Marine Fisheries Review 77: 70-84.

Quintela, M., Skaug, H.J., Øien, N., Haug, T., Seliussen, B.B., Solvang, H.K., Pampoulie, C.,

Kanda, N., Pastene, L.A. and Glover, K.A. 2014. Investigating population genetic structure in a

highly mobile marine organism: the minke whale Balaenoptera acutorostrata acutorostrata in

the North East Atlantic. PLoS One 9: p.e108640.

Rambaut, A., Suchard, M.A., Xie, D., and Drummond, A.J. 2014. Tracer v1.6. Available from

http://beast.bio.ed.ac.uk/Tracer

Ramos-Onsins, S.E. and Rozas, J. 2002 Statistical properties of new neutrality tests against

population growth. Molecular Biology and Evolution 19: 2092-2100.

Ray, N., Currat, M. and Excoffier, L. 2003. Intra-demem molecular diversity in spatially

expanding populations. Molecular Biology and Evolution 20: 76-86.

Page 127: Genetic diversity, population structure and phylogeography ...

109

Reeves, R.R. and Mitchell, E. 1989. Status of white whales, Delphinapterus leucas, in Ungava

Bay and eastern Hudson Bay. Canadian Field-Naturalist 103: 220-239.

Reeves, R.R. and Smith, T.D. 2006. A taxonomy of world whaling. Whales, whaling, and ocean

ecosystems (JA Estes, DP DeMaster, DF Doak, TM Williams, and RL Brownell, Jr. (eds.).

University of California Press, Berkeley, pp.82-101.

Richard, P.R., Heide-Jørgensen, M.P., Orr, J.R., Dietz, R. and Smith, T.G. 2001. Summer and

autumn movements and habitat use by belugas in the Canadian High Arctic and adjacent areas.

Arctic 54: 207-222.

Richard, P. 2010. Stock definition of belugas and narwhals in Nunavut. DFO Canadian Science

Advisory Secretariat Research Document 2010/022. iv + 14p.

Rogers, A.R. and Harpending, H. 1992. Population growth makes waves in the distribution of

pairwise genetic differences. Molecular Biology and Evolution 9: 552-569.

Rogers, A.R., Fraley, A.E., Bamshad, M.J., Watkins, W.S., and Jorde, L.B. 1996. Mitochondrial

mismatch analysis is insensitive to the mutational process. Molecular Biology and Evolution 13:

895-902.

Ronquist, F., Teslenko, M., van der Mark, P., Aures, D.L., Darling, A., Höhna, S., Larget, B.,

Liu, L., Suchard, M.A., and Huelsenbeck, J.P. 2012. MrBayes 3.2: Efficient Bayesian

phylogenetic inference and model choice across a large model space. Systematic Biology 61:

539-542.

Schenekar, T. and Weiss, S. 2011. High rate of calculation errors in mismatch distribution

analysis results in numerous false inferences of biological importance. Heredity 107: 511-512.

Seutin, G., White, B.N., and Boag, P.T. 1991. Preservation of avian blood and tissue samples

for DNA analyses. Canadian Journal of Zoology 69: 82-9.

Sgrò, C.M., Lowe, A.J. and Hoffmann, A.A. 2011. Building evolutionary resilience for

conserving biodiversity under climate change. Evolutionary Applications 4: 326-337.

Shaw, A.K. and Couzin, I.D. 2013. Migration or residency? The evolution of movement

behavior and information usage in seasonal environments. The American Naturalist 181: 114-

124.

Page 128: Genetic diversity, population structure and phylogeography ...

110

Slatkin, M. and Excoffier, L. 2012. Serial founder effects during range expansion: a spatial

analog of genetic drift. Genetics 191: 171-181.

Spice, E.K., Goodman, D.H., Reid, S.B. and Docker, M.F. 2012. Neither philopatric nor

panmictic: microsatellite and mtDNA evidence suggests lack of natal homing but limits to

dispersal in Pacific lamprey. Molecular Ecology 21: 12916-2930.

St. Aubin, D.S., Smith, T.G. and Geraci, J.R. 1990. Seasonal epidermal molt in beluga whales,

Delphinapterus leucas. Canadian Journal of Zoology 68: 359-367.

Stewart, B.E., and Stewart, R.E.A. 1989. Delphinapterus leucas. Mammalian Species 336:1-8.

Stirling, I. 1997. The importance of polynyas, ice edges, and leads to marine mammals and birds.

Journal of Marine Systems: 9-21.

Stronen, A.V., Forbes, G.J., Paquet, P.C., Goulet, G., Sallows, T. and Musiani, M., 2012.

Dispersal in a plain landscape: short-distance genetic differentiation in southwestern Manitoba

wolves, Canada. Conservation Genetics 13: 359-371.

Tajima, F. 1989. Statistical-method for testing the neutral mutation hypothesis by DNA

polymorphism. Genetics 123: 585-595.

Tamura, K., Dudley, J., Nei, M., and Kumar, S. 2007. MEGA4: Molecular Evolutionary

Genetics Analysis (MEGA) software version 4.0. Molecular Biology and Evolution 24: 1596-

1599.

Tamura, K., Peterson, D., Peterson, N., Stecher, G., Nei, M, and Kumar, S. 2011. MEGA5:

Molecular Evolutionary Genetics Analysis using maximum likelihood, evolutionary distance,

and maximum parsimony methods. Molecular Biology and Evolution 28: 2731-2739.

Tamura, K. Stecher, G., Peterson, D., Filipski, A., and Kumar, S. 2013. MEGA6: Molecular

Evolutionary Genetics Analysis version 6. Molecular Biology and Evolution 30: 2725-2729.

Templeton, A.R., Routman, E. and Phillips, C.A. 1995. Separating population structure from

population history: a cladistic analysis of the geographical distribution of mitochondrial DNA

haplotypes in the tiger salamander, Ambystoma tigrinum. Genetics 140: 767-782.

Page 129: Genetic diversity, population structure and phylogeography ...

111

Turgeon, J., Duchesne, P., Colbeck, G.J., Postma, L.D., and Hammill, M.O. 2012.

Spatiotemporal segregation among summer stocks of beluga (Delphinapterus leucas) despite

nuclear gene flow: implication for the endangered belugas in eastern Hudson Bay (Canada).

Conservation Genetics 13: 419-433.

Tyack, P. 1986. Population biology, social behavior and communication in whales and dolphins.

Trends in Ecology & Evolution 1: 144-150.

Vélez-Juarbe, J. and Pyenson, N.D. 2012. Bohaskaia monodontoides, a new monodontid

(Cetacea, Odontoceti, Delphinoidea) from the Pliocene of the western North Atlantic Ocean.

Journal of Vertebrate Paleontology 32: 476-484.

Viricel, A. and Rosel, P.E. 2014. Hierarchical population structure and habitat differences in a

highly mobile marine species: the Atlantic spotted dolphin. Molecular Ecology 23: 5018-5035.

Waples, R.S., 1998. Separating the wheat from the chaff: patterns of genetic differentiation in

high gene flow species. Journal of Heredity 89: 438-450.

Waples, R.S., Punt, A.E. and Cope, J.M. 2008. Integrating genetic data into management of

marine resources: how can we do it better? Fish and Fisheries 9:423-449.

Watterson, G.A., and Guess, H.A. 1977. Is the most frequent allele the oldest? Theoretical

Population Biology 11: 141-160.

Watterson, G.A. 1985. The genetic divergence of two populations. Theoretical Population

Biology 27: 298-317.

Williams, M.A., Ambrose, S.H., van der Kaars, S., Ruehlemann, C., Chattopadhyaya, U., Pal, J.

and Chauhan, P.R. 2009. Environmental impact of the 73ka Toba super-eruption in South Asia.

Palaeogeography, Palaeoclimatology, Palaeoecology 284: 295-314.

Williams, M., 2012. Did the 73 ka Toba super-eruption have an enduring effect? Insights from

genetics, prehistoric archaeology, pollen analysis, stable isotope geochemistry, geomorphology,

ice cores, and climate models. Quaternary International 269: 87-93.

Wilson, R.E., Farley, S.D., McDonough, T.J., Talbot, S.L. and Barboza, P.S., 2015. A genetic

discontinuity in moose (Alces alces). Conservation Genetics 16: 791-800.

Page 130: Genetic diversity, population structure and phylogeography ...

112

Zhou, W., Jin, J., Wu, J., Chen, H., Yang, J., Murphy, R.W. and Che, J. 2017. Mountains too

high and valleys too deep drive population structuring and demographics in a Qinghai–Tibetan

Plateau frog Nanorana pleskei (Dicroglossidae). Ecology and Evolution 7: 240-252.

Zlojutro, M., Rubicz, R., Devor, E.J., Spitsyn, V.A., Makarov, S.V., Wilson, K., and Crawford,

M.H. 2006. Genetic structure of the Aleuts and Circumpolar Populations based on mitochondrial

DNA sequences: a synthesis. American Journal of Physical Anthropology 129: 446-464.

Zuk, M., Bastiaans, E., Langkilde, T. and Swanger, E. 2014. The role of behaviour in the

establishment of novel traits. Animal Behaviour 92: 333-344.

Page 131: Genetic diversity, population structure and phylogeography ...

113

Appendix 2.1. Summary of results from previous research studies of mtDNA in beluga

populations. Abbreviations: na = not available; StL = St. Lawrence Estuary; EHB = Eastern

Hudson Bay; EB = Foxe Basin; WHB = Western Hudson Bay; NHB = Northern Hudson Bay; NE

HB = Northeast Hudson Bay; BI = Belcher Islands; SE BI = Southeast Baffin Island; HS = Hudson

Strait, UG = Ungava Bay; WG = Western Greenland; BB = Baffin Bay; CreB = Creswell Bay;

EHA = Eastern High Arctic; CHA = Central High Arctic; BeS = Beaufort Sea; BerS = Bering Sea;

AK = Alaska; ECS = Eastern Chukchi Sea; SOK = Southern Sea of Okhotsk; SV = Svalbard; WhS

= White Sea

Study

reference

N

samples

analyzed

Sample

collection

locations

Years of

sample

collections

mtDNA

region

analyzed

N

haplotypes

identified

Type of

analysis

Highlights of

Results

Brennin et

al. 1997

95 StL, EHB,

WHB, SE

BI, WG,

HA, BeS,

ECS

na RFLP

analysis

with 10

restriction

enzymes

8 phylogeographic Two lineages

identified (eastern

and

northern/western),

and two dominant

haplotypes (in 81%

of samples).

Brown

Gladden et

al. 1997

628 StL,

BB(EHB,

WG,

CreB), SE

BI, EHB,

W/N HB,

BeS,

ECS,

BerS

1984 -1994 234bp of

control

region

sequence

39 phylogeographic Two assemblages

identified (eastern

and western), and

two dominant

haplotypes (in 52%

of samples).

O’Corry

Crowe et al.

1997

324 AK, ECS,

BeS

1977 -

1995

410bp of

control

region

sequence

29 phylogeographic Five management

stocks identified.

de March et

al. 2002

504 WG, EHA,

CHA, SE

BI,

1982 -

1997

260bp of

control

region

sequence

Unclear,

28+

phylogeographic Suggested

existence of more

stocks than

previously

described for this

area.

Page 132: Genetic diversity, population structure and phylogeography ...

114

Study

reference

N

samples

analyzed

Sample

collection

locations

Years of

sample

collections

mtDNA

region

analyzed

N

haplotypes

identified

Type of analysis Highlights of

Results

Palsböll et

al. 2002

195 WG 1984 -

1994

244bp of

control

region

sequence

19 Stock structure of

Baffin Bay

belugas

Relatively high

degree of genetic

structure, however

insufficient

phylogenetic signal

to define

management units.

Two dominant

haplotypes found

(in 84% of

samples).

de March

and Postma

2003

739 WHB,

NHB, SE

BI, HS,

UB

1984 -

1997

234bp of

control

region

sequence

35 phylogeography Multiple stocks in

two assemblages,

and three dominant

haplotypes (62%)

Meschersky

et al. 2008

626 SOK 227bp of

control

region

sequence

Unclear,

20+

phylogeography Clear differentiation

of Sea of Okhotsk

belugas with

historical

connection to

Bering-Beaufort-

Chukchi population.

O’Corry-

Crowe et al.

2010

122 WG, SV,

WhS, AK,

BeS

1994 -

2001

410bp of

control

region

sequence

16 in WG,

SV and

WhS

samples (3

dominant

haplotypes,

68%

phylogeography Three dominant

haplotypes (68%),

Mismatch

distribution analysis

and divergence

time estimates

using Isolation with

Migration (IMa)

Page 133: Genetic diversity, population structure and phylogeography ...

115

Study

reference

N

samples

analyzed

Sample

collection

locations

Years of

sample

collections

mtDNA

region

analyzed

N

haplotypes

identified

Type of analysis Highlights of

Results

Turgeon et

al. 2012

1432 FB, WHB,

NHB, BI,

NE HB,

HS, UG,

SE BI

1982 -

2006

234bp of

control

region

sequence

37 Spatiotemporal

variation in stock

composition,

allocation of

harvests

27 haplotypes

represented by two

or more individuals,

one dominant

haplotype (54%)

O’Corry-

Crowe et al.

2015

10 AK

(Yakutat

Bay)

2002 -

2008

410bp of

control

region

sequence

1 Genetic origins,

breeding

patterns, kinship,

inbreeding within

distinctive group

Single haplotype

present in other

beluga

stocks/populations

in western Arctic in

varying proportions.

Page 134: Genetic diversity, population structure and phylogeography ...

116

Appendix 2.2. Information for geographic sample collections used for the analyses of haplotype

diversity (Table 2.2) and geographic differentiation (Table 2.3).

Sample

Collection Total Sample

Size Sample

Community Year Collection

Dates Number of

Samples Sample

Collection Method

Western 118 Arviat 1985 17Jul – 23Aug 24 harvest Hudson Bay 1987 2Aug – 23Aug 22 harvest

(WHB) 1999 11Aug – 28Aug 36 harvest 2003 4Aug – 27Aug 36 harvest

Beaufort Sea (BeS)

301 Hendrickson Island

1984 July 6 harvest

1989 3Jul – 8Jul 6 harvest 1993 10Jul – 22Jul 9 harvest 1994 2Jul – 30Jul 32 harvest 1995 3Jul – 16Jul 17 harvest 1996 5Jul – 24Jul 12 harvest 2000 7Jul – 23Jul 23 harvest 2001 10Jul – 23Jul 18 harvest 2002 30Jun – 21Jul 30 harvest 2003 4Jul – 22Jul 24 harvest 2004 3Jul – 19Jul 27 harvest 2005 July 26 harvest 2006 July 35 harvest 2007 July 18 harvest 2008 July 18 harvest

Central High Arctic (CHA)

54 Cunningham Inlet

1999 20Jul – 21Jul 57 sloughed skin

Eastern 57 Grise Fiord 1984 13Sep 17 harvest

High Arctic 1985 10Sep – 24Sep 4 harvest (EHA) 1987 16Sep – 26Sep 9 harvest

1993 No data 4 harvest 1996 7Aug 1 harvest 1997 30May 1 harvest 1999 13Sep 3 harvest 2000 9Sep - 13Sep 13 harvest 2001 15Sep - 19Sep 5 harvest

Foxe Basin 72 Igloolik 1994 18Aug – 6Sep 16 harvest (FB) 1995 12Jul – 6Sep 31 harvest

1996 No data 5 harvest 1997 15Aug – 27Sep 11 harvest 2001 18Sep – 19Oct 9 harvest

James Bay 28 unknown 2001 No data 1 harvest (JB) unknown 2002 7Sep 5 harvest

Cape Jones Is. 2003 22Sep 2 harvest Cape Jones Is. 2004 8Sep 1 harvest Cape Hope Is. 2007 9Aug – 25Aug 6 tag biopsy Cape Hope Is. 2008 4Aug – 16Aug 7 tag biopsy Cape Hope Is. 2009 29Jul – 9Aug 6 tag biopsy

Page 135: Genetic diversity, population structure and phylogeography ...

117

Sample Collection

Total Sample Size

Sample Community

Year Collection Dates

Number of Samples

Sample Collection Method

Long Island 39 Long Island 2012 23Jul & 3Sep 3 harvest (LI) Middle Long 2005 6Jul 1 harvest

Island 2007 12Sep 3 harvest 2008 17Jul & 1Sep 3 harvest 2009 23Jul – 26Jul 3 harvest 2011 27 & 28Aug 4 harvest Northend Long 2003 5Aug 2 harvest Island 2004 29Aug – 11Sep 8 harvest 2005 10Aug 4 harvest 2010 3Aug – 12Aug 3 harvest Southend Long 2003 10 & 11Aug 4 harvest Island 2005 16Jul 1 harvest

Eastern 30 Nastapoka 1984 23May – 8Aug 10 harvest Hudson Bay River 1985 1Jul – 15Aug 20 harvest

(EHB)

St. Lawrence 125 various 1997 No data 11 necropsy Estuary 1998 No data 9 necropsy

(SLE) 1999 No data 11 necropsy 2000 No data 6 necropsy 2001 No data 8 necropsy 2002 No data 13 necropsy 2003 No data 11 necropsy 2004 No data 12 necropsy 2005 No data 8 necropsy 2006 No data 10 necropsy 2007 No data 15 necropsy 2008 No data 11 necropsy

Cumberland 186 Pangnirtung 1983 15Aug 5 harvest Sound 1984 16Aug – 25Aug 10 harvest (CSd) 1985 21Aug 2 harvest

1986 26 & 27Jul 19 harvest 1991 10Jul – 25Aug 11 harvest 1992 7May – 31Aug 18 harvest 1993 24Jul – 20Aug 10 harvest 1994 3Jul – 27Aug 25 harvest 1995 15May – 28Aug 17 harvest 1996 5Jun – 21Jul 24 harvest 1998 July 9 tag biopsy 1999 No data 8 harvest 2000 18Jul 1 harvest 2002 6Jul – 27Jul 19 harvest 2006 No data 2 harvest 2007 No data 2 harvest 2008 No data 4 harvest

Page 136: Genetic diversity, population structure and phylogeography ...

118

Sample Collection

Total Sample Size

Sample Community

Year Collection Dates

Number of Samples

Sample Collection Method

Belcher Islands 311 Sanikiluaq 1993 16Jun – 22Jun 10 harvest harvest 1994 18Jun – 13Sept 30 harvest

(SQH) 1995 13Jun – 10Sept 23 harvest

1996 15Jun – 25Sept 18 harvest 1997 12Jun – 30Jun 17 harvest

1998 11May – 22May 22 harvest

2002 17 May – 30May

10 harvest

2003 28Jun – 29Sept 12 harvest

2004 17Jun – 5Jul 10 harvest 2005 6May – 11Jun 17 harvest

2006 20May – 27May 5 harvest

2007 5Jun – 21Jun 15 harvest

2008 5May – 17Jun 15 harvest 2009 9Jun – 7Jul 13 harvest

2010 1May – 1Jun 12 harvest

2011 No data 24 harvest 2012 No data 17 harvest

2013 No data 39 harvest

Belcher Islands 56 Sanikiluaq 1996 2Nov 1 harvest ice 2004 17Dec 28 harvest

entrapments 2005 14Mar 1 harvest (SQE) 2008 22Nov 1 harvest

2011 Mar 13 harvest 2013 Feb 12 harvest

Page 137: Genetic diversity, population structure and phylogeography ...

119

Appendix 2.3. Results of re-analyses of summer and winter beluga sample collections using only

samples collected in the period 2000-20008 (Nsamples = 718).

Table A2.1. Summary of haplotype diversity found in only the summer and winter beluga sample collections 2000-

2008. Abbreviations: WHB, western Hudson Bay (Arviat); BeS, Beaufort Sea (Hendrickson Island); EHA, eastern

High Arctic (Grise Fiord); FB, Foxe Basin (Igloolik); JB, James Bay, LI, Long Island; EHB, eastern Hudson Bay

(Nastapoka River); SL, St. Lawrence Estuary; CSd, Cumberland Sound (Pangnirtung); SQH, Belcher Islands harvests

(Sanikiluaq); SQE, Belchers Islands ice entrapments.

Sample Collection

Number of sequences

Number of haplotypes

Number of segregating sites

Avg. number

nucleotide diff. (k)

Haplotype diversity (h)

Nucleotide diversity (π)

WHB 36 9 16 2.67 0.794 0.0044

BeS 180 24 17 2.19 0.766 0.0036

EHA 18 7 7 2.50 0.863 0.0041

FB 9 4 3 1.28 0.806 0.0021

JB 28 6 15 2.05 0.521 0.0034

LI 39 15 21 5.23 0.866 0.0086

EHB 30 7 15 3.74 0.733 0.0062

SL 74 5 14 1.87 0.356 0.0031

CSd 35 11 13 3.33 0.537 0.0055

SQH 199 30 26 3.48 0.845 0.0057

SQE 56 10 15 5.82 0.863 0.0096

Total data 718 84 38 5.59 0.942 0.0092

Page 138: Genetic diversity, population structure and phylogeography ...

120

Table A2.2. Patterns of differentiation based on genetic distance among summer and winter beluga sample

collections 2000-2008. ФST values, calculated in Arlequin ver. 3.5 (below diagonal), and ФPT values, calculated in

GenAlEx ver. 6.5 (upper diagonal), are both FST analogues based on nucleotide divergence among haplotypes.

Minimum significance levels using Bonferroni correction (Holm 1979) were P=0.0100 and P=0.032 respectively.

All pairwise comparisons were significant (P<0.0022 and P<0.008 respectively, data not shown), except for those

highlighted in grey. Abbreviations: WHB, Western Hudson Bay (Arviat); CSd, Cumberland Sound (Pangnirtung);

SQH, Belcher Islands harvests (Sanikiluaq); SQE, Belchers Islands ice entrapments; BeS, Beaufort Sea

(Hendrickson Island); CHA, Central High Arctic (Cunningham Inlet); EHA, Eastern High Arctic (Grise Fiord); FB,

Foxe Basin (Igloolik); JB, James Bay; LI, Long Island; EHB, eastern Hudson Bay (Nastapoka River); SLE, St.

Lawrence Estuary.

WHB Csd SQH SQE BeauS CHA EHA FB JB LI EHB SL

WHB * 0.054 0.087 0.384 0.462 0.066 0.188 0.020 0.339 0.296 0.604 0.716

Csd 0.027 * 0.094 0.450 0.429 0.052 0.174 0.095 0.340 0.338 0.660 0.749

SQH 0.088 0.088 * 0.404 0.399 0.109 0.176 0.141 0.276 0.279 0.617 0.709

SQE 0.110 0.113 0.056 * 0.534 0.418 0.453 0.478 0.410 0.234 0.121 0.369

BeauS 0.172 0.167 0.141 0.162 * 0.377 0.307 0.561 0.203 0.272 0.698 0.742

CHA 0.031 0.030 0.132 0.162 0.212 * 0.067 0.147 0.248 0.242 0.659 0.757

EHA 0.071 0.058 0.113 0.127 0.181 0.056 * 0.323 0.120 0.185 0.687 0.765

FB 0.038 0.082 0.198 0.239 0.268 0.061 0.153 * 0.540 0.429 0.727 0.793

JB 0.253 0.257 0.189 0.225 0.287 0.287 0.218 0.405 * 0.066 0.668 0.754

LI 0.126 0.128 0.081 0.098 0.170 0.155 0.098 0.257 0.087 * 0.441 0.614

EHB 0.190 0.191 0.170 0.151 0.238 0.242 0.184 0.323 0.309 0.155 * 0.318

SL 0.349 0.327 0.306 0.366 0.361 0.419 0.373 0.469 0.525 0.386 0.428 *

Page 139: Genetic diversity, population structure and phylogeography ...

121

Figure A2.1 Principal Coordinates Analysis (PCoA) using ΦPT genetic distance matrix for mtDNA haplotypes from

summer and winter beluga whale sample collections 2000-2008. Symbols are coloured according to previously

described eastern (blue) and western (yellow) haplotype assemblages (Brown Gladden et al. 1997) and new sample

collections (red). Abbreviations: FB, Foxe Basin; WHB, western Hudson Bay; Csd, Cumberland Sound; SQH,

Belchers Is. (Sanikiluaq) harvests; CHA, central High Arctic; EHA, eastern High Arctic; JB, James Bay; BeauS,

Beaufort Sea; SQE, Belcher Is. ice entrapments; EHB, eastern Hudson Bay; StL, St. Lawrence estuary.

JB

LI

EHBSQE

WHB

BeauS

FB

EHA

CSdSQH

StL

MD

S2 (

25

.56

% o

f va

riat

ion

)

MDS1 (44.88% of variation)

Page 140: Genetic diversity, population structure and phylogeography ...

122

Appendix 2.4. Alignment of polymorphic sites in N=83 haplotypes found in main geographic beluga sample

collections (N=1377 samples). Numbers at top read vertically indicate sequence position of variable sites. ‘Ref’ is

reference sequence (haplotype E72) used for alignment. The blue box indicates the variable positions used to define

haplotypes in previous studies (e.g. Brown-Gladden et al. 1997).

234445889111111112222222233333333445

440249456002666771124688800444446198

475057347808124501345699558

Ref TCCTTTGTTGCCACACCCTCCTTTCTACTAATCTAC

E02 ............................C.......

E03 .........................C..........

E05 ............................C.....G.

E06 ............................C......T

E07 ............................CG.....T

E08 ......................C.....CG.....T

E09 ................T........C..........

E10 ..............G.............C.......

E11 ......A.........T..........TC.......

E13 ...C..AC.A..G.....C.........C.......

E15 ...C..AC.A..G.....C....C....C.G.....

E16 ...C..AC.A.TG.....C....C....C.....G.

E17 ...C..AC.A.TG.....C....C..G.......G.

E18 ...C..AC.A.TG.....C....C..G.C.....G.

E19 ...C..AC.A.TG.....C...CC..G.C.....G.

E22 ..................C.........C.......

E23 ..........T.................C.......

E24 ....................T.......C.......

E25 .........A..........................

E28 ...............T....................

E29 ............G...............C.......

E30 ..........T.........................

E31 ...C..AC.A.TG....TC....C....C.......

E32 ...C..AC.A.T......C...CC....C.......

Page 141: Genetic diversity, population structure and phylogeography ...

123

E33 ................T.........G.........

E35 ................T...........C.......

E36 ....................T...............

E37 ....................T....C..........

E38 ......AC........T..........TC.......

E40 ......................C.....C.......

E41 ...C..AC.A.TG.....C....C....C.......

E42 ...........T........................

E50 ............G.......................

E51 ...C..AC.A.TG.....C.........C.....G.

E52 ............................C.....GT

E55 ........C...................C.......

E56 ...C..AC.A.TG.....C....C....C.G.....

E57 .....CA.........T..........TC.......

E59 ..............G.....................

E61 ......A...T.....T..........TC.......

E62 ......A.........T..........T....T...

E63 .T....AC.........T.........TC.......

E65 .........................C..C.......

E67 ..........T.....T...........C.......

E68 ................T...................

E69 ...C..AC.A.TG.....C...CC....C.......

E72 ....................................

E75 ......AC.........T.........TC.......

E77 ............G.G.............C.....G.

E79 ............G.G.............CG....G.

E80 ................T..........TC.......

E81 ...C..AC.A.TGT....C....C....C.......

E82 ...C..AC.A.TG.....C....C..G.C.......

E86 ................T..G.......TC.......

E90 ..............G............TC.......

E92 ....C.......................C.......

E94 ...C..AC.A.TG.G...C....C...TC.....G.

E97 ......A.........T.....C....TC.......

Page 142: Genetic diversity, population structure and phylogeography ...

124

E100 C...........G.G.............C.....G.

E103 ......A.........T..........TC....C..

E110 ......A........TT..........TC.......

E112 ......AC...................TC.......

E113 ..............G.............C.....G.

E115 .........A..................C.......

E116 ......A.........T...T...T..TC.......

E120 ......A.........T..........TC.....G.

E121 ......A.......G............TC.....G.

E130 ............................CG......

E138 ..................................G.

E140 ......A....................TC.....G.

E142 ......AC........T....C.....TC.......

E144 ......A....T....T..........TC.....G.

E146 ......AC.......TT..........TC.....G.

E147 ......AC........T..........TC.....G.

E148 ......A.........T...T......TC.....G.

E150 ......A.......G.T..........TC.....G.

E153 ...C..AC.A..G...T.C....C....C.G.....

E154 ...C..A..A.TG..TT.C....C....C.....G.

E155 ...C..AC.A.TG..TT.C....C....C.....G.

E156 ..T...A.........T..........TC.......

E176 ............................C..G...T

E177 ...C..AC.A.TG..T..C....C....C.....G.

E181 ......A....................TC.......

Page 143: Genetic diversity, population structure and phylogeography ...

125

Chapter 3: Fine-scale genetic structure of nearshore beluga (Delphinapterus leucas)

aggregations in the Eastern Beaufort Sea: are kin groups being impacted by harvesting?

Abstract

In many mammalian species, fission-fusion group formation can shape population

structure in response to the influence of ecological and behavioural variables. In the Arctic,

migratory whales such as belugas (Delphinapterus leucas) have patterns of behaviour and habitat

use that make them vulnerable to adverse effects from subsistence overharvesting, industrial

development and indirect impacts of climate change. Even in large, stable stocks, the annual use

of dedicated migratory pathways and philopatric aggregations in nearshore areas may result in

specific segments of the population being at a higher risk of overharvesting. In this study, the

impact of subsistence hunting on possible kin-groups of belugas in the Eastern Beaufort Sea

(EBS) was investigated using nearshore samples collected from harvest monitoring programs

from 1983 to 2008. Patterns of relatedness based on genotypes of 964 samples analyzed at 16

microsatellite loci and mitochondrial DNA control region sequences were compared among

different aggregation and harvesting areas and from ice entrapment events spanning the

distribution of the stock. Clustering methods (Bayesian approach, network analysis and

discriminant analysis of principal components (DAPC)) were also used to detect possible

structuring within Beaufort Sea belugas. Samples were heavily biased towards males (4:1), and

neither males nor females were more related than expected by chance using spatial and temporal

a priori divisions of the samples. Results across clustering methods were somewhat inconsistent,

with differing numbers of clusters identified and support for more than one cluster weak. The

exception was the DAPC clustering analysis of female EBS belugas, which had strong

Page 144: Genetic diversity, population structure and phylogeography ...

126

assignments to non-overlapping clusters for most individuals. However, the assignments to the

clusters had no discernable pattern in space or time when interpreted using sampling locations

associated with aggregation areas. The patterns observed in this study suggest that groups of

related belugas, most strongly observed in females, use this large summering area but do not

form fine-scale kin structure related to aggregation/harvesting locations in the nearshore areas.

As a result, harvesting and other disturbances in particular bays where belugas are aggregating

will not be necessarily putting a discrete genetic unit of the stock at risk. This also appears true

for natural mortality of groups of whales due to ice entrapments observed in the last few decades.

3.1 Introduction

Population structure in mammals is shaped by many factors such as dispersal, philopatry,

local adaptation and social organization (Greenwood 1980, Storz 1999, Berdahl et al. 2015). The

tendency of animals to form groups is a dynamic activity that can shape population structure

over a continuum of highly structured social hierarchies to fluid populations with highly variable

group composition (Couzin and Laidre 2009). What constitutes a group may be peculiar to a

certain species, and even within a species, group formation and size may be dictated by

environmental and behavioural variables such as food availability, the risk of predation, and

intra-group competition (Grove 2012). As with many life history strategies, the individual

benefits of spatial and temporal associations with groups need to outweigh the costs (e.g. Van

Horn et al. 2007, Kashima et al. 2013, Lardy et al. 2016).

One type of group-living strategy is termed fission-fusion which, as the name implies,

involves groups that “merge (fusion) or split (fission) as they move through the environment”

(Couzin 2006). This process is kinetic, with groups merging, splitting and then merging again as

Page 145: Genetic diversity, population structure and phylogeography ...

127

random, non-permanent units (e.g. large herbivores, Gerard et al. 2002) or as part of more stable

sub-groups (e.g. killer whale matriline units, Tavares et al. 2017). Group formation and

individual associations are most often studied in wild populations by observing (either remotely

or directly) individuals in a study area over time. Various techniques are employed, such as

group focal follows with photo-indentification of individuals (e.g. Pearson et al. 2017, Tavares et

al. 2017), observing pre-marked individuals (e.g. King et al. 2017), or GPS tracked individuals

(e.g. Krause et al. 2013, Body et al. 2015).

Characteristics of fission-fusion groups related to size, individual membership and the

degree of cohesion often vary over different spatial and temporal scales (Aureli et al. 2008). In

studies of chimpanzees (Pan troglodytes, Pan paniscus), the greatest predictor of group size and

type was the time spent moving, feeding and socializing (Lehmann et al. 2006). The larger the

group size, the longer it takes for animals to meet their physiological and social demands

effectively and this can have an impact on how animals survive in particular habitats (Dunbar et

al. 2009, Grove et al. 2012). In tree-roosting big brown bats (Eptesicus fuscus), the number and

size of groups seem to be dictated by the number and type of trees available for roosting, the

thermoregulatory benefits of roosting in larger groups, and maternal kin relationships (Metheny

et al. 2008a, b). The availability of food resources was found to be the main factor influencing

fission-fusion in spotted hyenas (Crocuta crocuta), serving to cause both group fusion (defence

of shared resources) and group fission (individual competition for food) (Smith et al. 2008).

In some species of cetaceans, particularly delphinids, fission-fusion group dynamics are

influenced by factors such as food resources, habitat and environmental preferences (e.g.

inshore/coastal vs. pelagic), avoidance of predation, sexual conflict, sociality and kin

associations (Gowans et al. 2007, Möller 2012). An understanding of how these species may be

Page 146: Genetic diversity, population structure and phylogeography ...

128

using fission-fusion to adapt to changes in these influences is an important part of conservation

considerations. In Hawaiian spinner dolphins (Stenella longirostris), fission-fusion strategies

appear to be necessary to philopatric groups of animals exploiting limited resting habitats and

prey resources around the smaller islands and atolls (Karczmarski et al. 2005, Andrews et al.

2010). These behaviours may make these groups vulnerable to human disturbances such as

ecotourism activities (Andrews et al. 2010). An example of this has been shown for a small

population of Indo-Pacific humpback dolphins (Sousa chinensis), where a shift from fission-

fusion dynamics typical of larger populations (Parra et al. 2011) to stable, cohesive social

associations was found (Dungan et al. 2016). This difference in population structure was

attributed to anthropogenic stressors impacting the behavioural evolution of the population

(Dungan et al. 2016). In bottlenose dolphins (Tursiops spp.), all populations are characterized by

fission-fusion dynamics; however, variation in socially influenced groups occurs both within and

between communities (e.g. Connor et al. 2000, Lusseau et al. 2006, Moreno and Acevedo-

Gutiérrez 2016). Various stressors, both natural and human-induced, are thought to have the

potential to alter bottlenose dolphin behaviour and social structure. These include fisheries and

aquaculture (Chivers et al. 2001, Diaz Lopez and Bernal Shiral 2008, Blasi and Boitani 2014),

wind and tide turbine farms (Louis et al. 2015), and environmental disasters (Elliser et al. 2011).

These factors were found to alter group size, community membership, habitat preference and

feeding behaviours. Studies of other cetacean species may add additional perspectives on the

potential impacts of human activities on fission-fusion dynamics, social organization and

structure, and contribute further to conservation and management planning.

In most areas of their Arctic and sub-Arctic distribution, beluga whales (Delphinapterus

leucas) follow seasonal migration patterns with movements of animals that can number in the

Page 147: Genetic diversity, population structure and phylogeography ...

129

tens of thousands (Harwood et al. 1996, Richard 2005). Globally partitioned into more than 20

putative stocks (or subpopulations) defined mostly by movement patterns and genetics (IWC

2000), at least six stocks of belugas spend a portion or the entirety of an annual cycle of

migration in Canadian waters (Richard 2010). These stocks are assessed for the purpose of

identifying conservation and management needs by the Committee on the Status of Endangered

Wildlife in Canada (COSEWIC) on the basis of Designatable Units (DUs) which are defined as

“discrete and evolutionary significant units of the taxonomic species, where significant means

that the unit is important to the evolutionary legacy of the species as a whole and if lost would

likely not be replaced through natural dispersion” (COSEWIC 2012, 2016). Seven of eight

beluga DUs found in Canadian waters have been assessed and span a wide spectrum of

vulnerability: from Endangered (St. Lawrence Estuary, Eastern Hudson Bay and Ungava Bay

populations, although the Ungava Bay population may, in fact, be extirpated) and Threatened

(Cumberland Sound population), to Special Concern (Western Hudson Bay and Eastern High

Arctic/Baffin Bay populations) and Not at Risk (Eastern Beaufort Sea stock) (COSEWIC 2004,

2014, 2016). The main threats to these whales come from anthropogenic activities such as

aboriginal subsistence harvests, industrial development, shipping, and pollution (COSEWIC

2004, 2014, Reeves et al. 2014). An eighth DU (James Bay) was recently identified, but its status

has not yet been assessed (COSEWIC 2016).

The Eastern Beaufort Sea (EBS) beluga stock is part of the Bering-Beaufort-Chukchi Sea

population of belugas (DFO 2000). Whales migrate annually from an overwintering area in the

Bering Sea and move into the Mackenzie Delta estuary (see Figure 1) in late June, forming peak

aggregations during the month of July (Norton and Harwood 1985, Harwood et al. 1996). These

recurring patterns of aggregations provide an opportunity for hunting, and therefore the EBS

Page 148: Genetic diversity, population structure and phylogeography ...

130

belugas are of cultural and subsistence importance to Aboriginal communities in the Inuvialuit

Settlement Region of the western Canadian Arctic (Harwood and Smith 2002). The harvests are

considered sustainable and the stock to be stable or increasing (DFO 2000, Harwood et al. 2002).

However, shipping and industrial development, especially oil and gas exploration and drilling, is

predicted to increase near areas of beluga “hot spots” (Fast et al. 2001, Harwood et al. 2014,

Reeves et al. 2014). These types of activities can be detrimental, especially for migratory

mammals, due to disturbance and potential loss of habitat (Beckmann et al. 2012, Seidler et al.

2014). It has also been suggested that disruption of beluga behaviour and social systems are a

contributing factor to the lack of recovery in some beluga stocks following intense exploitation

(Wade at al. 2012).

The effects of human activities have been recognized to cause disruption to kin structure

and should be considered for conservation and management planning. For example, wild boar

(Sus scrofa) populations have been found to have differing patterns of kin structure depending on

the level of hunting pressure (Poteaux et al. 2009, Lacolina et al. 2009, Podgórski et al. 2014).

Poaching of adult female African elephants (Loxodonta africana) was also found to alter kin

structure in core elephant groups, disrupting social bonds and increasing agonistic contests

between groups (Gobush and Wasser 2009). Investigation of kin structure in racoon (Procyon

lotor) populations revealed that even a species with high dispersal capabilities might be

negatively impacted at the kin structure level by anthropogenic habitat loss and fragmentation

(Dharmarajan et al. 2014). Genetic data have been applied to numerous conservation and wildlife

management challenges, particularly those that seek to maintain genetic diversity of species, sub-

species, populations, and individuals (Allendorf et al. 2013). One essential step to achieving

these population management goals is to identify what represents distinct units to conserve (e.g.

Page 149: Genetic diversity, population structure and phylogeography ...

131

Pallsbøl et al. 2006, Waples and Gaggiotti 2006) and it is common to partition wild populations

into subgroups for management purposes based on genetic structure revealed by population-

based approaches (Allendorf et al. 2013). However, individual-based analyses have recently

emerged that use the identification of genetically similar clusters of individuals to assess a

variety of ecological and conservation questions (Palsbøll et al. 2010). One such approach,

estimation of relatedness among individuals, can be used for conservation of wild populations by

helping to understand social systems (Allendorf et al. 2013) which may contribute to kin

structure or “spatial structure within populations where more related individuals are closer to

each other in space than to unrelated individuals” (e.g. Zeyl et al. 2009).

Most species of whales, including delphinids, are very social and form a variety of

kinship associations (Gowans et al. 2008, Möller 2012). The evolution of social structure is

mainly driven by mating strategies, food distribution and predation risk, with environment type

playing a significant role in motivating dispersal (Möller 2012). Differing patterns of habitat

selection have been shown to result in segregation during the summer of Eastern Beaufort Sea

belugas based on sex, age and reproductive status (Richard et al. 2001a, Loseto et al. 2006). This

segregation was linked to different resource needs of individuals and may also represent social

structure within the stock (Loseto et al. 2006). Individual-based genetic comparisons of whales

found in various areas of their distribution would add information about kin relationships related

to these habitat segregation patterns.

Eastern Beaufort Sea belugas have been previously analyzed using population-focused

approaches to distinguish the stock within a broad geographic scale (Brown Gladden et al. 1999,

O’Corry-Crowe et al. 1997) but have not been examined using an individual-based, finer-scale

analysis. In particular, microsatellite (simple sequence repeats in nuclear DNA) data can be used

Page 150: Genetic diversity, population structure and phylogeography ...

132

to compare patterns of relatedness among groups of animals to determine if individuals who

overlap in their space use are more closely related than the population as a whole. This has been

demonstrated for mammalian species such as long-tailed macaques (Macaca fascicularis) (Ruiter

and Gefen 1998), ocelots (Leopardus pardalis) (Rodgers et al. 2015), roe deer (Capreolus

capreolus) (Biosa et al. 2015), and even for high fission-fusion species such as bats (Myotis

bechsteinii) (Kerth et al. 2011), giraffes (Giraffa camelopardalis) (Carter et al. 2013) and gelada

baboons (Theropithecus gelada) (Snyder-Mackler et al. 2014).

The goal of this study is to combine the use of individual-based methods of genetic

analyses with samples obtained from annual subsistence harvesting to determine if there is a

fine-scale geographic structure within the Eastern Beaufort Sea beluga due to associations among

whales that are close relatives. For species that are difficult to observe over the long term, this

type of genetic data can be useful to help understand the influence of social systems and

dispersal on population structure (Nidiffer and Cortés-Ortiz 2015). The utility of this approach

has been demonstrated for Hudson Bay belugas (Colbeck et al. 2013) and has been informative

for determining patterns of genetic relatedness in other exploited species such as managed red

deer (Cervus elaphus) populations (Pérez-González et al. 2012) and white-tailed deer

(Odocoileus virginianus) (Grear et al. 2010).

In this study, samples are partitioned from main summer aggregation areas to

correspond with beluga “hot spots” in areas of concern (Harwood et al. 2015). We also include

samples collected from several ice entrapment events that appear to be happening more

frequently in recent years (Kocho-Schellenberg 2010). Network analyses based on relatedness

and clustering methods were used to test the hypothesis that whales forming aggregations in

different nearshore areas in the Beaufort Sea and the Amundsen Gulf are more closely related

Page 151: Genetic diversity, population structure and phylogeography ...

133

than those from a randomly distributed population. We hypothesize that belugas that are

aggregating in various bays across the overall nearshore range of the EBS stock comprise groups

that are more closely related to each other than the stock as a whole and that this kin-based

structure contributes to the fission-fusion dynamics of group formation in this stock. We

evaluated the sensitivity of our within EBS beluga stock analyses by using a comparison of EBS

results to a group of samples from a separate, distinct and small population of beluga thought to

have associations among close kin (St. Lawrence Estuary population in eastern Canada, T.

Frasier pers. comm.). We also evaluated if male and female belugas in the EBS stock show

differences in patterns of relatedness as might be expected in a species with matrilineal

philopatry to estuaries (Smith et al. 1994, Clutton-Brock and Lukas 2012). Finally, we assessed

if there is evidence of kin-groups present in the harvest of belugas from the EBS stock.

3.2 Materials and Methods

3.2.1 Study area and samples

A total of 1054 samples for this study were obtained from tissues collected during

research and subsistence harvest monitoring programs established in the Beaufort Sea in the

1970s (Harwood et al. 2002). During these programs, samples and information are collected

from hunters that use traditional whaling camps located along the coastline of the eastern

Beaufort Sea (Figure 3.1). The hunt coincides with the peak of beluga aggregations during July

in the three main areas of the Mackenzie River estuary: Shallow Bay/Niaqunnaq Bay, eastern

Mackenzie Bay and Kugmallit Bay (Harwood et al. 2014). Hunters select for larger and older

belugas, and there is a strong bias towards males (Harwood et al. 2015). In the 1980s, the ratio of

Page 152: Genetic diversity, population structure and phylogeography ...

134

males to females in the harvest was 2.0 to 1, in the 1990s 3.0 to 1, and in the 2000s 3.6 to 1

(Harwood et al. 2015).

Figure 3.1. Sampling locations for Eastern Beaufort Sea stock of beluga whales used in this study.

Symbols indicate the current hunting camps (black triangle) and main communities (white square)

participating in beluga harvest monitoring.

Page 153: Genetic diversity, population structure and phylogeography ...

135

At the same time or slightly after belugas aggregate in the Mackenzie Estuary, whales

become distributed further to the east into the eastern Beaufort Sea and Amundsen Gulf

(Harwood and Smith 2002, FJMC 2013). In this area, belugas in Darnley Bay are harvested by

hunters from the community of Paulatuk (Figure 3.1) in late July and early August. Again, the

hunt is directed towards adult male beluga, though they are younger than those harvested in the

Mackenzie Delta areas (Harwood et al. 2015). Whales harvested in Darnley Bay also are more

likely to be taken from groups of whales in pods than the whales harvested in the Mackenzie

Estuary areas.

Based on summer habitat selection, Loseto et al. (2006) found that satellite-tagged whales

were segregated among nearshore open water, offshore open water, mixed ice and closed ice.

Harvested samples used in this study are largely comparable to captured and tagged whales

having come from only the nearshore open water habitat. Thus, our harvest analyses largely

represent adult males that have not yet departed for other ice-associated foraging habitats and

females that may be moving between nearshore and offshore open water.

Additional samples for genetic analyses were obtained during sampling of ice entrapment

events in three different years in the Husky Lakes (1989, 1996, 2006). The Husky Lakes are a

series of interconnecting saline lakes located to the south and southeast of the community of

Tuktoyaktuk (Figure 3.1). Belugas often move into the Husky Lakes in the summer, apparently

to feed. However, periodically large numbers (estimated 80 – 200) (Kocho-Schellenberg 2010)

become trapped when winter ice forms before autumn migration has started. These samples of

whales may provide a more naturally occurring sample of a social group than those collected

during selective harvests.

Page 154: Genetic diversity, population structure and phylogeography ...

136

Finally, samples were obtained from a necropsy program of beach-cast belugas from the

St. Lawrence River in eastern Canada. The St. Lawrence Estuary beluga population is a small,

isolated population with limited migration at the southernmost limit of the species distribution

(DFO 2014). It is genetically distinct from all other Canadian beluga populations and has lower

levels of genetic diversity (Brown Gladden et al. 1999, COSEWIC 2014). These samples will be

used as a different population comparison and to ensure the ability of the analyses to detect

signals of relatedness among the Beaufort Sea beluga samples. For the final data analyses, a total

of 964 and 1032 beluga samples were included in the microsatellite and mitochondrial DNA

datasets, respectively (see Sections 3.3.1 and 3.3.5).

3.2.2 DNA extraction and sex identification

Samples in the field were preserved either by freezing in coolers or a salt-saturated 20%

DMSO solution (Seutin et al. 1991) and frozen upon arrival at the laboratory (Winnipeg, MB).

Total cellular DNA extractions were performed using a variety of methods including

phenol:chloroform (Amos and Holzel 1992), Qiagen spin columns (DNeasy Blood and Tissue

Kits), and the Biosprint automated platform (Qiagen Inc, Valencia, CA, USA). Molecular

determination of sex was completed using methods described in Bérubé and Palsbøll (1996) or

Shaw et al. (2005) with separation and visualization of the PCR products on a QIAxcel (Qiagen

Inc, Valencia, CA, USA) to determine sex based on length differences.

3.2.3 Microsatellite genotyping

Samples were genotyped at 16 different microsatellite loci: FCB1, FCB3, FCB4, FCB5,

FCB8, FCB10, FCB11, FCB14, FCB17 (from belugas, Buchanan et al. 1996), EV14, EV37,

Page 155: Genetic diversity, population structure and phylogeography ...

137

EV94 (from humpback whales (Megaptera novaeangliae), Valsecchi and Amos 1996),

KWM2A, KWM12A (from killer whales (Orcinus orca), Hoelzel et al. 1998), SW19 (from

sperm whales (Physeter microcephalus), Richard et al. 1996), and PPH104 (from harbour

porpoise (Phocoena phocoena), Rosel et al. 1999). Loci were amplified individually or in

duplexes in reactions with a total volume of 10µL. Reaction mixtures contained 20 – 100ng of

DNA, 1X AmpliTaq gold reaction buffer without MgCl2 (Life Technologies), 1.5mM MgCl2,

0.2mM dNTP mix, 0.125 – 0.5µM of each primer (one of each pair labelled with a fluorescent

dye and the other tailed with a poly-A sequence), and 1 unit of AmpliTaq gold (Life

Technologies). One of three different thermal cycler profiles were used, and amplification

products from individual and duplex PCRs were pooled in ratios to optimize peak intensities (see

Appendix 3.1 for details). Fragments were analyzed using an Applied Biosystems 3130xl genetic

analyzer (Life Technologies) with an internal 400HD Rox size standard. Alleles were scored

according to size in base pairs using GeneMarker ver1.95 software (SoftGenetics).

Fairly large numbers of loci (30-40) are needed to have moderate confidence around a

single pairwise estimate of relatedness (Blouin 2003), but fewer loci are often used especially if

they have a relatively large number of alleles (Van de Casteele et al. 2001). The 16 loci analyzed

in our dataset satisfy this requirement in that all loci were suitably polymorphic and individuals

had high heterozygosity (Waits et al. 2001).

3.2.4 Validity and variability of microsatellite markers

Potential errors in the raw data were assessed using GenAlEx ver. 6.5 (Peakall and

Smouse 2012) for empty data cells, non-numeric data, missing data, samples with repeated

genotypes, and samples with matching names. Specimens that were missing data at more than

Page 156: Genetic diversity, population structure and phylogeography ...

138

three loci (80% complete genotype (Morin et al. 2010)) were removed. The data were also

checked for scoring errors and the possible presence of null alleles using the software

Microchecker (van Oosterhout et al. 2004).

To assess genotyping error rate, 180 individuals were randomly selected for re-

amplification and scoring at all loci. Between four and six different beluga samples were also

included as replicates with every PCR amplification and analysis to act as a positive control for

reactions and to assess consistency across runs.

Tests for significant departures from Hardy-Weinberg proportions (H-W) and linkage

disequilibrium were performed using 10,000 iterations in the software GENEPOP web ver. 4.2

(Raymond and Rousset 1995, Rousset 2008). Significant values for multiple comparisons were

corrected using the sequential Bonferroni technique (Holm 1979).

Genetic variation within the sample groups was estimated by calculating the mean

number of alleles (NA), number of private alleles (PA), observed heterozygosity (Ho), and

unbiased heterozygosity expected under H-W proportions (He) using GenAlEx ver. 6.5 (Peakall

and Smouse 2012). Allelic richness (AR) as an unbiased measure of allele number adjusted by

sample size and FIS, the proportional excess of homozygotes relative to H-W proportions, were

calculated using FSTAT ver. 2.9.3 (Goudet 1995). The ability of the microsatellite data to

discriminate individuals was assessed by calculating the probability of identity (Pid) using

GenAlEx ver. 6.5 (Peakall and Smouse 2012). This is the estimate of the average probability that

two unrelated individuals will by chance have the same multilocus genotype.

Page 157: Genetic diversity, population structure and phylogeography ...

139

3.2.5 Relatedness within geographic areas of interest

To test for kin structuring of belugas sampled in various Beaufort Sea areas due to social

and/or cultural differences, we analyzed the data to determine if relatedness among individuals

would be higher within particular groups of samples than expected. We used a local-scale to

large-scale approach where we first grouped samples from individuals harvested on the same day

by hunters from the same whaling camp. Within location, same day samples determined whether

the sampling process (harvested animals) influenced the data due to targeting individuals

belonging to the same pod of whales (Palsbøll et al. 2002). Second, we tested if whales killed by

hunters from different whaling camps sharing the same aggregation area (the Bays) are more

related than expected by chance alone. This analysis is aimed to determine whether hunters from

particular camps were targeting certain sub-regions of the Bay that may attract related groups of

individual whales. These analyses were conducted twice, once by dividing samples up by

decade, and again with no division by period. Finally, we analyzed the data to test if individuals

killed in the same aggregation area or entrapment (as outlined in Table 1) are more related than

expected. In essence, then, all of our analyses address the question: do whales that use different

aggregation areas represent different genetic subgroups of the stock? In the final analysis, the St.

Lawrence Estuary samples served as a test of the ability of the analyses to detect relatedness by

providing results from a small, isolated population that is expected to be closely related due to

inbreeding. All analyses were performed for males only and both sexes combined. Due to the

relatively small numbers of females in the dataset, female only analyses were only carried out for

the comparison of aggregation area and entrapments.

For all of the analysis scenarios, relatedness was estimated using the package related

(Pew at al. 2014) in R ver. 3.1.3 (R Core Team 2015). Among other advantages, this program

Page 158: Genetic diversity, population structure and phylogeography ...

140

allows for the comparison of the performance of four commonly used relatedness estimators (Li

at al. (1993), Lynch and Ritland (1999), Queller and Goodnight (1989), and Wang (2002))

employing the user’s genotype data. The program will simulate genotypes of known relatedness,

calculate relatedness values using the four moment-based estimators and then plot the data for

assessment. Using this method, the Wang estimator (2002) was selected for this study, and it was

used consistently for all analyses.

Beluga data were tested to see if individuals within groups are more related than expected

by chance or if they instead represent random groups of individuals. This was done using an

iterative randomization approach where all pairwise relatedness values were first estimated

within each group, and then an average was calculated. Then, the order of the individuals in the

genotype file was randomly shuffled while keeping the rows representing each group constant.

Thus, individuals were shuffled between groups while maintaining a constant size for each

group. Then relatedness was calculated within each group again. For all of the analysis scenarios

used in this study, the number of iterations used for analyzing group relatedness using the related

software package was 200, which determined (using trials) based on the processing limits of the

computer used for analyses.

Expected values of relatedness (based on iterative random reshuffling individuals within

the dataset) for each of the group comparisons were plotted using histograms of relatedness

values estimated under a null hypothesis that the samples represent a random group of

individuals. Therefore, the larger the sample size at each location, the larger the pool of

individuals to draw from for the reshuffling. This, in turn, resulted in a tighter distribution of

values for the null hypothesis (e.g. see Figure 3.4). An observed relatedness value falling outside

the distribution would reject the null hypothesis. P-values were also generated for each

Page 159: Genetic diversity, population structure and phylogeography ...

141

comparison of observed vs. expected values, though these are not exact p-values as they are

based on simulation rather than direct calculation. Instead, they give an indication of how many

of the iterations contained average relatedness values less than or equal to the observed

relatedness value. Thus, p-values were linked to sample size at each of the location comparisons

used to generate the range of expected values of relatedness under the null hypothesis.

3.2.6 Network and clustering analyses based on pairwise relatedness of individuals

Networks based on pairwise relatedness values between individuals were analyzed for

males and females separately, for samples from the Beaufort Sea area only, and for all samples

including the St. Lawrence Estuary. The analyses were performed using the software packages

related (Pew et al. 2014) and igraph (Csardi and Nepusz 2006) in R ver. 3.1.3 (R Core Team

2015). Relatedness of individuals in the beluga dataset was again estimated using the Wang

(2002) estimator. A matrix was created with the number of rows and columns equal to the total

number of samples in the dataset. The matrix was then filled with the pairwise relatedness

estimates. However, negative values cannot be used for creating edges in a network; therefore,

all negative estimates of relatedness were truncated to zero. Finally, the matrix was labelled and

imported into igraph (Csardi and Nepusz 2006) to create and analyze the network.

A weighted (based on relatedness values) adjacency graph was created from the matrix

with the diagonal of the matrix zeroed out so as to not be included in the calculation, and only

the lower left triangle used to create a undirected graph. The column names (the sample codes

for the individual belugas) were used as the vertex (node) names in the graph.

Three different methods were used to test for clusters (or communities): the fast greedy

community method, the leading eigenvector community method, and the spinglass community

Page 160: Genetic diversity, population structure and phylogeography ...

142

method. Community structure is a feature of the network where the vertices (the nodes, or

individuals in the beluga dataset) are grouped so that there is a higher density of edges (the

connections of individuals) within the groups than between them (Clauset et al. 2004).

Community structure detection is, therefore, a data-analysis technique that highlights structure

present in large-scale network datasets (Newman 2006a). Both the fast greedy and the leading

eigenvector algorithms are hierarchical approaches that try to optimize a network quality

function, modularity (Clauset et al. 2004, Newman 2006b). Modularity is a quantitative measure

that relates a statistically meaningful arrangement of edges to true community structure (Girvan

and Newman 2002, Newman 2006a). Thus, modularity indicates structure by measuring if a

community division has more edges (or links) within a community and few edges between

communities (Clauset et al. 2004). Modularity estimates for all beluga networks were obtained

for both fast greedy and leading eigenvector community algorithms, and assignments of

individuals to each of the identified clusters found under each method were compared.

The spinglass community algorithm is a non-hierarchical approach for detecting

community structure in networks that is based on a modified q-state Potts model (a spin-glass

model) (Reichardt and Bornholdt 2004). This model interprets structure by translating similarity

measures into coupling strengths based on the interactions and spin states of the vertices – in this

case, the individual belugas comprising the dataset. The model is simulated for a given number

of iterations and the spin states of the particles, in the end, define the communities. Modularity

for this type of algorithm cannot be calculated.

Another network property related to clustering is transitivity, which occurs when two

nodes in the network are both neighbours of the same third node and because of that common

connection, have a higher probability of also being neighbours with each other (Girvan and

Page 161: Genetic diversity, population structure and phylogeography ...

143

Newman 2002). Transitivity is also known as the clustering coefficient, and the higher the value,

the more probable it is to observe a link between two nodes which are both connected to a third

one (Orman and Labatut 2011).

To assess the signals for structuring within the beluga relatedness networks, modularity

estimates for all networks were obtained for both the fast greedy and leading eigenvector

community algorithms. Transitivity for each network was also determined, and assignments of

individuals to each of the identified clusters found under the three clustering methods were

compared. One of the shortcomings of network analysis techniques is that we are unable to

perform traditional significance testing (Lusseau et al. 2006). However, the strength of the

observed modularity and transitivity values were assessed by comparing them to an expected

distribution of transitivity and modularity values for simulated datasets and relatedness networks

of unrelated individuals. The simulations to create the data were run with 200 iterations and used

allele frequencies from the original beluga dataset. The fast greedy community algorithm was

used to determine transitivity and modularity for each iteration of the simulation.

The networks and the clustering analysis results for each of the community algorithms

were imported and visualized in CytoScape ver 3.2.1 (Shannon et al. 2003). For all of the

network clustering analyses, the full networks were used. However, to assist in the visualization

of crowded networks, the networks were trimmed to include only weighted relatedness values

greater than 0.35 for figures (determined using trials of different values).

3.2.7 Bayesian and multivariate clustering analyses

Additional analyses were used to identify the most likely number of distinct genetic

clusters and assign individuals to these groups: a Bayesian model-based method implemented in

Page 162: Genetic diversity, population structure and phylogeography ...

144

STRUCTURE (Pritchard et al. 2000), and a multivariate method, the discriminant analysis of

principal components (DAPC) using adegenet (Jombart et al 2010) in R ver. 3.1.3 (R Core Team

2015).

Methods implemented in STRUCTURE use genotype data to describe and visualize

population structure based on allele frequencies assuming that a variety of conditions are met,

specifically, Hardy-Weinberg and linkage equilibrium within groups, random mating within

populations and free recombination between loci (Pritchard et al. 2010). The model uses

simulations to estimate group membership of each individual using several user-defined

parameters. It is important that these parameters be defined with sufficient rigour to best achieve

reproducibility of STRUCTURE results (Gilbert et al. 2012).

For the analyses of the entire beluga dataset (males and females, St. Lawrence Estuary

samples included), the admixture model with correlated allele frequencies was used, without

using any a priori information on sample locations or putative population origins. The model

was run with values of K (assumed number of distinct populations) set from 1 to 10 using a burn-

in period of 500,000 iterations followed by 100,000 Markov chain Monte Carlo (MCMC)

repetitions. Twenty independent runs were conducted for each value of K to check for

convergence of results and increase the precision of the parameter estimates (Gilbert et al. 2012).

The most likely number of clusters was chosen by calculating and finding the largest value for

ΔK, based on the second order rate of change of LnP(D) (the mean log likelihood of the data)

with respect to K (Evanno et al. 2005) as implemented in the software STRUCTURE

HARVESTER (Earl and von Holdt 2012).

Discriminant analysis of principal components (DAPC) employs methods to describe

genetic clusters of related individuals by maximizing differences between groups while

Page 163: Genetic diversity, population structure and phylogeography ...

145

minimizing variation within groups (Jombert et al. 2010). Again, beluga data were analyzed with

males and females separated, and both sexes combined, for the complete dataset including the St.

Lawrence Estuary as well as for only the Beaufort Sea locations.

All DAPC analyses were performed following the recommendations of Jombart (2014).

The data was first transformed using a principal components analysis and individuals then

assigned to groups using k-means clustering (max. clusters set to 10-15). During the first round

of analyses, all principal components (PCs) were retained (100-120) and the best number of

clusters to describe the data determined using the Bayesian Information Criterion (BIC). The first

DAPC was then performed on the decomposed data to assess differentiation among groups.

Individuals were assigned to clusters based on membership probabilities. However, retaining too

many PCs with respect to the number of individuals can lead to overfitting and instability of the

membership probabilities (Jombart 2014). Therefore a second round of DAPC was performed

using an optimized number of PCs for the analysis. This was determined using an a-score which

measures the trade-off between discrimination power and over-fitting. Final DAPC results were

visualized using scatter plots, and assignments of individuals to clusters were compared to

clusters assignments from all other types of analyses used for this study.

3.2.8 Mitochondrial DNA control region sequencing

Based on the results of the DAPC analyses of females, sequencing of mitochondrial DNA

was used to investigate potential structuring of maternal haplotype lineages among the main

sampling locations.

Mitochondrial DNA (mtDNA) sequences were generated using amplification reactions

designed to target a portion of the mtDNA control region. Primers Belmt-5 (5’-GAT AGA GTT

Page 164: Genetic diversity, population structure and phylogeography ...

146

TTT TGA GCC CG-3’) and Belmt-6 (5’-TCA CCA CCA ACA CCC AAA G-3’) were used in a

polymerase chain reaction (PCR) mixture containing 1x buffer, 25mM MgCl2, 10mM dNTP

mix, 20uM of each primer, 0.5units of Taq polymerase and approximately 50-200ng of template

DNA. The PCR amplifications were performed in a total volume of 50µL under the following

conditions: 95°C for10 min.; 35 cycles of 95°C for 20s, 55°C for 30s, 72°C for 45s; extension at

72°C for 10 min. Products were visualized using agarose gel electrophoresis, and successfully

amplified samples were cleaned using or QiaQuick PCR clean-up kits (Qiagen Inc. (Canada),

Toronto, ON) or RapidTips (Diffinity Genomics, D-Mark BioSciences, Toronto, ON). DNA

sequencing was performed using BigDye ver. 3.5 (ThermoFisher Scientific, Mississauga, ON)

with the Belmt-6 primer as the sequencing primer (2uM). The PCR sequencing temperature

profile was: 96°C for 1min.; 32 cycles of 96°C for 10s, 50°C for 30s, and 60°C for 4min; and an

extension at 72°C for 7min. Sequencing was performed on an Applied Biosystems 3130xl

genetic analyzer (ThermoFisher Scientific, Mississauga, ON).

DNA sequences were aligned and edited using MEGA ver. 6 (Tamura et al. 2011).

Individual sample sequence haplotypes were assigned using GenAlEx ver. 6 (Peakall and

Smouse 2006). The resulting “library” of unique haplotypes was verified each time a new

haplotype was found using alignment and visualization in the software package DNA Alignment

(Fluxus Technology Ltd. 2013)). Samples resulting in a new haplotype were also resequenced to

verify the sequence. A subset of samples was re-sequenced to estimate the error in haplotype

identification and positive control samples were used for every sequencing plate analysis to

verify the reproducibility of sequencing results.

Haplotype frequency distributions and geographic patterns were evaluated for the main

aggregation areas for all samples combined and for each sampling location by decade for

Page 165: Genetic diversity, population structure and phylogeography ...

147

evidence of both spatial and temporal differentiation. Genetic diversity and differentiation were

estimated for the main aggregation areas combined to have robust sample sizes. The number of

different haplotypes (NH), haplotype diversity (h), nucleotide diversity (π), the number of

segregating sites in each group and the average number of nucleotide differences (k) within

sample collections were determined using DnaSP ver. 5.1 (Librado and Rozas 2009). The

presence of fine-scale population structure was assessed using analysis of molecular variance

(AMOVA) between and within the sample collection groups and pairwise ΦST comparisons

using ARLEQUIN ver. 3.5.1.2 (Excoffier and Lischer 2010). AMOVA was also performed in

GenAlEx ver. 6.4 (Peakall and Smouse, 2012) using the FST analogue ΦPT and significance tested

using randomization of the data with the observed value being included as another permutation

(Peakall and Smouse 2010). The matrix of ΦPT pairwise differences produced in GenAlEx

(Peakall and Smouse 2012) was then used in a Principal Coordinates Analysis (PCoA) as a way

to visualize the patterns of genetic relationships among the sample collections from different

areas based on genetic distance.

Bonferroni correction (Holm 1979) is applied to pairwise tests of genetic differentiation

to decrease the risk of falsely rejecting the null hypothesis (of no difference), i.e. Type I error,

resulting from experiment-wise multiple testing (Narum 2006). Sequential Bonferroni

corrections address Type II errors (not rejecting the null hypothesis when it is false) for table-

wise multiple comparisons. These types of corrections have been accused of being misused

(Cabin and Mitchell 2000) and of being overly conservative and obscuring “potentially relevant

results” (Moran 2003, García 2004, Narum 2006). Instead, many authors advocate the use of

False Discovery Rate (FDR) procedures (Benjamini and Hochberg 1995) to provide better power

over falsely rejected hypotheses and allow for better interpretation of biologically meaningful

Page 166: Genetic diversity, population structure and phylogeography ...

148

patterns of genetic differentiation (e.g. Benjamini and Yekutieli 2001, García 2004, Narum

2006). Therefore, patterns of mtDNA differentiation based on genetic distance among sample

locations were evaluated using both Bonferroni correction and a Benjamini-Hochberg FDR

correction applied using software developed by Lesack and Naugler (2011).

3.3 Results

3.3.1 Validity and variability of microsatellite markers

Initially, 1054 beluga samples were analyzed. Nineteen duplicate samples were removed

from the dataset, and samples with less than 80% complete genotype at the 16 loci were also

excluded from the final dataset. This resulted in 964 samples from nine sampling locations

(Shingle Point, West Whitefish Stations, Kendall Island, Hendrickson Island, Tuktoyaktuk, East

Whitefish Station, Paulatuk, Husky Lakes, St. Lawrence River Estuary) and spanning three

decades being included in the subsequent analyses of group differences. The samples from the

three most important aggregation areas, i.e.: Shallow Bay, East Mackenzie Bay, and Kugmallit

Bay, were grouped for further analyses, but the samples from the individual entrapment events in

the Husky Lakes were kept separate to examine questions of interest in this study (Table 3.1).

A total of 17.5% of the dataset was reprocessed to estimate genotyping error. The error

rate across most loci ranged from 0.000 to 0.005 errors/allele. Microchecker analyses of the data

indicated the possible presence of null alleles and/or allelic dropout at locus KWM12A for the

2006 Husky Lakes samples and locus EV94 in the Kugmallit Bay sample group. Further

analyses in GENEPOP found a significant departure from Hardy-Weinberg proportions at locus

FCB17 in the St. Lawrence Estuary samples and for EV94 and KWM12A in the 2006 Husky

Lakes samples. The Husky Lakes and St. Lawrence samples came from whales that were

Page 167: Genetic diversity, population structure and phylogeography ...

149

emaciated (Husky Lakes) and/or from beach-cast animals that may have spent some time

decaying. Sample quality likely contributed to inconsistent DNA template quality or low

template quantity that could have affected PCR amplification at some loci (Dakin and Avise

2004). This is the likely cause here as the majority of missing data due to failed or “poor-quality”

results came from these two sample locations. Considering the effect is limited to few loci in the

overall genotypes and sample groups, all data were included in the analyses. Results of further

statistical analyses that assume Hardy-Weinberg equilibrium were carefully assessed to

determine if these data were causing any spurious results. For all sampling locations, departures

from expected Hardy-Weinberg proportions (Fis) were not significant (Table 3.2).

Tests for linkage disequilibrium (LD), estimated in GENEPOP, were significant for FCB

4/FCB17 and FCB8/FCB3 locus pairs across all populations; however, closer examination of the

results of each population revealed that this result was driven by a significant LD in Kugmallit

Bay (FCB4 and FCB17) and Paulatuk (FCB8 and FCB3). Use of these loci in other studies of

beluga datasets including different sized populations did not detect indications of linkage among

these loci (de March and Postma 2003, Turgeon et al. 2012). Therefore, the impact of this result

on the overall dataset was considered negligible.

Mean numbers of alleles varied by location and overall sample sizes (Table 3.2). Allelic

richness, adjusted by sample size, was smallest across loci for the St. Lawrence samples, as was

the observed heterozygosity. Analyses of probability of identity (P(ID)) resulted in negligible

probabilities that two individuals in the dataset would have matching genotypes due to chance

(Table 3.2) at a threshold of P(ID) < 0.0001 (Waits et al. 2001), and indicated high discriminatory

power to identify individuals with these loci.

Page 168: Genetic diversity, population structure and phylogeography ...

150

Table 3.1. Summary of samples analyzed from each location. Note that number of females and number of males do

not always total final number as PCR amplifications for sex determination failed in some samples.

Location Abbreviation N Samples N male N female Collection Years

Shallow

Bay/Niaqunnaq

Bay

NB 70 47 21 1988-2005

East

Mackenzie Bay

EM 192 130 62 1981-2006

Kugmallit Bay KB 480 398 64 1983-2008

Paulatuk PA 52 42 9 1993-2006

2006 Husky L. HA 28 26 2 2006

1989 Husky L. HB 27 20 7 1989

1996 Husky L. HC 9 5 3 1996

St. Lawrence

Estuary

SLE 106 38 41 1997-2008

TOTAL N 964 706 209

Page 169: Genetic diversity, population structure and phylogeography ...

151

Table 3.2. Measures of nuclear genetic diversity by sampling locations. N, sample size; NA, mean number of alleles

across all loci; PA, number of private alleles; AR, allelic richness; Ho, observed heterozygosity; He, unbiased expected

heterozygosity; Fis, the proportional excess of homozygotes relative to H-W proportions; Fis P-value with adjusted

nominal level for significance = 0.00039; PID, probability of identity. Values in parentheses are standard deviations.

Location N NA PA AR Ho He Fis Fis

P-value

PID

Shallow/Niaqunnaq

Bay

69 7.69

(0.64)

1 3.39 0.719

(0.044)

0.721

(0.041)

-0.029 0.941 9.3 x10-17

East Mackenzie

Bay

186 9.25

(0.75)

8 3.43 0.712

(0.040)

0.719

(0.039)

0.010 0.132 7.8 x10-17

Kugmallit Bay

470 9.69

(0.73)

13 3.42 0.708

(0.037)

0.719

(0.038)

0.015 0.009 8.1 x10-17

Paulatuk

51 7.63

(0.58)

2 3.44 0.728

(0.043)

0.708

(0.041)

0.003 0.432 3.3 x10-16

2006 Husky L.

26 6.50

(0.58)

0 3.41 0.683

(0.052)

0.716

(0.041)

0.047 0.037 5.2 x10-16

1989 Husky L.

27 7.00

(0.59)

3 3.47 0.756

(0.039)

0.725

(0.040)

-0.044 0.962 1.6 x10-16

1996 Husky L.

9 4.81

(0.36)

0 3.39 0.744

(0.052)

0.729

(0.036)

-0.024 0.677 1.4 x10-14

St. Lawrence

Estuary

103 6.00

(0.57)

3 2.82 0.634

(0.038)

0.624

(0.036)

-0.017 0.849 1.9 x10-12

Page 170: Genetic diversity, population structure and phylogeography ...

152

3.3.2 Relatedness and network clustering analyses

Fine-scale structuring of Beaufort Sea belugas based on relatedness was tested using a

priori groupings based on patterns of harvesting and ice entrapment events. All within-group

relatedness results based on various Beaufort Sea sample groupings were similar no matter how

the samples were combined. These results indicate that belugas harvested in different areas of the

eastern Beaufort Sea and near Paulatuk, as well as samples taken from entrapments in the Husky

Lakes, were not more related than expected for a random group of individuals. Samples collected

on the same day by hunters from the same whaling camps were not more related than expected

(see Figure 3.2 for Shallow Bay results as an example) indicating that pods of related whales

were not being sampled.

Hunted whales were also not found to be more related than expected when sampled by

decade within Bays (see Figure 3.3 for Shallow Bay results as an example), demonstrating that

temporal trends of relatedness were not present in the data.

Comparisons of relatedness among males only (Figure 3.4) and females only (Figure 3.5)

indicated that whales of the same sex in the harvest are not more related than expected. In

contrast to the Beaufort beluga samples, belugas from the St. Lawrence Estuary were more

related than anticipated. Relatedness estimates for St. Lawrence (SLSL) males only (Figure 3.4),

SLSL females (Figure 3.5) only, and both SLSL sexes combined (Figure 3.6) were just under

0.25 (half-siblings).

Page 171: Genetic diversity, population structure and phylogeography ...

153

Figure 3.2. Relatedness of beluga samples from Shallow/Niaquunaq Bay harvested by hunting camps on the same day

(labels, e.g. SBSB, indicate within-group comparison of a subset of the data). Samples from the Shingle Point camp

are indicated by histograms A – E and West Whitefish camp by F and G. The overall graph combines the results of

each of the same day relatedness estimates. Each histogram represents the distribution of expected values of

relatedness (based on iterative random reshuffling of individuals within the dataset) for each of the group comparisons

estimated under a null hypothesis that the samples represent a random group of individuals (i.e. they are not related).

The red arrow indicates where the observed value of relatedness lies in the distribution. An observed relatedness value

falling outside the distribution would reject the null hypothesis (see SLSL, Figure 3.4–3.6). P-values, based on

simulations, indicate how many of the calculated relatedness values were greater than or equal to the observed

relatedness value.

Page 172: Genetic diversity, population structure and phylogeography ...

154

Figure 3.3. Relatedness of beluga samples from Shallow/Niaqunnaq Bay harvested by hunting camps in the same

decade (labels, e.g. SBSB, indicate within-group comparison of a subset of the data). Samples from the Shingle Point

camp are indicated by histograms A (1980s), B (1990s), and C (2000s), and West Whitefish camp by D (1980s), E

(1990s), and F (2000s). The overall graph combines the results of each of the decadal relatedness estimates. Each

histogram represents the distribution of expected values of relatedness (based on iterative random reshuffling of

individuals within the dataset) for each of the group comparisons estimated under a null hypothesis that the samples

represent a random group of individuals (i.e. they are not related). The red arrow indicates where the observed value

of relatedness lies in the distribution. An observed relatedness value falling outside the distribution would reject the

null hypothesis. P-values, based on simulations, indicate how many of the calculated relatedness values were greater

than or equal to the observed relatedness value.

Page 173: Genetic diversity, population structure and phylogeography ...

155

Figure 3.4. Relatedness of male beluga samples (N=706) from: Shallow/Niaqunnaq Bay (NBNB) (note that the

duplication of “NB” and other location abbreviations denotes an estimation of relatedness within the aggregation area

rather than between areas); East Mackenzie Bay (EMEM); Kugmallit Bay (KBKB); Paulatuk (PAPA); 2006 Husky

Lakes (HAHA); and the St. Lawrence Estuary population (SLSL). Each histogram represents the distribution of

expected values of relatedness (based on iterative random reshuffling of individuals within the dataset) for each of the

group comparisons estimated under a null hypothesis that the samples represent a random group of individuals (i.e.

they are not related). The red arrow indicates where the observed value of relatedness lies in the distribution. An

observed relatedness value falling outside the distribution would reject the null hypothesis. P-values, based on

simulations, indicate how many of the calculated relatedness values were greater than or equal to the observed

relatedness value.

Page 174: Genetic diversity, population structure and phylogeography ...

156

Figure 3.5. Relatedness of female beluga samples (N=209) from: Shallow/Niaqunnaq Bay (NBNB) (note that the

duplication of “NB” and other location abbreviations denotes an estimation of relatedness within the aggregation area

rather than between areas); East Mackenzie Bay (EMEM); Kugmallit Bay (KBKB); Paulatuk (PAPA); 2006 Husky

Lakes (HAHA); and the St. Lawrence Estuary population (SLSL). Each histogram represents the distribution of

expected values of relatedness (based on iterative random reshuffling of individuals within the dataset) for each of the

group comparisons estimated under a null hypothesis that the samples represent a random group of individuals (i.e.

they are not related). The red arrow indicates where the observed value of relatedness lies in the distribution. An

observed relatedness value falling outside the distribution would reject the null hypothesis. P-values, based on

simulations, indicate how many of the calculated relatedness values were greater than or equal to the observed

relatedness value.

Page 175: Genetic diversity, population structure and phylogeography ...

157

Figure 3.6. Relatedness of all (male and female) beluga samples (N=964) from Shallow/Niaqunnaq Bay (NBNB) (note

that the duplication of “NB” and other location abbreviations denotes an estimation of relatedness within the

aggregation area rather than between areas); East Mackenzie Bay (EMEM); Kugmallit Bay (KBKB); Paulatuk

(PAPA); 2006 Husky Lakes (HAHA); and the St. Lawrence Estuary population (SLSL). Each histogram represents

the distribution of expected values of relatedness (based on iterative random reshuffling of individuals within the

dataset) for each of the group comparisons estimated under a null hypothesis that the samples represent a random

group of individuals (i.e. they are not related). The red arrow indicates where the observed value of relatedness lies in

the distribution. An observed relatedness value falling outside the distribution would reject the null hypothesis. P-

values, based on simulations, indicate how many of the calculated relatedness values were greater than or equal to the

observed relatedness value.

Page 176: Genetic diversity, population structure and phylogeography ...

158

Clustering analyses of weighted networks based on relatedness among individuals,

without any a priori groupings, echoed the results of the within-group relatedness comparisons.

The base network topologies themselves can be indicative of the population genetic structure

among the samples (Dyer and Nason 2004, Garroway et al. 2008). In our study, the

interconnected nodes (with each node being a beluga sample) represent the patterns of

relatedness among all of the individual whales with edge (the relationship between individuals)

length proportional to the level of relatedness. In most cases, the networks (even ‘trimmed’ to

minimize low relatedness connections, those at r < 0.35) were dense clouds of complex

connections without any obvious patterns. The only apparent genetic structure was found when

the St. Lawrence Estuary samples were included in the analyses, and two clusters are separated

in the network (Figure 3.7).

Clustering analyses of the network were employed to assign individual whales to discrete

clusters; however, the number of clusters varied depending on the clustering algorithm used, and

samples were not always grouped in the same clusters between methods. Analyses of male

samples from the Beaufort Sea sampling locations (Figure 3.8) resulted in two clusters using the

fast greedy method and three clusters using both the leading eigenvector and the spinglass

community methods.

Samples of females from the Beaufort Sea (Figure 3.9) formed more clusters than the

males: four clusters using the fast greedy method, three clusters using the leading eigenvector

method and four clusters with the spinglass community method. The leading eigenvector method

cluster sizes were 44:78:46, with the other two methods more evenly distributed: 55:45:21:47 for

the spinglass community method and 35:30:51:52 for the fast greedy method. As with the males,

samples did not assign to groups consistently, however, for the females, there was a more

Page 177: Genetic diversity, population structure and phylogeography ...

159

noticeable pattern of the same sub-groups of samples appearing together in the different clusters

found by the different methods.

Combining all samples, males and females (Figure 3.10) formed three clusters (though

most samples fell into two clusters) using the fast greedy method, three clusters using the leading

eigenvector method and nine clusters (most samples grouped into three clusters) with the

spinglass community method. Though the different clustering methods produced similar

numbers of clusters for the Beaufort Sea beluga samples, assignment of individual samples to the

clusters was not consistent across the clustering methods. Furthermore, the observed values for

transitivity (clustering coefficient) and modularity (using the fast greedy community method) fell

within the distribution of expected values generated for simulated data of unrelated individuals

for all analyses. The combination of these results suggests that there is not a strong signal for

structuring within any of the Beaufort Sea beluga relatedness networks analyzed.

Once again, the inclusion of the St. Lawrence estuary samples provided perspective on

these results as the majority of the SLE samples were consistently grouped together in a cluster

separate from the Beaufort Sea samples (Figure 3.7). However, observed transitivity and

modularity were still not different than expected for unrelated individuals. This may be due to

the overall very low level of relatedness among the Beaufort Sea samples which made up most of

the samples network.

Page 178: Genetic diversity, population structure and phylogeography ...

160

Figure 3.7. Network of male and female beluga samples from all Beaufort Sea sampling locations (Shallow/Niaqunnaq

Bay, East Mackenzie Bay, Kugmallit Bay, and Husky Lakes), and the St. Lawrence Estuary, coloured based on group

assignment: (A) to three groups found by the fast greedy method; (B) to three groups found by the leading eigenvector

method; and (C) to six groups found by the spinglass community method. For ease of visualization, all connections

with relatedness values < 0.35 were truncated to 0. Also shown are (D) the observed and expected values of transitivity

for the network and (E) the observed and expected values of modularity (based on the fast greedy method) for the

network.

Page 179: Genetic diversity, population structure and phylogeography ...

161

Figure 3.8. Network of male beluga samples (N=668) from all Beaufort Sea sampling locations (Shallow/Niaqunnaq

Bay, East Mackenzie Bay, Kugmallit Bay, and Husky Lakes), coloured based on group assignment: (A) to two groups

found by the fast greedy method; (B) to three groups found by the leading eigenvector method; and (C) to three groups

found by the spinglass community method. For ease of visualization, all connections with relatedness values < 0.35

were truncated to 0. Also shown are (D) the observed and expected values of transitivity for the network and (E) the

observed and expected values of modularity (based on the fast greedy method) for the network.

Page 180: Genetic diversity, population structure and phylogeography ...

162

Figure 3.9. Network of female beluga samples (N=168) from all Beaufort Sea sampling locations (Shallow/Niaqunnaq

Bay, East Mackenzie Bay, Kugmallit Bay, and Husky Lakes), coloured based on group assignment: (A) to four groups

found by the fast greedy method; (B) to three groups found by the leading eigenvector method; and (C) to four groups

found by the spinglass community method. For ease of visualization, all connections with relatedness values < 0.35

were truncated to 0. Also shown are (D) the observed and expected values of transitivity for the network and (E) the

observed and expected values of modularity (based on the fast greedy method) for the network.

Page 181: Genetic diversity, population structure and phylogeography ...

163

Figure 3.10. Network of all (male and female) beluga samples (N=858) from all Beaufort Sea sampling locations

(Shallow/Niaqunnaq Bay, East Mackenzie Bay, Kugmallit Bay, and Husky Lakes), coloured based on group

assignment: (A) to three groups found by the fast greedy method; (B) to three groups found by the leading eigenvector

method; and (C) to nine groups found by the spinglass community method. For ease of visualization, all connections

with relatedness values < 0.35 were truncated to 0. Also shown are (D) the observed and expected values of transitivity

for the network and (E) the observed and expected values of modularity (based on the fast greedy method) for the

network.

Page 182: Genetic diversity, population structure and phylogeography ...

164

3.3.3 Analyses with STRUCTURE

Analysis using STRUCTURE with all samples from the Eastern Beaufort Sea (EBS) and

the St. Lawrence Estuary (SLE) identified the most likely number of clusters as two (based on

the Evanno method) (Figure 3.11). The majority of individuals (99%) were strongly assigned to

one of the clusters with assignment probability Q > 0.90. There were three samples from the SLE

population that had a strong assignment to the EBS cluster (green lines in the red area, Figure

3.11).

Further STRUCTURE analysis of Beaufort Sea samples only (as the strong

differentiation between the EBS samples and the SLE samples may overwhelm subtle structure

within EBS) did not support any number of K groups above one, with individual samples equally

assigned to all clusters. However, the model-based clustering approach used in STRUCTURE

does not perform well at low levels of genetic differentiation (Fst < 0.28) and also is not

appropriate for describing relatedness among clusters (Putman and Carbone 2014).

Page 183: Genetic diversity, population structure and phylogeography ...

165

Figure 3.11. (A) Bayesian clustering assignment of individual beluga samples (N=964) into K=2 clusters based on

microsatellite data. Each vertical column represents one individual, with the length of the coloured segments

proportional to the assignment strength of that individual to one of the clusters. Samples are grouped by sampling

locations from left to right: 1) Kugmallit Bay; 2) East Mackenzie Bay; 3) Shallow/Niaqunnaq Bay; 4) Paulatuk; 5)

2006 Husky Lakes; 6) 1989 Husky Lakes; 7) 1996 Husky Lakes; and 8) St. Lawrence Estuary. Within each sample

location group, samples are ordered by sampling year. (B) Detection of the number of K groups that best fit the

beluga data performed with the Evanno method (Evanno et al. 2005) and implemented in Structure Harvester (Earl

and vonHoldt 2012).

Page 184: Genetic diversity, population structure and phylogeography ...

166

3.3.4 Discriminant Analysis of Principal Components (DAPC)

DAPC is designed to be able to describe clusters of genetically related individuals

(Jombert et al. 2010) and may perform better than STRUCTURE for detection of population

subdivision in some cases. Using this method, nine clusters were identified for male beluga

samples from the Beaufort Sea locations and three clusters identified for the female samples

(Figure 3.12). For the male samples, there was some degree of overlap among the clusters, with

six of the nine identified clusters almost completely overlapping. However, at least one cluster

(cluster 3 in Figure 3.12A) was clearly separate from the other clusters, and two others (clusters

5 and 6 in Figure 3.12A) appeared distinct. Conversely, the DAPC analysis of female Beaufort

Sea samples resulted in clearly separated clusters (Figure 3.12B). These patterns suggest that

differences in kin relationships among males and females are present within the nearshore open-

water habitat of the EBS belugas, with females more strongly clustering in related groups.

However, for both males and females, the clusters are not formed by animals sampled from a

particular aggregation area, nor from a particular period.

DAPC analyses of males and females combined had a large degree of overlap among

clusters for the EBS samples, with and without the inclusion of the St. Lawrence Estuary

samples (Figure 3.13). The SLE samples, as expected, formed a distinct cluster separated from

the nine clusters observed for the EBS samples (Figure 3.13B). When considered on their own,

the Beaufort Sea samples formed eight overlapping clusters (Figure 3.13A) in a pattern similar to

that observed in the analyses that included the SLE samples

Page 185: Genetic diversity, population structure and phylogeography ...

167

Figure 3.12. Discriminant Analysis of Principal Components (DAPC) clustering of male (A) and female (B) beluga

samples from all Beaufort Sea sampling locations (Shallow/Niaqunnaq Bay, East Mackenzie Bay, Kugmallit Bay,

Paulatuk and Husky Lakes), coloured based on group assignment.

Page 186: Genetic diversity, population structure and phylogeography ...

168

Figure 3.13. Discriminant Analysis of Principal Components (DAPC) clustering of male and female beluga samples

combined from: (A) all Beaufort Sea sampling locations (Shallow/Niaqunnaq Bay, East Mackenzie Bay, Kugmallit

Bay, Paulatuk and Husky Lakes); and (B) all Beaufort Sea sampling locations and the St. Lawrence Estuary,

coloured based on group assignment.

Page 187: Genetic diversity, population structure and phylogeography ...

169

3.3.5 Mitochondrial DNA control region sequencing

A total of 1048 beluga samples were successfully sequenced over a 702bp portion of the

mtDNA control region. Duplicate samples were identified based on nuclear DNA genotypes, and

a total of ten samples were removed from the dataset. A further six individual samples were

removed due to poor sequence quality. Two different control samples were included during

every sequencing analysis run, resulting in each control sample having 10 replicates. Both

control samples were consistently scored as the same haplotype over all runs. A total of 122

samples were re-analyzed to estimate error rate and resulted in one sample mismatch in the

haplotype score (estimated error rate 0.8%).

The final sequence dataset included a total of 1032 individuals which revealed 37

variable positions and resulted in the identification of 55 unique haplotypes (Table 3.3). The

resolution of a larger portion of mtDNA control region sequence as compared to the data used in

Chapter 2 (702bp vs. 609bp) did not alter the number of haplotypes or diversity in the

Hendrickson Island sample set (which was the Beaufort Sea sample set used for the Canada-wide

comparisons in Table 2.2). Similarly, the larger portion of mtDNA control region sequence did

not increase the number of haplotypes observed in the St. Lawrence Estuary samples, although

the discard of a few poor quality samples slightly reduced the number of haplotypes and

haplotype diversity for this location.

Page 188: Genetic diversity, population structure and phylogeography ...

170

Table 3.3. Summary of mtDNA haplotype diversity found in beluga sample collections. Abbreviations: EWF, East

Whitefish Station; HI, Hendrickson Island; KI, Kendall Island; PA, Paulatuk; SP, Shingle Point, Tuk, Tuktoyaktuk;

WWF, West Whitefish Station; Hsky, Husky Lakes; SLE, St. Lawrence Estuary. The order of highest (1) to lowest

(10) levels of haplotype diversity and nucleotide diversity are indicated in parentheses.

Sample

Collection

Number of

sequences

Number of

haplotypes

Number of

segregating

sites

Avg. number

nucleotide

diff. (k)

Haplotype

diversity (h)

Nucleotide

diversity (π)

EWF 153 25 21 2.38 0.817 (5) 0.0034 (6)

HI

298 27 19 2.27 0.776 (7) 0.0032 (7)

KI 205 32 25 2.54 0.838 (3) 0.0036 (4)

PA 64 16 15 1.87 0.763 (8) 0.0027 (8)

SP 37 12 13 3.02 0.887 (1) 0.0043 (1)

Tuk 52 12 13 1.79 0.658 (9) 0.0025 (9)

WWF 35 15 14 2.35 0.867 (2) 0.0034 (6)

Hsky2006 30 11 11 1.72 0.825 (4) 0.0025 (9)

Hsky1989 26 10 11 2.69 0.800 (6) 0.0038 (2)

Hsky1996 10 6 8 2.44 0.867 (2) 0.0035 (5)

SLE 122 11 18 2.60 0.422 (10) 0.0037 (3)

Total 1032 55 37 3.91 0.839 0.0070

Page 189: Genetic diversity, population structure and phylogeography ...

171

Overall, the haplotype diversity ranged in the sampling locations from the lowest level

observed in the St. Lawrence Estuary samples (h = 0.422) to the highest in the Shingle Point

samples (h = 0.887). Nucleotide diversity was also highest within the Shingle Point samples (π =

0.0043) but was lowest in the Tuktoyaktuk and 2006 Husky Lakes samples (π = 0.0025). Within

the western Canadian Arctic samples only, the distribution of haplotypes was similar across

locations (except for Shingle Point), and one haplotype was dominant (Figure 3.14, dark blue

colour). The St. Lawrence Estuary haplotypes were very different from the western Arctic,

though these samples were also dominated by a single haplotype (yellow colour).

As expected, AMOVA results significantly differentiated the St. Lawrence Estuary

samples from all of the western Arctic locations (Table 3.4). Overall, 56% of the molecular

variance was found within sample groups, and 44% of variation found among sample groups

(ΦPT = 0.442, p = 0.000). However, Shingle Point samples were also found to be significantly

different from the Paulatuk samples and the 2006 Husky Lakes entrapment samples (Table 3.4).

These patterns of differentiation were highlighted in the PCoA ordination of genetic distance ΦPT

(Figure 3.15), where 82% of the variation was contained in the first axis separating the St.

Lawrence Estuary samples from the western Arctic samples. Shingle Point was at the opposite

boundary of the y-axis from the Paulatuk and 2006 Husky Lake samples depicting the range of

variation in the sample collections from the western Arctic locations.

Page 190: Genetic diversity, population structure and phylogeography ...

172

Figure 3.14. Distribution of mtDNA haplotypes among sampling locations. Each coloured slice of the pie represents

a unique haplotype (N=55) and the size of the pie slice indicates the relative frequency of that haplotype in the total

sample at each location. Inset map of Canada indicates the Beaufort Sea Large Ocean Management Area (LOMA)

outlined in red. Abbreviations: SP, Shingle Point; WWF, West Whitefish Station; KI, Kendall Island; HI,

Hendrickson Island; HSKY, Husky Lakes; EWF, East Whitefish Station; TUK, Tuktoyaktuk; PA, Paulatuk; SLE,

St. Lawrence Estuary.

Page 191: Genetic diversity, population structure and phylogeography ...

173

Table 3.4. Patterns of mtDNA differentiation based on genetic distance among sample locations (Nsamples=1032). ΦST

values, calculated in Arlequin ver. 3.5 (below diagonal), and ΦPT values, calculated in GenAlEx ver. 6.5 (above

diagonal), are both FST analogues based on nucleotide divergence among haplotypes. Significant pairwise

comparisons (P<0.001, using minimum significance levels with sequential Bonferroni correction (Holm 1979) based

on a table-wise significance of P=0.05) are highlighted in grey. Abbreviations: EWF, East Whitefish Station; HI,

Hendrickson Island; KI, Kendall Island; PA, Paulatuk; SP, Shingle Point; Tuk, Tuktoyaktuk; WWF, West Whitefish

Station, Hsky, Husky Lakes entrapments in 2006, 1989, and 1996; SLE, St. Lawrence Estuary.

EWF HI KI PA SP Tuk WWF Hsky06 Hsky89 Hsky96 SLE

EWF * 0.000 0.000 0.016 0.034 0.008 0.000 0.038 0.000 0.008 0.748

HI -0.004 * 0.000 0.013 0.046 0.005 0.000 0.035 0.004 0.012 0.757

KI -0.003 -

0.000

* 0.027 0.022 0.018 0.000 0.049 0.000 0.011 0.741

PA 0.156 0.013 0.027 * 0.113 0.000 0.000 0.018 0.052 0.000 0.759

SP 0.034 0.046 0.022 0.113 * 0.092 0.050 0.148 0.000 0.066 0.732

Tuk 0.008 0.005 0.018 -

0.003

0.092 * 0.000 0.033 0.032 0.034 0.763

WWF -0.006 -

0.006

-

0.001

-

0.002

0.050 -0.008 * 0.026 0.008 0.000 0.747

Hsky06 0.038 0.035 0.049 0.018 0.149 0.033 0.026 * 0.097 0.035 0.747

Hsky89 -0.000 0.004 -

0.005

0.053 -

0.015

0.032 0.008 0.097 * 0.032 0.737

Hsky96 0.008 0.012 0.011 -

0.011

0.006 0.034 -0.003 0.035 0.032 * 0.732

SLE 0.748 0.757 0.741 0.759 0.732 0.763 0.7467 0.747 0.737 0.732 *

Page 192: Genetic diversity, population structure and phylogeography ...

174

Figure 3.15. Principal Coordinates Analysis (PCoA) using ФPT genetic distance matrix for mtDNA sequence data for

beluga sample locations. Symbols are coloured to correspond to sampling areas in Table 1: Shallow Bay – black,

Shingle Point (SP), West Whitefish Station (WWF); East Mackenzie Bay – yellow, Kendall Island (KI); Kugmallit

Bay – green, East Whitefish Station (EWF), Hendrickson Island (HI), Tuktoyaktuk (TuK); Paulatuk (PA) – pink;

Husky Lakes (Hsky) – red; St. Lawrence Estuary (SLE) – blue.

EWFHI

KI

PA

SP

Tuk

WWF

Hsky06

Hsky89

Hsky96

SLE

MD

S2 (

13

.93

% o

f va

riat

ion

)

MDS1 (82.03% of variation)

Page 193: Genetic diversity, population structure and phylogeography ...

175

Using a Benjamini-Hochberg FDR correction, Shingle Point was additionally

differentiated from Hendrickson Island and Tutoyaktuk. Kendall Island was also differentiated

from both the Paulatuk and 2006 Husky Lakes samples, and Husky Lakes 1989 was

differentiated from the 2006 Husky Lakes samples. These new results confirm the trends

revealed by the PCoA ordination of the dataFurther analysis of temporal variation within sample

locations also revealed trends in the data as illustrated by PCoA ordination of the samples

grouped by decade at each of the locations (Figure 3.16). AMOVA results based on these finer

scale groups revealed almost the same overall pattern of molecular variance as the previous

analysis (57% variation within sample groups, 43% variation among sample groups, ΦPT =

0.442, p = 0.000); however, results of individual pairwise differences also indicated a trend of

change over time from the 1980s to the 2000s (Table 3.5). The majority of the significant

differentiation occurred between the 1980s and the 2000s, especially for Husky Lakes, Kendall

Island and West Whitefish Station. However, it is important to note that the number of samples

from the 1980s that were analyzed was small compared to other decades, especially for West

Whitefish Station (N=3) (Table 3.6).

Page 194: Genetic diversity, population structure and phylogeography ...

176

Figure 3.16. Principal Coordinates Analysis (PCoA) using ФPT genetic distance matrix for mtDNA sequence data for

beluga samples grouped by location and decade of sampling. Location abbreviations: Shingle Point (SP); West

Whitefish Station (WWF); Kendall Island (KI); East Whitefish Station (EWF); Hendrickson Island (HI),

Tuktoyaktuk (TuK); Paulatuk (PA); Husky Lakes (Hsky); St. Lawrence Estuary (SLE). A representative sampling

location from each of the main aggregation areas in the Beaufort Sea is highlighted: Shallow Bay (WWF) in purple;

East Mackenzie Bay (KI) in yellow; Kugmallit Bay (HI) in red.

Page 195: Genetic diversity, population structure and phylogeography ...

177

Table 3.5. Patterns of differentiation based on mtDNA genetic distance among sample collections (Nsamples=1032) divided by decades 1980s, 1990s, and 2000s.

ΦPT values, calculated in GenAlEx ver. 6.5, are below the diagonal with associated p-values above the diagonal. Significant pairwise comparisons, using False

Discovery Rate (FDR) corrections, are highlighted in grey. Abbreviations: EWF, East Whitefish Station; HI, Hendrickson Island; KI, Kendall Island; PA,

Paulatuk; SP, Shingle Point; Tuk, Tuktoyaktuk; WWF, West Whitefish Station, Hsky, Husky Lakes entrapments in 2006, 1989, and 1996; SLE, St. Lawrence

Estuary. Patterns of results using ΦST values, calculated in Arlequin ver. 3.5 (data not shown), were identical.

Page 196: Genetic diversity, population structure and phylogeography ...

178

Table 3.6. Summary of mtDNA haplotype diversity found in beluga sample collections grouped by decade 1980s,

1990s, and 2000s. Abbreviations: EWF, East Whitefish Station; HI, Hendrickson Island; KI, Kendall Island; PA,

Paulatuk; SP, Shingle Point, Tuk, Tuktoyaktuk; WWF, West Whitefish Station; Hsky, Husky Lakes; SLE, St.

Lawrence Estuary. The order of highest (1) to lowest (23) levels of haplotype diversity and nucleotide diversity are

indicated in parentheses.

Sample

Collection

Number of

sequences

Number of

haplotypes

Number of

segregating

sites

Avg. number

nucleotide

diff. (k)

Haplotype

diversity (h)

Nucleotide

diversity (π)

EWF2000s 94 22 18 2.29 0.800 (15) 0.0033 (10)

EWF1980s 13 10 9 2.80 0.962 (1) 0.0041 (5)

EWF1990s 46 14 14 2.47 0.813 (13) 0.0035 (8)

HI2000s 218 26 19 2.19 0.756 (17) 0.0031 (12)

HI1980s 12 6 8 3.17 0.818 (11) 0.0045 (2)

HI1990s 68 20 15 2.35 0.810 (14) 0.0033 (10)

KI2000s 115 21 19 2.38 0.814 (12) 0.0034 (9)

KI1980s 18 10 10 3.40 0.928 (3) 0.0048 (1)

KI1990s 72 19 17 2.44 0.844 (9) 0.0035 (8)

PA2000s 57 15 14 2.01 0.787 (16) 0.0029 (13)

PA1990s 7 3 2 0.57 0.524 (20) 0.0008 (18)

SP2000s 15 9 10 3.11 0.914 (4) 0.0044 (3)

SP1980s 7 5 6 3.05 0.905 (5) 0.0043 (4)

SP1990s 15 9 10 3.14 0.933 (2) 0.0045 (2)

Tuk2000s 20 5 7 1.06 0.442 (21) 0.0015 (17)

Tuk1980s 10 6 9 2.87 0.844 (9) 0.0041 (5)

Tuk1990s 22 9 10 1.97 0.749 (18) 0.0028 (14)

WWF2000s 15 9 12 1.92 0.848 (8) 0.0027 (15)

WWF1980s 3 2 3 2.00 0.667 (19) 0.0029 (13)

WWF1990s 16 9 9 1.96 0.858 (7) 0.0028 (14)

Hsky2006 30 11 11 1.72 0.825 (10) 0.0025 (16)

Hsky1989 26 10 11 2.69 0.800 (15) 0.0038 (7)

Page 197: Genetic diversity, population structure and phylogeography ...

179

Sample

Collection

Number of

sequences

Number of

haplotypes

Number of

segregating

sites

Avg. number

nucleotide

diff. (k)

Haplotype

diversity (h)

Nucleotide

diversity (π)

Hsky1996 10 6 8 2.44 0.867 (6) 0.0035 (8)

SLE2000s 91 10 17 2.73 0.431 (22) 0.0039 (6)

SLE1990s 31 5 16 2.25 0.398 (23) 0.0032 (11)

Page 198: Genetic diversity, population structure and phylogeography ...

180

3.4 Discussion

Testing for patterns of relatedness among whales sampled from different harvest areas

has the potential to shed light on the presence or absence of kin structure among belugas within

stocks and populations. In the present study, however, there was no evidence of fine-scale kin

structure in beluga whales in the Beaufort Sea related to nearshore aggregation or harvesting

location. Given this result, harvesting practices that have been used in the region over the last

several decades do not appear to be impacting particular kin-groups or distinct genetic units

associated with particular geographic locations.

This contrasts with belugas studied in the Hudson Bay population of eastern Canada

where close kin, including relatives other than parents and siblings, were caught in harvests of

animals travelling together during migration (Colbeck et al. 2013). These associations of related

individuals formed among belugas that aggregated in space and time during the migratory

period; however, the kin groups (other than mothers and offspring) dissociated once animals

reached the summering areas. However, it has been suggested that kin structure may vary

between populations due to differences in local animal densities and individual space-use

strategies (Zeyl et al. 2009). Indeed, social systems of delphinids have been shown to vary within

species as a result of individual responses to variation in environments and resources (Gowans et

al. 2008). The summer distribution of belugas in the Eastern Beaufort Sea and the Amundsen

Gulf covers a much larger geographic area than the summer distribution of individual stocks of

whales in Hudson Bay (Reeves et al. 2014), so alternate space use strategies and social structure

may be possible.

Within the Beaufort Sea area, samples taken from belugas harvested in aggregation

locations were not more related than expected for individuals grouped at random. There are

Page 199: Genetic diversity, population structure and phylogeography ...

181

several possible reasons for this. First, the harvesting practices in the Beaufort Sea area select for

adult males and this bias is clearly reflected in the samples available for genetic analyses. Unlike

the study in Hudson Bay/Hudson Strait (Colbeck et al. 2013, see Supplementary Data), females,

and especially females with calves, are not proportionately represented in the Beaufort Sea

samples. The Beaufort Sea beluga harvest is strongly biased towards males (4:1 males:females),

as opposed to the more equal ratio of males to females in samples collected from Hudson

Strait/Hudson Bay harvests. Thus, associations between mother and calves, as found in Hudson

Bay summering areas (Colbeck et al. 2013), would be difficult to detect in the type of samples

available for this study. Second, with the exception of Paulatuk and the Husky Lake ice

entrapments, pods, or groups of whales, are not generally harvested at one time. This also limited

genetic analyses of samples of whales known to be associated in space and time. Third, despite

the large geographic distribution of whales in the Beaufort Sea and surrounding areas,

coordinated movements among kin, such as during migration (Colbeck et al. 2013), may not be

happening in this area or not represented by the samples available for analyses. However, the

objectives of this study were focussed on the kin-relationships in beluga aggregation areas where

the largest effort for harvesting of EBS belugas occurs. Investigations of kin-relationships during

EBS beluga migration and larger population questions will require further research with different

sample collection timing and strategies such as biopsy sampling of free-ranging whales.

The apparent lack of kin structure based on geographically aggregated groups most likely

reflects the beluga movement patterns revealed by data from satellite tagged whales. Male

belugas left the Mackenzie Estuary in the first half of July and travelled north to the permanent

pack ice or east towards the Amundsen Gulf (Richard et al. 2001b). Female belugas were also

shown to leave the Mackenzie Delta to make short-term trips to the Amundsen Gulf and back to

Page 200: Genetic diversity, population structure and phylogeography ...

182

nearshore or offshore waters of the delta. Even though the speeds at which the whales moved

were relatively slow (less than 3 km/hr), which corresponded to potential movements of about 24

km/day (Richard et al. 2001b), our analyses indicate that belugas do not even cluster in short-

term groups of related individuals in aggregations areas. Evidence of this would have been

detected in the analysis of relatedness in whales harvested in the same area on the same day.

Thus, our results suggest that patterns of movements among adult whales in the coastal regions

are not constrained by kinship and correspond to general movements of unrelated animals. This

would indicate that sociality based on kinship among adults is not a major factor influencing

spatial structure on a fine scale for EBS belugas during the period when samples were obtained.

This has also been observed in coastal river otters (Lontra canadensis), for example, where

sociality seems to arise from cooperative group foraging on high-quality prey resources with no

kin associations within these groups (Blundell et al. 2004). Similarly, studies involving giraffes

have also shown that animals that are not close kin may associate with each other solely on the

basis of similar habitat preferences (Carter et al. 2013). Another type of sociality, alliance

formation for the protection of calves, has also been proposed for EBS belugas (Loseto et al.

2006). It has been suggested that association of close kin could facilitate the cooperative

protection of young in delphinids (Gaspari et al. 2007). Again, our results provided no evidence

that such alliances are formed among related individuals. The lack of juveniles in our sample (or

samples more representative of the population as a whole) may obscure the involvement of kin in

this type of social behaviour (Loseto et al. 2006), but our results do provide an additional

perspective to this idea.

While relatedness in EBS belugas was not found using a priori divisions of the samples,

some clusters of genetically-related individuals were detected within the full set of samples,

Page 201: Genetic diversity, population structure and phylogeography ...

183

particularly for females. The lack of spatial or temporal patterns for the clusters indicates a lack

of fine-scale kin structure but may suggest that for the nearshore portion of the stock overall,

female EBS belugas form moderate bonds with other female kin and there is female philopatry to

the overall Eastern Beaufort Sea area. This would be a similar sociogenetic pattern to other

delphinids that live in fission-fusion societies (Möller 2011). The seasonal philopatry of related

groups of female belugas may have benefits such as the maintenance of detailed knowledge of

food distribution, avoidance of predation, and support from matrilineal kin for breeding success

(Clutton-Brock and Lukas 2012).

The belugas sampled from ice-entrapments in the Husky Lakes are from a different

habitat, different period and are a more natural group sample from the EBS beluga stock than the

harvest samples. Also, the timing of the entrapments would correspond to the time when animals

would be in migratory groups starting their westward fall migration back to wintering areas

(Richard et al. 2001b). In Hudson Bay belugas, groups of whales migrating together were found

to be networks of related individuals (Colbeck et al. 2013). However, our results based on

entrapment samples were not different from the July harvest samples. For each of the entrapment

years sampled, there was no evidence of relatedness nor any clusters in the network analyses that

corresponded to the individual entrapment events. These results were unexpected. It seemed

likely that the attraction of this area for belugas revolve around unique foraging opportunities in

that the Husky Lakes are rich in many fish species (e.g. lake trout, whitefish, cod) (Inuvialuit

Land Administration 2011, Kocho-Schellenberg 2010). Though the benefits of such a feeding

area are high, the risk of ice entrapment is also high. However, avoidance of ice entrapments has

been suggested to be learned from older, experienced whales (Wade et al. 2012) which might

mitigate the risk in this situation. This does not seem to be true for the belugas sampled from the

Page 202: Genetic diversity, population structure and phylogeography ...

184

Husky Lakes. It is possible that the lack of complete sampling of all individuals involved in the

entrapment may have biased our analyses of group relatedness. However, the analyses of average

relatedness in mass-stranded pods of common dolphins (Delphinus delphis) also did not reveal

closer kin relationships than samples taken from a random collection of animals from different

groups (Viricel et al. 2008). This finding was extrapolated to overall common dolphin

populations, and the authors concluded that social organization in this delphinid species is not

based on kinship. The results of our Husky Lakes beluga analyses, together with the analyses of

harvested beluga samples, support a similar view of Eastern Beaufort Sea belugas. Social

structure, at least among adult whales in the stock, is most likely rooted in habitat selection and

foraging opportunities (an additional hypothesis suggested by Loseto et al. 2006) without the

added influence of kin-biased association. A similar result was found with the analyses of

narwhals (Monodon monoceros) sampled from an ice entrapment in the eastern Canadian Arctic

(Watt et al. 2015). Among the portion of samples collected, genetic relatedness and fatty acid

signatures were not correlated, suggesting that foraging and kin-based social groups are not

linked for narwhal.

This study was the first to apply measures of relatedness, network analyses, and Bayesian

methods to detect distinct genetic clusters within the EBS beluga stock. The use of beluga

samples from the St. Lawrence Estuary as a contrasting population revealed the types of patterns

that could be detected using these methods from a small, isolated beluga population.

Furthermore, the STUCTURE analysis revealed the ability to detect individual outliers in the

data (Figure 3.11). Three individual beluga samples from the SLE population appeared to have a

strong assignment to the EBS population.These are likely samples of belugas that “strayed” into

the St. Lawrence Estuary from an eastern Canadian beluga population, probably from the

Page 203: Genetic diversity, population structure and phylogeography ...

185

overwintering area in Baffin Bay. Incidents of strays (usually young adults) and observations of

extralimital belugas are not uncommon (Brown Gladden et al. 1997, Brown Gladden et al. 1999,

O’Corry-Crowe et al. 2015), but no apparent strays were detected in the EBS samples.

However, the patterns of mitochondrial DNA haplotype distributions do suggest a

matrilineal driven philopatry for EBS belugas (Brown-Gladden et al. 1999), especially to the

core summer aggregation areas within the Mackenzie Delta where large numbers of animals are

seasonally predictable and are most regularly harvested (i.e. Kendall Island, Hendrickson Island,

East Whitefish Station, Tuktoyaktuk). The mtDNA haplotypes and frequencies in these areas

were not significantly different across hunting camp locations. However, in the areas near the

edges of the nearshore range of the stock (Shingle Point, Paulatuk and Husky Lakes), variability

in haplotype frequencies was observed that resulted in some significant differences in pairwise

genetic comparisons involving samples from these locations. This variability could be linked to

changes in group structure as compared to core nearshore areas around the Mackenzie Delta area

that may be consistent with fission-fusion dynamics related to the distribution and availability of

prey and beluga feeding behaviour (Lehmann et al. 2006, Dunbar et al. 2009, Grove 2011).

Studies of migrating beluga whales found that feeding behaviour in the nearshore areas was more

common during migration than in summer aggregations where behaviours other than feeding

(e.g. moulting, calving, protections from predation/disturbance) may be more important

(Quakenbush et al. 2015). This matches the observation that EBS belugas do not appear to feed

extensively in the core aggregation areas near the Mackenzie River estuary, but they are known

to feed in east Amundsen Gulf (nearer to Paulatuk) and during migration (Harwood et al. 2014),

as well as likely within the Arctic Archipelago as inferred from deep-diving males in Vicount-

Melville Sound (Richard et al. 2001b). The timing of sampling for the Husky Lakes (early winter

Page 204: Genetic diversity, population structure and phylogeography ...

186

entrapments) and Shingle Point (late summer/early fall) would coincide with beluga groups that

have formed during the start of migrations out of the eastern Beaufort Sea areas towards the

wintering grounds to the west (Richard et al. 2001b).

Whether or not there is a matrilineal social structuring that is occurring as groups form to

feed and/or migrate in the EBS beluga stock is hard to determine. It is challenging to collect

detailed data about association patterns among individuals for animals that cannot be easily and

consistently observed over time (Whitehead 1997). Belugas, which spend so much time out of

sight under water and ice, move rapidly and over long distances, and gather in large groups

during times that they are accessible, definitely fall into this category. Association data, in

combination with genetic data, would be needed to clarify the role of social structure in the

fission-fusion dynamics of EBS belugas.

The results of this study do provide the first individual-based socio-genetic information

about the Eastern Beaufort Sea beluga stock and add to our knowledge about patterns of social

structure revealed by satellite tracking, habitat selection and foraging ecology (Richard et al.

2001b, Harwood et al. 2014, Loseto et al. 2006). The patterns observed here suggest that clusters

of related belugas, particularly in females, use this large summering area but do not form fine-

scale kin structure related to aggregation ‘hotspots’ in the nearshore areas. The overall results

from mtDNA sequencing in this study also refute an indication of “microgeographic variation

within the Mackenzie Delta” (Brown Gladden et al. 1997) that suggested annual fidelity to

particular sites. It is most likely the difference in samples sizes that added this clarification. This

could be considered good news for the maintenance of genetic diversity and perhaps for the

resilience of the stock to the cumulative impacts of subsistence harvesting, increased disturbance

from industrial activities and tourism, and shifts in prey species and/or distribution due to

Page 205: Genetic diversity, population structure and phylogeography ...

187

changing climate (Wade et al. 2012, Reeves et al. 2014). Disturbances from these activities in

particular bays where belugas aggregate will not put a discrete genetic unit of the stock at risk.

The same is apparent for natural mortality of groups of whales due to ice entrapments, though

this inference rests on samples that may not be characteristic of entrapments in general. The

selection of large adult whales, particularly adult males, during the harvests also appears likely to

have minimal impact on the stock as it was the female belugas that were found to form discrete

clusters. It is this female group that may be at greater risk of disturbance and harm. However,

removals of any specific class of individuals with specialized roles from a population can have

negative impacts on the transmission of cultural knowledge, reproductive success, and social

cohesion (e.g. Coltman et al. 2003, Williams and Lusseau 2006, Gobush and Waser 2009). For

example, preference for large males in the trophy hunt of bighorn sheep (Ovis canadensis)

caused directional selection for greater breeding success by the smaller males (Coltman et al.

2003). This may not be true for selective subsistence harvesting. Factors other than animal size,

in particular, the level of harvesting pressure, plus elements such as population structure and the

composition of the harvest relative to age at maturity may have a greater influence on selective

evolutionary changes (Mysterud 2011).

Animals that can be flexible in the degree of fission-fusion group formation during the

exploitation of different habitats are the most likely to adapt to effects of climate change (Dunbar

et al. 2009). The last 20 years has seen trends of later sea ice advance, earlier sea ice retreat and

shorter ice season duration in the Arctic, including the Beaufort Sea (Stammerjohn et al. 2012).

Belugas whales are considered to be moderately sensitive to climate change (Laidre 2008) and

behavioural flexibility, especially in response to sea ice changes and trophic web changes that

affect prey densities and distributions, will be important. Ecosystem regime shifts in the Bering

Page 206: Genetic diversity, population structure and phylogeography ...

188

Sea, where EBS belugas overwinter, did not have long-term effects on body size or survivorship

(Luque and Ferguson 2009). Their resilience may have been, in part, due to the whales’ ability to

adjust their feeding behaviour patterns (Luque and Ferguson 2009). Recently, genetic analyses of

mtDNA haplotypes in two decades of samples of belugas provided evidence to link short-term

behavioural changes in migration patterns and seasonal habitat use to changes in sea ice

conditions (O’Corry-Crowe et al. 2016). Our current results did not reveal any short- or long-

term dramatic changes in haplotypes or frequencies within sampling locations; however, sample

sizes were heavily skewed towards the last ten years. The general trend that was observed

encourages the continued use of genetic analyses to track potential changes in group structure

related to habitat use and feeding patterns among EBS belugas. Even though we were not able to

discern kin-based or mtDNA haplotype patterns for fine-scale clustering of beluga whales in the

nearshore EBS area, the genetic information from this study will support continuing conservation

efforts and harvest management of Eastern Beaufort Sea belugas.

3.5 Acknowledgements

This study would not have been possible without the many people involved in harvest

monitoring programs and beluga sampling in the Beaufort Sea and surrounding areas, especially

the hunters of the Inuvialiut Settlement Region. Tim Frasier (St. Mary’s University, Halifax)

provided invaluable training and guidance for the analysis of relatedness and testing of

clustering. We thank Veronique Lésage, DFO Quebec, for providing the St. Lawrence estuary

samples and all of the individuals involved in that beluga necropsy and sampling program.

Laboratory assistance was provided by Denise Tenkula and Susie Bajno. We are indebted to Lois

Harwood for sharing her wealth of knowledge about Beaufort Sea belugas. Funding for this

Page 207: Genetic diversity, population structure and phylogeography ...

189

study was obtained from Fisheries and Oceans Canada (DFO), especially the Genomics Research

and Development (GRDI) program, the DFO Ecosystem Research Initiative (ERI), and the

Fisheries Joint Management Committee (FJMC).

3.6 References

Allendorf, F.W., Luikart, G., and Aitken, N. 2013. Conservation and the genetics of populations,

second ed. Wiley-Blackwell Publishing, Oxford, UK, xviii + 602p.

Amos, W., and Hoelzel, A.R. 1992. Applications of molecular genetic techniques to the

conservation of small populations. Biological Conservation 61:133–144.

Andrews, K.R., Karczmarski, L., Au, W.W., Rickards, S.H., Vanderlip, C.A., Bowen, B.W.,

Gordon Grau, E. and Toonen, R.J., 2010. Rolling stones and stable homes: social structure,

habitat diversity and population genetics of the Hawaiian spinner dolphin (Stenella longirostris).

Molecular Ecology 19: 732-748.

Aureli, F., Schaffner, C.M., Boesch, C., Bearder, S.K., Call, J., Chapman, C.A., Connor, R.,

Fiore, A.D., Dunbar, R.I., Henzi, S.P. and Holekamp, K. 2008. Fission-fusion dynamics: new

research frameworks. Current Anthropology 49: 627-654.

Beckmann, J.P., Murray, K., Seidler, R.G., and Berger, J. 2012. Human-mediated shifts in

animal habitat use: sequential changes in pronghorn use of a natural gas field in Greater

Yellowstone. Biological Conservation 147: 222–233.

Benjamini, Y., and Hochberg, Y. 1995. Controlling the false discovery rate: a practical and

powerful approach to multiple testing. Journal of the Royal Statistical Society Series B 57: 289–

300.

Benjamini, Y., and Yekutieli, D. 2001. The control of the false discovery rate in multiple testing

under dependency. Annals of Statistics 29: 1165–1188.

Berdahl, A., Torney, C.J., Schertzer, E. and Levin, S.A. 2015. On the evolutionary interplay

between dispersal and local adaptation in heterogeneous environments. Evolution 69: 1390-1405.

Bérubé, M., and Palsbøll, P. 1996. Identification of sex in cetaceans by multiplexing with three

ZFX and ZFY specific primers. Molecular Ecology 5: 283–287.

Biosa, D. Grignolio, S., Sica, N., Pagon, N., Scandura, M., and Apollonio, M. 2015. Do relatives

like to stay closer? Spatial organization and genetic relatedness in a mountain roe deer

population. Journal of Zoology 296: 30–37.

Page 208: Genetic diversity, population structure and phylogeography ...

190

Blasi, M.F., and Boitani, L. 2014. Complex social structure of an endangered population of

bottlenose dolphins (Tursiops truncatus) in the Aeolian Archipelago (Italy). PLoS ONE 9:

e114849.

Blouin, M. S. 2003. DNA-based methods for pedigree reconstruction and kinship analysis in

natural populations. Trends in Ecology and Evolution 18: 503-511.

Blundell, G.M., Ben-David, M., Groves, P., Bowyer R.T., and Geffen, E. 2004. Kinship and

sociality in coastal river otters: are they related? Behavioral Ecology 15: 705–714.

Body, G., Weladji, R.B., Holand, Ø. and Nieminen, M. 2015. Fission-fusion group dynamics in

reindeer reveal an increase of cohesiveness at the beginning of the peak rut. Acta Ethologica 18:

101-110.

Brown Gladden, J.G., Ferguson, M.M. and Clayton, J.W., 1997. Matriarchal genetic population

structure of North American beluga whales Delphinapterus leucas (Cetacea: Monodontidae).

Molecular Ecology 6:1033-1046.

Brown Gladden, J.G., Ferguson, M.M., Friesen, M.K., and Clayton, J.W. 1999. Population

structure of North American beluga whales (Delphinapterus leucas) based on nuclear DNA

microsatellite variation and contrasted with the population structure revealed by mtDNA

variation. Molecular Ecology 8: 347–363.

Brown Gladden, J.G., Brodie, P.F. and Clayton, J.W. 1999. Mitochondrial DNA used to identify

an extralimital beluga whale (Delphinapterus leucas) from Nova Scotia as originating from the

St. Lawrence population. Marine Mammal Science 15: 556-558.

Buchanan, F.C., Friesen, M.K., Littlejohn, R.P., and Cayton, J.W. 1996. Microsatellites from

the beluga whale, Delphinapterus leucas. Molecular Ecology 5: 571–575.

Cabin, R.J., and Mitchell, R.J. 2000. To Bonferroni or not to Bonferroni: when and how are the

questions. Bulletin of the Ecological Society of America 81: 246–248.

Carter, K.D., Seddon, J.M., Frère, C.H., Carter, J.K., and Goldizen, A.W. 2013. Fission-fusion

dynamics in wild giraffes may be driven by kinship, spatial overlap and individual social

preferences. Animal Behaviour 85: 385–394.

Chivers, B.L., and Corkeron, P.J. 2001. Trawling and bottlenose dolphins’ social structure.

Proceedings of the Royal Society of London B: Biological Sciences 268: 1901-1905.

Clauset , A., Newman, M.E.J., and Moore, C. 2004. Finding community structure in very large

networks. Physical Review E 70(6), p.066111.

Clutton-Brock, T.H., and Lukas, D. 2012. The evolution of social philopatry and dispersal in

female mammals. Molecular Ecology 21: 472–492.

Page 209: Genetic diversity, population structure and phylogeography ...

191

Colbeck, G.J., Duschesne, P., Postma, L.D., Lésage, V., Hammill, M.O., and Turgeon, J. 2013.

Groups of related belugas (Delphinapterus leucas) travel together during their seasonal

migrations in and around Hudson Bay. Proceedings of the Royal Society of London B:

Biological Sciences 280: 1752–1762.

Coltman, D.W., O’Donoghue, P., Jorgenson, J.T., Hogg, J.T., Strobeck, C., and Festa-Bianchet,

M. 2003. Undesireable evolutionary consequences of trophy hunting. Nature 426: 655–658.

Connor, R.C., Wells, R.S., Mann, J., and Read, A.J., 2000. The bottlenose dolphin, social

relationships in a fission-fusion society. Cetacean Societies: Field Studies of Dolphins and

Whales. Mann, J., Connor, R.C., Tyack, P.L, and Whitehead, H. (Eds.),University of Chicago

Press, USA., pp.91-125.

COSEWIC. 2004. COSEWIC assessment and update status report on the beluga whale

Delphinapterus leucas in Canada. Committee on the Status of Endangered Wildlife in Canada.

Ottawa. ix + 70 p.

COSEWIC. 2012. Definitions and abbreviations. [Online:

http://www.cosewic.gc.ca/eng/sct2/sct2_6_e.cfm].

COSEWIC 2014. COSEWIC assessment and update status report on the beluga whale (St

Lawrence Estuary population) Delphinapterus leucas in Canada. Committee on the Status of

Endangered Wildlife in Canada. Ottawa

COSEWIC. 2015. Committee on the Status of Endangered Wildlife in Canada

Assessment Process, Categories and Guidelines.

COSEWIC. 2016. Designatable Units for Beluga Whales (Delphinapterus leucas) in Canada.

Committee on the Status of Endangered Wildlife in Canada. Ottawa. 73 pp.

Couzin, I.D. and Laidre, M.E. 2009. Fission–fusion populations. Current Biology 19: R633-

R635.

Csardi, G., and Nepusz, T. 2006. The igraph software package for complex network research.

Interjournal, Complex Systems 1695.

Dakin, E.E., and Avise, J.C. 2004. Microsatellite null alleles in parentage analysis. Heredity 93:

504–509

de March, B.G.E and Postma, L.D. 2003. Molecular stock discrimination of belugas

(Delphinapterus leucas) hunted in eastern Hudson Bay, northern Quebec, Hudson Strait, and

Sanikiluaq (Belcher Islands), Canada, and comparisons to adjacent populations. Arctic 56: 111-

124.

Page 210: Genetic diversity, population structure and phylogeography ...

192

Dent, E.A., and vonHoldt, B.M. 2012. STRUCTURE HARVESTER: a website and program for

visualizing STRUCTURE output and implementing the Evanno method. Conservation Genetics

Resources 4: 359–361.

DFO. 2000. Eastern Beaufort Sea beluga. DFO Stock Status Report E5–38.

DFO. 2014. Status of beluga (Delphinapterus leucas) in the St. Lawrence River estuary. DFO

Canadian Science Advisory Secretariat Science Advisory Report 2013/076.

Dharmarajan, G., Beasley, J.C, Fike, J.A., and Rodes Jr., O.E. 2014. Effects of landscape,

demographic and behavioural factors on kin structure: testing ecological predictions in a

mesopredator with high dispersal capability. Animal Conservation 17: 225–234.

Díaz López, B. and Bernal Shirai, J.A. 2008. Marine aquaculture and bottlenose dolphins’

(Tursiops truncatus) social structure. Behavioural Ecology and Sociobiology 62: 887-894.

Dunbar, R.I.M, Korstjens, A.H., and Lehmann, J. 2009. Time as an ecological constraint.

Biology Reviews 84: 413–429.

Dungan, S.Z., Wang, J.Y., Araujo, C.C, Yang, S.-C., and White, B. 2016. Social structure in a

critically endangered Indo-Pacific humpback dolphin (Sousa chinensis) population. Aquatic

Conservation in Marine and Freshwater Ecosystems 26: 517-529.

Dyer, R.J., and Nason, J.D. 2004. Population graphs: the graph theoretic shape of genetic

structure. Molecular Ecology 13: 1713–1727.

Elliser, C.R. and Herzing, D.L., 2011. Replacement dolphins? Social restructuring of a resident

pod of Atlantic bottlenose dolphins, Tursiops truncatus, after two major hurricanes. Marine

Mammal Science 27: 39-59.

Evanno, G., Regnaut, S., and Goudet, J. 2005. Detecting the number of clusters of individuals

using the software STRUCTURE: a simulation study. Molecular Ecology 14: 2611–2620.

Falush, D., Stephens, M., and Pritchard, J.K. 2003. Inference of population structure using

multilocus genotype data: linked loci and correlated allele frequencies. Genetics 164: 1567–

1587.

Fast, H., Mathias, J, and Banias, O. 2001. Directions toward marine conservation in Canada’s

western Arctic. Ocean Coastal Management 44: 183–205.

FJMC (Fisheries Joint Management Committee). 2013. Beaufort Sea Management Plan.

Amended fourth edition. 44 p.

Fluxus Technology Ltd. 2013. DNA Alignment version 1.3.3.2. www.fluxus-

engineering.com/dnaalignment

Page 211: Genetic diversity, population structure and phylogeography ...

193

García, L.V. 2004. Escaping the Bonferroni iron claw in ecological studies. Oikos. 105: 657–

663.

Garroway, C.J., Bowman, J., Carr, D., and Wilson, P.J. 2008. Applications of graph theory to

landscape genetics. Evolutionary Applications 1: 620–630.

Gaspari, S., Azzellino, A., Airold, S., and Hoelzel, A.R. 2007. Social kin associat associations

and genetic structuring of striped dolphin populations (Stenella coeruleoalba) in the

Mediterranean Sea. Molecular Ecology 16: 2922–2933.

Gerard, J.F., Bideau, E., Maublanc, M.L., Loisel, P. and Marchal, C. 2002. Herd size in large

herbivores: encoded in the individual or emergent? The Biological Bulletin 202: 275-282.

Gilbert, K.J., Andrew, R.L., Bock, D.G., Franklin, M.T., Kane, N.C., Moore, J.S., Moyers, B.T.,

Renault, S., Rennison, D.J., Veen, T., and Vines, T.H. 2012. Recommendations for utilizing and

reporting population genetic analyses: the reproducibility of genetic clustering using the program

STRUCTURE. Molecular Ecology 21: 4925–4930.

Girvan, M., and Newman, M.E.J. 2002. Community structure in social and biological networks.

Proceedings of the National Acadademy of Sciences USA 99: 7821–7826.

Gobush, K.S., and Wasser, S.K. 2009. Behavioural correlates of low relatedness in African

elephant core groups of a poached population. Animal Behaviour 78: 1079–1086.

Goudet, J. 1995. FSTAT (version 1.2): a computer program to calculate F-statisitcs. Journal of

Heredity 86: 485-486.

Gowans, S., Wursig, B., and Karczmarski. 2008. The social structure and strategies of

delphinids: predictions based on an ecological framework. Advances in Marine Biology 53: 195–

294.

Grear, D.A., Samuel, M.D., Scribner, K.T., Weckworth, B.V., and Langenberg, J.A. 2010.

Influence of genetic relatedness and spatial proximity on chronic wasting disease infection

among female white-tailed deer. Journal of Applied Ecology 47: 532–540.

Greenwood, P.J., 1980. Mating systems, philopatry and dispersal in birds and mammals. Animal

Behaviour 28: 1140-1162.

Grove, M. 2012. Space, time, and group size: a model of constraints on primate social foraging.

Animal Behaviour 83:411–419.

Harwood, L.A., Innes, S., Norton, P., and Kingsley, M.C.S. 1996. Distribution and abundance of

beluga whales in the Mackenzie estuary, southeast Beaufort Sea, and west Amundsen Gulf

during late July 1992. Canadian Journal of Fisheries and Aquatic Sciences 53: 2262–2273.

Page 212: Genetic diversity, population structure and phylogeography ...

194

Harwood, L.A., Norton, P., Day, B., and Hall, P.A. 2002. The harvest of beluga whales in

Canada’s western Arctic: hunter-based monitoring of the size and composition of the catch.

Arctic 55: 10–20.

Harwood, L.A., and Smith, T.G. 2002. Whales of the Inuvialuit Settlement Region in Canada’s

western Arctic: and overview and outlook. Arctic 55: 77–93.

Harwood, L.A., Iacozza, J., Auld, J.C., Norton, P., and Loseto, L. 2014. Belugas in the

Mackenzie River estuary, NT, Canada: Habitat use and hotspots in the Tarium Niryutait Marine

Protected Area. Ocean and Coastal Management 100: 128–138.

Harwood, L.A., Kingsley, M.C.S., and Pokiak, F. 2015. Monitoring beluga harvests in the

Mackenzie Delta and near Paulatuk, NT, Canada: harvest efficiency and trend, size and sex of

landed whales, and reproduction, 1970 – 2009. Canadian Manuscript Report of Fisheries and

Aquatic Sciences 3059.

Hoelzel A.R., Dahlheim, M., and Stern, S.J. 1998. Low genetic variation among killer whales

(Orcinus orca) in the eastern North Pacific and genetic differentiation between foraging

specialists. Journal of Heredity 89: 121–128.

Holm, S. 1979. A simple sequentially rejective multiple test procedure. Scandanavian Journal of

Statistics 6: 65–70.

Iacolina, L., Scandura, M., Bongi, P., and Apollonio, M. 2009. Non-kin associations in wild boar

social units. Journal of Mammalogy 90: 666–674.

Inuvialuit Land Administration. 2011. Husky Lakes Special Cultural Area.

IWC. 2000. Status of Monodontid whales: white whale. Journal of Cetacean Research and

Management 2 (SUPPL.), 243–250.

Jombart, T. 2010. Discriminant analysis of principal components: a new method for the analysis

of genetically structured populations. BMC Genetics 11: 94.

Jombart, T. 2014. A tutorial for Discriminant Analysis of Principal Components (DAPC) using

adegenet 1.4–1.

Karczmarski, L., Würsig, B., Gailey, G., Larson, K.W. and Vanderlip, C., 2005. Spinner

dolphins in a remote Hawaiian atoll: social grouping and population structure. Behavioural

Ecology 16: 675-685.

Kashima, K., Ohtsuki, H. and Satake, A. 2013. Fission-fusion bat behavior as a strategy for

balancing the conflicting needs of maximizing information accuracy and minimizing infection

risk. Journal of Theoretical Biology 318: 101-109.

Page 213: Genetic diversity, population structure and phylogeography ...

195

Kerth, G., Perony, N., and Schweitzer, F. 2011. Bats are able to maintain long-term social

relationships despite the high fission-fusion dynamics of their groups. Proceedings of the Royal

Society B 278: 2761–2767.

King, W.J., Festa-Bianchet, M., Coulson, G. and Goldizen, A.W. 2017. Long-term consequences

of mother-offspring associations in eastern grey kangaroos. Behavioral Ecology and

Sociobiology 71: 77.

Kocho-Schellenberg, J.E. 2010. Understanding the evolution of beluga entrapment co-

management in the Inuvialuit Settlement Region using social network analysis. Master’s Thesis,

Faculty of Environment, Earth and Resources, University of Manitoba, Winnipeg, 138pp.

Krause, J., Krause, S., Arlinghaus, R., Psorakis, I., Roberts, S. and Rutz, C. 2013. Reality mining

of animal social systems. Trends in Ecology & Evolution 28: 541-551.

Laidre, K.L., Stirling, I., Lowry, L.F., Wiig, O., Heide-Jørgensen, M.P., and Ferguson, S.H.

2008. Quantifying the sensitivity of Arctic marine mammals to climate-induced habitat change.

Ecological Applications 18: S97–S125.

Lehmann, J., Korstjens, A.H., and Dunbar, R.I.M. 2006. Fission-fusion social systems as a

strategy for coping with ecological constraints: a primate case. Evolution and Ecology 21: 613–

634.

Lesack, K., and Naugler, C. 2011. An open-source software program for performing Bonferroni

and related corrections for multiple comparisons. Journal of Pathology Informatics 2: 52.

Li, C.C, Weeks, D.E, and Chakravarti, A. 1993. Similarity of DNA fingerprints due to chance

and relatedness. Human Heredity 43: 45-52.

Loseto, L.L., Richard, P., Stern, G.A., Orr, J., and Ferguson, S.H. 2006. Segregation of Beaufort

Sea beluga whales during the open-water season. Canadian Journal of Zoology 84: 1743-1751.

Loseto, L., Wazny, T., Cleator, H., Ayles, B., Cobb, D., Harwood, L., Michel, C., Nielsen, O.,

Paulic, J., Postma, L., Ramlal, P., Reist, J., Richard, P., Ross, P.S., Solomon, S., Walkusz, W.,

Weilgart, L., and Williams, B. 2010. Information in support of indicator selection for monitoring

the Tarium Niryutait Marine Protected Area (TNMPA). DFO Canadian Science Advisory

Secretariat Science Research Document 2010/094. vi + 47 p.

Louis, M., Gally, F.. Barbraud. C., Béesau, J., Tixier, P., Simon-Bouhet, B., Le Rest, K., and

Guinet, C. 2015. Social structure and abundance of coastal bottlenose dolphins, Tursiops

truncatus, in Normano-Breton Gulf, English Channel. Journal of Mammalogy 96: 481-493.

Luque, S.P., and Ferguson, S.H. 2009. Ecosystems regime shifts have not affected growth and

survivorship of eastern Beaufort Sea belugas. Oecologia 160: 367–378.

Page 214: Genetic diversity, population structure and phylogeography ...

196

Lusseau, D., Wilson, B., Hammond, P.S., Grellier, K., Durban, J.W., Parsons, K.M., Barton,

T.R., and Thompson, P.M. 2006. Quantifying the influence of sociality on populations structure

in bottlenose dolphins. Journal of Animal Ecology 75: 14–24.

Lynch, M., and Ritland, K. 1999. Estimation of pairwise relatedness with molecular markers.

Genetics 152: 1753–1766.

Metheny, J.D., Kalcounis-Rueppell, M.C., Bondo, K.J. and Brigham, R.M. 2008a. A genetic

analysis of group movement in an isolated population of tree-roosting bats. Proceedings of the

Royal Society of London B: Biological Sciences 275: 2265-2272.

Metheny, J.D., Kalcounis-Rueppell, M.C., Willis, C.K., Kolar, K.A. and Brigham, R.M. 2008b.

Genetic relationships between roost-mates in a fission–fusion society of tree-roosting big brown

bats (Eptesicus fuscus). Behavioral Ecology and Sociobiology 62: 1043-1051.

Möller, L.M. 2012. Sociogenetic structure, kin associations and bonding in delphinids.

Molecular Ecology 21: 745–764.

Moran, M.D. 2003. Arguments for rejecting the sequential Bonferroni in ecological studies.

Oikos 100: 403–405.

Moreno, K., and Acevedo-Gutiérrez, A. 2016. The social structure of Golfo Dulce bottlenose

dolphins (Tursiops truncatus) and the influence of behavioural state. Royal Society Open

Science doi: 10.1098/rsos.160010.

Morin, P.A., Martien, K.K., Archer, F.I., Cipriano, F., Steel, D., Jackson, J. and Taylor, B.L.

2010. Applied conservation genetics and the need for quality control and reporting of genetic

data used in fisheries and wildlife management. Journal of Heredity 101: 1-10.

Mysterud, A. 2011. Selective harvesting of large mammals: how often does it results in

directional selection? Journal of Applied Ecology 48: 827-834.

Narum, S.R. 2006. Beyond Bonferroni: less conservative analyses for conservation genetics.

Conservation Genetics 7: 783–787.

Newman, M.E.J. 2006a. Modularity and community structure in networks. Proceedings of the

National Academy of Sciences USA 103: 8577–8582.

Newman, M.E.J. 2006b. Finding community structure in networks using the eigenvectors of

matrices. Physical Review E 74(3): 036104.

Nidiffer, M.D., and Cortés-Ortiz, L. 2015. Intragroup relatedness in two howler monkey species

(Alouatta pigra and A. palliate): implications for understanding social systems and dispersal.

American Journal of Primatology 77: 1333–1345.

Page 215: Genetic diversity, population structure and phylogeography ...

197

O’Corry-Crowe, G., Mahoney, A.R., Quakenbush, L., Whiting, A., Lowry, L., and Harwood, L.

2016. Genetic profiling links changing sea-ice to shifting beluga whale migration patterns.

Biology Letters 12: 20160404.

O'Corry-Crowe, G., Lucey, W., Archer, F.I. and Mahoney, B. 2015. The genetic ecology and

population origins of the beluga whales, Delphinapterus leucas, of Yakutat Bay. Marine

Fisheries Review 77: 47-58.

O’Corry-Crowe, G.M., Suydam, R.S, Rosenberg, A., Frost, K.J., and Dizon, A.E. 1997.

Phylogeography, population structure and dispersal patterns of the beluga whale Delphinapterus

leucas in the western Nearctic revealed by mitochondrial DNA. Molecular Ecology 6: 955–970.

Orman, G. and Labatut, V. 2011. A comparison of community detection algorithms on artificial

networks. Discovery Science, 2009, Porto, Portugal, Springer 5808, pp.242–256.

Palsbøll P., Heide-Jørgensen, M.P., and Bérubé, M. 2002. Analysis of mitochondrial control

region nucleotide sequences from Baffin Bay beluga (Delphinapterus leucas): detecting pods or

sub-populations? NAMMCO Scientific Publications 4: 39–50.

Palsbøll, P.J., Bérubé, M., and Allendorf, F.W. 2006. Identification of management units using

population genetic data. Trends in Ecology and Evolution 22: 11–16.

Palsbøll, P.J., Peery, M.Z., and Bérubé, M. 2010. Detecting populations in the ‘ambiguous’ zone:

kinship-based estimation of population structure at low genetic divergence. Molecular Ecology

Resources 10: 797–805.

Parra, G.J., Corkeron, P.J. and Arnold, P. 2011. Grouping and fission–fusion dynamics in

Australian snubfin and Indo-Pacific humpback dolphins. Animal Behaviour 82: 1423-1433.

Peakall, R., and Smouse, P.E. 2012. GenAlEx 6.5: genetic analysis in Excel. Population genetic

software for teaching and research – an update. Bioinformatics Applications Note 28: 2537–

2539.

Pearson, H.C., Markowitz, T.M., Weir, J.S. and Würsig, B. 2017. Dusky dolphin

(Lagenorhynchus obscurus) social structure characterized by social fluidity and preferred

companions. Marine Mammal Science 33: 251-276.

Pérez-González, J., Frantz, A.C., Torres-Porras, J., Castillo, L., and Carranza, J. 2012.

Population structure, habitat features and genetic structure of managed red deer populations.

European Wildlife Research 58: 933–943.

Pew, J., Muir, P.H., Wang, J., and Frasier, T. 2014. related: and R package for analysing

pairwise relatedness from codominant molecular markers. Molecular Ecology Resources 15:

557-561.

Page 216: Genetic diversity, population structure and phylogeography ...

198

Podgórski, T., Scandura, M., and Jędrzejewska, B. 2014. Next of kin next door – philopatry and

socio-genetic population structure in wild boar. Journal of Zoology 294: 190–197.

Poteaux, C., Baubet, E., Kaminski, E., Brandt, S. Dobson, F.S., and Baudoin, C. 2009. Socio-

genetic structure and mating system of a wild boar population. Journal of Zoology 278: 116–125.

Pritchard, J., Stephens, M, and Donnelly, P. 2000. Inference of population structure using

multilocus genotype data. Genetics 155: 945–959.

Putman, A.I, and Carbone I. 2014. Challenges in the analysis and interpretation of microsatellite

data for population genetic studies. Ecology and Evolution 4: 4399–4428.

Quakenbush, L.T, Suydam, R.S., Bryan, A.L., Lowry, L.F., Frost, K.J., and Mahoney, B.A.

2015. Diet of beluga whales, Delphinapterus leucas, in Alaska form stomach contents, March-

November. Marine Fisheries Review 77: 70-84.

Queller, D.C., and Goodnight, K.F. 1989. Estimating relatedness using molecular markers.

Evolution 43: 258–275.

R Core Team. 2015. R: A language and environment for statistical computing. R Foundation for

Statistical Computing, Vienna, Austria.

Raymond M., and Rousset, F. 1995. GENEPOP (version 1.2): population genetics software for

exact tests and ecumenicism. Journal of Heredity 86: 248–249.

Reichardt, J., and Bornholdt, S. 2004. Detecting fuzzy community structures in complex

networks with a Potts model. Physical Review Letters 93: 218701.

Reeves, R.R., Ewins, P.J., Agbayani, S., Heide-Jørgensen, M.P., Kovacs, K.M., Lydersen, C.,

Suydam, R., Elliott, W., Polet, G., van Dijk, Y., and Blijleven, R. 2014. Distribution of endemic

cetaceans in relation to hydrocarbon development and commercial shipping in a warming Arctic.

Marine Policy 44: 375–389.

Reichardt, J., and Bornholdt, S. 2006. Statistical mechanics of community detection. Physical

Review E 74: 016110.

Richard, K.R., Dillon, M.C., Whitehead, H., and Wright, J.M. 1996. Patterns of kinships in

groups of free-living sperm whales (Physeter macrocephalus) revealed by multiple molecular

genetic analyses. Proceedings of the National Academy of Sciences USA 93: 8792–8795.

Richard, P.R., Martin, A.R., and Orr, J.R. 2001a. Summer and autumn movements of belugas of

the Beaufort Sea Region. Arctic 54: 223–236.

Richard, P.R., Heide-Jørgensen, M.P., Orr, J., Dietz, R., and Smith, T.G. 2001b. Summer and

autumn movements and habitat use by belugas in the Canadian high Arctic and adjacent waters.

Arctic 54: 207–222.

Page 217: Genetic diversity, population structure and phylogeography ...

199

Richard, P.R. 2005. An estimate of the Western Hudson Bay beluga population size in 2004.

DFO Canadian Science Advisory Secretariat Research Document 2005/017. Ii + 29 p.

Richard, P.R. 2010. Stock definition of belugas and narwhals in Nunavut. DFO Canadian

Science Advisory Secretariat Research Document 2010/022. iv + 14 p.

Ritland, K. 1996 Estimators for pairwise relatedness and inbreeding coefficients. Genetics

Resources 67: 175–186.

Rodgers, T.W., Giacalone, J., Heske, E.J., Janečka, J.E., Jansen, P.A., Phillips, C.A., and

Schooley, R.L. 2015. Socio-spatial organization and kin structure in ocelots from integration of

camera trapping and noninvasive genetics. Journal of Mammalogy 96: 120–128.

Rosel, P., France, S.C., Wang, J.Y., and Kocher, T.D. 1999. Genetic structure of harbour

porpoise Phocoena phocoena populations in the northwest Atlantic based on mitochondrial and

nuclear markers. Molecular Ecology 8 (Supplementary Issue S1): S41–S54.

Rousset, F. 2008. Genepop'007: a complete reimplementation of the Genepop software for

Windows and Linux. Molecular Ecology Resources 8: 103–106.

Seutin, G., White, B.N., and Boag, P.T. 1991. Preservation of avian blood and tissue samples for

DNA analyses. Canadian Journal of Zoology 69: 82–90.

Shaw, C.N., Wilson, P.J., and White, B.N. 2003. A reliable molecular methods for gender

determination for mammals. Journal of Mammalogy 84: 123–128.

Seutin, G., White, B.N., and Boag, P.T. 1991. Preservation of avian blood and tissue samples for

DNA analyses. Canadian Journal of Zoology 68: 82-90.

Shannon, P., Markiel, A., Ozier, O., Baliga, N.S., Wang, J.T., Ramage, D., Amin, N.,

Schwikowski, B, and Ideker, T. 2003. Cytoscape: a software environment for integrated models

of biomolecular interaction networks. Genome Research 13: 2498–2504.

Seidler, R.G., Long, R.A., Berger, J., Bergen, S., and Beckmann, J.P. 2014. Identifying

impediments to long-distance mammal migrations. Conservation Biology 29: 99–109.

Smith, T.G., Hammill, M.O., and Martin, A.R. 1994. Herd composition & behaviour of white

whales (Delphinapterus leucas) in two Canadian estuaries. Meddeleser om. Grønland,

Biosciences 39: 175–186.

Smith, J.E., Kolowski, J.M., Graham, K.E., Dawes, S.E. and Holekamp, K.E. 2008. Social and

ecological determinants of fission–fusion dynamics in the spotted hyaena. Animal Behaviour 76:

619-636.

Page 218: Genetic diversity, population structure and phylogeography ...

200

Snyder-Mackler, N., Albert, S.C., and Bergman, T.J. 2014. The socio-genetics of a complex

society: female gelada relatedness patterns mirror association patterns in a multilevel society.

Molecular Ecology 23: 6179–6191.

Stammerjohn, S., Massom, R, Rind, R., and Martinson, D. 2012. Regions of rapid sea ice

change: an inter-hemispheric seasonal comparison. Geophysical Research Letters 39: L06501.

Storz, J.F. 1999. Genetic consequences of mammalian social structure. Journal of Mammalogy,

80: 553-569.

Tavares, S.B., Samarra, F.I. and Miller, P.J. 2017. A multilevel society of herring-eating killer

whales indicates adaptation to prey characteristics. Behavioral Ecology 28: 500-514.

Turgeon, J., Duchesne, P., Colbeck, G.J., Postma, L.D., and Hammill, M.O. 2012.

Spatiotemporal segregation among summer stocks of beluga (Delphinapterus leucas) despite

nuclear gene flow: implication for the endangered belugas in eastern Hudson Bay (Canada).

Conservation Genetics 13: 419-433.

Valsecchi, E., and Amos, W. 1996. Microsatellite markers for the study of cetacean populations.

Molecular Ecology 5: 151–156.

van Oosterhout, C., Hutchinson, W.F., Wills, D.P., and Shipley, P. 2004. MICRO-CHECKER:

software for identifying and correcting genotyping errors in microsatellite data. Molecular

Ecology Notes 4: 535-538.

Van de Casteele, T., Galbusera, P., and Matthysen, E. 2001. A comparison of microsatellite-

based pairwise relatedness estimators. Molecular Ecology 10: 1539–1549.

Van Horn, R.C., Buchan, J.C., Altmann, J. and Alberts, S.C. 2007. Divided destinies: group

choice by female savannah baboons during social group fission. Behavioral Ecology and

Sociobiology 6: 1823-1837.

Viricel, A., Strand, A.E., Rosel, P.E., Ridoux, V., and Garcia, P. 2008. Insights on common

dolphin (Delphinus delphis) social organization from genetic analysis of a mass stranded pod.

Behavioral Ecology and Sociobiolgy 63: 173–185.

Wade, P.R., Reeves, R., and Mesnick, S.L. 2012. Social and behavioural factors in cetacean

responses to overexploitation: are odontocetes less “resilient” than mysticetes? Journal of Marine

Biology Article ID 567276, 15 pages, doi:10.1155/2012/567276.

Waits, L.P., Luikart, G., and Taberlet, P. 2001. Estimating the probability of identity among

genotypes in natural populations: cautions and guidelines. Molecular Ecology 10: 249-256.

Wang, J. 2002. An estimator for pairwise relatedness using molecular markers. Genetics 160:

1203–1215.

Page 219: Genetic diversity, population structure and phylogeography ...

201

Waples, R.S., and Gaggiotti, O. 2006. What is a population? An empirical evaluation of some

genetic methods for identifying the number of gene pools and their degree of connectivity.

Molecular Ecology 15: 1419-1439.

Watt, C.A., Petersen, S.D., and Ferguson, S.H. 2015. Genetics and fatty acids assist in

deciphering narwhal (Monodon monoceros) social groupings. Polar Biology 38:1971-1981.

Williams, R., and Lusseau, D. 2006. A killer whale social network is vulnerable to targeted

removals. Biology Letters 2: 497–500.

Zeyl, E., Aars, J., Ehrich, D., and Wiig, Ø. 2009. Families in space: relatedness in the Barents

Sea population of polar bears (Ursus maritimus). Molecular Ecology 18: 735-749.

Page 220: Genetic diversity, population structure and phylogeography ...

202

Appendix 3.1. Multiplex amplification reaction mixtures, amplification conditions, and pooling

ratios for beluga microsatellite genotypes in this study.

Panel 1: FCB4 (6FAM) and FCB5 (VIC) duplex

Component Volume for 10µL reaction Final concentration

Sterile water 5.2µL

10X reaction buffer (no

MgCl2)

1.0µL 1X

MgCl2 (25mM) 0.6µL 1.5mM

dNTPs (10mM each) 0.2µL 0.2mM

Primer FCB5a (20µM) 0.15µL 0.375µM

Primer FCB5b (20µM) 0.15µL 0.375µM

Primer FCB4a (20µM) 0.25µL 0.5µM

Primer FCB4b (20µM) 0.25µL 0.5µM

AmpliTaq Gold (Life

Technologies)(5U/µL)

0.2µL 1.0U

Template DNA (10-

100ng/µL)

2.0µL 20-200ng

Thermal cycler profile:

1 cycle: 95°C for 11min

25 cycles: 95°C for 45s; 63°C for 45s; 72°C for 45s

1 cycle: 72°C for 15min

Hold at 4°C

Page 221: Genetic diversity, population structure and phylogeography ...

203

Panel 2: Pool 5µL reaction 2A with 5µL reaction 2B

Reaction 2A: FCB1 (NED) and SW19 (VIC) duplex

Component Volume for 10µL reaction Final concentration

Sterile water 5.0µL

10X reaction buffer (no

MgCl2)

1.0µL 1X

MgCl2 (25mM) 0.6µL 1.5mM

dNTPs (10mM each) 0.2µL 0.2mM

Primer FCB1a (20µM) 0.25µL 0.5µM

Primer FCB1b (20µM) 0.25µL 0.5µM

Primer SW19a (20µM) 0.25µL 0.5µM

Primer SW19b (20µM) 0.25µL 0.5µM

AmpliTaq Gold (Life

Techologies)(5U/µL)

0.2µL 1.0U

Template DNA (10-

100ng/µL)

2.0µL 20-200ng

Thermal cycler profile:

1 cycle: 95°C for 11min

10 cycles: 95°C for 45s; 48°C for 45s; 72°C for 45s

25 cycles: 95°C for 45s; 53°C for 45s; 72°C for 45s

1 cycle: 72°C for 15min

Hold at 4°C

Page 222: Genetic diversity, population structure and phylogeography ...

204

Reaction 2B: FCB8 (NED) and FCB10 (6FAM) duplex

Component Volume for 10µL reaction Final concentration

Sterile water 5.0µL

10X reaction buffer (no

MgCl2)

1.0µL 1X

MgCl2 (25mM) 0.6µL 1.5mM

dNTPs (10mM each) 0.2µL 0.2mM

Primer FCB8a (20µM) 0.25µL 0.5µM

Primer FCB8b (20µM) 0.25µL 0.5µM

Primer FCB10a (20µM) 0.25µL 0.5µM

Primer FCB10b (20µM) 0.25µL 0.5µM

AmpliTaq Gold (Life

Techologies)(5U/µL)

0.2µL 1.0U

Template DNA (10-

100ng/µL)

2.0µL 20-200ng

Thermal cycler profile:

1 cycle: 95°C for 11min

25 cycles: 95°C for 45s; 60°C for 45s; 72°C for 45s

1 cycle: 72°C for 15min

Hold at 4°C

Page 223: Genetic diversity, population structure and phylogeography ...

205

Panel 3: Pool 5µL reaction 3A (dilute with 10µL water) with 5µL reaction 3B (dilute with 50µL water)

Reaction 3A: FCB14 (VIC) and KWM2A (VIC) duplex

Component Volume for 10µL reaction Final concentration

Sterile water 5.4µL

10X reaction buffer (no

MgCl2)

1.0µL 1X

MgCl2 (25mM) 0.6µL 1.5mM

dNTPs (10mM each) 0.2µL 0.2mM

Primer FCB14a (20µM) 0.05µL 0.125µM

Primer FCB14b (20µM) 0.05µL 0.125µM

Primer KWM2Aa (20µM) 0.25µL 0.5µM

Primer KWM2Ab (20µM) 0.25µL 0.5µM

AmpliTaq Gold (Life

Techologies)(5U/µL)

0.2µL 1.0U

Template DNA (10-

100ng/µL)

2.0µL 20-200ng

Thermal cycler profile:

1 cycle: 95°C for 11min

10 cycles: 95°C for 45s; 48°C for 45s; 72°C for 45s

25 cycles: 95°C for 45s; 55°C for 45s; 72°C for 45s

1 cycle: 72°C for 15min

Hold at 4°C

Page 224: Genetic diversity, population structure and phylogeography ...

206

Reaction 3B: FCB3 (NED) and KWM12A (6FAM) duplex

Component Volume for 10µL reaction Final concentration

Sterile water 5.4µL

10X reaction buffer (no

MgCl2)

1.0µL 1X

MgCl2 (25mM) 0.6µL 1.5mM

dNTPs (10mM each) 0.2µL 0.2mM

Primer FCB3a (20µM) 0.25µL 0.5µM

Primer FCB3b (20µM) 0.25µL 0.5µM

Primer KWM12Aa (20µM) 0.05µL 0.125µM

Primer KWM12Ab (20µM) 0.05µL 0.125µM

AmpliTaq Gold (Life

Techologies)(5U/µL)

0.2µL 1.0U

Template DNA (10-

100ng/µL)

2.0µL 20-200ng

Thermal cycler profile:

1 cycle: 95°C for 11min

5 cycles: 94°C for 30s; 56°C for 30s; 72°C for 30s

5 cycles: 94°C for 30s; 51°C for 30s; 72°C for 30s

35 cycles: 95°C for 30s; 48°C for 30s; 72°C for 30s

1 cycle: 72°C for 15min

Hold at 4°C

Page 225: Genetic diversity, population structure and phylogeography ...

207

Panel 4: Pool 5µL reaction 4A with 5µL reaction 4B

Reaction 4A: FCB11 (NED)

Component Volume for 10µL reaction Final concentration

Sterile water 5.45µL

10X reaction buffer (no

MgCl2)

1.0µL 1X

MgCl2 (25mM) 0.6µL 1.5mM

dNTPs (10mM each) 0.25µL 0.25mM

Primer FCB11a (20µM) 0.25µL 0.5µM

Primer FCB11b (20µM) 0.25µL 0.5µM

AmpliTaq Gold (Life

Techologies)(5U/µL)

0.2µL 1.0U

Template DNA (10-

100ng/µL)

2.0µL 20-200ng

Thermal cycler profile:

1 cycle: 95°C for 11min

25 cycles: 95°C for 45s; 61°C for 45s; 72°C for 45s

1 cycle: 72°C for 15min

Hold at 4°C

Page 226: Genetic diversity, population structure and phylogeography ...

208

Reaction 4B: FCB17 (VIC) and EV37 (6FAM) duplex

Component Volume for 10µL reaction Final concentration

Sterile water 5.0µL

10X reaction buffer (no

MgCl2)

1.0µL 1X

MgCl2 (25mM) 0.6µL 1.5mM

dNTPs (10mM each) 0.2µL 0.2mM

Primer FCB17a (20µM) 0.25µL 0.5µM

Primer FCB17b (20µM) 0.25µL 0.5µM

Primer EV37a (20µM) 0.25µL 0.5µM

Primer EV37b (20µM) 0.25µL 0.5µM

AmpliTaq Gold (Life

Techologies)(5U/µL)

0.2µL 1.0U

Template DNA (10-

100ng/µL)

2.0µL 20-200ng

Thermal cycler profile:

1 cycle: 95°C for 11min

25 cycles: 95°C for 45s; 55°C for 45s; 72°C for 45s

1 cycle: 72°C for 15min

Hold at 4°C

Page 227: Genetic diversity, population structure and phylogeography ...

209

Panel 5: Pool 5µL reaction 5A with 5µL reaction 5B (dilute with 20µL water)

Reaction 5A: EV14 (NED) and EV94 (6FAM) duplex

Component Volume for 10µL reaction Final concentration

Sterile water 5.2µL

10X reaction buffer (no

MgCl2)

1.0µL 1X

MgCl2 (25mM) 0.6µL 1.5mM

dNTPs (10mM each) 0.2µL 0.2mM

Primer EV14a (20µM) 0.15µL 0.375µM

Primer EV14b (20µM) 0.15µL 0.375µM

Primer EV94a (20µM) 0.25µL 0.5µM

Primer EV94b (20µM) 0.25µL 0.5µM

AmpliTaq Gold (Life

Techologies)(5U/µL)

0.2µL 1.0U

Template DNA (10-

100ng/µL)

2.0µL 20-200ng

Thermal cycler profile:

1 cycle: 95°C for 11min

10 cycles: 95°C for 45s; 48°C for 45s; 72°C for 45s

25 cycles: 95°C for 45s; 55°C for 45s; 72°C for 45s

1 cycle: 72°C for 15min

Hold at 4°C

Page 228: Genetic diversity, population structure and phylogeography ...

210

Reaction 5B: PPH104 (VIC)

Component Volume for 10µL reaction Final concentration

Sterile water 5.45µL

10X reaction buffer (no

MgCl2)

1.0µL 1X

MgCl2 (25mM) 0.6µL 1.5mM

dNTPs (10mM each) 0.25µL 0.25mM

Primer PPH104a (20µM) 0.25µL 0.5µM

Primer PPH104b (20µM) 0.25µL 0.5µM

AmpliTaq Gold (Life

Techologies)(5U/µL)

0.2µL 1.0U

Template DNA (10-

100ng/µL)

2.0µL 20-200ng

Thermal cycler profile:

1 cycle: 95°C for 11min

5 cycles: 94°C for 30s; 56°C for 30s; 72°C for 30s

5 cycles: 94°C for 30s; 51°C for 30s; 72°C for 30s

35 cycles: 95°C for 30s; 48°C for 30s; 72°C for 30s

1 cycle: 72°C for 15min

Hold at 4°C

Page 229: Genetic diversity, population structure and phylogeography ...

211

CHAPTER 4: Mitochondrial genome diversity and phylogenetic patterns among Canadian

belugas (Delphinapterus leucas), with comparisons to narwhal (Monodon monoceros)

mitogenomes

Abstract

For many years, Sanger sequencing portions of the control region (CR) in mitochondrial

DNA (mtDNA) of belugas (Delphinapterus leucas) has provided valuable insight for

conservation and population studies for these whales. However, the advantages of increasing the

overall amount of mtDNA sequence and including mtDNA protein-coding genes for

investigating stock structure and phylogenies has been demonstrated in other species, including

other cetaceans. In this study, a next-generation workflow was developed to sequence complete

mitochondrial genomes (mitogenomes) for population studies of beluga and narwhal (Monodon

monoceros). A total of 106 belugas and 94 narwhals from across each species’ Canadian range

were sequenced, which resulted in complete mitogenomes of 16,385bp and 16,381bp,

respectively, for all samples. Genetic diversity and phylogenetic analyses of beluga mitogenomes

supported previous results found with CR sequence, but more phylogenetic clades were inferred

consistently across methods, and overall node support was much higher. Within beluga clades,

clusters of samples were found that affirm the identification of management units and the

inferred evolutionary relationships of haplotypes using previous methods. Preliminary tests of

selection detected the presence of negative, or purifying, selection on all protein-coding genes of

belugas, particularly for the ND1, CO1 and CO2 genes. However, no signals of positive selection

were detected with the analytical methods used. In contrast to belugas, narwhal mitogenomes

continued to exhibit much lower levels of genetic diversity and offered no improvements

Page 230: Genetic diversity, population structure and phylogeography ...

212

compared to previous approaches for resolving phylogenetic relationships among geographic

sample collections. This could be due to insufficient numbers of samples for the analyses. For

both species, especially narwhals, mitogenomic analyses of more samples and samples from

across the species’ global range would improve the interpretation of phylogeographic

relationships, provide better resolution of clades for the estimation of divergence times, and

increase the possibility of finding signals of adaptive selection.

4.1 Introduction

The technological power to characterize an increasing amount of genetic variation, both

neutral and adaptive, in natural populations has changed dramatically in the last five decades

(Allendorf 2017). For at least the last 30 years, molecular markers have been used to investigate

a wide range of population genetic hypotheses for beluga whales (Delphinapterus leucas).

Among the earliest studies, beluga population structure and divergence were examined using

restriction enzyme digestion of mitochondrial DNA (mtDNA) and visualization with Southern

blotting (Southern 1975) to provide the first evidence of two distinct beluga stocks within

Hudson Bay (Helbig et al. 1989). An early study of nuclear variation among different groups of

belugas used immunological techniques (microcomplement fixation) and starch gel

electrophoresis of allozymes, but the approach had limited success for resolving questions of

population sub-structure (Lint et al. 1990). DNA fingerprint analyses with three minisatellite

probes compared belugas from the eastern Beaufort Sea and the St. Lawrence Estuary and

revealed a dramatically lower level of genetic variation in the St. Lawrence Estuary population,

most likely due to a genetic bottleneck from which the population has failed to recover

(Patenaude et al. 1990). Nuclear DNA markers continued to be developed, and variation among

Page 231: Genetic diversity, population structure and phylogeography ...

213

beluga populations across a broad geographic range was assessed using the analyses of Major

Histocompatability Complex (MHC) locus DQβ (Murray et al. 1995). The analyses found low

levels of variation overall but did find the High Arctic belugas to be different from all other

beluga populations. Given that these whales shared high frequencies of a particular allele at this

locus with narwhal, also a High Arctic species, the study also concluded that evidence of positive

selection at this locus could be an environmental adaptation to different pathogens encountered

at higher latitudes (Murray et al. 1995).

However, by the mid-1990s, nuclear DNA microsatellites were developed for belugas

(Buchanan et al. 1996) and these became the nuclear markers of choice for genetic investigations

such as population and conservation studies, inferring demographic histories, and the

identification of individual relationships and kin groups (e.g. Maiers (Postma) et al. 1996, Brown

Gladden et al. 1999, O’Corry-Crowe et al. 2010, Turgeon et al. 2012, Colbeck et al. 2012). But

for many beluga conservation studies, analyses of mtDNA has proven more informative for

questions about sub-population structuring, phylogeography, monitoring the impacts of

harvesting on individual stocks, and the genetic origins of populations (e.g. Brennin et al. 1997,

Brown Gladden et al. 1997, de March and Postma 2003, Turgeon et al. 2012, O’Corry-Crowe et

al. 2015). These studies have all relied on the sequencing of differing lengths of control region

(CR) sequence. The hypervariable nature of this portion of mtDNA sequence made it well-suited

for detecting recent mutations inferring, in particular, matriarchal phylogeographic trees,

population sub-structure, dispersal, and historical biogeography (Avise and Ellis 1986, Avise et

al. 1987, Avise 1989, Avise et al. 1989).

As costs become increasingly more affordable to analyze large numbers of samples,

genomic data are now being used to address a wide range of conservation problems such as

Page 232: Genetic diversity, population structure and phylogeography ...

214

defining conservation units, assessing the past and present connectivity among and within

species, and assessing adaptive potential (Corlett 2017). In particular, sequencing of complete

mitochondrial DNA genomes (mitogenomes) is allowing comparisons to previous studies that

used a portion or portions of the mitogenome, and subsequently refining conclusions drawn from

those studies. For example, the analyses of complete mitogenome sequences in Eurasian brown

bears (Ursus arctos) had “far greater resolving power than shorter sequences” and provided new

conclusions about population structuring, phylogeography, and demographic history (Keis et al.

2013). Mitochondrial genomes from green turtles (Chelonia mydas) were able to resolve shared

control region haplotypes and improved genetic signal for identifying cryptic population

structure among rookeries (Shamblin et al. 2012). Furthermore, mitogenome sequencing may

have the power to perform mixed stock analysis for the assignment of foraging turtles to natal

sites. Previously undocumented haplotype diversity was found with expanded mitochondrial

genome sequencing of red foxes (Vulpes vulpes), which clarified population structure and

population origins of red foxes in western North America (Volkmann et al. 2015).

Complete mitogenome sequencing offers further advantages for population and

conservation studies of wild species. Analyses of full mitogenomes can investigate diversity

within coding regions of the genome, in particular, the 13 genes associated with the oxidative

phosphorylation system (OXPHOS) that is important for the production of cellular energy

(Ballard and Pichaud 2014). As changes to this metabolic pathway due to mutations in the

protein coding genes can result in fitness and evolutionary consequences, mtDNA sequence

substitutions can play an important role in adaptive evolution within species (da Fonseca et al.

2008). Relating these mutational changes with environmental stressors such as temperature, diet,

changes in hormone levels, and chemical exposure (Ballard and Pichaud 2014) can detect signals

Page 233: Genetic diversity, population structure and phylogeography ...

215

of local adaptation within species and populations. Sequence analyses of nine mitochondrial

genes involved in the OXPHOS pathway of Atlantic salmon (Salmo salar) provided evidence of

both purifying and positive selection in ND1, ND3 and ND4 genes that were correlated with a

non-random pattern over a wide latitudinal range (Consuegra et al. 2015). Since some of the

mutations were specific to Arctic populations, the selection on the mtDNA genome was

hypothesized to be related to local metabolic adaptation to cold temperatures (Consuegra et al.

2015). Changes in latitude are not the only environmental gradients over which mtDNA-linked

adaptation can occur. In Chinese snub-nosed monkeys (Rhinopithicus species), evidence of

positive selection in mtDNA ND2 and ND6 genes in one particular species (Rhinopithicus

roxellana) was found to be associated with high altitude and cold temperatures (Yu et al. 2011).

In cetacean species, whole mitogenome sequences have been used to investigate a wide

range of molecular ecology questions including population diversity (e.g. sperm whales

(Physeter macrocephalus) Alexander et al. 2013), phylogenetic relationships (e.g. Delphinidae,

Vilstrup et al. 2011; fin whales (Balaenoptera physalus) Archer et al. 2013), and evolutionary

origins (e.g. Arnason et al. 2004, killer whales (Orcinus orca) Foote et al. 2011a). Whole

mitogenome analyses of killer whales, arguably one of the most studied cetaceans, have revealed

many new insights into the natural history and ecology of this species. Phylogenetic inferences

for killer whales were greatly improved by moving from low diversity control region sequences

to whole mitogenomes (Morin et al. 2010) and strengthened the identification of geographic

ecotypes that formed distinct clades associated with resource specializations (Foote et al. 2011b,

Moura et al. 2015). Evidence of functional adaptation in killer whales, related to metabolic

performance for the cold environment of the Antarctic pack ice, has also been found from the

analyses of mitogenome coding genes (Foote et al. 2011c).

Page 234: Genetic diversity, population structure and phylogeography ...

216

Currently, whole mitogenome sequencing has not been explored as a tool for beluga

population ecology and conservation research. Based on studies in other cetacean species, whole

mitogenome sequencing of beluga samples will offer more power to investigate genetic

population diversity, structure, and signals of selection. In this study, the main goal is to assess

the potential advantages of using mitogenomes for population genetic studies of belugas in

Canada. To do this, I have the following objectives:

1. To develop a next-generation sequencing workflow customized for beluga and

narwhal sample templates to generate complete mitochondrial genomes for

population studies;

2. To characterize the genetic diversity, differentiation and phylogenetic relationships of

beluga sample mitogenome haplotypes and compare these results to similar analyses

using only mtDNA control region (CR) haplotypes (as described in Chapter 2);

3. Identify nonsynonymous and synonymous mutations across mtDNA protein-coding

genes in beluga samples to detect indications of selection. This involves comparing

intraspecific polymorphism across beluga sample collections and using between-

species divergence comparisons to narwhal. Based on a previous study (Murray et al.

1995), a comparison of mitogenome protein-coding sequence from high Arctic beluga

samples (Grise Fiord) to samples collected from the most southern Canadian

population of belugas (St. Lawrence Estuary) is the most likely to detect a signal of

positive selection.

Page 235: Genetic diversity, population structure and phylogeography ...

217

4.2 Materials and Methods

4.2.1 Sample selection

Beluga whale samples in Canada have been collected for many years through harvest

monitoring programs, research programs such as satellite tagging studies, and occasional

opportunistic sampling of carcases and sloughed skin. Samples in this study have been accessed

from archived tissues (primarily skin and muscle) dating back to the 1980s and held at the

Freshwater Institute, Winnipeg, Manitoba (DFO Central and Arctic). Samples taken in the field

were either frozen or preserved in a salt-saturated 20% DMSO solution (Seutin et al. 1991) and

frozen upon arrival at the lab.

Geographic samples for this study were chosen to cover the full Canadian range of

belugas and include representative samples for each genetically distinct stock identified using

CR mitochondrial DNA sequences (Figure 4.1). All samples used for mitogenome sequencing

were selected, when possible, from July or August to ensure sampling of summer aggregations.

The one exception was the inclusion of beluga samples from ice entrapments around the Belcher

Islands, which were winter samples and considered to be from whales “resident” to southern

Hudson Bay. Within the Beaufort Sea, additional samples were chosen from different

aggregation areas where local subsistence harvest or ice entrapment mortality occurs. These

samples will allow for testing of spatial hypotheses of genetic structure and phylogenetic patterns

from a broad, Canada-wide scale to a local, within-stock scale.

Page 236: Genetic diversity, population structure and phylogeography ...

218

Figure 4.1. Sampling locations and sample sizes for beluga and narwhal mitochondrial genome

sequences. Total number of samples for beluga, N=106; total number of samples for narwhal, N=96.

Abbreviations after location name indicates putative stock (identified in Richard 2010) (see legends).

Beluga Stocks: EBS = E. Beaufort Sea CHA = Central High Arctic EHA = E. High Arctic CSd = Cumberland Sound FB = Foxe Basin (unknown) WHB = W. Hudson Bay SQH = Belcher Islands harvest SQE = Belcher Islands ice entrapments JB = James Bay LI = Long Island (unknown) SLE = St. Lawrence Estuary EHB = E. Hudson Bay

Narwhal Stocks: SI = Somerset Island PCH = Parry Channel JSD = Jones Sound AI = Admiralty Inlet ES = Eclipse Sound EB = E. Baffin Island RB = Repulse Bay

Page 237: Genetic diversity, population structure and phylogeography ...

219

Specific samples for each stock were chosen on the basis of CR haplotypes identified

using 702bp of mtDNA sequence (Chapter 2), and included both haplotypes that were unique to

a particular stock (see Appendix 4.1) and haplotypes that were indicated by phylogenetic

analyses to be ancestral haplotypes (found in high overall proportions and/or sampled in almost

every stock i.e. E02, E11, E72, E120, E16, E18). This selection strategy was used to encompass

the widest possible range of mutational motifs previously found among Canadian belugas (e.g. as

in Achilli et al. 2012).

For all locations except the central High Arctic (Cunningham Inlet), beluga tissue

samples for genetic analyses were taken from dead animals or from satellite tagging programs

where there was confidence that replicate sampling of individuals did not occur. The

Cunningham Inlet samples were collected from sloughed beluga skin washed onto shore by high

wave action. These tissues were all previously analyzed using 15 microsatellite loci and replicate

samples of individuals identified using GenAlEx ver. 6 (Peakall and Smouse 2006). Only skin

samples from unique individuals from this location were included in analyses for this study.

Narwhal samples, representing seven of the eight potential summer stocks in Canada

(Higdon and Ferguson 2017, Figure 4.2), were also sequenced for inter-species comparisons with

the beluga mitogenome dataset (Figure 4.1). Narwhals and belugas are the only extant members

of the odontocete family Monodontidae and are estimated to have diverged approximately 6.28

Mya (McGowan et al. 2009). In contrast to belugas and most other cetaceans, narwhals have a

very low level of variation in CR mitochondrial DNA sequence that is thought to stem from

recent expansion from a small founding population (Palsböll et al. 1997, de March et al. 2003).

This low diversity of mtDNA is not unique to narwhal and has been observed in other species of

whales with matrilineal population structure such as pilot whales (Globicephala sp.), sperm

Page 238: Genetic diversity, population structure and phylogeography ...

220

whales and killer whales (Whitehead et al. 1998). It has also frustrated past attempts to

genetically distinguish summering stocks of narwhals in Canada that are part of the Baffin Bay

narwhal population (de March et al. 2003).

Narwhal samples have also been collected in Canada over a similar timeframe as belugas,

and are also available mostly due to harvest monitoring programs and research programs

involving satellite tagging studies. Tissue samples collected for genetic analyses were also

preserved in the field either by freezing or in a salt-saturated 20% DMSO solution (Seutin et al.

1991) and frozen upon arrival at the lab.

Page 239: Genetic diversity, population structure and phylogeography ...

221

Figure 4.2. Summer aggregation ranges of putative narwhal stocks in Canada (from Higdon and Ferguson

2017).

Page 240: Genetic diversity, population structure and phylogeography ...

222

4.2.2 Complete mitochondrial genome sequencing

For this thesis, a novel workflow was developed to generate complete beluga and narwhal

mitochondrial genome sequences using an Ion Torrent Personal Genome Machine (PGM™)

system (ThermoFisher Scientific/Life Technologies, Carlsbad, CA, USA). During the time data

were collected, a complete mitochondrial genome had not been produced or published for

belugas; however, a complete mitogenome was available in GenBank for narwhal (accession

#AJ554062).

Total cellular DNA extractions from whale skin samples were performed using DNeasy

Blood and Tissue kits (Qiagen Inc., Valencia, CA, USA) according to the manufacturer’s

protocols. Primers were developed to amplify the entire mitochondrial genome using two long-

range PCR amplifications based on the narwhal mitogenome sequence from GenBank and

designed using GenBank Primer-BLAST software. Primers were selected within the Cytb and

CO1 gene coding regions and were designated Delmtg-1 (forward primer: CAA TAC ACT ACA

CAC CAG ACA CC; reverse primer: GCT AGG ACA GGT AGT GAT AGT AGG) and

Delmtg-2 (forward primer: CTT GTCCCT TTA ATA ATC GGA GCC; reverse primer: TCT

GGT GTG TAG TGT ATT GCT AGG). These primer sets were used to amplify two long-range

PCR amplicons (target amplicon sizes 8.2kb for Delmtg-1 and 8.6kb for Delmtg-2) using the

SequalPrep™ Long PCR Kit with dNTPs (ThermoFisher Scientific/Life Technologies).

Amplification reaction mixtures were prepared in 20μL volumes containing 2μL 10X reaction

buffer, 0.4μL dimethyl sulphoxide (DMSO), 1μL 10X enhancer B, 0.36μL long polymerase

(1.8U), 14.24 μL DNase-free water, and 1 μL of primer set (500nM final concentration). The

PCR reactions were performed under the following conditions: 30 cycles of 94⁰C for 10s, 59⁰C

(for Detmtg-1 primer set) or 55⁰C (for Delmtg-2 primer set) for 30s, 68⁰C for 8min; and a final

Page 241: Genetic diversity, population structure and phylogeography ...

223

extension of 72⁰C for 5min. Amplification products were visualized on 0.8% agarose gels,

compared to a 1kb DNA ladder, and assessed for quality and expected size. Successfully

amplified samples were purified using QIAquick PCR Purification kits (Qiagen Inc.) using

manufacturer protocols and eluted into 50μL of nuclease free water. After purification,

individual amplicon concentrations were determined using a NanoDrop UV-Vis

Spectrophotometer (ThermoFisher Scientific) and the Delmtg-1 and Delmtg-2 amplicons were

pooled in equimolar amounts (250ng of each amplicon for a total of 500ng template DNA) for

library construction.

Amplicon libraries were prepared using Ion Shear Plus Fragment Library kits

(ThermoFisher Scientific/Life Technologies) using the manufacturer’s recommended protocols.

Enzymatic shear times were optimized for each set of samples prepared, but in most reactions

were 7 minutes (range 5–8 minutes). Fragmented amplicons were purified using a Serapure

protocol (Faircloth and Glenn 2011) and libraries prepared using manufacturer protocols for end

repair and ligation of barcodes to identify individual samples during PGM sequencing. The

unamplified library was size selected using 2% agarose gel electrophoresis (E-Gel system,

ThermoFisher Scientific/Life Technologies) and a 50bp DNA ladder to yield library fragments of

approximately 200bp. Resulting 200bp fragment libraries were amplified in approximately 50μL

reaction volumes (47μL master mix containing 40μL Platinum PCR mix, 2μL Library

amplification primer mix, and 5μL nuclease-free water; plus 3-5μL fragmented sample) using

the following conditions: 1 cycle of 95⁰C for 5min.; 15 cycles of 95⁰C for 15s, 58⁰C for 15s,

70⁰C for 1min; and a final hold at 4⁰C. Amplification products were assessed using 2% agarose

gel electrophoresis, purified with Serapure, and size-selected again (as above) to isolate 200bp

amplified libraries. Individual sample libraries were analyzed to determine relative quality,

Page 242: Genetic diversity, population structure and phylogeography ...

224

confirm fragment size profiles, and quantify concentration using a BioAnalyzer (Agilent

Technologies, Santa Clara, CA, USA) with the High Sensitivity DNA Kit (Agilent

Technologies). Each sample library was diluted to 100pM concentration and samples (generally

32 samples, range 16-37) pooled in equimolar amounts into a final 100pM barcoded library pool.

Sequencing template preparation of the diluted library pool was performed with an Ion

OneTouch 2 Instrument using the Ion PGM template OT2 200 (ThermoFisher Scientific/Life

Technologies) according to the manufacturer’s protocols with 2.5-3μL of library pool. At the end

of template preparation, 2μL of the unenriched, template-positive Ion Sphere Particles (ISPs)

were assessed for the percentage of template ISPs using an Ion Sphere Quality Control kit

(ThermoFisher Scientific/Life Technologies) and Qubit 2.0 Fluorometer (ThermoFisher

Scientific/Life Technologies) according to the recommended protocols. All PGM sequencing

reactions in this study contained 19–23% template ISPs, all within the recommended range of

10-30% ( Ion PGM Template OneTouch 2 200 kit User Guide, ThermoFisher Scientific/Life

Technologies). Sequencing of templated ISPs was conducted using an Ion PGM 200 or Ion PGM

Hi-Q Sequencing kit and an Ion 316 or 318 Chip on an Ion Torrent Personal Genome Machine

(PGM) (ThermoFisher Scientific/Life Technologies) following the manufacturer protocols.

4.2.3 Complete mitochondrial genome assembly

The raw sequencing reads from the PGM were quality filtered, and adapters and barcodes

were trimmed using post-sequencing processing with the PGM Torrent Suite Software ver.

4.2.1(ThermoFisher Scientific/Life Technologies). The extracted reads were assembled using the

SeqMan NGen, SeqMan Pro, and MegAlign Pro programs contained in the LaserGene software

package from DNAStar (Madison, WI, USA).

Page 243: Genetic diversity, population structure and phylogeography ...

225

To create a reference sequence for belugas, PGM data for 16 beluga samples representing

each of the putative stocks were assembled using the de novo genome workflow in SeqMan

NGen. Based on the results, the sample with the fewest number of contigs of large size was

chosen (sample 98SLE003) to then be reassembled and aligned to the narwhal GenBank

mitogenome sequence (accession #AJ554062) using the reference-guided genome workflow in

SeqMan NGen with a minimum match percentage of 90%. The resulting reference sequence

assembly was analyzed in SeqMan Pro where mismatches and sequencing errors were manually

corrected, and a draft beluga reference sequence created. The original PGM data for the draft

reference sequence was then reassembled to the new reference so that any remaining errors could

be manually corrected. No structural variants were detected. As further confirmation that there

were no unresolved insertions, the unassembled sequence reads from the reference sample data

were de novo assembled in SeqMan NGen. The contigs that resulted from this analysis had very

thin coverage, and alignment with the beluga consensus reference using MegAlign Pro

confirmed they were not insertion elements. Reassembly of the original raw PGM reference

sample data to the new beluga consensus sequence produced an assembly with relatively few

(<50) consensus disagreements, mostly due to homopolymeric errors and misassembly of small

repeat elements. These positions were manually edited from the sequence, and it was saved as

the final beluga consensus reference sequence for reference-guided genome assembly of all

remaining beluga samples. All resulting beluga mitogenome sequences were aligned with the

beluga mitogenome reference using MegAlign Pro and edited for errors. Raw PGM data for all

narwhal samples were assembled using the NGen reference-guided assembly workflow with the

GenBank narwhal mitogenome sequence as a reference. All resulting narwhal consensus

Page 244: Genetic diversity, population structure and phylogeography ...

226

sequences were aligned with the GenBank reference using MegAlign Pro to check for errors

which were subsequently corrected.

A published complete beluga mitochondrial genome (derived from a beluga originating

in Russia) was published in early March 2017 (Kim et al. 2017). This sequence was obtained

from GenBank (accession #KY444734) and included with all beluga samples in this study and

aligned using MEGA ver. 6 (Tamura et al. 2013). Based on the published beluga mitogenome,

all samples in this study were again edited to remove any errors in the sequence, especially

homopolymer errors, which are known to be a problem associated with the flow-based

chemistries used by the PGM (Loman et al. 2012). Narwhal sequences were also aligned with the

GenBank narwhal reference using MEGA ver. 6 (Tamura et al. 2013), and sequences again re-

examined for potential errors. All errors were corrected and final sequence alignments for each

species retained for data analyses.

Individual sample sequences were refined with further inspection and haplotypes were

identified using GenAlEx ver. 6.5 (Peakall and Smouse 2012). GenAlEx data review options

highlighted polymorphic sites in the sequences and ambiguous positions in individual samples

(“R”, “Y”, or spurious deletions), which were then identified and edited in the MEGA

alignments (ver. 6, Tamura et al. 2013) of all sequences. Through iterations of this process, final

alignments of complete mitogenomes for all samples were created using MEGA and haplotypes

assigned using GenAlEx.

4.2.4 Mitogenome sequence variability and differentiation among sample collections

Mitogenome haplotype frequencies and distributions were evaluated and compared to the

patterns revealed using CR region sequences. The number of polymorphic sites, parsimony

Page 245: Genetic diversity, population structure and phylogeography ...

227

informative sites, different haplotypes (NH), haplotype diversity (h), nucleotide diversity (π), and

the average number of nucleotide differences (k) within sample collections were determined

using DnaSP ver. 5.1 (Librado and Rozas 2009). The genetic differentiation among all Canada-

wide geographic samples was assessed using analysis of molecular variance (AMOVA) between

and within the sample collection groups and pairwise ΦST comparisons using Arlequin ver. 3.5

(Excoffier and Lischer 2010). Tajima’s D statistic (Tajima, 1989) and Fu’s FS test statistic (Fu

1997) were used to assess departures from an evolutionary model of neutral mutation. These

tests are slightly different as Tajima’s D uses information from the mutation frequency and Fu’s

FS uses information from the haplotype distribution (Ramos-Onsins and Rozas 2002). Both

statistics were determined using Arlequin ver. 3.5 and significance was tested using 10,000

coalescent simulations. Significantly negative Tajima’s D tests and Fu’s FS tests are considered

to be indicative of population expansion.

4.2.5 Phylogenetic analyses of complete mitogenome sequences

Unrooted median-joining phylogenetic networks using complete mitogenome sequence

haplotypes representing the entire Canada-wide beluga sample collections i.e. with Hendrickson

Island samples only representing the Eastern Beaufort Sea (EBS) stock (N=77), and for the EBS

samples only (N=38), were constructed using Network ver. 5.0 (www.fluxus-engineering.com).

Different network parameters were explored using default values (no weighting), weighting of

variable positions based on transversion:transition ratios of 1:1 and 3:1, and by “down”

weighting hypervariable positions within the control region (Fluxus Engineering 2015). For both

networks, different epsilon values were set after determining the maximum unweighted pairwise

difference from a mismatch distribution analysis of the samples. For the Canada-wide network,

Page 246: Genetic diversity, population structure and phylogeography ...

228

epsilon values of 10 (recommended, Fluxus Engineering 2015), 50 and 160 (maximum

unweighted pairwise difference) were used. For the EBS only samples, epsilon values of 10 and

40 (maximum unweighted pairwise difference) were used. For all networks, the “Greedy FHP”

method for distance calculation (Foulds et al. 1979) was selected and so was the MP (maximum

parsimony) option that identifies and eliminates unnecessary median vectors and links. The

resulting network was drawn and modified for clarity using Network Publisher ver. 2.0

(http://www.fluxus-engineering.com/nwpub.htm).

Evolutionary histories among beluga and narwhal samples were also investigated using

phylogenetic trees based on complete mitogenome sequences, which also allows for the

characterization of potential population subdivision through the identification of lineages with a

common ancestor (Nichols 2001). The most appropriate substitution model for the data was

determined using methods contained within MEGA ver. 6 (Tamura et al. 2013). The optimal

model was selected based on the lowest Akaike Information Criterion (AIC) and Bayesian

Information Criterion (BIC) scores and was considered to best describe the substitution pattern in

the data.

Neighbour-Joining (NJ) trees were used to infer evolutionary histories among all beluga

samples (N=106) and all narwhal samples (N=93) and included outgroup mitogenome sequences

from the Indo-Pacific finless porpoise (Neophocaena phocaenoides) and the narrow-ridged

finless porpoise (Neophocaena asiaorientalis) obtained from GenBank (accession numbers

NC_021461 and NC_026456 respectively). The evolutionary distance for both trees was

computed using the TN93 model (Tamura and Nei 1993) based on model selection results in

MEGA ver. 6 analyses. A bootstrap consensus tree inferred from 1000 replicates was retained

and major clades with bootstrap support of 80% or greater identified.

Page 247: Genetic diversity, population structure and phylogeography ...

229

Phylogenetic analysis by maximum likelihood (ML) (an approach that infers an

evolutionary tree that makes the data most likely, Hall 2011) was implemented in the online

version of PhyML ver. 3.0 (Guindon et al. 2010). All beluga sample mitogenome sequences

(N=106) were submitted, and the substitution model TN93 (Tamura and Nei 1993) was selected.

The equilibrium frequencies, the transition:transversion ratio and the proportion of invariable

sites were all estimated. The number of substitution rate categories was 4, and the gamma shape

parameter was estimated by the program. Tree searches used a BIONJ tree as a starting tree, and

the tree topology improvement was assessed using SPR (subtree pruning and regrafting

topological moves, Hordijk and Gascuel 2005). This method is recommended for datasets larger

than 50 sequences and where a weak phylogenetic signal is expected (Guindon 2012). Branch

support was estimated using the approximate likelihood ratio test (“aLRT SH-like” option)

(Anisimova and Gascuel 2006) and using bootstrap resampling of 1000 random replicates.

A Bayesian inference (BI) approach was also used to estimate a phylogenetic tree;

however, the tree that is produced in this analysis is the one that is most likely given the data and

the substitution model selected (Hall, 2011). The BI analyses of beluga samples (N=106) were

conducted using MrBayes ver. 3.2.5 (Ronquist et al. 2012) executed on the CIPRES

(Cyberinfrastructure for Phylogenetic Research) Science Gateway ver. 3.1 (Miller et al. 2010).

Analyses used two parallel independent runs each consisting of four simultaneous chains (one

“heated chain” and three “cold” chains). The optimal substitution model for the data, the NST

parameter of the likelihood model, was set to ‘2’ which encompasses the K2, HKY, T3P and

TN93 models (Ronquist et al. 2011). Similarly, the rates parameter was set to ‘invgamma’,

which indicates that a proportion of the sites are invariable and that the rate for the remaining

sites should be drawn from a gamma distribution. This distribution is approximated using four

Page 248: Genetic diversity, population structure and phylogeography ...

230

categories, and the proportion of invariable sites was estimated from a uniform distribution. All

priors for the phylogenetic model were left at the default parameters; therefore, no molecular

clock was assumed, rate heterogeneity was modelled by the analyses, and any combination of

base frequency was given equal prior weight. The two concurrent runs of 10,000,000 generations

were sampled every 1000 generations to assess for convergence using an evaluation of the

standard deviation of the split frequencies between runs and a Potential Scale Reduction Factor

(PRSF) convergence diagnostic (Ronquist et al. 2011). The effective sample sizes (ESS) of

parameters sampled from the MCMC analysis and the overall trace data were also evaluated

using a 25% burn-in applied by Tracer ver. 1.6 (Rambaut et al. 2014). A 50% majority-rule

consensus tree was then constructed to summarize the posterior probabilities for each identified

clade.

4.2.6 Mitogenome codon substitution patterns and evidence of selection

Sequences containing only the concatenated mtDNA protein-coding genes (PCG) were

created for both the beluga and narwhal datasets. In each case, the GenBank reference sequence

(accession #KY444734 for beluga and accession #AJ554062 for narwhal) was imported into the

SeqBuilder program contained in the LaserGene software package from DNAStar (Madison, WI,

USA). After the PCG annotations had been applied, the non-coding portions of the sequence

were edited out and the resulting concatenated sequence exported as a new PCG reference. The

beluga and narwhal samples were aligned to their respective PCG reference sequence using

MEGA ver. 6 and edited to remove the non-coding regions. All beluga samples and ten narwhal

samples representing six putative narwhal stocks were combined into a final dataset with

identical haplotypes removed so only unique haplotypes were included. Nucleotide diversity (π)

Page 249: Genetic diversity, population structure and phylogeography ...

231

of the mtDNA genes in beluga and the interspecific nucleotide divergence (Dxy , the average

number of nucleotide substitutions per site between species) between beluga and narwhal

mitogenomes were calculated using pairwise comparisons made with DnaSP ver. 5.1 (Librado

and Rozas 2009).

Natural selection and its influence on the evolution of protein-coding genes (PCGs) is

commonly studied by measuring the ratio ω = dN/dS or KA/KS, or the rates of non-synonymous

(dN or Ka = the mean number of nucleotide substitutions per 100 nonsynonymous sites) to

synonymous nucleotide substitutions (dS or Ks= the mean number of nucleotide substitutions per

100 synonymous sites) (Nei and Gojobori 1986). It is assumed that ω > 1 indicates positive

selection where replacement substitutions increase fitness, ω = 1 indicates neutrality, and ω < 1

indicates negative, or purifying, selection (Nozawa et al. 2009). Using DnaSP ver. 5.1, KA/KS (ω)

estimates were determined for each of the H-strand mtDNA protein coding genes in beluga to

determine if there was evidence of selective pressure on a particular gene or set of genes.

A codon-based Z-test of selection, implemented in MEGA ver. 6 (Tamura et al. 2013),

was used to perform a distance-based sequence analysis of the beluga clades to estimate the

average number of synonymous and nonsynonymous substitutions that have occurred in the

PCGs. Estimates of dN and dS were calculated, along with the variance of the difference between

these two quantities using the bootstrap method (1000 replicates), to test the null hypothesis that

dN = dS (neutrality) (Nei and Gojobori 1986) using a Z-test (Nei and Kumar 2000). Rejection of

this null hypothesis could indicate positive selection (dN > dS) or negative selection (dN < dS).

The McDonald-Kreitman (MK) test (McDonald and Kreitman 1991), executed in the

DnaSP program, was also used to test for indications of adaptive protein evolution in the beluga

clades identified by phylogenetic analyses. In this test, the ratio of nonsynonymous (amino-acid

Page 250: Genetic diversity, population structure and phylogeography ...

232

altering) to synonymous changes within species are compared to the ratio of nonsynonymous to

synonymous changes between species (Meiklejohn et al. 2007). Under a prediction of neutrality

(no selection), the ratio of replacement (nonsynonymous) to silent (synonymous) changes

observed between species should equal the ratio of replacement to silent changes between

species (McDonald and Kreitman 1991). Departures from this neutrality can be evaluated using

the Neutrality Index (NI) (Rand and Kann 1996, Meiklejohn et al. 2007), where NI = 1.0

indicates neutrality, NI < 1.0 indicates negative selection (detected as an excess of amino acid

divergence), and NI > 1.0 indicates positive selection (detected as an excess of amino acid

polymorphism) (Meiklejohn et al. 2007). Narwhal mitogenome sequences (N=10) were used for

interspecific comparisons to the beluga clades.

Many methods that use dN/dS or KA/KS ratios to assess selection pressure in particular

DNA regions may lack the statistical power to detect positive selection, as only a few sites may

be affected (Kosakovsky Pond and Frost 2005). To perform this type of analysis, methods that

specifically identify signals of positive selection were used in the software package HyPhy

(Kosakovsky Pond et al. 2005) found at the Datamonkey server (Poon et al. 2009). This analysis

has the advantage of including the evolutionary history of the sequences, in the form of a

phylogenetic tree, along with site-by-site analysis over multi-gene data sets (Kosakovsky Pond et

al. 2005) that enables the identification of both codons and lineages under selection (Poon et al.

2009).The PhyML program was used to infer a maximum-likelihood (ML) tree for the

concatenated protein-coding gene sequences of the 87 (N=86 Canadian samples + reference

sequence) unique beluga mitogenome haplotypes (same settings used for the complete

mitogenome tree and aLRT-SH branch support). These sequences and the ML tree were used as

inputs for the “Positive Selection” option in HyPhy with several approaches for estimating the

Page 251: Genetic diversity, population structure and phylogeography ...

233

ratios of nonsynonymous and synonymous changes (dN/dS): the multirate fixed effects likelihood

(FEL) method that estimates dN/dS for each site; the single likelihood ancestor counting (SLAC)

method that counts the numbers of synonymous and nonsynonymous substitutions along the

phylogeny (Kosakovsky Pond and Frost 2005); and the branch site REL (BSR) method that

allows evolutionary rate variation along branches and sites simultaneously (Kosakovsky Pond et

al. 2011). These methods were chosen on the basis of computational demands for the beluga

dataset, but with enough of a range of approaches to allow for consensus-based inference of the

results and to avoid false positives (Kosakovsky Pond et al. 2007).

4.3 Results

4.3.1 Complete mitogenome sequencing workflow for beluga and narwhal

A total of 115 beluga samples and 105 narwhal samples, representing broad (and fine-

scale for belugas) geographic distribution over the Canadian range of each whale, were

processed using the next-generation sequencing workflow developed for this study. Each step of

the analysis pipeline was refined to address limitations in laboratory infrastructure while at the

same time optimizing sample throughput and quality control measures. For each sequencing run,

bases were filtered to include only those with quality scores greater than or equal to Q20. These

quality scores, or Phred scores (Ewing et al. 1998), reflect the probability of base call error

(Huse et al. 2007). A Phred quality score of 20 indicates a 0.01 probability of an incorrect base

call (or 99% base call accuracy). While these scores can give a general sense of base-calling

error rate, the information from quality scores does not describe error patterns (Li et al. 2004).

These patterns were considered in the final assembly of consensus sequences for each sample.

As a result, of the 115 beluga samples processed, raw data (reads) for 108 samples were

Page 252: Genetic diversity, population structure and phylogeography ...

234

available for assembly. The number of narwhal samples was reduced from 105 samples

sequenced to a total of 96 samples for assembly.

In general, the number of reads covering the same regions of the genome (read depth),

the evenness of the read coverage along the entire genome, the length of the reads, and the read

quality are the major indicators of the quality of the assembled, reconstructed genome sequences

(e.g. Loman et al. 2012). Across all runs, the quality parameters for beluga and narwhal samples

were very good, with large numbers of reads of tightly distributed and consistent read lengths

(Table 4.1). Beluga and narwhal sample sequencing depth varied across the full mitogenome and

ranged from 200- to over 3500-fold coverage (Figure 4.3a), with the exception of a few samples

that had read depth that ranged from 1- to 120-fold and incomplete coverage across the full

mitogenome (Figure 4.3b). Samples with poor read depth or coverage were discarded (N=2

beluga; N=3 narwhal). For both beluga and narwhal sample mitogenome assemblies, insertions

and deletions of single nucleotides were the most common errors in the sequencing reads. These

homopolymer errors are known to be the dominant type of error in PGM sequencing (e.g. Loman

et al. 2012, Bragg et al. 2013, Zhang et al. 2015). All of these errors (approximately 65 in total,

or 0.4% of total nucleotides) were edited out of the final sequences as described in Section 4.2.3,

leaving only polymorphic variable sites. Overall, a total of 106 complete beluga mitogenome

sequences comprised of 16,386bp were retained for further analyses. For narwhal, the complete

mitogenomes (16,381bp) of 93 samples were retained.

Page 253: Genetic diversity, population structure and phylogeography ...

235

Table 4.1. Mitogenome sequencing run summary statistics for beluga and narwhal samples

(averaged across sequencing runs/samples).

Statistic Beluga Narwhal

Median read length (bp) 215 202

Total number of reads 4.1 million 1.9 million

% Useable reads 61% 56%

Average coverage depth 1500X 1500X

Page 254: Genetic diversity, population structure and phylogeography ...

236

Figure 4.3. Examples of read coverage against reference mitogenome sequence. (A) High

threshold coverage typical of most samples included in this study; (B) Poor (below) threshold

coverage for samples not included in analyses.

(A)

(B)

Page 255: Genetic diversity, population structure and phylogeography ...

237

4.3.2 Mitogenome sequence data analysis

Multiple alignments of samples for each species were created with GenBank

mitogenomes as reference sequences to assess variability. For beluga samples, one variable

position could not be resolved with confidence. This position (at 15,476), in the control region,

was deleted from both the reference and Canadian samples, resulting in a final mitogenome

sequence length for beluga of 16,385bp. The Canadian narwhal samples contained two deleted

positions compared to the GenBank reference, both again occurring in the control region

(positions 15,469 and 15,483). Each of these positions was deleted from the reference to align it

with the Canadian samples, resulting in a final narwhal mitogenome sequence of 16,381bp.

Overall, there were 305 variable mitogenome positions observed in the Canadian beluga

samples (N=106) and 151 variable positions in the narwhal sample mitogenomes (N=94) (Table

4.2). For belugas, 240 of the variable sites were found in mtDNA coding regions, resulting in

173 synonymous changes (72.1%) and 67 nonsynonymous (replacement) changes (27.9%).

Narwhal samples had 118 of the total variable positions in coding regions, with 86 sites resulting

in synonymous changes (72.9%) and 32 positions with variations leading to replacement changes

(27.1%). The patterns of variation over the complete mitogenomes for each species were

different (Figure 4.4), most notably in the control region. Beluga mitogenomes for the samples in

this study had 33 variable positions in 916bp of control region sequences (3.6% variability).

Narwhal, as found in previous studies (Palsbøll et al. 1997, de March et al. 2003), had much

lower variability than beluga in the control region, with 12 variable positions in 913bp of

sequence (1.3% variability).

Page 256: Genetic diversity, population structure and phylogeography ...

238

Table 4.2. Complete mitogenome genetic variability among beluga sample collections and beluga clades (as identified in Section 4.3.3).

Abbreviations: h = Haplotype Diversity; h SD = Standard Deviation of h; π = Nucleotide Diversity; π SD = Standard Deviation of π; k = Average

Number of Nucleotide Differences; D = Tajima’s Neutrality Test; Fs = Fu’s Neutrality Test. *ns = not significant and sig. = significant; **na = test

not available, at least 4 sequences required.

Sample Collection N

Sample

Poly-

morphic

Sites

Parsimony

Informative

Sites

N

Haplo-

types

h h

SD

π π

SD

k D Fs

Grise Fiord 11 25 34 10 0.982 0.046 0.001 0.000 19.06 -0.256 ns*

(P>0.10)

-0.709 ns

(P>0.10)

Cunningham Inlet 5 48 30 5 1.000 0.126

0.002 0.000 25.20 0.707 ns

(P > 0.1)

0.787 ns

(P>0.10)

Igloolik 5 37 2 5 1.000 0.126 0.000 0.000 15.20 -1.082 ns

(P>0.10)

0.193 ns

(P>0.10)

E. Beaufort Sea 38 81 53 29 0.983 0.010 0.001 0.000 21.31 0.385 ns (P>0.10) -4.488 ns

(0.1>P>0.05)

Pangnirtung 10 53 42 9 0.978 0.003 0.001 0.000 17.89 -0.221 ns

(P>0.10)

-0.383 ns

(P>0.10)

St. Lawrence 3 15 0 3 1.000 0.272 0.001 0.000 10.00 na** na

SQE ice

entrapments

10 180 149 8 0.956 0.059 0.005 0.001 84.93 1.633 ns (P>0.10) 4.593 ns

(P>0.10)

SQE harvests 5 41 2 5 1.000 0.126 0.001 0.001 16.80 -1.101 ns

(P>0.10)

0.315 ns

(P>0.10)

Long Island 3 146 0 3 1.000 0.272 0.006 0.003 97.33 na na

James Bay 4 40 4 4 1.000 0.177 0.001 0.001 20.67 -0.545 ns

(P>0.10)

1.15 ns

(P>0.10)

E. Hudson Bay 4 150 2 4 1.000 0.177 0.005 0.002 75.33 -0.831 ns

(P>0.10)

2.506 ns

(P>0.10)

W. Hudson Bay 8 175 163 7 0.964 0.077 0.005 0.001 75.85 0.640 ns (P>0.10) 3.116 ns

(P>0.10)

Clade A1 37 50 17 30 0.986 0.010 0.000 0.000 4.72 -2.183 sig.

(P = 0.003)

-27.43 sig.

(P = 0.000)

Clade A2 9 41 34 7 0.944 0.070 0.001 0.000 15.17 0.027 ns (P>0.10) 1.15 ns

(P>0.10)

Clade B 44 81 35 38 0.994 0.006 0.000 0.000 7.18 -2.206 sig.

(P = 0.002)

-25.034 sig.

(P = 0.000)

Clade C 13 37 15 11 0.962 0.050 0.001 0.000 8.72 -1.1907 ns

(P>0.10)

-2.46 ns

(0.1>P>0.05)

Page 257: Genetic diversity, population structure and phylogeography ...

239

Sample Collection N

Sample

Poly-

morphic

Sites

Parsimony

Informative

Sites

N

Haplo-

types

h h

SD

π π

SD

k D Fs

All Beluga 106 305 220 86 0.996 0.002 0.003 0.000 48.84 -0.543 ns

(P>0.10)

-21.91 sig.

(P = 0.003)

All Narwhal 94 151 93 56 0.977 0.007 0.001 0.000 15.32 -1.618 sig.

(P = 0.021)

-19.73 sig.

(P = 0.001)

Page 258: Genetic diversity, population structure and phylogeography ...

240

Figure 4.4. Sequence variation observed within the beluga (A) and narwhal (B) mitochondrial

genomes. Graphs for both the total number of substitutions (S) and nucleotide diversity (π) were

created by considering windows of 200bp (step size = 100bp) centred at the midpoint.

Page 259: Genetic diversity, population structure and phylogeography ...

241

A total of 86 unique haplotypes were identified in the 106 beluga samples analyzed in

this study (Appendix 4.1). The majority of these haplotypes (69.8%) were observed in single

samples, with shared haplotypes generally found in samples from the same or closely related

stocks (Table 4.2). The greatest number of shared haplotypes (5/14) were found in samples from

the Eastern Beaufort Sea stock which had the largest number of samples analyzed from a single

stock (N=38). A further 3/14 shared haplotypes were observed in samples from the Belcher

Island ice entrapments. However, some samples with shared haplotypes were notable. Samples

from Igloolik, which were taken from migratory whales thought to be from the High Arctic

stocks, had haplotype 19 which was also found in a sample from Grise Fiord (as expected), but

also one with haplotype 16 which was also found in a sample from the Belcher Islands ice

entrapment. The most unexpected result occurred with haplotype 77 which was observed in three

samples, one from the St. Lawrence Estuary and two samples from western Hudson Bay. This

was also the only instance where samples with the same mitogenome haplotype did not share the

same control region (CR) haplotype found in previous analyses (Chapter 2, this thesis). In fact,

each of the three samples had different CR haplotypes (Appendix 4.1). This result indicates an

error, either in the CR haplotype identification or the mitogenome sequencing. It is not possible,

with the current analyses, to determine where the source of the error is.

Comparison of mitogenome sequences to CR mtDNA sequences for both species

provides a better assessment of the differences in variability between beluga and narwhal (Table

4.4). Though far fewer haplotypes have been observed in narwhals using CR sequence as

compared to beluga, the number of samples that have been sequenced is also considerably less -

approximately 1:6. However, the mitogenome analyses of a similar number of samples, both

selected from broad geographic ranges, still indicate that mtDNA diversity is much lower in

Page 260: Genetic diversity, population structure and phylogeography ...

242

narwhal as compared to beluga. While the number of haplotypes observed per sample between

the two species is more equitable (1:1.68 samples in narwhal; 1:1.23 samples in beluga), the

number of polymorphic sites in the narwhal samples is half the number found in beluga samples.

Within the geographic sample collections of beluga, the number of polymorphic sites and

diversity in mitogenomes had a wide range that was similar to the patterns observed in studies

examining CR mtDNA sequence. Samples from the St. Lawrence Estuary had the fewest number

of polymorphic sites, few parsimony informative sites (i.e. a variable position that occurs in at

least two of the sample sequences), and the smallest average number of nucleotide differences

(Table 4.2). Surprisingly, it was not the sample collections with the largest sample sizes (Grise

Fiord, Eastern Beaufort Sea, and Pangnirtung) that had the greatest amount of diversity. Samples

from the Belcher Islands (SQ) ice entrapments and western Hudson Bay, with sample sizes of

N=10 and N=8 respectively, had the largest number of variable positions, parsimony informative

sites and number of haplotypes. However, the most diversity was observed in the Long Island

samples, which included only three samples but had 146 variable positions, the highest

nucleotide diversity, and the largest average number of nucleotide differences. This result

supports previous conclusions (Chapter 2) that beluga samples harvested from Long Island are

from a mixture of different beluga stocks in the south and eastern Hudson Bay area.

Significantly negative D and Fs values are an indication of demographic population

expansion. Neither the Tajima’s D statistic nor Fu’s Fs was significant for any of the beluga

sample collections (all probabilities > 0.05) (Table 4.2). These results are not able to reject a null

hypothesis of a constant population size, although neutrality tests such as these have been shown

to have weak power when sampling occurs too early (before substantial population growth has

occurred) or too late (after the population has reached a steady size) (Fu, 1997).

Page 261: Genetic diversity, population structure and phylogeography ...

243

Table 4.3. Summary of beluga mitogenome haplotypes found in multiple samples.

Haplotype Sample ID Sample Collection Putative Stock

1 11SQE1001 Belcher Islands ice entrapments Southern Hudson Bay or James Bay

11SQE1006 Belcher Islands ice entrapments Southern Hudson Bay or James Bay

6 05KI013 Kendall Island Eastern Beaufort Sea

89HSKY027 Husky Lakes ice entrapment Eastern Beaufort Sea

89TUK009 Tuktoyaktuk Eastern Beaufort Sea

7 05PA016 Paulatuk Eastern Beaufort Sea

92TUK015 Tuktoyaktuk Eastern Beaufort Sea

8 94HI015 Hendrickson Island Eastern Beaufort Sea

04KI002 Kendall Island Eastern Beaufort Sea

92SP006 Shingle Point Eastern Beaufort Sea

11 11SQE1002 Belcher Islands ice entrapments Southern Hudson Bay or James Bay

11SQE1004 Belcher Islands ice entrapments Southern Hudson Bay or James Bay

16 97IG251 Igloolik Migratory (unknown sources)

04SQE012 Belcher Islands ice entrapments Southern Hudson Bay or James Bay

19 00GF1034 Grise Fiord Eastern High Arctic

01IG1004 Igloolik Migratory (unknown sources)

33 89HSKY015 Husky Lakes ice entrapment Eastern Beaufort Sea

99EWF007 East Whitefish Station Eastern Beaufort Sea

93WWF001 West Whitefish Station Eastern Beaufort Sea

41 00GF1044 Grise Fiord Eastern High Arctic

99CI026 Cunningham Inlet Central High Arctic

45 95PG539 Pangnirtung Pangnirtung

02PG002 Pangnirtung Pangnirtung

51 04SQE019 Belcher Islands ice entrapments Southern Hudson Bay or James Bay

05SQE111 Belcher Islands ice entrapments Southern Hudson Bay or James Bay

54 94HI003 Hendrickson Island Eastern Beaufort Sea

93SP010 Shingle Point Eastern Beaufort Sea

Page 262: Genetic diversity, population structure and phylogeography ...

244

Haplotype Sample ID Sample Collection Putative Stock

59 84GF005 Grise Fiord Eastern High Arctic

84GF006 Grise Fiord Eastern High Arctic

77 98SLE003 St. Lawrence Estuary St. Lawrence Estuary

03AR1074 Arviat Western Hudson Bay

03AR1056 Arviat Western Hudson Bay

Page 263: Genetic diversity, population structure and phylogeography ...

245

Table 4.4. Comparison of mtDNA control region (CR) sequence information to complete

mitochondrial genomes for beluga and narwhal.

Data Information Beluga Narwhal

Control region (CR) sequence (bp) 609* 501**

Number of sequences 2500 433

Total number of variable sites 39 16

Number of CR haplotypes 83 22

Number of haplotypes per sample 1:30 1:20

complete mitogenome sequence

(bp) 16,385 16,381

Number of mitogenomes 106 94

Total number variable sites 305 151

Number of haplotypes 86 56

Number of haplotypes per sample 1:1.23 1:1.68

*CR data for beluga from Chapter 2,

this thesis. **CR data for narwhal from de March et al. 2003.

Page 264: Genetic diversity, population structure and phylogeography ...

246

4.3.3 Phylogenetics of complete mitogenome sequences of beluga and narwhal

Complete mitochondrial genomes were used to infer detailed phylogenies for beluga

samples and a general phylogeny for the narwhal samples. Neighbour-joining (NJ) trees

constructed for beluga and narwhal reflected the differences in mtDNA diversity observed in

samples from these species (Section 4.3.2). The narwhal NJ tree revealed strong bootstrap

support between the narwhal samples and outgroups of beluga and finless porpoise (Figure 4.5).

However, there was little support of clades forming among any of the narwhal geographic

sample collections, including the samples from Repulse Bay which are currently considered to be

a separate stock and population as compared to the Baffin Bay (all other locations) narwhal (de

March et al. 2003, Petersen et al. 2011).

This phylogenetic result for narwhal is contrasted by the NJ tree inferred from beluga

mitogenome sequences (Figure 4.6). Three major clades were identified with 100% bootstrap

support (A, B, and C) and within clade A, two clades are identified with bootstrap support

greater than 80% (83% support for clade A1 and 88% support for clade A2). The main clades

generally correspond to the haplogroups identified with CR mtDNA sequences (Chapter 2):

clade A overlaps with Haplogroup 1A and is dominated by Hudson Bay, High Arctic and

Cumberland samples; clade B with Haplogroup 1B, dominated by Eastern Beaufort Sea samples;

and clade C with haplogroup 2 dominated by St. Lawrence Estuary, Eastern Hudson Bay, and

Belcher Islands (SQ) entrapment samples. However, Haplogroup 1A, which was found in high

proportions across all geographic sample collections except the St. Lawrence Estuary, has been

split into two clades with the mitogenome analyses. Clade A1 is a smaller cluster of samples,

including mostly Eastern Beaufort Sea samples, the beluga reference sequence and two

Pangnirtung samples. Clade A2 continues to reflect a broad geographic collection of samples

Page 265: Genetic diversity, population structure and phylogeography ...

247

similar to Haplogroup 1A. However, within clade A2, the remaining eastern Beaufort Sea

samples group into two distinct clusters with relatively high bootstrap support (90% and 69%).

Similarly, there are several groups of samples that cluster together in clade B which

correspond to geographic sample collections. Cluster B1 is composed of almost all Grise Fiord

samples, with a single Cunningham Inlet sample (also from the High Arctic) included. Cluster

B2 has a similar composition to clade A1 with a group of Eastern Beaufort Sea samples

connected to a pair of Pangnirtung samples. Eastern Beaufort Sea samples comprise almost the

entire cluster B3, with a single sample from western Hudson Bay included. Cluster B4 is the

most diverse group in this clade, with samples from a broad geographic distribution.

Phylogenetic trees inferred using a Maximum-Likelihood (ML) and Bayesian Inference

approach (BI) resulted in trees with the same patterns as the NJ tree (Figure 4.7). The major

clades A1, A2, B, and C, were supported with high support (>70% ML/ >90% BI). Furthermore,

samples from the same geographic collections formed more distinct clusters within clade B. All

of the Eastern Beaufort Sea samples and one Western Hudson Bay sample clustered together in

B2, separate from the Pangnirtung samples. Cluster B3 contained eastern Beaufort Sea, High

Arctic and Western Hudson Bay samples, and cluster B4 was composed of all Hudson Bay

samples.

Nucleotide diversity was similar for all of the major clades (Table 4.2). Both Tajima’s D

and Fu’s FS tests of neutrality were significantly negative (P<0.005) for clade A1 and clade B.

These results strengthen the significantly negative FS neutrality tests for samples from Hudson

Bay, the High Arctic and Cumberland Sound observed with CR mtDNA sequences (Table 2.6,

Chapter 2) and provided further evidence of recent demographic population expansion in these

areas. Similarly, neither CR mtDNA Haplogroup 2 nor mitogenome clade C had significantly

Page 266: Genetic diversity, population structure and phylogeography ...

248

negative neutrality tests, and thus a null hypothesis of a constant population size cannot be

rejected.

Given that the mitogenome haplotypes represent mostly individuals within sample

collections in this study, exact tests of differentiation based on mitogenome haplotype

frequencies (determined in Arlequin ver. 3.5) are not particularly informative for population

structure. However, general patterns with larger groupings (i.e. narwhal compared to beluga, and

comparison of beluga clades) using pairwise FST allowed for some general comparisons to the

patterns of differentiation among sample collections with CR mtDNA (Table 2.3, Chapter 2). All

narwhal samples were strongly differentiated from all beluga groups (P = 0.000). Among the

beluga clades, all clades (A1, A2, B and C) were also significantly differentiated from each other

(P = 0.000). Clade C was most similar to the eastern Hudson Bay and St. Lawrence Estuary

sample collections, again indicating the congruence of this clade with Haplogroup 2 found using

CR mtDNA sequences. Similarly, clade A2 was most similar to the mid-Canadian Arctic sample

collections of Cunningham Inlet, Igloolik, and Pangnirtung which aligns Haplogroup 1A. Clades

A1 and B were generally different from all of the individual sample collections and may reflect a

refinement of Haplogroup 1B into two groups. Given that these two clades also were the ones

identified by neutrality tests to have undergone recent population expansion, this may add

support the patterns of post-glacial expansion from different refugial populations into contact

zones discussed in Chapter 2.

The median-joining (MJ) phylogenetic network for the broad Canada-wide geographic

beluga mitogenomes highlights a deep divergence among clades A, B and C (Figure 4.8). Clades

A and B, which are separated by 28 nucleotide changes, show a closer evolutionary relationship

than clades B and C, which are separated by 138 nucleotide differences. These are much larger

Page 267: Genetic diversity, population structure and phylogeography ...

249

differences than what was observed among Haplogroups A1, A2 and B in the CR mtDNA

network of the same geographic distribution of samples (Figure 2.8, Chapter 2). In this network,

the majority of the haplotypes differed by one nucleotide change, with the largest distance being

three nucleotide differences between Haplogroup 2 (corresponding to mitogenome clade C) and

the Haplogroups 1A and 1B (mitogenome clades A and B). Thus, the CR mtDNA haplogroups

from Chapter 2 and the mitogenome clades in this study have the same patterns, but on a

different scale.

The MJ phylogenetic network for the Eastern Beaufort Sea (EBS) samples confirms the

presence of three clusters of samples revealed by the ML and BI trees (Figure 4.9). The samples

that cluster in group B2 are separated by 27 nucleotide differences from the clusters in clade A2

and A1, which are separated by six nucleotide differences. However, within these clusters, there

are no fine-scale geographic patterns related to locations where beluga samples were collected.

Page 268: Genetic diversity, population structure and phylogeography ...

250

Figure 4.5. Bootstrap (1000 replicates) consensus

Neighbour-Joining tree inferred from complete

narwhal mitochondrial genomes. Representative

samples of beluga and two species of finless

porpoise (IP = Indo-Pacific; NR= Narrow-Ridged)

were used as outgroups. Nodes with bootstrap

support >60% (red numbers at nodes) are indicated.

Coloured symbols indicate samples from the same

locations.

Page 269: Genetic diversity, population structure and phylogeography ...

251

B1

A2

A1

A

C

B

B4

B3

B2

Figure 4.6. Bootstrap (1000 replicates) consensus

Neighbour-Joining tree inferred from complete

beluga mitochondrial genomes. Representative

samples of narwhal and two species of finless

porpoise (IP = Indo-Pacific; NR= Narrow-

Ridged) were used as outgroups. Branch support

>60% indicated by red numbers at nodes. Beluga

mtDNA clades identified with high bootstrap

support (>80%) are identified. Clusters within

Clade B are numbered. Coloured symbols

indicate samples from the same locations.

Page 270: Genetic diversity, population structure and phylogeography ...

252

Figure 4.7. Maximum Likelihood tree inferred from complete beluga mitochondrial genomes split into

two parts to highlight sample membership in clades. Significant support for the major clades are indicated

by red numbers adjacent to nodes. The first number indicates % bootstrap support based on 1000

replicates using ML approach, and the second number indicates Bayesian posterior probabilities resulting

from Bayesian Inference approach. Coloured symbols indicate samples from the same location.

Part A. Clade C:

71/100

To

Figure

Part B

C

Page 271: Genetic diversity, population structure and phylogeography ...

253

Figure 4.7, Part B. Clade A (A1 and A2) and Clade B. Clusters within Clade B are numbered 1 –

4 (Coloured symbols indicate samples from the same location):

B To

Figure

Part A

100/100

98/96

88/96

A2

A1

74/95

B1

B2

B3

B4

100/100

Page 272: Genetic diversity, population structure and phylogeography ...

254

Figure 4.8. Median-joining phylogenetic network for Canada-wide geographic beluga mitochondrial

genome haplotypes (Table 4.1). Hendrickson Island samples only were used from E. Beaufort Sea stock

(total sample N=9). Each circle (node) represents an individual haplotype and the size of the node is

proportional to the overall frequency of occurrence of the haplotype in the complete dataset. Nodes are

coloured according to geographic origin of the samples (for location abbreviations, see Figure 4.1). Red

numbers indicate the number of nucleotide changes between haplotypes. The network was reconstructed

using Network ver. 5 (Fluxus Technology 2016).

Page 273: Genetic diversity, population structure and phylogeography ...

255

Figure 4.9. Median-joining phylogenetic network for Eastern Beaufort Sea beluga mitochondrial genome

haplotypes (Table 4.1). Each circle (node) represents an individual haplotype and the size of the node is

proportional to the overall frequency of occurrence of the haplotype in the overall dataset. Nodes are

colored according to geographic origin of the samples. Red numbers indicate the number of nucleotide

changes between haplotypes. The network was reconstructed using Network ver. 5 (Fluxus Technology

2016).

Page 274: Genetic diversity, population structure and phylogeography ...

256

4.3.4 Mitogenome codon diversity and signatures of selection

Concatenated mitogenome sequences for beluga protein-coding genes (PCGs) on the

heavy strand resulted in 12 genes distributed in 10,823bp of sequence (Figure 4.10). The greatest

amount of nucleotide diversity in gene codons was found in the ATP8 gene, and the least amount

of diversity was in the ND4L gene (Table 4.5). Ratios of the rate of nonsynonymous

substitutions to the rate of synonymous substitutions (KA/KS estimates) for each of the 12 beluga

PCGs indicate the widespread influence of negative, or purifying selection (ω < 1) on all

mitochondrial genes. These effects were strongest in the ND1, CO1 and CO2 genes.

The McDonald-Krietman test (with the use of 10 narwhal mitogenome sequences as an

outgroup for interspecific analyses) indicated slight departures from a neutral prediction for the

ratio of amino acid replacement variation to synonymous variation within species compared to

the ratio of replacement to synonymous divergence between species (Table 4.6). The Neutrality

Index was slightly less than 1.0 for clades A1, A2 and clade B, but was slightly larger than 1.0

for clade C and the beluga samples overall. None of these departures was statistically significant

using a Fisher exact test (P>0.05) (Table 4.6).

A codon-based Z-test of selection rejected a null hypothesis of neutrality (dN = dS) for

clade A2 only, with a significantly negative value (-3.026, P=0.003) indicating that this beluga

mtDNA clade has undergone negative (purifying) selection.

The results of all tests for positive selection on beluga mtDNA sites and phylogenetic

branches of the beluga ML tree using the HyPhy software (Kosakovsky Pond et al. 2005) did not

indicate statistically significant evidence of positive selection on branches (branch REL method)

or sites (single likelihood ancestor counting method or SLAC, multirate FEL).

Page 275: Genetic diversity, population structure and phylogeography ...

257

Figure 4.10. Linear map of complete beluga mitochondrial genome. Protein coding regions are indicated

by green arrows. All protein coding genes (N=13) are present on the heavy (H) strand, with the exception

of ND6, which is located on the light (L) strand. Multi-colored boxes below each strand indicate location

of open reading frames (ORFs) indicating codons (excluding stop codons) of the top strand (first three

lines) and the bottem strand (second three lines). Mitogenome information from Kim et al. (2017) was

imported into Lasergene SeqBuilder software (DNASTAR) and the vertebrate mtDNA coding table was

applied to create map.

Page 276: Genetic diversity, population structure and phylogeography ...

258

Table 4.5. Intra- and interspecific diversity and divergence in mtDNA protein-coding genes of beluga

compared to narwhal. Abbreviations: π = intraspecific nucleotide diversity of beluga genes; Dxy =

interspecific nucleotide divergence of beluga genes compared to narwhal genes; π(a)/π(s) = intraspecific

ratio of nonsynonymous to synonymous mutations at nonsynonymous and synonymous sites in beluga

genes; KA/KS = interspecific ratio of rates of nonsynonymous to rates of synonymous mutations at

nonsynonymous and synonymous sites in beluga genes compared to narwhal genes.

Gene Length (bp) # of Codons π Dxy π(a)/π(s) KA/KS

ND1 957 319 0.003 0.060 0.032 0.055

ND2 1041 347 0.004 0.068 0.041 0.072

CO1 1551 517 0.003 0.068 0.049 0.039

CO2 684 228 0.003 0.053 0.121 0.051

ATP8 201 67 0.006 0.056 0.120 0.248

ATP6 681 227 0.004 0.063 0.182 0.113

CO3 785 261 0.004 0.068 0.061 0.102

ND3 346 115 0.003 0.094 0.117 0.091

ND4L 297 99 0.001 0.065 0.029 0.076

ND4 1378 458 0.003 0.075 0.094 0.078

ND5 1821 606 0.004 0.065 0.243 0.087

CYTB 1140 380 0.003 0.067 0.194 0.079

Table 4.6. McDonald-Kreitman test to evaluate departures from selective neutrality in mtDNA protein-

coding genes among beluga clades. Interspecific divergence was estimated using comparison to narwhal

samples (N=10). Abbreviations: PN = Number of non-synonymous substitutions fixed within beluga; PS =

Number of synonymous substitutions fixed within beluga; dN = Number of non-synonymous substitutions

fixed between species; dS = Number of synonymous substitutions fixed between species; ns = not

significant.

Beluga

Clade

Number of

Samples

PN PS dN dS Neutrality

Index (NI)

P-value

(Fisher exact

test)

Clade A1 32 14 71 144 557 0.763 0.396 ns

Clade A2 6 12 67 145 553 0.683 0.301 ns

Clade B 39 19 82 143 552 0.894 0.678 ns

Clade C 10 17 58 141 552 1.147 0.652 ns

All Beluga 87 61 212 126 501 1.146 0.475 ns

Page 277: Genetic diversity, population structure and phylogeography ...

259

4.4 Discussion

This is the first study to use complete mitochondrial genome sequences to study

populations of beluga whales and move beyond the use of low coverage markers (e.g. restriction

fragment analyses and control region mtDNA sequence) for the investigation of questions about

population structure, stock discrimination, and phylogenetics (e.g. Brennin et al. 1997, Brown

Gladden et al. 1997, de March and Postma 2003, Turgeon et al. 2012, O’Corry-Crowe et al.

2015). The laboratory protocols and workflow specific to beluga and narwhal templates

developed for this thesis provide a means to produce large amounts of mitogenome data for

significant numbers of samples using an Ion Torrent PGM next-generation sequencer (NGS) in a

relatively cost- and time-effective manner. The recent publication of an Illumina HiSeq 4000

(Illumina, San Diego, CA, USA) reference mitogenome sequence for beluga (Kim et al. 2017)

makes the assembly of whole mitogenome sequences even more reliable for use in a variety of

beluga studies to infer evolutionary relationships and patterns. However, the results of this study

do provide some cautions about using these methods for population mitogenomic analyses of

beluga samples.

As compared to other NGS sequencing platforms, Ion Torrent data is known to have

several shortcomings (Lui et al. 2012, Loman et al. 2012, Quail et al. 2012). First and foremost is

the high rate of insertion and deletion (indel) errors that occur in homopolymeric regions, which

are segments of sequence that are characterized by runs of the same base (Merriman et al. 2012,

Bragg et al. 2013). In these regions, the PGM reads have been found to have many

inconsistencies in homopolymers larger than eight bases long, and problems generating reads at

all for homopolymer regions longer than 14 bases (Quail et al. 2012). It is important to note,

though, that problems with long homopolymer regions occur with every NGS platform,

Page 278: Genetic diversity, population structure and phylogeography ...

260

especially for GC-rich templates (Ross et al. 2013). The largest impact of these errors for

population genomic studies is on the identification of nucleotide variants leading to

polymorphism (Bragg et al. 2013), which is the basis for identifying haplotypes as was done in

this study for belugas and narwhals. However, higher read coverage can compensate for these

types of errors and make it easier to distinguish true variants from PGM indels (Bragg et al.

2013). For the beluga and narwhal samples sequenced for this study, read depth and coverage

were very high; thus, the identification of indel errors was quite clear. Also, as these were mostly

intraspecific analyses, insertion and deletions were not likely to be found in the alignments of

same-species mitogenomes and thus could be eliminated with confidence. True indels were more

important for the alignments including the finless porpoise (Neophocaena sp.) outgoups during

phylogenetic analyses and were examined carefully.

Since sequence editing for both beluga and narwhal samples was done with an alignment

of all samples, conservative decisions on the veracity of variable positions were made

consistently across all samples. This should have maintained the integrity of haplotype

identification, phylogenetic comparisons, and the identification of synonymous and

nonsynonymous substitutions during analyses for the objectives of this study, but the pursuit of

future population mitogenomic analyses will require more quality controls. The use of replicates

is an essential approach for the identification of high frequency indel errors that could be

mistaken for polymorphisms (Bragg et al. 2013). Indeed, the inconsistencies for the set of three

diverse geographic samples that shared the same mitogenome haplotype, but had three different

CR haplotypes (Appendix 4.1), were difficult to interpret in the absence of any knowledge of the

error rate for mitogenome sequences in the study. Most of the population mitogenomic literature

reviewed for this study did not contain explicit reporting on sequence error rates. The one

Page 279: Genetic diversity, population structure and phylogeography ...

261

exception reported a sequence error rate based on full mitogenome replicate sequencing of

approximately 5% of the total number of samples (Morin et al. 2010). An examination of

complete or near-complete mitogenome sequences for domestic animals publicly available from

GenBank found a considerable error rate (14.5%) and underscores the need for rigorous error

assessment of data (Shi et al. 2014). In addition to replicates, detailed phylogenetic analysis of

raw data may also allow for the detection of mitogenome sequence errors due to incomplete

mutation patterns, poor quality template, contamination and sequencing artefacts (Duleba et al.

2015).

The main objective of this study was to assess the potential of complete mitogenome

sequences to improve the resolution of evolutionary relationships among beluga samples

throughout the species’ Canadian range. This type of genomic information may have the power

to “improve traditional conservation genetic inferences and provide qualitatively novel insights”

(Shafer et al. 2014) for belugas. Though the highest rate of sequence variability was still located

in the traditional control region (CR) sequence mtDNA marker (Figure 4.4), variation over the

complete mitogenome provided greater resolution of the beluga sample collection haplotype

diversity (Appendix 4.1, Table 4.2) and the consistent identification of clades in all the inferred

trees across different phylogenetic approaches (Figures 4.6 and 4.7). Furthermore, there was

higher statistical support for most nodes in the consensus tree. The mitogenome phylogenetic

relationships of geographic samples confirmed and, in many cases, strengthened the

identification of beluga stocks and their evolutionary connections described by CR mtDNA

sequence haplotypes (Chapter 2). Clusters of samples from the High Arctic (B1), the Eastern

Beaufort Sea (B2) and Hudson Bay (B4) were distinguished within clade B (Figure 4.7), and the

Page 280: Genetic diversity, population structure and phylogeography ...

262

most likely source stock identified for migratory sample collections (e.g. Igloolik clustered with

High Arctic samples in clade A1).

The use of whole mitogenome sequencing identified three distinct clusters within the

EBS beluga samples that were in three different clades: A1, A2 and B2 (Figure 4.7). This result

indicates the presence of fine-scale structure within the stock, but the clusters did not correspond

to spatially distinct sample collections. This lack of geographic mtDNA genome structure is a

similar result to the CR mtDNA sequence analyses of EBS samples in Chapter 3 of this thesis.

However, clustering analysis based on nuclear DNA of female samples resulted in three clusters

of genetically-related individuals found in the overall nearshore area (Figure 3.12B) that support

the idea that female EBS belugas form moderate bonds with other females and that there is

female philopatry to the EBS area. The phylogenetic results based on complete mitogenome

sequences further suggest that three distinct maternal lineages may also contribute to these

philopatric patterns.

In the broad Canada-wide geographic analyses of CR mtDNA sequences (Chapter 2), two

main haplogroups were consistently identified using different phylogenetic approaches. The

larger of these two groups, containing haplogroup 1 subgroups 1A and 1B, was characterized by

polytomies in the gene trees and star-like patterns in the median-joining network, most likely

resulting from recent, low levels of sequence divergence in areas of contact zones formed from

expansions of multiple post-glacial refugial populations (Chapter 2). Given the fairly long

generation time of belugas (13 years per generation), this period of recent expansion spans

relatively few generations on an evolutionary timescale (de March and Postma 2003). This most

likely has contributed to the short branches in the phylogenetic tree for haplotypes that belong to

these haplogroups, which is also a sign of incomplete lineage sorting (Maddison 1997, Maddison

Page 281: Genetic diversity, population structure and phylogeography ...

263

and Knowles 2006). It has been found that increasing the amount of mtDNA sequence can

clarify phylogenies in recently evolved species, but the amount of sequence needed to resolve an

internode depends on both the length of the internode and its depth in the phylogeny (DeFilippis

and Moore 2000). Whole mitogenome sequences generated for beluga samples in this study did

not completely resolve the polytomies, but given the short time frame for clade and lineage

divergence, some phylogenetic relationships among Canadian belugas may not be resolved with

this marker (e.g. Roos et al. 2011, Liedigk et al. 2015). However, with mitogenomic analyses,

the number of polytomies was reduced in beluga phylogenetic trees, and an increased number of

more geographically divergent clusters of samples were identified (Figure 4.7). Further

exploration of these mitogenomic patterns with increased sample sizes may continue to improve

the resolution and accuracy of phylogenetic comparisons among belugas. An increase in both the

number of loci used to estimate the phylogeny and the number of individuals sampled can yield

more accurate estimates of phylogenetic relationships despite incomplete lineage sorting

(Maddison and Knowles 2006). However, for shallower trees, sampling more individuals was

found to have the most impact (Maddison and Knowles 2006).

The median-joining (MJ) phylogenetic network of mitogenome haplotypes had

phylogeographic clustering patterns consistent with the CR mtDNA haplotype network in

Chapter 2 (Figure 2.8). However, the mitogenome network emphasized the divergence between

the three main clades, or haplogroups, of belugas (Figure 4.8). Once again, the samples from the

St. Lawrence Estuary, eastern Hudson Bay, and the Belcher Island ice entrapments (clade C, or

Haplogroup 2) were most distantly connected to the other two groups. The main improvement is

the distinction between clades A and B (CR Haplogroups 1A and 1B), which is much more

pronounced in the mitogenome network. However, mitogenome haplotype networks for belugas

Page 282: Genetic diversity, population structure and phylogeography ...

264

may have the most potential for resolving phylogenetic patterns at a finer geographic scale. CR

mtDNA haplotypes for Eastern Beaufort Sea (EBS) belugas (N=1032) were dominated by three

main haplotypes (Chapter 3, Figure 3.14). Overall haplotype diversity for these samples was

0.839, and they had an overall average number of nucleotide differences (k) of 3.91 (Table 3.3).

With complete mitogenome sequences, haplotype diversity in EBS beluga samples (N=38) was

0.978 and a k = 21.31 (Table 4.4). But while CR mtDNA haplotypes revealed no apparent

structure (Chapter 3, Figure 3.15), the mitogenome network for this sample collection revealed

two highly divergent maternal lineages, with evidence of a third lineage (Figure 4.6). This

pattern was also found in all of the phylogenetic trees (Figures 4.6 and 4.7) where clusters of

EBS belugas were formed within clades A1, A2 and B2/B3. This type of improved network

resolution and identification of discrete lineages based on mitogenomes has also been observed

in bobwhites (Colinus virginianus) (Halley et al. 2015) and brown bears (Keis et al. 2012). In

bobwhites, the neutrality tests and tests of adaptive evolution for mitogenome data were not

significant, indicating that the divergent maternal lineages that were identified in the MJ network

survived from pre-expansion to post-glacial expansion (Halley et al. 2015). Such analyses of an

increased sample size of EBS samples may also prove to be informative.

In contrast to belugas, whole mitogenome analyses of narwhal samples were not very

informative for identifying geographic clusters of samples that correspond to putative stock

designations. In fact, mitogenome phylogenetic analysis did not support differentiation among

different populations represented by northern Hudson Bay narwhal samples and Baffin Bay

narwhals (de March et al. 2003, Peterson et al. 2011). Control region sequencing analyses of

samples from these populations did not reveal private haplotypes, but instead, the population

differentiation was based on statistically significant differences in the ratios of haplotypes (de

Page 283: Genetic diversity, population structure and phylogeography ...

265

March et al. 2003). Given the resolution of highly shared narwhal CR haplotypes into more

diverse mitogenome haplotypes in this study, it would be worthwhile to increase the number of

samples and the geographic scale of samples for narwhals to see if patterns do emerge. Several

studies have shown that sampling protocols (sampling intensity and site design) can have a

significant influence on the results of genetic clustering analyses, especially when genetic

gradients are present (e.g. Schwartz and McKelvey 2009, Koen et al. 2013). In particular,

phylogenetic interpretations based on mtDNA may be biased “when limited sampling occurs in a

species harbouring multiple mtDNA types” (Ballard and Rand 2005). Mitogenome samples over

a broader geographic, perhaps global, distribution of narwhals would provide a better assessment

of mitogenome diversity in this species. Also, the CR sequence region could be removed from

narwhal mitogenomes to focus on the analysis of the most informative regions, which have been

shown to vary among taxa (Duchêne et al. 2011). As revealed in Figure 4.4, variation for the

narwhal samples peaked in regions spanning positions 4600-4900 (ND2 gene region) and

positions 12,500-12,800 (ND5 gene region). Analyses for indications of positive selection in

these regions would also be worthwhile. Evidence of adaptive evolution has been found in the

ND genes, particularly ND2 and ND5, in a number of mammalian species (da Fonseca et al.

2008). In sable (Martes zibellina, Malyarchuk et al. 2014) and African elephants (Loxondonta

sp., Finch et al. 2014), selection in the ND genes was suggested to be linked to metabolic

adaptations for particular environmental conditions and habitats.

Sperm whales, a more globally distributed cetacean with large populations, were found to

have overall low mitogenome diversity in samples representing the worldwide species diversity

(Alexander et al. 2012). The results were consistent with hypotheses that these whales underwent

a bottleneck or selective sweep in the recent past (72,800 to 137,400 years ago). Sperm whales

Page 284: Genetic diversity, population structure and phylogeography ...

266

occupy a specialized niche, and the characteristic deep foraging dives of sperm whales may have

influenced positive selection on the mtDNA protein-coding genes as a physiological and

environmental adaptation for this dive behaviour (Alexander et al. 2012). This supports a

hypothesis of a selective sweep as opposed to a population bottleneck due to commercial

whaling, though changes in prey availability may have also affected populations at some point in

the past (Alexander et al. 2012). A similar hypothesis about the influence of a selective sweep on

mtDNA diversity may apply to narwhals. This Arctic species of whale displays a lack of

behavioural plasticity, with highly predictable and consistent movement patterns, diving

behaviour, and habitat selection (Heide-Jørgensen et al. 2015). This has created a much narrower

ecological niche for narwhals as compared to belugas. This, along with a closer association with

ice and distribution at higher latitudes (thus colder temperatures), could be influencing adaptive

selection in narwhal mtDNA as has been proposed in early studies of MHC variability in High

Arctic whales (Murray et al. 1995) and similar to Antarctic killer whales (Foote et al. 2011c).

Despite its almost ubiquitous use for studies of molecular diversity, mtDNA sequencing,

which represents the analysis of a single locus, is not without weaknesses as a marker for

population genetic studies (e.g. Galtier et al. 2009a). Several of the hallmarks that have

distinguished mtDNA as the preferred marker for phylogenetic and evolutionary studies have

been challenged. For example, the assumption that mtDNA is solely maternally inherited has

been revised (Wolff et al. 2013). Instead, the paternal contribution of mtDNA gets cumulatively

diminished by a number of mechanisms and is thus limited to leakage effects that can still lead to

distinct mtDNA lineages within a zygote (Wolff et al. 2013). Also, mtDNA was thought to be

clonally inherited; however, genetic hitchhiking on the mitochondrial genome has been linked to

genetic variability, as well as selective sweeps of advantageous mutations (Ballard and Dean

Page 285: Genetic diversity, population structure and phylogeography ...

267

2001). Finally, the idea that mtDNA is effectively neutral needs to be considered in the context

of adaptive evolution which has been shown to influence within-species mtDNA diversity across

a wide range of species (Bazin et al. 2006). All of these factors need to be kept in mind when

pursuing mitogenome analyses, and armed with this revised perspective, whole mtDNA

sequence analysis for molecular ecology studies may offer new types of insights about

mitogenome evolution, adaptation, and processes associated with variation (Galtier et al. 2009a).

This study was not specifically designed to address questions about adaptive selection,

but basic approaches using the ratio of rates of nonsynonymous to rates of synonymous

substitutions were used to search for signals of selective pressure on the mitochondrial protein

coding genes (PCGs) in beluga samples. The results for each of the 12 beluga PCGs indicate the

widespread influence of negative, or purifying, selection (ω < 1) on all mitochondrial genes, with

the strongest signals in the ND1, CO1 and CO2 genes (Table 4.5). Given the importance of these

genes for the synthesis of cellular energy (in the form of ATP), purifying selection is essential to

remove deleterious mutations and maintain mitochondrial gene function (e.g. Meiklejohn et al.

2007, Castellana et al. 2011) and has much greater influence on mtDNA evolution than positive

selection (Bazin et al. 2006). Thus, it was not unexpected that given the objectives and design of

this study, no evidence of positive selection was found for the beluga clades (Table 4.6). Given

the similar migratory patterns of most of the Canadian beluga stocks, there is likely not enough

environmental variability represented by the samples analyzed to influence local physiological

adaptations in the mitogenome. Plus, as reviewed by Ballard and Melvin (2010), it may be too

early to determine “whether adaptive mutations in mtDNA are the rule or the exception to the

rule”. As with narwhal, comparisons of PCGs in mitogenomes of belugas from the global

distribution of the species, and less conservative analyses than were used in this study (such as

Page 286: Genetic diversity, population structure and phylogeography ...

268

analyses of physio-chemical changes in proteins resulting from amino acid replacements,

implemented in TREESAAP (Wooley et al. 2003), would offer a greater chance of detecting

signals of positive selection (e.g. Foote et al. 2011c, Malyarchuk et al. 2014, Finch et al. 2014,

Filipi et al. 2015). In addition, more detailed analyses of mutation rates in mtDNA PCGs will

need to consider the variability in mtDNA mutation rates across lineages and at different codon

positions (Galtier et al. 2009b, Duchêne et al. 2011), though the need for partitioning may not be

required if the most informative genes only are used (Duchêne et al. 2011).

Ion Torrent costs continue to decrease, chemistries and technologies continue to improve

sequence coverage and error biases, and the workflow is evolving to become more efficient and

less time-consuming. Analyses of complete mitochondrial genomes for larger numbers of

samples should continue to be pursued for population studies of belugas and narwhals. At the

intra-population level, the increased amount of sequence, including the coding regions, offer the

potential for increased resolution of fine-scale structure and stronger inference of the

evolutionary histories among different groups (e.g. Morin et al. 2010, Keis et al. 2013). This will

allow for continued evaluation of the geographic areas used to define putative beluga and

narwhal stocks as the units for conservation and management. Shallow haplotype trees that are

geographically unstructured, such as the NJ tree inferred for the Canadian narwhal samples, can

be a result of recent population expansion and increase, recent or current gene flow, or possibly

due to purifying selection (Zink et al. 2003). Mitogenomic comparisons of populations across the

full global distribution of the species may provide better identification of ancestral haplotypes,

better resolution of lineage sorting, and the estimation of divergence times. Furthermore, a wider

geographic range of samples may also allow for comparisons of populations with a wider

Page 287: Genetic diversity, population structure and phylogeography ...

269

ecological variation that may have a greater potential to detect signals of selection and

adaptation.

4.5 Acknowledgements

Most of the samples for genetic analyses in this study were collected by hunters from

communities in the Inuvialuit Settlement Region (ISR), Nunavut, and Nunavik during harvest

monitoring programs funded by the Fisheries Joint Management Committee (FJMC), the

Nunavut Wildlife Management Board (NWMB), the Nunavik Inuit Land Claims Agreement

(NICLA), and Fisheries and Oceans Canada. Countless scientific studies would not be possible

without these partnerships. The efforts of many dedicated individuals to coordinate these sample

collections and other sampling endeavours, the shipping and sorting of sample kits, archiving

materials in an organized system and maintaining sample information databases are immensely

appreciated. Denise Tenkula (DFO Central and Arctic) and Craig McFarlane provided laboratory

assistance for the development of the Ion Torrent mitogenome sequencing workflow and sample

processing. Funding was provided by Fisheries and Oceans Canada (DFO) Central and Arctic

Region, the DFO Genomics Research and Development Initiative (GRDI) and the DFO Nunavut

Implementation Fund (NIF).

4.6 References

Alexander, A., Steel, D., Slikas, B., Hoekzema, K., Carraher, C., Parks, M., Cronn, R. and Baker,

C.S. 2013. Low diversity in the mitogenome of sperm whales revealed by next-generation

sequencing. Genome Biology and Evolution 5: 113-129.

Allendorf, F.W. 2017. Genetics and the conservation of natural populations: allozymes to

genomes. Molecular Ecology 26: 420-430.

Page 288: Genetic diversity, population structure and phylogeography ...

270

Anisimova, M., and Gascuel, O. 2006. Approximate likelihood-ratio test for branches: a fast,

accurate, and powerful alternative. Systematic Biology 55: 539-552.

Archer, F.I., Morin, P.A., Hancock-Hanser, B.L., Robertson, K.M., Leslie, M.S., Bérubé, M.,

Panigada, S. and Taylor, B.L. 2013. Mitogenomic phylogenetics of fin whales (Balaenoptera

physalus spp.): genetic evidence for revision of subspecies. PLoS One 8: p.e63396.

Arnason, U., Gullberg, A. and Janke, A. 2004. Mitogenomic analyses provide new insights into

cetacean origin and evolution. Gene 333: 27-34.

Avise, J.C. and Ellis, D. 1986. Mitochondrial DNA and the evolutionary genetics of higher

animals and discussion. Philosophical Transactions of the Royal Society of London B:

Biological Sciences 312: 325-342.

Avise, J.C., Arnold, J., Ball, R.M., Bermingham, E., Lamb, T., Neigel, J.E., Reeb, C.A. and

Saunders, N.C. 1987. Intraspecific phylogeography: the mitochondrial DNA bridge between

population genetics and systematics. Annual Review of Ecology and Systematics 18: 489-522.

Avise, J.C. 1989. Gene trees and organismal histories: a phylogenetic approach to population

biology. Evolution 43: 1192-1208.

Avise, J.C., Bowen, B.W. and Lamb, T. 1989. DNA fingerprints from hypervariable

mitochondrial genotypes. Molecular Biology and Evolution 6: 258-269.

Ballard, J.W.O. and Rand, D.M. 2005. The population biology of mitochondrial DNA and its

phylogenetic implications. Annual Review of Ecology, Evolution and Systematics 36: 621-642.

Ballard, J.W.O. and Melvin, R.G., 2010. Linking the mitochondrial genotype to the organismal

phenotype. Molecular Ecology 19: 1523-1539.

Ballard, J.W.O. and Pichaud, N. 2014. Mitochondrial DNA: more than an evolutionary

bystander. Functional Ecology 28: 218-231.

Bazin, E., Glémin, S. and Galtier, N. 2006. Population size does not influence mitochondrial

genetic diversity in animals. Science 312: 570-572.

Bragg, L.M., Stone, G., Butler, M.K., Hugenholtz, P. and Tyson, G.W. 2013. Shining a light on

dark sequencing: characterising errors in Ion Torrent PGM data. PLoS Computational Biology 9:

p.e1003031.

Page 289: Genetic diversity, population structure and phylogeography ...

271

Brennin, R., Murray, B.W., Friesen, M.K., Maiers (Postma), L.D., Clayton, J.W., and White,

B.N. 1997. Population genetic structure of beluga whales (Delphinapterus leucas): mitochondrial

DNA sequence variation within and among North American populations. Canadian Journal of

Zoology 75: 795-802.

Brown Gladden, J.G., Ferguson, M.M, and Clayton, J.W. 1997. Matriarchal genetic population

structure of North American beluga whales Delphinapterus leucas (Cetacea: Monodontidae).

Molecular Ecology 6: 1033-1046.

Brown Gladden, J.G., Ferguson, M.M., Friesen, M.K. and Clayton, J.W. 1999. Population

structure of North American beluga whales (Delphinapterus leucas) based on nuclear DNA

microsatellite variation and contrasted with the population structure revealed by mitochondrial

DNA variation. Molecular Ecology 8: 347-363.

Castellana, S., Vicario, S. and Saccone, C. 2011. Evolutionary patterns of the mitochondrial

genome in Metazoa: exploring the role of mutation and selection in mitochondrial protein–

coding Genes. Genome Biology and Evolution 3: 1067-1079.

Colbeck, G.J., Duchesne, P., Postma, L.D., Lesage, V., Hammill, M.O. and Turgeon, J., 2013.

Groups of related belugas (Delphinapterus leucas) travel together during their seasonal

migrations in and around Hudson Bay. Proceedings of the Royal Society of London B:

Biological Sciences 280: 20122552.

Consuegra, S., John, E., Verspoor, E. and De Leaniz, C.G. 2015. Patterns of natural selection

acting on the mitochondrial genome of a locally adapted fish species. Genetics Selection

Evolution 47: 58.

Corlett, R.T. 2017. A bigger toolbox: biotechnology in biodiversity conservation. Trends in

Biotechnology 35: 55-65.

da Fonseca, R.R., Johnson, W.E., O'Brien, S.J., Ramos, M.J. and Antunes, A. 2008. The adaptive

evolution of the mammalian mitochondrial genome. BMC Genomics 9: 119.

de March, B.G.E and Postma, L.D. 2003. Molecular stock discrimination of belugas

(Delphinapterus leucas) hunted in eastern Hudson Bay, northern Quebec, Hudson Strait, and

Sanikiluaq (Belcher Islands), Canada, and comparisons to adjacent populations. Arctic 56: 111-

124.

Page 290: Genetic diversity, population structure and phylogeography ...

272

de March, B.G.E., Tenkula, D.A, and Postma, L.D. 2003. Molecular genetics of narwhal

(Monodon monoceros) from Canada and West Greenland (1982-2001). Canadian Science

Advisory Secretariat Research Document 2003/080, i + 19pp.

Duchêne, S., Archer, F.I., Vilstrup, J., Caballero, S. and Morin, P.A. 2011. Mitogenome

phylogenetics: the impact of using single regions and partitioning schemes on topology,

substitution rate and divergence time estimation. PLoS One 6: p.e27138.

Duleba, A., Skonieczna, K., Bogdanowicz, W., Malyarchuk, B. and Grzybowski, T. 2015.

Complete mitochondrial genome database and standardized classification system for Canis lupus

familiaris. Forensic Science International: Genetics 19: 123-129.

Ewing, B., Hillier, L., Wendl, M.C. and Green, P. 1998. Base-calling of automated sequencer

traces using Phred. I. Accuracy assessment. Genome Research 8: 175-185.

Excoffier, L. and Lischer, H.E. 2010. Arlequin suite ver 3.5: a new series of programs to perform

population genetics analyses under Linux and Windows. Molecular Ecology Resources 10: 564-

567.

Faircloth, B., and Glenn, T. 2011. https://ethanomics.files.wordpress.com/2012/08/serapure_v2-

2.pdf

Filipi, K., Marková, S., Searle, J.B. and Kotlík, P. 2015. Mitogenomic phylogenetics of the bank

vole Clethrionomys glareolus, a model system for studying end-glacial colonization of Europe.

Molecular Phylogenetics and Evolution 82: 245-257.

Finch, T.M., Zhao, N., Korkin, D., Frederick, K.H. and Eggert, L.S. 2014. Evidence of positive

selection in mitochondrial complexes I and V of the African elephant. PLoS One 9: p.e92587.

Fluxus Engineering, 2015. Network 4.6 User Guide http://www.fluxus-

engineering.com/Network4600_user_guide.pdf

Foote, A.D., Morin, P.A., Durban, J.W., Willerslev, E., Orlando, L. and Gilbert, M.T.P. 2011a.

Out of the Pacific and back again: insights into the matrilineal history of Pacific killer whale

ecotypes. PLoS One 6: p.e24980.

Foote, A.D., Vilstrup, J.T., De Stephanis, R., Verborgh, P., Abel Neilsen, S.C., Deaville, R.,

Kleivane, L., Martin, V., Miller, P.J., Øien, N.,…., and Piertney, S.B. 2011b. Genetic

differentiation among North Atlantic killer whale populations. Molecular Ecology 20: 629-641.

Page 291: Genetic diversity, population structure and phylogeography ...

273

Foote, A.D., Morin, P.A., Durban, J.W., Pitman, R.L., Wade, P., Willerslev, E., Gilbert, M.T.P.

and da Fonseca, R.R. 2011c. Positive selection on the killer whale mitogenome. Biology Letters

7: 116-118.

Foulds, L.R., Hendy, M.D., and Penny, D. 1979. A graph theoretic approach to the development

of minimal phylogenetic trees. Journal of Molecular Evolution 13: 127-149.

Fu, Y.-X. 1997. Statistical tests of neutrality against population growth, hitchhiking and

background selection. Genetics 147: 915-925.

Galtier, N., Nabholz, B., Glémin, S. and Hurst, G.D.D. 2009a. Mitochondrial DNA as a marker

of molecular diversity: a reappraisal. Molecular Ecology 18: 4541-4550.

Galtier, N., Jobson, R.W., Nabholz, B., Glémin, S. and Blier, P.U. 2009b. Mitochondrial whims:

metabolic rate, longevity and the rate of molecular evolution. Biology Letters 5: 413-416.

Guidon, S. 2012. PhyML – Manual.

http://www.atgcmontpellier.fr/download/papers/phyml_manual_2012.pdf)

Guindon, S., Dufayard, J.F., Lefort, V., Anisimova, M., Hordijk, W. and Gascuel, O. 2010. New

algorithms and methods to estimate maximum-likelihood phylogenies: assessing the

performance of PhyML 3.0. Systematic Biology 59: 307-321.

Hall, B. 2011. Phylogenetic Trees Made Easy. Sinauer Associates Inc. Publishers,

Massachusetts, U.S.A., 282pp.

Heide‐Jørgensen, M.P., Nielsen, N.H., Hansen, R.G., Schmidt, H.C., Blackwell, S.B. and

Jørgensen, O.A. 2015. The predictable narwhal: satellite tracking shows behavioural similarities

between isolated subpopulations. Journal of Zoology 297: 54-65.

Helbig, R., Boag, P.T. and White, B.N., 1989. Stock identification of beluga whales

(Delphinapterus leucas) using mitochondrial DNA markers: Preliminary results. Musk-Ox 37:

122-128.

Higdon, J.W., and Ferguson, S.H. 2017. Database of aerial surveys and abundance extimates for

beluga whales (Delphinapterus leucas) and narwhals (Monodon monceros) in the Canadian

Arctic. Canadian Technical Report of Fisheries and Aquatic Sciences 3211: v + 48p.

Hordijk, W., and Gascuel, O. 2005. Improving the efficiency of SPR moves in phylogenetic tree

search methods based on maximum likelihood. Bioinformatics 21: 4338-4347.

Page 292: Genetic diversity, population structure and phylogeography ...

274

Huse, S.M., Huber, J.A., Morrison, H.G., Sogin, M.L. and Welch, D.M. 2007. Accuracy and

quality of massively parallel DNA pyrosequencing. Genome Biology 8: R143.

Keis, M., Remm, J., Ho, S.Y., Davison, J., Tammeleht, E., Tumanov, I.L., Saveljev, A.P.,

Männil, P., Kojola, I., Abramov, A.V. and Margus, T. 2013. Complete mitochondrial genomes

and a novel spatial genetic method reveal cryptic phylogeographical structure and migration

patterns among brown bears in north‐western Eurasia. Journal of Biogeography 40: 915-927.

Kim, J.H., Lee, Y.R., Koh, J.R., Jun, J.W., Giri, S.S., Kim, H.J., Chi, C., Yun, S., Kim, S.G.,

Kim, S.W. and Kim, H.K. 2017. Complete mitochondrial genome of the beluga whale

Delphinapterus leucas (Cetacea: Monodontidae). Conservation Genetics Resources: DOI

10.1007/s12686-017-0705-5.

Koen, E.L., Bowman, J., Garroway, C.J. and Wilson, P.J. 2013. The sensitivity of genetic

connectivity measures to unsampled and under-sampled sites. PLoS One 8: p.e56204.

Kosakovsky Pond, S.L.K. and Frost, S.D., 2005. Not so different after all: a comparison of

methods for detecting amino acid sites under selection. Molecular Biology and Evolution 22:

1208-1222.

Kosakovsky Pond, S.L.K., Frost, S.D.W., and Muse, S.V. 2005. HyPhy: hypothesis testing using

phylogenies. Bioinformatics 21: 676-679.

Li, M., Nordborg, M. and Li, L.M., 2004. Adjust quality scores from alignment and improve

sequencing accuracy. Nucleic Acids Research 32: 5183-5191.

Librado, P. and Rozas, J. 2009. DnaSP v5: A software for comprehensive analysis of DNA

polymorphism data. Bioinformatics 25: 1451-1452.

Lint, D.W., Clayton, J.W., Lillie, W.R. and Postma, L. 1990. Evolution and systematics of the

beluga whale, (Delphinapterus leucas) and other odontocetes: A molecular approach. In T.G.

Smith, D.G. St. Aubin, and J.R. Geraci (Eds.), Advances in Research on the Beluga Whale,

Delphinapterus leucas. Canadian Bulletin of Fisheries and Aquatic Sciences 224: 7-22.

Liu, L., Li, Y., Li, S., Hu, N., He, Y., Pong, R., Lin, D., Lu, L. and Law, M. 2012. Comparison

of next-generation sequencing systems. BioMed Research International, doi:

10.1155/2012/251364.

Page 293: Genetic diversity, population structure and phylogeography ...

275

Loman, N.J., Misra, R.V., Dallman, T.J., Constantinidou, C., Gharbia, S.E., Wain, J. and Pallen,

M.J. 2012. Performance comparison of benchtop high-throughput sequencing platforms. Nature

Biotechnology 30:434-439.

Maddison, W.P., 1997. Gene trees in species trees. Systematic Biology 46: 523-536.

Maddison, W.P. and Knowles, L.L. 2006. Inferring phylogeny despite incomplete lineage

sorting. Systematic Biology 55: 21-30.

Maiers (Postma), L.D., Friesen, M.K., Wiens, A.V., and Clayton, J.W. 1996. Use of DNA

microsatellites in beluga whale (Delphinapterus leucas) population genetics. Canadian Technical

Report or Fisheries and Aquatic Sciences 2115: iv + 17p.

Malyarchuk, B., Derenko, M. and Denisova, G. 2014. A mitogenomic phylogeny and genetic

history of sable (Martes zibellina). Gene 550: pp.56-67.

McDonald, J.H. and Kreitman, M. 1991. Adaptive protein evolution at the Adh locus in

Drosophila. Nature 351: 652-654.

McGowen, M.R., Spaulding, M. and Gatesy, J. 2009. Divergence date estimation and a

comprehensive molecular tree of extant cetaceans. Molecular Phylogenetics and Evolution 53:

891-906.

Meiklejohn, C.D., Montooth, K.L. and Rand, D.M. 2007. Positive and negative selection on the

mitochondrial genome. Trends in Genetics 23: 259-263.

Merriman, B., Torrent, I., Rothberg, J.M. and R&D Team. 2012. Progress in ion torrent

semiconductor chip based sequencing. Electrophoresis 33: 3397-3417.

Miller, M.A., Pfeiffer, W. and Schwartz, T. 2010, Creating the CIPRES Science Gateway for

inference of large phylogenetic trees. In Gateway Computing Environments Workshop (GCE),

2010 (pp. 1-8).

Morin, P.A., Archer, F.I., Foote, A.D., Vilstrup, J., Allen, E.E., Wade, P., Durban, J., Parsons,

K., Pitman, R., Li, L. and Bouffard, P. 2010. Complete mitochondrial genome phylogeographic

analysis of killer whales (Orcinus orca) indicates multiple species. Genome Research 20: 908-

916.

Moura, A.E., Nielsen, S.C.A, Vilstrup, J.T., Moreno-Mayar, J.V., Gray, H., Natoli, A., Möller,

L., and Hoelzel, A.R. 2013a. Recent diversification of a marine genus (Tursiops spp.) tracks

habitat preference and environmental change. Systematic Biology 62: 865-877.

Page 294: Genetic diversity, population structure and phylogeography ...

276

Moura, A.E., Nielsen, S.C.A, Vilstrup, J.T., Moreno-Mayar, J.V., Gray, H., Natoli, A., Möller,

L., and Hoelzel, A.R. 2013b. Data from: Recent diversification of a marine genus (Tursiops spp.)

tracks habitat preference and environmental change. Dryad Digital Repository.

http://dx.doi.org/10.5061/dryad.k501d

Moura, A.E., Kenny, J.G., Chaudhuri, R.R., Hughes, M.A., Reisinger, R.R., De Bruyn, P.J.N.,

Dahlheim, M.E., Hall, N. and Hoelzel, A.R. 2015. Phylogenomics of the killer whale indicates

ecotype divergence in sympatry. Heredity 114: 48-55.

Morin, P.A., Archer, F.I., Foote, A.D., Vilstrup, J., Allen, E.E., Wade, P., Durban, J., Parsons,

K., Pitman, R., Li, L. and Bouffard, P. 2010. Complete mitochondrial genome phylogeographic

analysis of killer whales (Orcinus orca) indicates multiple species. Genome Research 20: 908-

916.

Murray, B.W., Malik, S. and White, B.N., 1995. Sequence variation at the major

histocompatibility complex locus DQ beta in beluga whales (Delphinapterus leucas). Molecular

Biology and Evolution 12: 582-593.

Nei, M. and Gojobori, T. 1986. Simple methods for estimating the numbers of synonymous and

nonsynonymous nucleotide substitutions. Molecular Biology and Evolution 3: 418-426.

Nei, M. and Kumar, S. 2000. Molecular evolution and phylogenetics. Oxford University Press,

Inc., New York, USA.

Nichols, R. 2001. Gene trees and species trees are not the same. Trends in Ecology and

Evolution 16: 358-364.

Nozawa, M., Suzuki, Y. and Nei, M. 2009. Reliabilities of identifying positive selection by the

branch-site and the site-prediction methods. Proceedings of the National Academy of Sciences

USA 106: 6700-6705.

O’Corry-Crowe, G., Lydersen, C., Heide-Jørgensen, M.P., Hansen, L., Mukhametov, L.M.,

Dove, O. and Kovacs, K.M. 2010. Population genetic structure and evolutionary history of North

Atlantic beluga whales (Delphinapterus leucas) from West Greenland, Svalbard and the White

Sea. Polar Biology 33: 1179-1194.

O’Corry-Crowe, G.M., Lucey, W., Archer, F.I. and Mahoney, B. 2015. The genetic ecology and

population origins of the beluga whales, Delphinapterus leucas, of Yakutat Bay. Marine

Fisheries Review 77: 47-58.

Page 295: Genetic diversity, population structure and phylogeography ...

277

Pál, C., Papp, B. and Lercher, M.J. 2006. An integrated view of protein evolution. Nature

Reviews Genetics 7: 337-348.

Palsbøll, P., Heide-Jørgensen, M.P. and Dietz, R. 1997. Genetic studies of narwhals, Monodon

monoceros, from West and East Greenland. Heredity 78: 284-292.

Patenaude, N.J., Quinn, J.S., Beland, P., Kingsley, M. and White, B.N., 1994. Genetic variation

of the St. Lawrence beluga whale population assessed by DNA fingerprinting. Molecular

Ecology 3: 375-381.

Peakall, R., and Smouse, P.E. 2012. GenAlEx 6.5: genetic analysis in Excel. Population genetic

software for teaching and research - an update. Bioinformatics 28: 2537-2539.

Petersen, S.D., Tenkula, D. and Ferguson, S.H. 2011. Population Genetic Structure of Narwhal

(Monodon monoceros). DFO Canadian Science Advisory Secretariat Research Document

2011/021. vi + 20 p.

Poon, A.F., Frost, S.D. and Pond, S.L.K. 2009. Detecting signatures of selection from DNA

sequences using Datamonkey. Bioinformatics for DNA Sequence Analysis 53:163-183.

http://www.datamonkey.org/help/tutorial

Quail, M.A., Smith, M., Coupland, P., Otto, T.D., Harris, S.R., Connor, T.R., Bertoni, A.,

Swerdlow, H.P. and Gu, Y. 2012. A tale of three next generation sequencing platforms:

comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers. BMC Genomics

13: 341.

Rambaut, A., Suchard, M.A., Xie, D., and Drummond, A.J. 2014. Tracer v1.6. Available from

http://beast.bio.ed.ac.uk/Tracer

Ramos-Onsins, S.E. and Rozas, J. 2002 Statistical properties of new neutrality tests against

population growth. Molecular Biology and Evolution 19: 2092-2100.

Richard, P. 2010. Stock definition of belugas and narwhals in Nunavut. DFO Canadian Science

Advisory Secretariat Research Document 2010/022. iv + 14p.

Ronquist, F., Huelsenbeck, J., and Teslenko. 2011. Mr. Bayes ver. 3.2 manual

(http://mrbayes.sourceforge.net/mb3.2_manual.pdf)

Page 296: Genetic diversity, population structure and phylogeography ...

278

Ronquist, F., Teslenko, M., van der Mark, P., Aures, D.L., Darling, A., Höhna, S., Larget, B.,

Liu, L., Suchard, M.A., and Huelsenbeck, J.P. 2012. MrBayes 3.2: Efficient Bayesian

phylogenetic inference and model choice across a large model space. Systematic Biology 61:

539-542.

Ross, M.G., Russ, C., Costello, M., Hollinger, A., Lennon, N.J., Hegarty, R., Nusbaum, C. and

Jaffe, D.B. 2013. Characterizing and measuring bias in sequence data. Genome Biology 14:

R51.

Schwartz, M.K. and McKelvey, K.S. 2009. Why sampling scheme matters: the effect of

sampling scheme on landscape genetic results. Conservation Genetics 10: 441-452.

Shafer, A.B., Wolf, J.B., Alves, P.C., Bergström, L., Bruford, M.W., Brännström, I., Colling, G.,

Dalen, L., De Meester, L., Ekblom, R. and Fawcett, K.D. 2015. Genomics and the challenging

translation into conservation practice. Trends in Ecology and Evolution 30: 78-87.

Seutin, G., White, B.N., and Boag, P.T. 1991. Preservation of avian blood and tissue samples

for DNA analyses. Canadian Journal of Zoology 69: 82-9.

Shamblin, B.M., Bjorndal, K.A., Bolten, A.B., Hillis-Starr, Z.M., Lundgren, I., Naro-Maciel, E.,

and Nairn, C.J. 2012. Mitogenomic sequences better resolve stock structure of southern Greater

Caribbean green turtle rookeries. Molecular Ecology 21: 2330-2340.

Shi, N.N., Fan, L., Yao, Y.G., Peng, M.S. and Zhang, Y.P. 2014. Mitochondrial genomes of

domestic animals need scrutiny. Molecular Ecology 23: 5393-5397.

Southern, E.M. 1975. Detection of specific sequences among DNA fragments separated by gel

electrophoresis. Journal of Molecular Biology 98: pp.503 -508.

Tamura, K. and Nei, M., 1993. Estimation of the number of nucleotide substitutions in the

control region of mitochondrial DNA in humans and chimpanzees. Molecular Biology and

Evolution 10: 512-526.

Tajima, F. 1989. Statistical-method for testing the neutral mutation hypothesis by DNA

polymorphism. Genetics 123: 585-595.

Tamura, K., Stecher, G., Peterson, D., Filipski, A. and Kumar, S. 2013. MEGA6: molecular

evolutionary genetics analysis version 6.0. Molecular Biology and Evolution 30: 2725-2729.

Page 297: Genetic diversity, population structure and phylogeography ...

279

Turgeon, J., Duchesne, P., Colbeck, G.J., Postma, L.D., and Hammill, M.O. 2012.

Spatiotemporal segregation among summer stocks of beluga (Delphinapterus leucas) despite

nuclear gene flow: implication for the endangered belugas in eastern Hudson Bay (Canada).

Conservation Genetics 13: 419-433.

Vilstrup, J.T., Ho, S.Y., Foote, A.D., Morin, P.A., Kreb, D., Krützen, M., Parra, G.J., Robertson,

K.M., de Stephanis, R., Verborgh, P. and Willerslev, E. 2011. Mitogenomic phylogenetic

analyses of the Delphinidae with an emphasis on the Globicephalinae. BMC Evolutionary

Biology 11: 65.

Volkmann, L.A., Statham, M.J., Mooers, A.Ø. and Sacks, B.N. 2015. Genetic distinctiveness of

red foxes in the Intermountain West as revealed through expanded mitochondrial sequencing.

Journal of Mammalogy 96: 297-307.

Whitehead, H. 1998. Cultural selection and genetic diversity in matrilineal whales. Science 282:

1708-1711.

Woolley, S., Johnson, J., Smith, M.J., Crandall, K.A. and McClellan, D.A. 2003. TreeSAAP:

selection on amino acid properties using phylogenetic trees. Bioinformatics 19: 671-672.

Yu, L., Wang, X., Ting, N. and Zhang, Y. 2011. Mitogenomic analysis of Chinese snub-nosed

monkeys: Evidence of positive selection in NADH dehydrogenase genes in high-altitude

adaptation. Mitochondrion 11: 497-503.

Zhang, B., Penton, C.R., Xue, C., Wang, Q., Zheng, T. and Tiedje, J.M. 2015. Evaluation of the

Ion Torrent Personal Genome Machine for gene-targeted studies using amplicons of the

nitrogenase gene nifH. Applied and Environmental Microbiology 81: 4536-4545.

Page 298: Genetic diversity, population structure and phylogeography ...

280

Appendix 4.1. Haplotypes for beluga samples based on complete mitochondrial genome sequences

(16,385bp) as compared to control region mtDNA (609bp) haplotypes. Samples that have identical

mitogenome haplotypes are color-coded, with different colors indicating different haplotype matches. See

Figure 4.1 for sample collection abbreviations.

Sample Sample collection Mitogenome

haplotype CR Region haplotype

Sample collection month

KY444734.1_2017 reference 4 na na

00GF1034 EHA 19 E72 84GF028 EHA 48 E110 84GF005 EHA 59 E80

00GF1042 EHA 61 E80 September samples

84GF006 EHA 59 E80 00GF1043 EHA 40 E11 00GF1036 EHA 27 E24 00GF1039 EHA 75 E86 84GF020 EHA 74 E80 85GF001 EHA 38 E02

00GF1044 EHA 41 E97

99CI001 CHA 64 E11 99CI013 CHA 31 E02 99CI026 CHA 41 E97 July samples

99CI045 CHA 60 E80 99CI092 CHA 28 E72

01IG1044 FB 19 E72 01IG1073 FB 14 E22 97IG087 FB 66 E120 Aug, Sept, Oct

97IG251 FB 16 E02 samples

95IG445 FB 15 E123

02HI009 EBS 69 L144 04HI022 EBS 55 L011 94HI015 EBS 8 L113 94HI029 EBS 42 L147 July samples

05HI011 EBS 70 L142 94HI003 EBS 54 L011 94HI018 EBS 25 L151 94HI008 EBS 57 L120 06HI014 EBS 32 L002

05KI013 EBS 6 L151 04KI004 EBS 47 L120 July samples

04KI012 EBS 71 L147 04KI002 EBS 8 L113

05PA001 EBS 9 L151 05PA007 EBS 53 L011 July samples

05PA016 EBS 7 L113

Page 299: Genetic diversity, population structure and phylogeography ...

281

Sample Sample collection Mitogenome

haplotype CR Region haplotype

Sample collection month

89HSKY016 EBS 63 L147 89HSKY018 EBS 68 L120 July samples

89HSKY019 EBS 56 L011 89HSKY027 EBS 6 L151

94EWF009 EBS 62 L011 99EWF007 EBS 33 L113 July samples

99EWF010 EBS 39 L147 99EWF013 EBS 10 L151

92SP006 EBS 8 L113 93SP007 EBS 65 L120 93SP008 EBS 36 L151 July samples

93SP009 EBS 39 L147 93SP010 EBS 54 L011

89TUK009 EBS 6 L151 92TUK005 EBS 72 L147 July samples

92TUK010 EBS 43 L120 92TUK015 EBS 7 L113

92WWF001 EBS 58 L011 93WWF001 EBS 33 L113 July samples

93WWF002 EBS 30 L151 93WWF004 EBS 67 L147

95PG041 CSd 35 E02 95PG539 CSd 45 E75 95PG012 CSd 21 E37 95PG076 CSd 24 E02 96PG024 CSd 12 E02 July samples

01PG1059 CSd 22 E79 02PG002 CSd 45 E75

02PG1054 CSd 13 E72 02PG1198 CSd 5 E10 08PG002 CSd 29 E77

02SLE005 SLE 78 E154 98SLE003 SLE 77 E180 na

99SLE105 SLE 84 E180

04SQE015 SQE 79 E69 04SQE018 SQE 87 E177 2004 December

04SQE003 SQE 82 E94 samples

04SQE002 SQE 86 E16 04SQE012 SQE 16 E02 04SQE019 SQE 51 E11

11SQE1001 SQE 1 E06 11SQE1002 SQE 11 E10 2011 March

Page 300: Genetic diversity, population structure and phylogeography ...

282

Sample Sample collection Mitogenome

haplotype CR Region haplotype

Sample collection month

11SQE1004 SQE 11 E10 samples

11SQE1006 SQE 1 E06

05SQ1111 SQH 51 E11 09SQ1175 SQH 26 E02 11SQ1294 SQH 17 E29 May or June samples

12SQ1117 SQH 3 E07 12SQ1308 SQH 2 E06

05LI7103 LI 52 E11 07LI9008 LI 73 E57 August samples

11LI017 LI 85 E32

02JB4148 JB 16 E02 03JB5071 JB 46 E116 July or early Sept

04JB6093 JB 49 E57 samples

09JB003 JB 50 E11

84EHB006 EHB 37 E02

85EHB011 EHB 81 E11 July or August

samples

85EHB032 EHB 80 E18 85EHB038 EHB 83 E16

87AR008 WHB 23 E09 03AR1074 WHB 77 E22 03AR1056 WHB 77 E02 87AR025 WHB 44 E11 August samples

99AR1007 WHB 20 E72 99AR1016 WHB 76 E11 03AR1042 WHB 34 E72 87AR026 WHB 18 E11

Page 301: Genetic diversity, population structure and phylogeography ...

283

Chapter 5: Summary, conclusions, and directions for future research

5.1 Introduction

The role of genetics in conservation biology recognizes that “future evolutionary

adaptation depends on the existence of genetic variation” (Milligan et al. 1994). The tools

available, including empirical approaches in the lab and theoretical applications for data analysis,

to detect and interpret that variation in natural populations have changed dramatically over the

last 50 years (Allendorf 2017, Charlesworth and Charlesworth 2017). As I consider the last 25

years I have spent using genetic approaches to study belugas (Delphinpaterus leucas), the ability

of researchers to adapt to rapid changes in laboratory technology, increasingly more complex

quantitative methods and computational resources, and to integrate information from whole

ecosystems and across scientific disciplines is no less remarkable.

At their core, strategies for the conservation and management of wild populations still

include the use of molecular data to: reveal patterns of extant genetic diversity; define population

and management units; provide information about the extent of historical population isolations

and subsequent patterns of geographic expansions; collect information about species biology

important to conservation; and understand evolutionary processes (e.g. Moritz 1999, 2002,

Paetkau 1999, Frankham 2003). This thesis took advantage of evolving empirical and theoretical

genetic approaches to examine diversity among belugas from across the Canadian range of the

species and spanning more than 25 years of sampling. The overall goal was to improve the

resolution of spatial genetic differentiation among and within putative Canadian beluga stocks

(as defined in Richard 2010), refine inferences about past responses of belugas to climate

Page 302: Genetic diversity, population structure and phylogeography ...

284

changes, test new approaches to detect fine-scale structure within seasonal aggregations, and

develop new tools to infer the adaptive potential of these whales.

5.2 Key findings and future research directions

5.2.1 Conservation and management units for beluga whales in Canadian waters

In the 1980’s, before the introduction of the first Perkin Elmer Cetus© thermalcycler, the

isolation and restriction enzyme digestion of high quality mtDNA enabled the first genetic

delineation of beluga stocks in Canada (Helbig et al. 1989). After the introduction of the

polymerase chain reaction (PCR) (Saiki et al. 1985, Mullis et al. 1989), this technique, along

with laborious manual Sanger sequencing (Sanger et al. 1977), allowed for the first control

region (CR) mtDNA study of belugas across Canada (Brown Gladden et a. 1997). In this present

thesis, automated, high throughput sequencing allowed for the most extensive analysis to date of

CR mtDNA genetic population structure for over 2500 beluga whales sampled in Canadian

waters.

This study resulted in the identification of ten genetically distinct groups of belugas that

correspond to different geographic seasonal sample collections (mostly summer). The results

also improved the resolution of genetic diversity among and within summering groups of belugas

in Canada as compared to previous studies (e.g. Brennin et al. 1997, Brown Gladden et al. 1997,

de March and Postma 2003, Turgeon et al. 2012).The information was incorporated into an

update of Designatable Units (DUs) for belugas in Canada (COSEWIC 2016) where eight DUs

were proposed, including: Eastern Beaufort Sea (EBS), Eastern High Arctic-Baffin Bay (EHA-

BB), Cumberland Sound (CS), Ungava Bay (UB), Western Hudson Bay (WHB), Eastern Hudson

Bay (EHB), St. Lawrence Estuary (STL), and James Bay (JB). The full analyses in this thesis

Page 303: Genetic diversity, population structure and phylogeography ...

285

continued to support the identification of these Designatable Units for belugas in Canada, but

also indicate finer scale genetic structure in the EHA-BB and WHB DUs. Specifically, results

suggest the presence of additional beluga DUs in the central Canadian High Arctic (i.e.

Cunningham Inlet area) and a local, migratory DU around the Belcher Islands in Hudson Bay.

Complex mixtures of genetically distinct groups that overlap spatially and temporally in

areas such as Hudson Bay (and possibly the High Arctic and Cumberland Sound) present

challenges for ongoing efforts to define boundaries for conservation and management purposes.

As recommended in the beluga DU report (COSEWIC 2016), comprehensive analyses in these

areas, and the full Canada-wide distribution of belugas, with nuclear DNA microsatellite markers

would complement the results from mtDNA sequencing. However, the utility of new genetic

markers, such as SNPs and other genome-wide markers, should also be investigated (Davey et al.

2011). More representative population genetic information (i.e. larger number of samples,

sampling efforts to target specific locations, larger genetic surveys of genome-wide information),

along with new ways to analyze data, such as network analysis (as used in Chapter 3) and

seascape genetics that integrates other data and approaches (e.g. Amaral et al. 2012, Selkoe et al.

2016), may help provide new information required for stock assessment and management goals.

Seascape genetics, a marine equivalent to landscape genetics, is an approach that uses spatial

analyses and statistics for population genetic data to help clarify the relationship of dispersal and

gene flow of an organism to particular environments/habitats (i.e. in the case of belugas, the

seascape) (Selkoe et al. 2016). Information from genetic markers, location data (latitude and

longitude coordinates) for beluga samples used for genetic analyses, temporal information such

as date or season, aerial survey information on distributions and aggregations, physical

oceanographic information, and other habitat feature information could be linked in a GIS

Page 304: Genetic diversity, population structure and phylogeography ...

286

framework to form layers that could allow for the interpretation of boundaries between

conservation and management units of belugas.

5.2.2 Phylogeography of Canadian belugas

Early DNA sequencing technologies used for previous phylogenetic studies of belugas

resulted in relatively short fragments of sequences for analyses (234 – 410bp, Chapter 2

Appendix 2.1). Phylogeographic analyses based on 609bp of sequence in this thesis supported

the identification of three main groups of haplotypes (instead of two groups, Brown Gladden et.

al 1997) that clustered together based on ancestral nodes with multiple descendant haplotypes.

Furthermore, the geographical distribution of these haplogroups offered a different model for the

location of glacial refugia and the patterns of historical expansion and recolonizations by

Canadian stocks of belugas as compared to previous studies (Brown Gladden et al. 1997, de

March et al. 2003). The most divergent lineages were found at the east, west and southern edges

of the Canadian distribution of belugas, with the central area characteristic of a contact zone

displaying an admixture of lineages. Neutrality tests and mismatch distribution analyses

indicated population expansions in the recent past for most haplogroups, with estimated

expansion times generally coinciding with patterns of glacial retreat (Dyke 2004). The results

presented in this thesis found levels of genetic diversity and suggested a phylogeographic history

across the Canadian range of belugas that may indicate a capacity among beluga lineages for

climate change resilience (e.g. Laidre et al. 2008).

Molecular genetic approaches have become well established for the investigation of

phylogeographic patterns (Hickerson et al. 2010), with Arctic marine taxa offering unique

opportunities to evaluate evolutionary processes in response to environmental changes (Weider

Page 305: Genetic diversity, population structure and phylogeography ...

287

and Hobæk, 2000). Species such as belugas allow for the study of a variety of connected

ecological and evolutionary phenomena (such as dispersal, founder effects, and post-glacial

range expansions), especially over a latitudinal gradient (Weider and Hobæk, 2000). Coordinated

efforts among global beluga population genetic researchers could offer extremely valuable

ecological and evolutionary insights for the species. The next step for phylogeographic studies

and assessing responses to climate change of Canadian belugas should focus on integrating the

molecular data with information from fossil records such as those from the Champlain Sea

(Huntington 2006, 2008), including genetic analyses using ancient DNA techniques (e.g. de

Bruyn et al. 2011), and spatial modelling approaches (Gavin et al. 2014). Species distribution

modelling, which correlates species presence with environmental and ecological variables

through space and time (Jiménez-Valverde et al. 2008, Kearney and Porter 2009), merged with

genetic information for belugas could contribute to the resolution of conservation and

management units and also link to the seascape genetics approach identified in Section 5.2.1.

5.2.3 Relatedness and fine-scale stock structure for nearshore aggregations of Beaufort Sea

belugas

In highly mobile animals such as whales, population structure may be quite cryptic and

can develop from non-random associations of related kin (e.g. Pilot et al. 2010, Möller 2012).

Analyses to determine patterns of individual and kin-relationships among belugas have only

recently been employed (Colbeck et al 2013). In this thesis, genetic analyses of relatedness using

microsatellite genotypes of samples obtained from annual subsistence harvesting did not reveal

fine-scale geographic structure of kin-groups within nearshore aggregations of Eastern Beaufort

Sea (EBS) beluga. Given these results, harvesting practices that have been used in the region

Page 306: Genetic diversity, population structure and phylogeography ...

288

over the last several decades do not appear to be impacting particular kin-groups or distinct

genetic units associated with particular geographic locations.

However, even though there were no spatial or temporal patterns, both control region

sequences and whole mitogenome analyses did suggest that for the nearshore portion of the

stock, female EBS belugas form moderate bonds with other female kin and there is female

philopatry to the overall Eastern Beaufort Sea area. Given these results, and the evidence for

segregation within the EBS belugas based on habitat use (Loseto et al. 2006), it would be

interesting to pursue studies of social associations with a more representative sample of the

stock. Association studies of marked individuals, combined with genetics, can provide very

powerful insights for social structure (e.g. Kerth et al. 2011, Snyder-Mackler et al. 2014,

Morinha et al. 2017). Biopsy sampling of free-ranging animals is commonly used for studies of

social structure in whales (e.g. Richard et al. 1996, Andrews et al. 2010); however, cultural

sensitivities discourage its use for belugas. New technologies may again provide alternatives to

harvest sampling for genetic investigations of the full range of the EBS beluga stock.

Recently, environmental DNA (eDNA) from seawater has been used for population

genetic analyses of whale sharks (Rhincodon typus) (Sigsgaard et al. 2016). Traditionally, eDNA

has been used as a tool to detect the presence/absence of aquatic species, including whales (Foote

et al. 2012), but some proponents foresee its application for conservation genetics and

phylogeography despite continuing concerns over false positives and other limitations (Bohmann

et al. 2014). It is possible that it will evolve even further as a molecular genetics tool. Belugas, in

particular, could be a good candidate species for further investigations of eDNA as a tool for

population genetic analyses in the marine environment. The fact that these whales form

predicable summer aggregations in large numbers in shallow river estuaries, and the unique

Page 307: Genetic diversity, population structure and phylogeography ...

289

(among cetaceans) characteristic that most of these whales are undergoing an annual skin moult

in these locations (Boily 1995), provides an opportunity to test and optimize a number of

methods for eDNA investigations. First, whales are accessible and are visible, allowing for

known presence or absence detections. Second, because of the moult, there is a high expectation

for sloughed skin cell material to be present in the water. Second, sampling could be done at

increasing distances from an aggregation of whales to test the influence of dilution, current, and

water mixing on detection results. Finally, genetic resources are readily available for beluga for

the design of assays of eDNA samples to detect a variety of genetic information. Once these

types of technological issues, along with the establishment of stringent quality control protocols

for false positives and negatives, have been established, population and stock-scale questions

may be able to be addressed. For example, a survey of haplotype frequencies from eDNA

sampling of beluga aggregations, as was done for whale shark aggregations (Sigsgaard et al.

2016), among river estuaries in Hudson Bay could be compared to existing results from skin

samples obtained through harvesting to evaluate the eDNA results. Based on these results, eDNA

samples from areas where harvest or biopsy sampling is not possible (e.g. Seal River, Churchill

River, Nelson River along Western Hudson Bay; along the northern Ontario coastline and into

James Bay) may be particularly interesting for refining the identification of unique genetic

groups of belugas in Hudson Bay.

5.2.4 Mitogenomics as a tool for investigating beluga population genetics and molecular ecology

Studies evaluating the use of whole mitogenome sequences for the study of population

genetic diversity and phylogenetics in belugas and narwhals have not previously been conducted.

Though analyses of whole mitogenomes has been employed for inferring phylogenetic

Page 308: Genetic diversity, population structure and phylogeography ...

290

relationships for more than 15 years, especially in mammals (Curole and Kocher 1999), more

recent advances in next-generation sequencing technologies have improved sequencing speed

and sample throughput at costs affordable for population-genetic studies (Schuster 2008). In this

thesis, the comparison of whole mitogenome sequences, generated using next-generation

sequencing protocols, for beluga and narwhal samples representing range-wide Canadian stocks

corroborated results of control region mtDNA analyses. Furthermore, finer-scale patterns of

phylogenetic relationships among beluga samples were resolved, but whole mitogenome

diversity failed to improve phylogenetic results of narwhals as compared to control region

mtDNA sequences. Future work should be done to examine possible advantages of phylogenies

from only the most informative regions of the mitogenome sequence (Duchêne et al. 2011).

Also, coalescent-based population genetic analyses (such as available in BEAST (Bayesian

Evolutionary Analysis by Sampling Trees), Bouckaert et al. 2014) for more precise estimates of

divergence times should be conducted using mitogenome phylogenies. The analysis of multiple

loci, specifically nuclear microsatellite loci, may also help to resolve the shallow phylogenies

found for beluga and narwhal samples and provide estimates of recent divergence times. The use

of multiple unlinked loci is often recommended for better estimation of phylogenetic trees for

species with recent, rapid radiation (e.g. Sánchez-Gracia and Castresana 2012).

Perhaps the greatest potential for mitogenomic studies of natural populations is for the

investigation of adaptive variation, particularly positive selection for variation related to

particular environments (Primmer 2009). In this study, preliminary analyses for indications of

selection among beluga lineages revealed widespread signals of negative, purifying selection, but

no evidence of positive, adaptive selection was found. Further research involving greater

numbers of samples that encompass a more diverse, global range of populations may have a

Page 309: Genetic diversity, population structure and phylogeography ...

291

greater chance of detecting variation that may be linked to adaptation within mtDNA protein-

coding genes. This approach should not be limited to just mtDNA data. Geographic cline

analysis has recently been combined with complete genome-wide genetic variation among

ecotypes of monkeyflowers (Mimulus aurantiacus) (Stankowski et al. 2017). The results

demonstrate the value of a using a spatially explicit framework to study genome-wide patterns of

divergence within a species.

5.3 Conclusion

This thesis examined DNA diversity among Canadian beluga whales sampled across a

broad spatial and temporal scale, and at various scales of sequence analyses (i.e. partial mtDNA

control region sequence, multiple nuclear microsatellite loci, and whole mitogenome sequences).

Understanding the genetic diversity of natural populations and the evolutionary forces that shape

it, the relationship of diversity to census population sizes, and how diversity is linked to adaptive

potential of a population, are critical for effective conservation strategies (Leffler et al. 2012).

The results of this thesis provided important information for the identification of conservation

and management units of belugas, including: Designatable Units (DUs) for conservation

assessment purposes (COSEWIC 2016), evolutionarily-related groups to interpret patterns of

refugial use and post-glacial expansion in response to climate changes, and kin-groups to

determine the potential impacts of subsistence harvesting practices. The development of an Ion

Torrent workflow allowed for testing a mitogenomic approach to examine neutral and adaptive

variation among beluga samples representing ancestral and distinct control region lineages from

across the Canadian distribution of the species. This may be a first step towards adopting

conservation genomics investigations of belugas to survey mtDNA genetic diversity in more

Page 310: Genetic diversity, population structure and phylogeography ...

292

detail. If this information is then linked to the ecological and life-history strategies of a

population or stock, it can provide a basis for predictions about potential responses to

environmental disturbance, population health, fitness, and other ecosystem interactions (Hughes

et al. 2008, Romiguier et al. 2014). This would be valuable for developing and updating

conservation strategies and management policies.

5.4 References

Allendorf, F.W. 2017. Genetics and the conservation of natural populations: allozymes to

genomes. Molecular Ecology 26: 420-430.

Amaral, A.R., Beheregaray, L.B., Bilgmann, K., Boutov, D., Freitas, L., Robertson, K.M.,

Sequeira, M., Stockin, K.A., Coelho, M.M. and Möller, L.M. 2012. Seascape genetics of a

globally distributed, highly mobile marine mammal: the short-beaked common dolphin (genus

Delphinus). PLoS One 7: p.e31482.

Andrews, K.R., Karczmarski, L., Au, W.W., Rickards, S.H., Vanderlip, C.A., Bowen, B.W.,

Gordon Grau, E. and Toonen, R.J., 2010. Rolling stones and stable homes: social structure,

habitat diversity and population genetics of the Hawaiian spinner dolphin (Stenella longirostris).

Molecular Ecology 19: 732-748.

Bohmann, K., Evans, A., Gilbert, M.T.P., Carvalho, G.R., Creer, S., Knapp, M., Douglas, W.Y.

and De Bruyn, M. 2014. Environmental DNA for wildlife biology and biodiversity

monitoring. Trends in Ecology Evolution 29: 358-367.

Boily, P., 1995. Theoretical heat flux in water and habitat selection of phocid seals and beluga

whales during the annual molt. Journal of Theoretical Biology 172: 235-244.

Bouckaert, R., Heled, J., Kühnert, D., Vaughan, T., Wu, C.H., Xie, D., Suchard, M.A., Rambaut,

A. and Drummond, A.J. 2014. BEAST 2: a software platform for Bayesian evolutionary

analysis. PLoS Computational Biology 10: e1003537.

Brennin, R., Murray, B.W., Friesen, M.K., Maiers (Postma), L.D., Clayton, J.W., and White,

B.N. 1997. Population genetic structure of beluga whales (Delphinapterus leucas): mitochondrial

DNA sequence variation within and among North American populations. Canadian Journal of

Zoology 75: 795-802.

Page 311: Genetic diversity, population structure and phylogeography ...

293

Brown Gladden, J.G., Ferguson, M.M, and Clayton, J.W. 1997. Matriarchal genetic population

structure of North American beluga whales Delphinapterus leucas (Cetacea: Monodontidae).

Molecular Ecology 6: 1033-1046.

Charlesworth, B. and Charlesworth, D. 2017. Population genetics from 1966 to 2016. Heredity

118: 2-9.

Colbeck, G.J., Duschesne, P., Postma, L.D., Lésage, V., Hammill, M.O., and Turgeon, J. 2013.

Groups of related belugas (Delphinapterus leucas) travel together during their seasonal

migrations in and around Hudson Bay. Proceedings of the Royal Society of London B:

Biological Sciences 280: 1752–1762.

COSEWIC. 2016. Designatable Units for Beluga Whales (Delphinapterus leucas) in Canada.

Committee on the Status of Endangered Wildlife in Canada. Ottawa. 73 pp.

Curole, J.P. and Kocher, T.D. 1999. Mitogenomics: digging deeper with complete mitochondrial

genomes. Trends in Ecology and Evolution 14: 394-398.

de Bruyn, M., Hoelzel, A.R., Carvalho, G.R. and Hofreiter, M. 2011. Faunal histories from

Holocene ancient DNA. Trends in Ecology and Evolution 26: 405-413.

de March, B.G.E and Postma, L.D. 2003. Molecular stock discrimination of belugas

(Delphinapterus leucas) hunted in eastern Hudson Bay, northern Quebec, Hudson Strait, and

Sanikiluaq (Belcher Islands), Canada, and comparisons to adjacent populations. Arctic 56: 111-

124.

Dyke, A.S., 2004. An outline of North American deglaciation with emphasis on central and

northern Canada. Developments in Quaternary Sciences 2: 373-424.

Foote, A.D., Thomsen, P.F., Sveegaard, S., Wahlberg, M., Kielgast, J., Kyhn, L.A., Salling,

A.B., Galatius, A., Orlando, L. and Gilbert, M.T.P., 2012. Investigating the potential use of

environmental DNA (eDNA) for genetic monitoring of marine mammals. PLoS One 7:

p.e41781.

Frankham, R. 2003. Genetics and conservation biology. Comptes Rendus Biologies 326: 22-29.

Gavin, D.G., Fitzpatrick, M.C., Gugger, P.F., Heath, K.D., Rodríguez‐Sánchez, F., Dobrowski,

S.Z., Hampe, A., Hu, F.S., Ashcroft, M.B., Bartlein, P.J. and Blois, J.L. 2014. Climate refugia:

joint inference from fossil records, species distribution models and phylogeography. New

Phytologist 204: 37-54.

Page 312: Genetic diversity, population structure and phylogeography ...

294

Harington, C.R. 2006. Félix: a late Pleistocene white whale (Delphinapterus leucas) skeleton

from the Champlain Sea deposits at Saint-Félix-de-Valois, Québec. Géographie physique et

Quaternaire 60: 183-198.

Harington, C.R. 2008. The evolution of Arctic marine mammals. Ecological Applications 18

(Supplement): S23-S40.

Helbig, R., Boag, P.T. and White, B.N., 1989. Stock identification of beluga whales

(Delphinapterus leucas) using mitochondrial DNA markers: Preliminary results. Musk-Ox 37:

122-128.

Hickerson, M.J., Carstens, B.C., Cavender-Bares, J., Crandall, K.A., Graham, C.H., Johnson,

J.B., Rissler, L., Victoriano, P.F. and Yoder, A.D. 2010. Phylogeography’s past, present, and

future: 10 years after. Molecular Phylogenetics and Evolution 54: 291-301.

Hughes, A.R., Inouye, B.D., Johnson, M.T., Underwood, N. and Vellend, M. 2008. Ecological

consequences of genetic diversity. Ecology Letters 11: 609-623.

Kearney, M. and Porter, W. 2009. Mechanistic niche modelling: combining physiological and

spatial data to predict species’ ranges. Ecology Letters 12: 334-350.

Kerth, G., Perony, N., and Schweitzer, F. 2011. Bats are able to maintain long-term social

relationships despite the high fission-fusion dynamics of their groups. Proceedings of the Royal

Society of London B: Biological Sciences 278: 2761–2767.

Laidre, K.L., Stirling, I., Lowry, L.F., Wiig, Ø., Heide-Jørgensen, M.P. and Ferguson, S.H. 2008.

Quantifying the sensitivity of Arctic marine mammals to climate‐induced habitat change.

Ecological Applications 18 (Supplement): S97-S125.

Leffler, E.M., Bullaughey, K., Matute, D.R., Meyer, W.K., Segurel, L., Venkat, A., Andolfatto,

P. and Przeworski, M. 2012. Revisiting an old riddle: what determines genetic diversity levels

within species? PLoS Biology 10: p.e1001388.

Loseto, L.L., Richard, P., Stern, G.A., Orr, J., and Ferguson, S.H. 2006. Segregation of Beaufort

Sea beluga whales during the open-water season. Canadian Journal of Zoology 84: 1743-1751.

Milligan, B.G., Leebens-Mack, J. and Strand, A.E. 1994. Conservation genetics: beyond the

maintenance of marker diversity. Molecular Ecology 3: 423-435.

Möller, L.M. 2012. Sociogenetic structure, kin associations and bonding in delphinids.

Molecular Ecology 21: 745–764.

Page 313: Genetic diversity, population structure and phylogeography ...

295

Moritz, C. 1999. Conservation units and translocations: strategies for conserving evolutionary

processes. Hereditas 130: 217-228.

Moritz, C. 2002. Strategies to protect biological diversity and the evolutionary processes that

sustain it. Systematic Biology 51: 238-254.

Mullis, K., Faloona, F., Scharf, S., Saiki, R.K., Horn, G.T. and Erlich, H.1986. Specific

enzymatic amplification of DNA in vitro: the polymerase chain reaction. Cold Spring Harbor

Symposia on Quantitative Biology 51: 263-273.

Paetkau, D. 1999. Using genetics to identify intraspecific conservation units: a critique of current

methods. Conservation Biology 13: 1507-1509.

Pilot, M., Dahlheim, M.E. and Hoelzel, A.R. 2010. Social cohesion among kin, gene flow

without dispersal and the evolution of population genetic structure in the killer whale (Orcinus

orca). Journal of Evolutionary Biology 23: 20-31.

Primmer, C.R. 2009. From conservation genetics to conservation genomics. Annals of the New

York Academy of Sciences 1162: 357-368.

Richard, P. 2010. Stock definition of belugas and narwhals in Nunavut. DFO Canadian Science

Advisory Secretariat Research Document 2010/022. iv + 14p.

Romiguier, J., Gayral, P., Ballenghien, M., Bernard, A., Cahais, V., Chenuil, A., Chiari, Y.,

Dernat, R., Duret, L., Faivre, N. and Loire, E. 2014. Comparative population genomics in

animals uncovers the determinants of genetic diversity. Nature 515:261-263.

Richard, K.R., Dillon, M.C., Whitehead, H. and Wright, J.M. 1996. Patterns of kinship in groups

of free-living sperm whales (Physeter macrocephalus) revealed by multiple molecular genetic

analyses. Proceedings of the National Academy of Sciences USA 93: 8792-8795.

Saiki, R.K., Scharf, S., Faloona, F., Mullis, K.B., Horn, G.T., Erlich, H.A. and Arnheim, N.

1985. Enzymatic amplification of b-globin genomic sequences and restriction site analysis for

diagnosis of sickle cell anemia. Science 230: 1350-1354.

Sánchez-Gracia, A. and Castresana, J. 2012. Impact of deep coalescence on the reliability of

species tree inference from different types of DNA markers in mammals. PLoS One 7:

p.e30239.

Page 314: Genetic diversity, population structure and phylogeography ...

296

Sanger, F., Nicklen, S. and Coulson, A.R. 1977. DNA sequencing with chain-terminating

inhibitors. Proceedings of the National Academy of Sciences USA 74: 5463-5467.

Selkoe, K.A., Aloia, C.C., Crandall, E.D., Iacchei, M., Liggins, L., Puritz, J.B., von der Heyden,

S. and Toonen, R.J. 2016. A decade of seascape genetics: contributions to basic and applied

marine connectivity. Marine Ecology Progress Series 554: 1-19.

Schuster, S.C. 2008. Next-generation sequencing transforms today's biology. Nature Methods 5:

16-18.

Sigsgaard, E.E., Nielsen, I.B., Bach, S.S., Lorenzen, E.D., Robinson, D.P., Knudsen, S.W.,

Pedersen, M.W., Al Jaidah, M., Orlando, L., Willerslev, E. and Møller, P.R. 2016. Population

characteristics of a large whale shark aggregation inferred from seawater environmental

DNA. Nature Ecology and Evolution 1: DOI: 10.1038/s41559-016-0004.

Snyder-Mackler, N., Albert, S.C., and Bergman, T.J. 2014. The socio-genetics of a complex

society: female gelada relatedness patterns mirror association patterns in a multilevel society.

Molecular Ecology 23: 6179–6191.

Turgeon, J., Duchesne, P., Colbeck, G.J., Postma, L.D., and Hammill, M.O. 2012.

Spatiotemporal segregation among summer stocks of beluga (Delphinapterus leucas) despite

nuclear gene flow: implication for the endangered belugas in eastern Hudson Bay (Canada).

Conservation Genetics 13: 419-433.

Weider, L.J. and Hobæk, A. 2000. Phylogeography and Arctic biodiversity: a review. Annales

Zoologici Fennici 34: 217-231.