Top Banner
Constraining Astronomical Populations with Truncated Data Sets Brandon C. Kelly (CfA, Hubble Fellow, [email protected]) 03/19/22 Brandon C. Kelly, [email protected]
13

Constraining Astronomical Populations with Truncated Data Sets

Jan 02, 2016

Download

Documents

Constraining Astronomical Populations with Truncated Data Sets. Brandon C. Kelly (CfA, Hubble Fellow, [email protected]). Goal of Many Surveys: Understand the distribution and evolution of astronomical populations. How does the growth of supermassive black holes change over time? - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Constraining Astronomical Populations with Truncated Data Sets

Constraining Astronomical Populations with Truncated Data

Sets

Constraining Astronomical Populations with Truncated Data

Sets

Brandon C. Kelly (CfA, Hubble Fellow, [email protected])

Brandon C. Kelly (CfA, Hubble Fellow, [email protected])

04/20/23 Brandon C. Kelly, [email protected]

Page 2: Constraining Astronomical Populations with Truncated Data Sets

Goal of Many Surveys: Understand the distribution and evolution of astronomical

populations

Goal of Many Surveys: Understand the distribution and evolution of astronomical

populations

04/20/23 Brandon C. Kelly, [email protected]

But all we can observe (measure) is the light (flux density) and location of sources on the sky!But all we can observe (measure) is the light (flux density) and location of sources on the sky!

• How does the growth of supermassive black holes change over time?

• How was the stellar mass of galaxies assembled?• What is the distribution of black hole spin for

supermassive black holes? How does this evolve?

Page 3: Constraining Astronomical Populations with Truncated Data Sets

A motivating exampleA motivating example

• Recent advances in modeling of stellar evolution have made it possible to relate a galaxy’s physical parameters (e.g., mass, star formation history) to its measured fluxes

• Opens up possibility of studying evolution of galaxy population, and, in particular, evolution in the distribution of their physical quantities, and not just their measurable ones.

04/20/23 Brandon C. Kelly, [email protected]

Page 4: Constraining Astronomical Populations with Truncated Data Sets

Simple vs. Advanced ApproachSimple vs. Advanced Approach

Simple but not Self-consistentSimple but not Self-consistent• Derive ‘best-fit’ estimates

for quantities of interest (e.g., mass, age, BH spin)

• Do this individually for each source

• Infer distribution and evolution directly from the estimates

• Provides a biased estimate of distribution and evolution

• Derive ‘best-fit’ estimates for quantities of interest (e.g., mass, age, BH spin)

• Do this individually for each source

• Infer distribution and evolution directly from the estimates

• Provides a biased estimate of distribution and evolution

Advanced and Self-ConsistentAdvanced and Self-Consistent• Derive distribution and

evolution of quantities of interest directly from observed distribution of measurable quantities

• Circumvents fitting of individual sources independently

• Self-consistently accounts for uncertainty in derived quantities and selection effects (e.g., flux limit)

• Derive distribution and evolution of quantities of interest directly from observed distribution of measurable quantities

• Circumvents fitting of individual sources independently

• Self-consistently accounts for uncertainty in derived quantities and selection effects (e.g., flux limit)

04/20/23 Brandon C. Kelly, [email protected]

Page 5: Constraining Astronomical Populations with Truncated Data Sets

The Posterior Distribution: How to quantitatively relate the distribution of physical quantities to measurable ones

The Posterior Distribution: How to quantitatively relate the distribution of physical quantities to measurable ones

• Define p(y|x) as the measurement model, it relates the physical quantities, x, to the measured ones, y

• Define p(x|θ) as the model distribution for the physical quantities

• The posterior probability distribution of the values of x (physical quantities) and θ (parameterizes distribution of x), given the values of y (measured quantities) for the n data points:

04/20/23 Brandon C. Kelly, [email protected]

p(x,θ | y)∝ p(y | x)p(x |θ)p(θ)

Page 6: Constraining Astronomical Populations with Truncated Data Sets

Incorporating the flux limit (truncation)Incorporating the flux limit (truncation)

1. If there is a flux limit (data truncation), denote Det(y) to be the selection function (probability of detection as a function of y). We need to normalize the posterior by the detection probability as a function of θ, Det(θ):

2. The probability distribution of the physical (missing) quantities for each source, x, and the parameters for the distribution of x, θ, given the n observed values of y, is then

04/20/23 Brandon C. Kelly, [email protected]

Det(θ) = Det(y)∫ p(y |θ)dy

= Det(y) p(y | x)p(x |θ)∫ dx[ ]∫ dy

p(x,θ | y)∝ p(θ) Det(θ)[ ]−n

p(y i | x i)p(x i |θ)i=1

n

Page 7: Constraining Astronomical Populations with Truncated Data Sets

But, there are some computational complications…

But, there are some computational complications…

• Expected fluxes are a highly non-linear, non-monotonic function of the physical parameters– Leads to multiple modes in p(y|x), and thus in the

posterior

• Calculation of expected flux for a given physical parameter set is very computationally intensive, based on running a complex computer model for stellar evolution– Typical to run model on a grid first, and then use a look-up

table

04/20/23 Brandon C. Kelly, [email protected]

Page 8: Constraining Astronomical Populations with Truncated Data Sets

ExampleExample

04/20/23 Brandon C. Kelly, [email protected]

Posterior Probability Distribution

Page 9: Constraining Astronomical Populations with Truncated Data Sets

Additional problems when there is truncation (e.g., a flux limit)

Additional problems when there is truncation (e.g., a flux limit)

• No simple way to calculate Det(θ):

• Naïve method: Simulate a sample given the model, θ, and count the fraction of sources that are detected

• Unfortunately, this stochastic integral introduces error in Det(θ), and posterior is unstable to even small errors in Det(θ)

04/20/23 Brandon C. Kelly, [email protected]

Det(θ) = Det(y) p(y | x)p(x |θ)∫ dx[ ]∫ dy

Page 10: Constraining Astronomical Populations with Truncated Data Sets

Example: Estimating a Luminosity Function (Distribution)

Example: Estimating a Luminosity Function (Distribution)

• Simulate galaxy luminosities from a Schechter function (i.e., a gamma distribution)

• Keep L > LLIM = L* (~ 30% detection fraction)

• Estimate Det(α,L*) stochastically:– For each (α,L*) simulate a sample

of 1000 and 10,000 luminosities– Keep those for which L > LLIM

04/20/23 Brandon C. Kelly, [email protected]

p(L |α ,L*)∝ Lαe−L /L*

,α = 0, L* =1

Page 11: Constraining Astronomical Populations with Truncated Data Sets

Statistical and computational problems, and directions for future work

Statistical and computational problems, and directions for future work

• Need to have more efficient algorithms– Modern and future surveys will produce tens to hundreds of thousands of

data points with several parameters (e.g., flux densities) each, how to efficiently do statistical inference (e.g., MCMC)?

– Potential algorithms need to handle multimodality in the posterior/likelihood function

• Need to efficiently and accurately compute the multi-dimensional integral for the detection probability– Alternatively, need to efficiently account for uncertainty in a more efficient

but less accurate integration method, e.g., stochastic integration

• Need to have an accurate and efficient method for interpolating the output from computationally intensive computer models (e.g., stellar evolution)

– Statistical emulators should help here

04/20/23 Brandon C. Kelly, [email protected]

Page 12: Constraining Astronomical Populations with Truncated Data Sets

Example: The Quasar Black Hole Mass Function (Distribution)

Example: The Quasar Black Hole Mass Function (Distribution)

04/20/23 Brandon C. Kelly, [email protected]

LuminosityLuminosity

Flux LimitFlux Limit

LuminosityLuminosity

Intrinsic Distributionof MeasurablesIntrinsic Distributionof Measurables

SelectionEffectsSelectionEffects

Observed Distributionof MeasurablesObserved Distributionof Measurables

Intrinsic DistributionOf Derived QuantitiesIntrinsic DistributionOf Derived Quantities

Black Hole MassBlack Hole Mass

Eddington RatioEddington RatioEmission LineWidthEmission LineWidth

Emission LineWidthEmission LineWidth

Page 13: Constraining Astronomical Populations with Truncated Data Sets

Example on Real Data: The Quasar Black Hole Mass Function (Distribution)

Example on Real Data: The Quasar Black Hole Mass Function (Distribution)

04/20/23 Brandon C. Kelly, [email protected]

From Kelly et al. (2010, ApJ, 719, 1315)From Kelly et al. (2010, ApJ, 719, 1315)