Lecture Notes in Physics Edited by'J. Ehlers, M0nchen, K. Hepp, ZLirich, R. Kippenhahn, MLinchen,H. A. WeidenmiJIler, Heidelberg, and 1 Zittartz, K61n Managing Editor: W. Beiglb6ck, Heidelberg 71 Problems of Stellar Convection Proceedings of the Colloquium Nr. 38 of the International Astronomical Union, Held in Nice, August 16-20, 1976 ml Edited by E. A. Spiegel and J. P. Zahn Springer-Verlag Berlin Heidelberg New York 1977
370
Embed
Problems of Stellar Convection: Proceedings of the Colloquium Nr. 38 of the International Astronomical Union, Held in Nice, August 16–20,1976
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Lecture Notes in Physics Edited by'J. Ehlers, M0nchen, K. Hepp, ZLirich, R. Kippenhahn, MLinchen, H. A. WeidenmiJIler, Heidelberg, and 1 Zittartz, K61n Managing Editor: W. Beiglb6ck, Heidelberg
71
Problems of Stellar Convection Proceedings of the Colloquium Nr. 38 of the International Astronomical Union, Held in Nice, August 16-20, 1976
ml
Edited by E. A. Spiegel and J. P. Zahn
Springer-Verlag Berlin Heidelberg New York 1977
Editors
Edward A. Spiegel Astronomy Department Columbia University New York, New York 10027 /USA
Jean-Paul Zahn Observatoire de Nice Le Mont Gros 0 6 3 0 0 Nice/France
ISBN 3-540-08532-? Springer-Verlag Berlin Heidelberg New York ISBN 0-387-08532-7 Springer-Verlag New York Heidelberg Berlin
This work is subject to copyright. All rights are reeerved, whether the whole or part of the material is concerned, specifically those of translation, re- printing, re-use of illustrations, broadcasting, reproduction by photocopying machine or similar means, and storage in data banks.
As president of the organizing committee of this meeting I was granted the honor of
opening the conference. But despite appearances I was only a figurehead that Jean-Paul
Zahn somehow decided to set up, Whatever hismotivatlon was, his execution was excellent
and my first remark must be an expression of my admiration for the marvelous job that
he and his associates have done in providing all the spiritual, intellectual, and
material advantages that we found waiting for us in Nice. Let me assure you though,
that a token president is not without uses and I wish I had known this before accepting
the job. I spent the days before and during the conference running routine errands,
carrying luggage, and being reprimanded for some of the minor things that inevitably
must go wrong in many large gatherings, I was even scolded because the name of some-
one who had not said he was coming was omitted from the list of participants. And when
the time was running short during the meeting, I was obliged, in a statesmanlike gesture,
to cUt my scheduled one-hour talk to eight minutes. But rank has its privileges and mine
was to be informed of the guiding principles behind the organization of the conference.
~ermit me now to share these with you.
It happened that the first day of the conference coincided with that of a large po-
litlcal convention (in another place,happily), and that suggested a convenient meta-
phor for describing the divergence of viewpoints among the participants. Let us there-
fore discuss the politics of stellar convection theory.
At the extreme right of the convective political spectrum are those who want to
write down the full equations and solve them. The ultra-conservatives, as I shall call
them, have virtue but no results that apply directly to stars.
At the other extreme of the convection spectrum are the radicals who want to write
down an algorithm for computing stellar structure that contains adjustable parameters
which can be fit to well known cases. In an extreme version of this we would write :
R = II where R is the radius and R± is an adjustable parameter. If we fit the parameter
to the sun we get R = 7 x 10 ]I cm and the resulting formula turns out to describe a
large number of stars tolerably well. I think it is fair to say that no one at the con-
ference was this radical, but it would be hard to deny that there have been things in
the literature that have these overtones.
But let me come to the political views represented by the actual participants. I
cannot be too specific since many participants have sometimes yielded to expediency and
shifted ground shamelessly. Nigel Weiss is a case in point. This paragon of the right
has recently (with Gough) written a paper on stellar mixing-length theory in what must
be the greatest fall from grace in recent memory. Having viewed this behaviour with
alarm let me point with pride to the spectrum of opinion represented here.(The infrared
has been filtered out.)
In these proceedings we have a coverage of this spectrum from mixing-length theory to
computations on the full equations ( for limited parameter ranges ). Naturally, this
represents more than most astrophysicists need to know about convection. Some will merely
read this introduction, expecting to find out where the best current approach is des-
cribed, hoping that this will be consistent with the constraint that results are to be
found in a finite time ........... say months, I have,of course, anticipated this need,
but am not sure I can meet it.
Douglas Gough and I spent the summer in a Cambridge drought trying to prepare a
statement that will answer such a specific question. Naturally, I didn't have time to
give the results of our lucubrations in my spoken introductory remarks. Nor did Gough
manage to fit them into his lecture on standard mlxing-length theory. That does not
mean that we could not put it all into one of our manuscripts. But which paper should it
be in? The solution is that we have prepared a joint appendix which I am told will appear
somewhere below.
Our conclusion is that a non-local mixing-length theory seems to be the best that
one can do at present. Unfortunately, this is not a precise a~tement and we simply
give an outline of how such an approach might be made and try to give an indication of
the physical assumptions needed. There are other ways to go about this, and our aim is
merely to suggest the level of sophistication in mixing-length theory that we think may
be warranted in stellar models.
I have indicated the spread in the approaches to convection discussed below as a
kind of abscissa. There is also an ordinate which represents a spectrum of complications
that arise in convection theory in specific kinds of stars on stages of evolution, or
refer to effects that are usually presented but ignored in first approximation. If we
must look mostly to the left to get usable results for stellar structure theory, it is
equally true that we usually turn to the right for guidance about how to handle these
special effects. For even if the solutions of the conservatives are not directly usable
for stars, they can be extended to include compressibility, rotation, magnetic fields,
compositional inhomogeneities, penetration, and, if we would just take the trouble, coup-
ling to pulsation. The hope is that what a special effect does to a conservative's solu-
tion it will probably do to a radical's model. This half-truth in practical terms means
that by seeing what rotation does to Boussinesq convection in two-dimensional or modal
convection, you may build enough intuition to make a cogent argument about what it does
to stellar convection. For example, when stellar model-builders want to decide what to
do about semiconvection, let them read Huppert's article on thermohaline convection. No
doubt many astrophysicists will not care for this general viewpoint unless it happens
to lead to answers according with what they need to coax their models into agreement
with observations. Eric Graham's discussion is a good case in point.
Graham has numerical solutions for fullyc~mpressible three-dlmensional convection
in a layer several pressure (and density) scale heights thick. Apart from a charming
tendency to swirl aboutj his flows look startlingly llke Boussinesq convection, and
he detects no sign of scale heights influencing his dynamics. Radicals will probably
ignore this result. What else can they do?
Lest I seem to give too much credit to the conservatives let me point out their
main fault : they rarely include effects in their calculations that are motivated by
purely astrophysical convection problems, but rather study traditional effects. If they
want to prove me wrong about this let one or more of them do a proper Boussinesq calcu-
lation of the URCA convection problem suramarlzed here by Giora Shaviv. This example does
not have the double entendre of something like rotation that interests meteorologists
also. So much for the ordinate.
In these proceedings we shall also leave the phase I have been describing to have a
look at recent trends in turbulence theory. Those who have followed this subject at all
know that it too has something of a political spectrum and some of the extreme conser-
vatives of turbulence report here on current approaches. Urlel Frisch will translate the
right wing's latest credo, fractal dimensions, into terms the leftist can understand,
and Yves Pomeau will tell us about aperiodic oscillations. These both refer to forms of
mathematics that may help us to see what turbulence is. Pomeau's talk is concerned with
systems of o.d.e.~s that give periodic solutions except in certain parameter ranges
where they go into aperiodic, almost random hehaviour, The suspicion has been around
for many years that this behaviour may have the mathematical ingredients that give tur-
bulence its stochastic features and, lately, attempts to formulate this idea precisely
have been mounted. But even if this does not turn out to work, it does not hurt to know
about aperiodic oscillators in other contexts. The funny behaviour of the solar cycle
during the reign of Louis XIV may have been a manifestation of such an aperiodic
oscillator of interest to this audience.
This has been a lengthy introduction yet it has not told you the full range of topics
to be covered. I hope that it gives you a flavour of what to expect in looking over the
proceedings. I am told that all the contributed papers have been refereed and so the pro-
lixity stops here. There is not even a concluding oration to be reported. Of course, I
happen to have a manuscript called "Convection in Stars III..." that might have served,
but that is destined for other things. However, a brief summary of developments before
this meeting is in Gough's report for IAU Commission Mestel and it is reprinted here
with bibliography. Its adequae~ as a summary may be a measure either of the rate of pro-
gress in this subject or of Gough's perspicacity.
HISTORICAL REMINISCENCES OF THE ORIGINS OF
STELLAR CONVECTION TKEORY
(1930- 1945)
Ludwig Biermann
Max-Planck-Institut fur Physik und Astrophysik
Munich, Germany
To set the stage for the report to follow +) , let me start with a
quotation from A.S. Eddington's classical work "The Internal Con-
stitution of the Stars", published exactly half a century ago.
Eddington wrote: "We shall not enter further into the historic problem
of convective equilibrium since modern researches show that the hypo-
thesis is untenable. In stellar conditions the main process of trans-
fer of heat is by radiation and other modes of transfer may be
neglected." (paragraph 69, p. 98).
Only 19 years earlier Robert Emden had outlined for the sun a very
different picture; in paragraph 4, chapter 18, of "Gaskugeln" - also
a classical treatise which influenced research well into the thir-
ties - he had stated with equal conviction that the energy radiated
into space from the photosphere could be brought there almost ex-
clusively only by convection, and that the granulation depicts the
cross section of the ascending hotter and the descending darker
currents. Emden's manuscrip£ had been read in the proof stage by his
brother-in-law Karl Schwarzschild, who advised Emden on a number of
points. It seems worth noting that Schwarzschild's somewhat earlier
work on the radiative equilibrium of the sun's atmosphere was discussed
in Emden's monograph (in chapter 16, par. 13) ++) .
+) The text includes a number of points, which actually came up in the later discussions during the colloquium or, in one case, during the IAU assembly in Grenoble. The author is indebted to many colleagues, in particular to D. Gough, L. Mestel, M. Schwarzschild and N. Weiss for important comments.
++) Whether Schwarzschild, who had proven that radiative transport prevailed in the sun's photospheric layer and had formulated the quantitative criterion for the stability of such layers, was in com- plete agreement with Emden, is not quite clear, though Emden's text does convey this impression.
Part of the change of the scientific argument during the intervening
time (1907-1926) was of course due to Jean's and L~demann and Newa]]'s
discovery that at the pressures and temperatures prevailing in the
stellar interiors, all the molecules and atoms would be broken up into
almost bare atomic nuclei and electrons and that, as a consequence,
radiative transport was recognized to be most efficient, in Eddington's
scheme of 1926 almost too efficient. In retrospect it looks however
as if Eddington, in laying the foundations of the theory of radiative
equilibrium in stars, did apparently not fully appreciate the power
of even very slow turbulent convection in transporting energy in
stellar interiors; that he neglected the possible influence of the
surface conditions was perhaps related to the fact that they were,
at the time when Eddington wrote his book, essentially inapplicable
to most stars. +)
This review will be concerned mainly with three developments, which
took place between 1930 and 1946: I) the application to stellar in-
teriors of the convective transport equation developed in hydro-
dynamics, which led to the proof that the adiabatic temperature
gradient is, in the case of thermal instability, a very good appro-
ximation there; 2) the influence of the surface boundary condition in
determining the extent of outer convection zones, which in certain
circumstances may comprise the whole star; 3) the introduction of the
scale height as a measure of the mixing length used in the transport
equation. Their connection with some further developments which may
come up during the present conference can be sketched only very
briefly.
To begin with item I, we note first that observations and measure-
("turbulent) mass ments of transport processes by non-stationary • "
motions in the earth' atmosphere and in the oceans had led, between
1915 and 1925, to a reasonably successful theoretical scheme. G.I.
Taylor (1915) and W. Schmidt (1917) noted++)that the quotient of the
+) Concerning both points it is instructive to reread his discussion of the point source model (ICS, § 91). It should be added, however, that Eddington (as far as the present author was able to find out) fully accepted the change of position which occurred during the period under review in this report. ++) Apparently, as a result of the war conditions, independently, as can be judged from their papers of 1915 and 1917.
flux of some quantity - like the momentum, the salinity or heat - and
the gradient of the content per cm 3 or per gram of the same quantity
led to consistent values of this coefficient, for which W. Schmidt
introduced the term "Austausch" (gr cm-lsec-1). In the special case
of heat flow ("Scheinleitung") the excess of the actual temperature
gradient over its adiabatic value, A?T, multiplied with the specific
heat for constant pressure, has to be used, because only small pressure
differences arise if the Mach number is <<I, as is usually true in the
earth~atmosphere. For interpreting the observed va~s of the "Austausch"
concepts developed by G.%. Taylor and by L. Prandtl n in particular the
"mixing length" were most useful.(These had been introduced first for
the case of dynamical instability.)These concepts relied to some ex-
tent of considerations analogous to those of kinetic theory. The
analogue of the molecule of kinetic theory is an element of the fluid,
which (having detached from its surroundings as a consequence of the
given instability due either to a super-adiabatic temperature gra-
dient or to the dynamical situation, for instance shear) moves as a
whole over some distance until it mixes again with the surrounding
fluid, in Prandtl's original thought the mixing length i was determined
by the geometry of the situation, e.g. the distance from the nearest
boundary or the diameter of the unstable region, and this was used
in the first application to stellar interiors. The somewhat later idea,
to link the mixing length with the scale height, will be discussed
below.
TO determine the velocity it was considered that pressure equilibrium
is only slightly disturbed, whereas - the flow of heat by conduction
or radiation being usually much slower - the temperature of an ele-
ment of the fluid is determined by the adiabatic gradient, such that
in case of thermal instability a rising element is less dense and
hotter than the surrounding fluid, and as a consequence is accelerated.
These considerations led t~ an expression for the velocity (v) of the
moving elements, which can be written in the form (g = acceleration
of gravity)
2 AT g£2 A?T v ~ gi • --~ T
For the convective transport of heat (H K = erg/cm2sec, Cp = specific
heat per gr for constant pressure, p = density) wewrite
H K ~Cp •piv • AVT
v 2
~pv • ~ R
(p - pressure =(R /D)pT; R - gas constant, ~ - mean molecular weight),
suppressing again a constant of order unity.
In meteorology the largest scale for which the transport equation has
been used with at least qualitative success (by A. Defant, see
W. Schmidt 1925), is the meridional transport of heat from the equa-
torial to the polar regions, which raises the average temperatures
of our own latitudes quite noticeably, the meridional heat flux ex-
ceeding the "solar constant" by roughly two powers of ten (as it must).
In this case the elements are low pressure systems which, due to their
rotation, have a certain stability; it happens, that the mixing length
and the pressure, but of course not the temperature and the density,
are comparable to those in the outermost layers of the sun's hydrogen
convection zone.
The application to stellar interiors was contained in a G6ttingen thesis
of 1932 (L. Biermann 1933), which owed a great deal to Prandtl's ad-
vice. For the convective zone around the centre, due to the conversion
of hydrogen into helium (supposed to be highly temperature sensitive
as is true for the C-N cycle), the mixing length was taken to be of
the order of 10.OO0 km, and the flux to be transported assumed to be
of the order of 1012erg/cm2sec, about 15 times that on the sun's sur-
face. For the third power of the velocity a combination of the equa-
tions given above leads to
3 HKg£ R* v p
which was found to be or order 1011 (cm/sec) 3. This then leads to
IAVTI ~ 10 - 8 or ~IO-51VTI
Though there is some incertainty about the coefficients of order
unity contained in the equations given above, it is clear that the
result last stated, that a relatively very small excess over the adia-
batic gradient is sufficient to carry all the flux, is quite insensi-
tige to errors made by the phenomenological theory used here. Two
years later Thomas G. Cowling, who used a slightly different approach
and formalism, recovered the same result. Since 1933/34 it has become
standard practice to use, in theoretical models of star's interiors,
the adiabatic temperature gradient whenever that required for radiative
transport exceeds the adiabatic one +) . In this form the stability cri-
terion was formulated first by Karl Schwarzschild for the solar atmo-
sphere (1906); in meteorology an equivalent criterion had been in use
already many years earlier.
Concerning item 2, we note first, that during the years 1934-38, an
attempt was made to explore the question whether partially or wholly
convective stellar models (not only such with a central convection zone)
lead to a more complex picture of the overall constitution of the stars
than the one given by Eddington (as was finally found to be the case).
Such models would result from a higher luminosity than the radiative
ones, for the same radius and opacity if the luminosity could be re-
garded as an (effectively) free parameter. Near the surface two cir-
cumstances, the second of which was not at once fully appreciated have
to be taken into account: the existence, in all stars with not too high
surface temperature, of convective zones due to the partial ionization
of hydrogen and helium, which had first been investigated by Uns~id in
1930, and (second) the necessity of using in the photosphere of every
star the radiative transport equation together with that of hydro-
static equilibrium, which means that the pressure must be of order
g/c, < being the opacity (gr-lcm 2) of the photosphereic layers - a
relation which of course could be written down with better accuracy.
The importance of this boundary condition for the case under discussion
(L. Biermann 1935) was emphasized by T.G. Cowling in 1936, but its
application was held back for some years by the (then) poor knowledge
of the value of the photospheric opacity for all stars later than
spectral class A.
+) ~The first such model being Cowiing's point source model of 1935.
Shortly before the discovery by Rupert Wildt that the negative hydrogen
ion is the main source of the photospheric opacity in such stars, an
attempt was made (L. Biermann 1938) to use approximate values for the
photospheric opacity based on work of Pannekoek which fortunately led
to approximately correct answers. It was found (a result which was
soon confirmed on the basis of Wildt's pioneering work of 1939, and
subsequent work of others) that in the surface of the sun and similar
stars (with photospheric pressures of the order of IO5dyn/cm 2, such
that the radiation pressure is only ~ 10-5p) convection was likely
to be an efficient mechanism of transport of heat not only in the
main parts of the hydrogen convection zone, but also in its outer
layers; as a consequence the adiabatic gradient should at least appro-
ximately be established in the hydrogen convection zone up to its
outer boundary in the middle or deep photosphere.
In order to i l l u s t r a t e the importance of this result le t us look at a diagram taken from the author's paper of 1938. This diagram, with the
logarithm of the total pressure (P = p+pR ) and of radiation pressure
(pR) as coordinates, shows besides the adiabats ( f u l l l ines +) ) the
lines of constant ratio PR to P (Eddington I-8, weak lines), further-
more the "dominant" ionization potential ~++) and lines for fixed
ratio of ~ to kT, of which the one for ~ = 0 corresponds to the limit
of degeneracy ("Entartungsgrenze"). Near the bottom are shown the
photospheric values of the pressure (for given effective temperature)
for three values of the surface gravity, including that for the sun.
The crossed line marked
log ad
indicates the zone of rapid increase (inwards) of the opacity and as a
consequence of the radiative temperature gradient, and the incipient
decrease of the adiabatic gradient (inwards) due to the additional
degrees of freedom (using the terminology of the kinetic theory of the
+) The branching above logT ~ 5 corresponds to then existing uncertain- ties regarding the chemical composition, especially the value of Z (in the terminology in use now).
++) For which the degree of ionization is ~I/2 according to Saha's formula.
J. !
! ..
...
!
i I
J ...
. !
! |
,i
!
~r
,q
~ i.
.i
~
"V
%
11
specific heat) resulting from the increasing degree of ionization. Above
= 50 eV, the ratio of the radiative gradient to the adiabatic gra-
dient
(d log PR/d log P)R
(d log PR/d log P)ad
is essentially given by the value of 1-B, such that for the special
solution corresponding to Eddington's model, for given mass and lumi-
nosity, and for larger values of I-8, the radiative gradient is smaller
than the adiabatic one and Schwarzschild's stability criterion is ful-
filled. For smaller values of 1-8 the radiative gradient increases,
such that the instability limit is reached soon and the adiabatic
gradient (d log PR/d log P ~ 8/5 for stars of about the sun's mass,
for which I-8 <<I) is the smaller one and convection prevails. +)
It is thus see~)that the surface boundary condition effectively results
in such a relation between PR and P that the total pressure increases
inwards at first considerably faster than the radiation pressure; in
the regions with temperatures around some 104 degrees, I-B has a mini-
mum and the radiative gradient is much larger than the adiabatic one,
the opacity being approximately proportional (I-8) -1 according to
Kramers' law (in that temperature region aotually still larger), where-
as the adiabatic gradient is approximately constant (~ 8/5). With the
increase of I-8 along the adiabates inwards, the stability limit is
gradually approached and in the case of the sun finally reached at
T ~ 106 (with modern values of the chemical composition Eddington's
model would show I-8 ~IO -3 for the sun). It is therefore clear that
in the sun and in similar stars the outer convection zone must comprise
a substantial number of scale heights until the radiative temperature
gradient decreases below the adiabatic one, such that the inner boun-
dary of the hydrogen convection zone should be at a temperature
T ~> 106 and a depth of ~ 1OO OOO km. This result has been confirmed
by the much more accurate computations of recent years. Dwarf stars
of later spectral class should have still deeper convection zones,
+)The presentation attempts to retrace the steps, by which the complete stellar models with deep outer convection zones and the fully convective models were actually found. For a more detailed review see L. Biermann 1945.
12
and it was proposed in 1938 that late type giants might even be fully
coDvective, a result which was recovered many years later by Hayashi.
In retrospect, it is easily seen why all these possibilities had not
been noticed earlier: that convective transport could be efficient up
to almost photospheric layers, was a rather remote possibility on the
background of the earlier theory - though not on that of Emden - and
a quantitative discussion of the power of convective transport was
hindered by the lack of reliable knowledge of the photospheric pressure
(which had been highly underestimated before 1938/39).
The largest uncertainty in transferring the mixing length theory to
astrophysics - our item 3 - is evidently connected with the value of
to be used, under the different circumstances. For the work done in
1938 described above the observed size of the solar granulation had
been taken as a guide; this choice +) whioh fortunately did not intro-
duce serious errors was until 1943 replaced by the answer which has
been the basis of all subsequent work, and which is to use the local
scale height, defined either by the density gradient or that of the
pressure, as measure of ~, such that £ is given:
I I ~Av-~J or Iv--f~j"
Since there should be a nondimensional factor of order unity, which re-
quires separate d£scussion, the two expressions are under most circum-
stances equivalent. The idea behind this choice is that in any case the
largest elements should travel farthest and reach the highest velocity
but that an element of the fluid, after having travelled over a density
scale height, should have changed ~ts shape to such an extent that it
is likely to break up into smaller fragments and to mix with the sur-
rounding fluid. This approximation is of course precisely in the spirit
of Prandtl's original ideas on the subject, and had most probably been
discussed with him. On a slightly different background, an equivalent
proposal had been made by E. 0pik already in 1938~ +)
+)An at least approximate determination of the size distribution func- tion of the solar granulation became possible recently (J.W. Harvey, M. Schwarzschild, 1975), whereas for red giants the observational si- tuation is still less clear (M. Schwarzschild, 1975).
++) The work reported here under items 1) and 2) had remained unknown to ~pik until his work of 1938 was completed, cf. his "Note added in proof" (~pik 1938). For a more recent review of the general problem see M. Schwarzschild, 1961).
13
To conclude these reminiscences I would first like to mention that the
results discussed under item 2), led for the sun to a proposal concern-
ing the old question, why a sunspot is dark, the answer being, that the
strong magnetic fields observed in the umbra should inhibit the convec-
tive transport in the layers underneath the spot+); it has been pursued
by a number of authors up to the present and may come up again at this
colloquium.
Of the various formalisms suggested to improve upon the one given above
for the heat transport, the scheme proposed by E. B~hm-Vitense 1953, 1970
1958 became the most widely used. Much more recently R. Ulrich has pro-
posed still further refinements, which are particularly useful at rela-
tively low densities. All attempts to determine the exact relationship
between the mixing length and the scale height (and/or other parameters)
have not been really satisfactory so far, though comparisons with obser-
vational data on the integral properties of stars (including their
chemical composition) and on the position of the instability strip in
the Herzsprung-Russell Diagram have been used with some success; only
for the sun its known age provides an independent parameter and thereby
a check that the mixing length theory leads to essentially reliable re-
sults (D.O. Gough, N.O. Weiss 1976). It seems that only a deductive
theory of stellar convection would offer the chance to go beyond the
present essentially phenomenological approach used hitherto; at least
one contribution at the present colloquium will, I understand, deal
is the mass density of the gas of quasiparticles and
(7)
r (8) F m = J m-ufd5f~
is the mass flux of the gas. The last term on the right of equation (6)
represents the mass exchanged with the background fluid by entrainment and erosion.
One may compute other moment equations, but we shall not do that here. We
should however mention that the number of moments goes up faster than in ordinary
kinetic theory or transfer theory. Quantities like fluxes of enthalpy and mechanical energy arise and there is the all-important turbulent stress tensor:
Ti j = f mui~jfd5 ~ (9)
Once a hierarchy of moment equations has been written down [a skeleton version
has recently been studied by Stellingwerf (private connnunication)] the problem of
closing it off must be faced. A possible approach, resembling the moment method,
is to decide on an approximation for f and use that guess, for that is all it is
at present, to get approximate expressions for the higher moments in terms of the
lower moments. Once this is done, a last problem of principle remains. One must
81
still decide how to describe that part of the fluid that does not move in
quasipartieles. Should this be thought of as a zero fluctuation condensate of the
quasiparticle gas? Or should one describe the background as an ordinary laminar
fluid acted on by the stresses and so forth generated by the quasiparticles? The
latter course seems decidedly preferable to us, especially for treating penetrative
convection, where most of the matter may be in the background fluid. If that is
accepted, the next course of action is to write the dynamical equations for the
background fluid including the mass, momentum, and energy sources indicated by the
moment equations of the quaslparticle gas. Then, in principle, one has a complete
set of equations for the dynamics of a star with turbulent convection, but for the
present without rotation or magnetic field.
Now we have tO come to the kay question: is this what has to be done or are
we to be saved from it by a 'real theory' starting from the full fluid equations?
We think that the inediate prospects for a sound fluid dynamical approach are not
bright. And even the approximations to such an approach as are on the horizon
promise to be far more demanding computationally than the scheme summarized here.
At present, untold computing hours are being lavished on stellar models using
a mlxlng-length theory whose reliability is untested off the main sequence. It
seems to us that if this situation is to continue it would be well to take the
mixlng-length theory seriously. In particular, one should be clear on the turbulence
model one is using and not simply alter the standard formulae according to whim,
as is often done in the literature. We are not saying that alternative general
structures to that given here may not be preferable. Nor are the procedures we
outline meant to be rigid. The present version of a mixing-length procedure is a
synthesis of ingredients existing in the literature and we have done no more than
put it together to show that a cogent discussion of mlxing-length theory is
possible. We have especially tried to show where the physics is missing and to
indicate a framework for including it. The resulting equations are in principle
capable of dealing with many of the problems of current interest, such as the
nonlinear interaction of pulsation and convection. Those coping with such questiOns
are all too familiar with many of the problems we have raised. But they, as we
ourselves, have sometimes dealt with these problems piecemeal and have not tried to
put them into context by working with a concrete general model. We are claiming
here that the specification of such a model is possible and desirable and that if
one can be constructed, stellar convection theory may begin to seem more rational.
We thank the SRC and the NSF for supporting our work on this subject.
62
REFERENCES
Biermann, L. 1932, Zs f. Ap, 5, 117
Frlsch, U. 1977, these proceedings
Gough, D. O. 1977, Ap J., ~!~, 196
Morton, B. R., Taylor, G. I. and Turner, J. S. 1956, P~0q. Roy. Soc., A23~, 1
Oplk, E. J. 1950, MNRAS, !~, 559
Prandtl, L. 1932, Beitr. z. Phys. d. Freien Arm., !9, 188
Priestley, C. H. B. 1953, Austr. J. Phxs., 6, 279
Priestley, C. H. B. 1954, Austr. J. Phys., Z, 202
Priestley, c. H. B. 1959~ Turbulent transfer in the lower atmosphere (University
of Chicago Press)
Siedentopf~ H.
Spiegel, E. A.
Turner, J. S.
Turner, J. S.
Ulrich, R. K.
Ulrich, R. K.
Ulrich, R. K.
1933, A toN', 247, 297
1963, Ap J., !~, 216
1963, J. Fluid Mech., !~, 1
1973, Buoyancy effects in fluids (Cambridge University Press)
1970a, Ap Sp. Sci., !, 71
1970b, Ap Sp. Sci., !, 183
1976, AP a . , ~ Z , 56~
OBSERVATIONS BEARING ON THE THEORY OF STELLAR CONVECTION II
Erika B~hm-Vitense
University of Washington, Seattle, Wa.,U.S.A.
and
University of C~ttingen
Germany
SUMMARY
It is shown that the best way to get information about efficiency of convective
energy transport in the hydrogen convection zones in stars other than the sun is pre-
sently contained in continuous energy distributions of A and F stars. Scans show that
convective energy transport must be much more efficient than thought hitherto. The scans
indicate that rapid rotation enhances convective energy transport.
Figure I lists all the observations, we could think of, which are influenced by the
outer convection zone and which can therefore give us information for our convection
theory.
I. STELLAR STRUCTURE AND EVOLUTION
Already Hoyle and Sehwarzsehild (1955) pointed out that with increasing efficiency
of convection the radius of a star shrinks.
If we describe the efficiency of convection by the one parameter £, the characteris-
tic length or the mixing length of the convective flow, and if we further assume that
the ratio £/H = ~ is depth independent, where H is the pressure scale height - an assump-
tion that may not be valid, see for instance B~hm (1958), Schwarzschild (1974) - then
this one parameter ~ can in principle be determined from the observed radius of the star.
Unfortunately, the radius depends also on the initial abundances Z, Y and CNO (Iben 1963)
as well as on the age of the star. From Iben's study we see that for given T a ~Iog L e
0.3 is obtained for A log ~ ~ 0.5, but one also finds A log L % 0.3 for a change in
abundances Y and Z by a factor of 2. A similar change in log L is obtained if opacities
are computed with different approximations. Different authors therefore obtain different
values of ~ from the radii of the sun and the stars, ranging from ~ = 0.4 to e = 2.
From the position of the main sequence we can therefore presently only conclude that
log e = 0 ~ 0.5.
l m
ixin
g t
o &
fr
cQ
de
ep
la
ers
tim
e sc
ale
o
f e
vo
luti
on
on
M.S
. ~
- (b
lue
stragElers]
Li,
Be,
B
qe-
C,
N,
O,
J H
e
wh
ite
dw
arf
abu
nd
ance
s
CONVECTION
stratification i~
deeper l
ayers
stratification
(K o
f a
dia
ba
t)
in
surf
ac
e
lay
ers
rad
ius
%-
shap
e o
f th
e K
- tu
rn o
ff
tra
ck
po
siti
on
of
red
g
ian
t b
ran
ch
~-
Tcenter
white dwarfs
(conli.g time) ~
F v
(0
4-
on
set
of
con
vec
tio
n ~
-
B-V
gap
on
M.S
red
b
ou
nd
ary
of
cep
hei
d
stri
p
Tce
nte
r ~f
th
e su
n
ne
utr
ino
s)
sola
r v
elo
cit
y
fie
ld
dam
pin
g of
sola
r o
scil
lati
on
s (G
old
reic
h
+ K
eele
y)
ce
nte
r to
ll
mb
v
ari
ati
on
o
f su
n
ve
loc
ity
~
--
.~
field
T
4--
chro
mo
sph
eric
heating
I em
issi
on
CoII,Mgll
absorption
He 10830
stellar wind
(Vro
t)
curve of growth
osc
illa
tio
ns
--P
~'~
of sun
Vturb
or grad W
solar intensity
fluctuations
+ F,
(T)
->line profiles
llne intensities
va
ria
bil
ity
o
f red
sup
erg
ian
ts
"~
(Se
hw
arz
sch
ild
)
FIGORE !
Ways in wh
ich the effects of t
he outer convection zo
nes ap
pear
in astronomical observations and which
therefor~ can
give us
information ab
out convection theory.
65
The same difficulties apply to studies of the evolutionary tracks which again
depend on Z, Y and = and also C, N, 0 abundances. In fact one would like to know
in order to find the other parameters°
An additional difficulty is encountered when studying H.Ro diagrams:In order to
compare theoretical and observed tracks we have to relate the theoretical parameters
T e and g to the observed ones: color and m or M . The relation between the color and v v
Te, g depends not only on Z, and the CNO abundances, but also on the influence of con-
vection on the observed energy distribution, i.e. the changes of the T (T) relation in
the surface layers. This could possibly be important on the whole evolutionary tracks
for stars of spectral types F and later.
There are several indications that the colors are indeed influenced by surface
convection :
(a) Theoretical color Computations for giants in radiative equilibrium giving
U - B as a function of Z for a given value of B -V show a maximum of U - B for
Z ~ 0o3 ° Z (B~hm-Vitense and Szkody 1974). This maximum is not well seen in the oh- ® servations (Wallerstein et al. 1966).
(b) For intermediate Z values we find a discrepancy between the observed and com-
puted M 4 = (B - V) - (V - r) index (Mannery et al. 1968) for giants in radiative equi-
librium (B~hm-Vitense and Szkody 1974).
(c) Canterna (1976) finds similar problems ~or his metallicity index C - M.
All these discrepancies show that for a given energy distribution in the red there
is less energy observed in the blue and violet region than predicted by the radiative
equilibrium models. Scaled solar Bilderberg models, i.e. models with a decreased tem-
perature gradient in the deeper layers, would ease the problem. It might be emphasized
however, that the discrepancies only exist for giants with 0.I Ze ~ Z < Ze,not for very
metal poor stars. It could,therefore, also be related to an error in the line blanke-
ting computation, for instance a wrong value for the microturbulence.
While the decreased ultraviolet flux is in itself an interesting problem and might
well tell us something about convective overshoot in stars with different Z, it also
tells us that we have to study and understand this convective overshoot before we
can deduce the "observed" evolutionary tracks in the L, Te diagram and proceed to de-
re,mine Z, Y, C, N, O, the age t, and finally e .
We then conclude that the stellar evolution computations require a knowledge of
rather than provide one.
66
II. SURFACE PHENOMENA
A. DIRECT MEASUREMENT OF VELOCITY FIELDS
The most direct observations of convection are the velocity fields which for stars
can only be observed by broadened line profiles. If the velocity field changes over one
mean free path of a photon it will lead to a broadening of the line absorption coeffi-
cient thereby increasing the line width and the equivalent width. If the velocity chan-
ges only over much larger scales it will lead to a broadening of the intensity profile
only and not change the equivalent widths of the lines. According to the influence on
the equivalent width we describe the two effects as micro or macro-turbulence (or possi-
bly rotation). However, we have to keep in mind that an increase in the equivalent width
could also be due to other effects than small scale velocity fields. Also there is no
reason why we should have only small scale or large scale turbulence, in fact we expect
a continuous turbulence spectrum of all scales. (For isotropictu~ulence see the dis-
cussions by H. and U. Friseh 1975, by G. Traving 1975 and by E. Sedlmayr 1975). There-
fore, we have to be careful with the interpretation of the socalled microturhulence.
If we believe that the microturbulence as determined from the equivalent widths by means
of curve of growth analysis really is a measure for the small scale velocity field,
which could be either due to turbulence or to laminar velocity gradients, then we find
the picture first compiled by Wright 1955, see Figure 2.
Generally the microturbulence increases for decreasing densities in stellar atmos-
pheres as is expected for convective velocities: Since the convective flux F c Q. V 3
p = density, V = convective velocity, we expect for a given F ~ F, where F is the total C
energy flux, that V = p-I/3. We do however not observe the decreas£of the microturbu-
fence for hot stars for which F << F is expected. c
A compilation of more recent data especially for giants has been given and dis-
cussed by Reimers (|976).
For supergiants Rosendhal (1970) and van Paradijs (1973) have more. recent studies,
qualitatively confirming the results shown by Wright (]955).
For main sequence stars Baschek and Reimers (1969) did a detailed investigation
especially in order to study possible differences between A m and normal A stars. Chaffee
(1970) extended the study to cooler stars. Andersen (1973) repeated the determination
with the new Fel oscillator strengths (Garz and Kock 1969). The result is shown in Fi-
gure 3. An increase in Vturb is found when going from late F to early F stars in quan-
titative agreement with convective velocities obtained for ~ = |. However, for earlier
stars the expected decrease in velocities is not found. For later stars an increase in
Vturb is suggested - though not observed for the sun - also in contradiction to expec-
tations from convection theory.
67
I / , - -
1I - -
111 - -
I V - -
V - -
X X~&A =x~
A ~
O OO
AA & •
X
O 0 O •
Oo
• & 4,
1
o@
o o
@
• •
o
@
I I I i I I ! I ! 1 I I 05 B0 B2 B5 A0 A5 F 0 F 5 GO G 5 K 0 K 5 M 0
Veloci~ (km./~0
• 1-2
• 2-3
o 4-5
A 5 - 7
• 7 - 1 0
A 10-15
15-20
X 20-30
X > 3 0
FIGURE 2:
Microturbulent velocities for stars of different spectral types and luminosity classes as given by Wright (1955).
68
70
At• A ~ •
Q &
i % o16
! ...... I ......... I ..... I I f
O O ~ OO
• • ® @_..
I 1 I ,,I,, I 1 t 0.7 0.8 0,9 :~.o
FIGURE 3 :
Microturbulent velocities for main sequence stars of different effective temperatures according to Andersen (1973). The open symbols refer to previous investigations by Baschek and Reimers (1969) and Chaffee (1970) using old oscillator strengths, the filled symbols to Anderson's determination with the Garz and Kock (1969) oscillator strengths. V, ~ refer to A m stars.
69
Another puzzle is provided by the measurements of Allen and G~eenstein (1960) and
Wallersteln (1962) showing that in Pop. II dwarfs Vturb ~ 0, a result which is certain-
ly not expected from convection theory , but these studies will have to be repeated with
new Fel oscillator strengths in order to be sure. Reimers (1976) attributes the increase
of Vturb for late type stars to possible measuring errors. Baschek and Reimers (1969)
suggest that for the A stars the high Vturb is caused by a large number of pulsation
modes similar to the ones studied recently by Lucy (1976) for e Cyg.
In short, measured values of Vturb sometimes do and sometimes do not agree even
qualitatively with exceptations from convection theory, indicating either that our
expectations are sometimes quite wrong, or more likely, that the measured microturbu-
lence has quite often nothing to do with convective velocities. How then do we know
when they do and when they do not ?
Even more difficult is the judgement of the observed depth dependence of the micro-
turbulence (Huang and Struve 1952, Rosendahl 1970). In general there does not seem to
be any observed contradiction to the assumption that for other stars the depth depen-
dence of Vturb is similar to the one observed for the sun.
B. INDIRECT MEASUREMENTS OF VELOCITY FIELDS BY MEANS OF ATMOSPHERES
AND CORONAE
(a) Chromospheric emission :
It is general belief that for the formation of classical solar type chromospheres
a velocity field is a necessary condition. We do not know whether it is also a suffi-
cient condition. The different strengths and the age dependence of the Call K 2
emissions for otherwise similar stars show the importance of a second parameter, pro-
bably the magnetic field. The absence of chromospheric emission may therefore not be
proof of the absence of a velocity field, only, if for a given spectral type we never
find chromospheric emission, I would believe this to indicate the absence of efficient
convection.
Chromospheres in cooler stars are seen by means of CaII K 2 and MgII h and k
emission or by the ]0830 line of HeI in absorption. O.C. Wilson (1976) has made ex-
tensive studies of the CaII K 2 emission in G and K stars. His results are shown in
Figure 4. In the same graph I have also plotted the bluest stars that have been ob-
served to show Mgll emission and I0830 HeI absorption according to Zirin (1975). There
appears to be a line in the HR diagram on the blue side of which the chromospheric ac-
tivity seems to stop. In the low luminosity part is not quite clear to me whether the
Call K 2 emission stops for somewhat more red stars than the Mgll emission. If so, it
~uld be an effect of the larger abundance and ionization energy of Mgll. If they
stop at the same time itshould indicate a cause different from ionization.
In the same graph I have also plotted the reddest Pop. I Cepheids according to
Sandage and Tammann (1974).
70
:E
- 6
- 4
- 2
0
2
4
GO ' I ' ' ' I ~ ' ' l l l l l ' I ' ' " ............... i ......... ' I
• • o " °" ° ~ ° . " "" ":" " -
• " " ° ".i ea=e/7 ; ° , . . . . . . - / -
o ° o . . , . . . = ~ .
/ . : . x o . . . . ~ :. . . . . . '~4,~n,:.~.:
/ - : ~ ~ . . . .
. , . - " X ~ . " , ' . : : . , . . . , , , ,
'2.00 0.40 0.80 1.20 1.60 B-V
+ S T A R S W i T H M g l l E M I S S I O N x B L U E S T S T A R S W I T H H e A B S O R P T I O N L I N E S
• R E D C E P H E I D S
FIGURE 4:
The color magnitude diagram for G and K stars with CalI K 2 emission (ooee) taken from Wilson (1976). We have added an additional point for Procyon (FOIV). We have also added ++ for the bluest stars observed to have MgII h and k emission and xx for the bluest stars showing He absorption lines (Zirin 1975). Also shown are the positions for Cephelds close to the red boundary of the instability strip. The straight line roughly marks the boundary for stars with or without observed signs of solar type chromospheres.
For the higher luminosities the red boundary of the instability strip appears to
agree roughly with the boundary line for Call and Mgll emission. There is of course
Call K 2 emission observed for some Cepheids and also for @ Cyg but this is supposedly
due to shockwaves created by pulsation. The agreement of these two boundary lines is
not surprising since we see no other reason fo= the breakdown of the pulsational in-
stability but the onset of efficient convection which reduces Frad, thereby reducing
the driving force.
Since the theoretical line for the onset of efficient convective energy transport
depends on ~ we can check which e should be chosen to make the theoretical and observed
boundary lines agree. Assuming that ~ is the same for all stars in this region - an
assumption which has been criticized by Schwarzsehild (1974) - we found agreement for
=0.5, 2, 3 or 5. This can be seen from Figure 5 (B~hm-Vitense and Nelson 1976).
(If e should not be same for all stars, then £ = R 2 appears to be also a possibility
except for la supergiants.)
As already noted earlier, the extension of the instability strip boundary reaches
the main sequence at about FO or B - V ~ 0.3. So F stars would be expected to have
efficient convection, while A stars would not, but they appear in the extension of
the instability strip, as mentioned earlier.
(h) Stellar rotation:
It has been suggested that stellar rotation will be braked by means of the stellar
wind and the magnetic field. (See for instance Kippenhahn ]972; further references are
given there.) Since stellar winds for later type stars are due to the presence of co-
ronae which are linked to stellar convection zones,the decrease of the rotational ve-
locities for the F stars may also mark the onset of efflc~ant convection. In Figure 6
(B~hm-Vitense and canterna 1974) we show the dependence oi the rotational velocities
on B - V for main sequence stars for different clusters. For field stars there is a
drop in v sin i for B - V ~ .25~ a second drop seems to appear for B - V ~ 0.4. r
For some of the clusters the drop at B - V = 0.40 is the more pronounced one. Apparant-
ly the final drop in v sin i does not occur where we expect convection to set in but r
only for cooler stars. It seems to occur at temperatures where the hydrogen and helium
convection zones merge.
C. TEMPERATURE INHOMOGENEITIES
Convective temperature inhomogeneities are expected to be largest for F stars.
We might look for evidence in the integrated light.
In ~igure 7 we compare continuum energy distribution of stars whose surface is
assumed to be half covered with an atmosphere with T e ~ ~ 100 ° and half with T e = 8340°.
The average would be 7500 °. The resulting energy distrib ion would appear as that of
an atmosphere with T = 7750 °. The inhomogeneous atmos ~ere~ therefore~ would re- e
72
I I ' H I I, I I ' I ~ i
0 • J- , H~z
5
c:) o,°~, //o "(~-)
..o/ I T ,I , I
7 0 0 0 6 0 0 0 5 0 0 0
,°, °°*' / /
• .00a L 0
| , ~ l . - - ~ ! 6 0 0 0 5000 ?000
FIGURE 5 :
Taken from B6hm-Vitense and Nelson (1976) this figure shows a comparison of theoretical and observed (---- or - - ) boundary lines for efficient
convection in the luminosity T e diagram (T e = T'-400 +_ 150~. Different values for the ratio of the mixing length £ to the pressure scale height H were
assumed, symbols are given in the graph. To obtain the points in Figure a we required that £ -- < 21 D for a con-
sistent theory, where D is the thickness of the unstable layer. The theoreti- cal ( .... ) and observed boundary lines agree roughly for £ = H.
For the points in Figure b we assumed that ~ < DF, where D F is the ex- tent of the zone where F c > 10 -2 - F, where F is the total flux and F c is the convective flux. No agreement between observed and theoretical boundary line can be found for any value of </H.
In Figure c we required i= D I at the boundary, where D 1 is the extent of layer for which V - V' > O. I, with the usual notation. No agreement be- tween theoretical and observed boundary can be found.
In Figure d we required £ = D at the boundary, where D m is the extenc
of the layer for which V - V' = 0.~ (V - V')max, where (? - V')ma x is the maximum value of V - V' in the convection zone under consideration. No agree- ment can be found between observed and theoretical boundary lines.
For details of the derivation see B6hm-Vitense and Nelson (19761.
73
3 0 0
2 0 0
I00 ¢.l (g
3o °"
= 2 0 0 b~
I00
30 O"
2 0 0
I00
(0) o F IELD STARS
!%~', O e 0 0
"~ e • •O
(c) PRAESEPE
ee = e •
e ~ • e e ~ o
e • o e • •
• • •d l , e O N ,, ,* ~,
' (e ) PLEIADES "
• ; ; .
e
. o • •
o ~ ' = ' I " I t I I | ,
O 0 0.1 0 .2 0.5 0 .4 0 .5
J I ] I I
(b ) COMA BERENICES
e • • •
(d) HYADES-
• ..- .
"., ,,. ,- • e e • e o
• •
(f)
e
e
= PER
0
B-V
o• e
%
I ! [ ...... d ~ • I
O.I 0.2 0.3 0 .4 0.5 0.6
FIGURE 6 :
The rotational velocity for field stars and for stars in different
clusters. TWO discontinuities in the velocity distribution of field stars are
suggested, one at B - V ~ 0.25 and the other at B - V = 0.40. For the clusters
the decrease of vsin i at B - V % 0.40 is more pronounced. The figure is
based on measurements by (a) : Abt and Hunter 1962 and Slettiebak 1955; (b)
and (d): Kraft 1965; (e) : Dickens et al. 1968; (e) : Andersen et al. 1966;
(f) : Kraft 1967.
74
o
un
0~1 I
I0.0
10.5 -
II . 0 -
HOMOGENEOUS RADIATIVE T, .-' 7 6 0 0
I I I
T e ~ 8 0 0 0 (,% log Fv:+ 0 .12)
i I I I 3.6 3.7 3.8 3.9
log X
FIGURE 7 :
The continuum energy distribution of a star whose surface is half cover-
ed with an atmosphere with T e = 6800 ° and half with T e = 8340 ° is compared
with the continuum energy distribution of stars with T e = 8000 ° (shifted
by Alog F v = 0.12) and T e = 7600 ° . The energy distribution of the inhomo-
geneous star is indistinguishable from that of a homogeneous star with
T e ~ 7750 ° .
75
semble one with a temperature about 250 a higher than its actual T e
In Figure 8 (from B~hm-Vitense 1972) we compare line profiles of a homogeneous
star and an inhomogeneous one with AT = ~ ]000 °. The T e and the corresponding frac-
tions of the surface area on the inhomogeneous star were chosen in such a way that in
Hy the~lines agree for both the homogeneous and the inhomogeneous stars. The B - V
colors then turn out to agree also. One finds that, because of the larger contribution
from the hot component, the spectral lines for the inhomogeneoua star are generally
weaker by ~ ]0% than those for the homogeneous one. The effect is especially strong
for the Ca lines,which are reduced by about 30%. One is reminded of the A stars oc- m
curring in this temperature range which show weak Ca lines, however for the A m stars
the other metallic lines appear generally stronger.
D. THE INFLUENCE OF CONVECTIVE ENERGY TRANSPORT ON THE CONTINUOUS
ENERGY DISTRIBUTION
The major influence of convection on the stellar structure is due to the convec-
tive energy transport which reduces the temperature gradient. If ~is happens in visible
layers then we might be able to see changes in the continuum flux distribution. F stars
are especially promising since the instability sets in rather high regions of the at-
mosphere and we have a pronounced minimum in the continuous absorption coefficient K C
longward of X = 3647 ~, where we can see down into the atmosphere tot = 2.
In Figure 9 I have plotted temperature stratifications for various depth dependen-
ces of the radiative flux F r = F - F c. Also shown are the corresponding energy distri-
butions in the visual region for stars with T ~ 7900 °. e
The curve shifted upwards (separate scale) shows the depth dependence of the ra-
diative flux as computed with the mixing length theory of convection.
The model with the steepest decrease in F r is a scaled solar, the socalled Bilder-
berg model. In an earlier study we had found (Bbhm-Vitense 1970) that the UBV colors
of main sequence F stars could not be obtained for radiative equilibrium models but
that scaled Bilderherg models would do fine. The other models were computed in order
to see whether we could reproduce the observed energy distributions with less ex-
treme reductions of the radiative flux. Little change from radiative equilibrium F
is obtained for these models.
It is interesting to note the very sensitive dependence of the surface tempera-
tures on the depth dependence of the convective flux. A low surface temperature as
obtained for one of the models should be apparent in the line spectrum which should
possibly look generally like that of a metallic line star.
From mixing length theory we expect a rather abrupt onset of convection for
T e ~ 8000 a. When using scaled Bilderberg models for convective stars we find a color
change of A(B - V) = 0.07 (B~hm-Vitense 1971). A similar value was also obtained by
Matsushlma and Travis (1973) with their nonlocal theory of convection if they use
76
,,, .... ~ i ..... ~ ...... f, +, , , i ~ ~ " 7
,.o,~// ? .oo~// ? '~W/ ~ "°° ¢I I /,7--°? ..~,y-t,:oo~ it~oo t k oo.
.<. ,&,oo~;;;~oooo ¢ A,oo-- ,X/,,o~t I x / , "" / .,[.__.J/,,oo, -Ti-"J,,oo, ~ ItD,oo,-; iJ/-,,oo, t
, I l l l ,~. I'~, 25, . . . . . , , , , , ,,,1" , f 9 ~,1", I , i l l , , , i l , , , ' l ..7,,o+jf ~,,ooo+;/~ ~ ~-, ,<>o_+'-, '~;t • ' 7 h , , - ,4/.-o • ,,~/~,,oo. t-J~,,°°, t__,Yk,,°°. :-__5/~,,°°. t " -,~--~,, ,. 9 < ~ : ~ ..... Ro:: ~ , ~ G , : - ,', , , :'9:1 O' .02 .04 ,06 .08 0 ,OZ .04 .06 .08 0 .02 .04 .06 .OB 0 .Oa .04 .06 .08 1.0
,,~,[~]
FIGURE 8 :
The line profiles of convective stars with AT e = ~ iooo ° are compared with those of a homogeneous (i.e. AT e = O) convective star and a star in radi- ative equilibrium, The inhomogenous star shows generally somewhat weaker lines especially of Calcium. The figure was taken from B6hm-Vitense 1970.
77
! o m *< t -
0 w !
0 K
16
/ °
_~~....__../ (o) • 6-
" I O ~ ! . 0 5 I O 4 , , - - P - " I . I I I I I i
5x lO "4 I0 " z 0.1 0 .67 3,5 21.1
-~ ,-~'
- ----"~-x~¢,,-- -
4 ,0" ,0~\ 10 (b) 2 I I I II I\ I m
3 ~lSJlO - 4 tO "2 0.1 0 .67 3.5 21.,
I.C ~ C _o ~
3.6 3.7 ~.e 3.9 log ),
FIGURE 9 :
In part (a) we give the temperature stratlfications for several models, in part (b) the corresponding depth dependences of the radiative flux and in part (c) the corresponding continuum energy distributions. The ooo in part (a) refer to the radiative equilibrium model for the stable layer and mixing length model for the convective layers. The corresponding depth dependence of the radiative flux is shown by the displaced curve (b) (with sparate scale). The long dashes refer to the scaled Bilderberg model. The other curves refer to trial models that were computed in order to see whether we could also ob- tain a reduction in the energy emitted Just longward of the Balmerjump with a smaller decrease of the radiative flux in layers ~ > I. This does not seem to be possible.
78
= | ( which however,leads to difficulties with the observed solar center to limb
variation). This abrupt color change should he observed as a gap in the observed B - V.
The presence of a gap at 0.2 < B - V < 0.3 for field stars was noticed by Mendoza
(1956). He also noticed that this gap is not present for Pleiades stars. Figure ]0
shows that this gap is indicated more or less pronounced in different clusters (B~hm-
Vitense and Can terna 1974), though its position changes slightly for different clusters.
We suspect that rotation my have influence on the onset of convection. If this inter-
pretation of the gap is correct, then field stars with B - V < 0.22 should be in ra-
diative equilibrium, those with B - V > 0.29 should be influenced by strong convec-
tion, leading to the color change.
Figure 11 shows the result of observations for field stars by Oke (1964), for
Hyades stars by eke and Conti (1965), and by Baschek and Oke (1965) for A m stars.
In the same Figure we compare these scans - corrected for the change of absolute
calibration (Oke and Schild 1970 and Hayes 1975) and for line absorption - with con-
tinuum energy distributions for radiative equilibrium models and for scaled Bilder-
berg models.
The right hand side of Figure 11 shows the result for the Hyades stars. Except
for the small deviations around 4000 ~ the scans show good agreement with radiative
equilibrium models (solid lines) for B - V < 0.2 and agreement with the scaled Bilder-
berg models (dashed curves)for B - V > 0.3 as we expected from the study of the colors.
The left hand side shows the results for the field stars, which display the gap
very clearly at B - V = 0.22. Unexpectedly we see the influence of convection already
for ~ Arl with B - V = 0.14.
Figure 12 shows the results for A stars. The cooler ones clearly fit convective m
models and not radiative ones,
We have made additional scanner observations of main sequence H~ades stars with
different rotational velocities (B~hm-Vitense and Johnson 1977). In Figure 13 we see
the results. Weather and instrumental problems reduced the accuracy of our Hyades
observations, but we can notice some interesting results: our bluest star is p Tau
with B - V = 0.24) i.e., at a B - V where the field stars show the gap. Unfortunately
we have only rather poor measurements for that star so our conclusions are somewhat
shaky but it seems this star shows effects of convective energy transport.
For 57 Tau we have plotted both Oke's and our 7all and spring measurements in
order to give an impression of the uncertainties in the observation which Conti and
eke estimate for their measurements to be of the order of om.02. For the Hyades our
uncertainties may be larger. We think that also for 57 Tau th~ ccnvective energy dis-
tribution fits better than the radiative one, though it is not quite conclusive.
For the field stars our scanner results are seen on the left hand side of Figure 13.
For stars with B - V ~ 0.3, all these stars clearly show the decrease in the UV as
given by scaled Bilderberg models in indicating very efficient convective energy
7 9
"°"o,° . t .... . 1
, , . , , . . . . . . . O if, , ., ' F ." .. • "..:" |2 FIELD . "" _ .
The two color diagrams for field stars and different star clusters. This
figure is taken from B6hm-Vitense and Canterna (1975) and is based on mea- surements by (a): Johnson and Morgan 1953; (b): Johnson et al. 1966; (c): Str~mgren and Perry 1965; (d): Johnson and Knuckles 1955; (e): Johnson |952;
(f): Johnson et al. 1962; (g): Johnson and Mitchell 1958; (h): Mitchell 1960. The gap for O.22 < B - V < 0.29 is very pronounced for field stars. It
is present in most of the clusters, though not visible for the Pleiades.
80
3.6
3.8
4.0
(.3 4 .2 +
~ 4.4 0
4.6 O4 I
4.8
5.0
5.2' 3 .6
- "e -
~ ' - [ ~ 8 7 0 0 q . h= A I , O . O I ~e( "r ~%= "~ . ~-..'~-~kq. ~ , ~-~ ~ )r "%~ ~ . K Teu. 7 7 . ~ O o o ~ i ' ~ , ° . . . . , ~ . - - : ~_ ~ . ~AT, 0,4 - ~ - . ~ , , o . ' ~ ~ ~ ' ~ ~8 ,ooo - ~ . ~ . _ - , ~ . ~ . . . . _
o. s . , . ~ ~ , .E ( o - s , , , . . . 7 , o o ° . . ~ l ' r ~ . . . ~Oo- FO, 0.33 ~ ~=-ru , v.~%_
" ' - AVERAGE" 7 6 Tou, 115 I I 1 I~ I I LINE 1 F 0 , 0 . 3 2 I
3.7 3.8 3.9 4.(J3.6 3.7 3.8 3.9 4.0 log k
FIGURE 11:
Shows scanner observations ( .... ) by Oke (1964) and by Baschek and Oke (1965) corrected for line blanketing and corrected for the new calibration by Oke and Schild (1970) and Hayes and Latham (1975). The points (-) and • de- monstrate the difference obtained for the continuum with different measured line blanketing corrections. Also shown are the computed continuum energy distributions for radiative equilibrium models ( ) and for scaled Bilderberg models ( .... ). For the Hyades stars, shown in the right hand column, radiative equilibrium models can represent the observed distribution rather well if B - V < 0.22. For larger values of B - V a reduction of the flux for
~ 4000 ~ is ~pparent indicating a flat temperature gradient in the layers T ~ i. For the field stars shown on the left hand side the violet flux is reduced already for 8Ari with B - V = O.14 and I Ps¢ with B - V = O.19. The energy distributions can be represented quite well with scaled Bilderberg models, i.e. models with an unexpectedly large convective energy transport in layers with ~ ~ I.
The number given beside the star name gives the rotational velocity vsin i, the number beside the spectral type gives B - V.
81
4,0
4"11 4.2
4.3
4.4
° 0 ÷ 4
~ 4.6 _o ~. 4.7 N
I 4 . 8
~e ~dO0, 60 Leo
900 15 " 0.20
m , , , . , . o
f.. 7~0 63, Tou 0.30
4,9 ~ -OC--" -- -- 1 _-- -. 5.0 ~q .
rUMo 5.1 r . 0.35
5 2 / I ! 3.6 3.7
log k
| I" 3.8 3.9
FIGURE 12 :
Shows the A m star scanner observations by Baschek and Oke (1965), cor-
rected for line blanketing and the new calibration, (notation as in Figure 11). The bluest star, 60 Leo, can in the average be well represented by a
radiative equilibrium model. (A discontinuity might be suggested at a wave- length, where a discontinuity in the OI continuous ~ occurs). In 15 Vul with
B - V = o.20 some convective energy transport may be present. For 63 Tau and T UMa the flux reduction in the violet is even stronger than predicted by the scaled Bilderberg model, however, the line corrections may be some-
what uncertain. For A m stars convection appears to become important for about the same
B - V as for normal stars.
82
transport in the top layers of the convection zone. For the field stars in the gap
(0.22 < B - V < 0°29) we can almost match the observed energy distribution with ra-
diative equilibrium models except for the sharp downturn just longward of the Balmer-
jump. It seems they try to have convection like Hyades stars but do not quite make it.
I do not understand this difference between the Hyades and the field stars.
In the last Figure 14 we have plotted the T , derived for the different stars e
by these comparisons of scans and computed energy distributions as a function of their
B - V given in the literature. Also given are the spectral types and the VrSin i. The
filled symbols indicate stars matching convective models, the open ones radiative
equilibrium energy distributions. Stars for which the decision could not be clearly
made are given in brackets. They are mostly the field stars in the gap. If we leave
out these uncertain ones, then we see two sequences, one for radiative and one for
convective energy distributions. The stars with high VrSin i occur exclusively on the
convective branch. I would interpret Figure 14 as telling us that generally convection
will become efficient for B - V > 0.22, however fast rotation will cause an earlier
onset of efficient convection leading to a reddening of the star by A(B - V) = 0.07
as given by the scaled Bilderberg models.
Could the decrease of the flux longward of 3647 ~ be a direct result of rapid ro-
tation without involving convection? Collins's results (1965) show that such an effect
can only be expected if the star rotates close to the Roche Limit and if at the same
time we look almost equator on, i.e.~ sin i ~ ]. For A stars this should lead to
v sin i ~ 350 km/sec, which is much larger than the observed values. r
Also indicated in Figure ]4 are the values for the A m stars. After correcting
B - V for the additional line blanketing - Baschek determined A(B - V) ~ 0.05-0.07 -
they fall on the same two sequences defined by the normal stars. There does not seem to
be any difference with respectto convectiono I might mention that we c~zmot reproduce
Baschek's and Oke's scanner measurements for any of the A m stars which we measured,
namely ]5 Vul, T UMa, 60 Leo, even though we do reproduce the energy distributions of
normal stars~ except for some minor discrepancies for 45 Tau. We are inclined to con-
clude that the A stars are all variable on time scales of the order of decades, a m
timeseale that reminds one of the solar cycle. We are presently checking on the
variability. This result is only preliminary.
III. SUMMARY
We have pointed out that stellar evolution computations presentl F need a good con-
vection theory rather than give us relevant information. The measured micro-and macro-
turbulent velocities may tell us something about convection, but we do not really know
when. Temperature inhomogeneities are hard to measure. The continuum energy distribution
in the UV for stars with B - V ~ 0.30 clearly shows the effect of a reduced temperature
83
5.6
5.;
4.8
o ~
L~ 4.4
l
4.0
, , , , ~ ' , i , , , ~ , , I , - ~__~a3oo~ ~...~-.~. ~o.o
~ooo. 5"~ CAq, ,s0 ~ "
;;'._;o~%~oo -%-~,<25 ~ o ; ~"',.. ,.u.~,.5 ~ ~ • /'.~ - . ~ , u - " " - - ' l = ~ F 2 0 . 5 6 " ~ - ' ~ " ~ " ~ A , ¢ , - , : ;n -'~OO.W'x%&.~STTou 100
i l i I i i i ¢ ~ I i I ~ ' ' , , , ~ ' ~ , 3.6 3.7 3.8 3.9 4 .0 " 3 6 3 7 3,8 3,9 4 .0 '3.6 3,7 3,8 3,9 4 , 0
~og X
FIGURE 13:
Shows the scanner observations by B~hm-Vitense and P. Johnson (1977). For 57 Tau we compare 2 sets of measurements of B6hm-Vitense and Johnson with those of Baschek and Oke to give an impression of the uncertainties. ~Indieates stars for which we have only 1 or 2 usable scans. Otherwise nota- tion as in Figure 11.
The scans confirm our previous conclusions. The stars in the "gap" i.e. with 0.22 ~ B- V ~ 0.29 show evidence,of convection, the field stars as well as the Hyades stars. HD 27429, a rapidly rotating star, shows an especially large reduction of the blue and violet flux.
Gives the relation between T e and B - V as determined from the compari- son of computed and observed energy distributions. Filled symbols indicate matches with scaled Bilderberg models, i.e. convective models, open symbols
matches with radiative equilibrium models. Dubious cases are given in brackets, they always appear twice, once matched with a convective model and once with
a purely radiative model. The spectral types and the rotational velocities VrSin i are given for each star. Leaving out the dubious cases we find that
for B - V ~ 0.30 all stars are convective. For smaller B - V we find two branches: One for radiative equilibrium stars and one for convective ones.
The high VrSin i occur all at the convective branch, indicating that rapid rotation appears to enforce convective energy transport in the late A stars
rather than to inhibit it as was previously suggested (B6hm-Vitense and Canter-
na 1975).
This conclusion rests on the interpretation of the somewhat uncertain energy distributions of p Tau, 69 Tau and 57 Tau.
+r refers to radiative equilibirium A m stars, +c to convective A m stars. After correcting B - V for the additional line blanketing they fall on the same two branches as the non A m stars.
85
gradient in layers ~ ~ 2/3 indicating an unexpectedly large convective energy flux in
these layers. Stars with B - V < 0°2 and VrSin i < |00 km/sec mostly appear to be in
radiative equilibrium. Stars with 0.10 < B - V < 0.30 may be convective, they are al-
ways convective if they have large v sin i. r
The point B - V = 0.30 also marks the extension of the red boundary line of the
instability strip to the main sequence. We take this as an additional evidence that
this boundary line actually marks the onset of convection in the HR diagram. With local
mixing length theory I/H = ] leads to agreement between the theoretical and observed
boundary lines for convection, neither larger nor smaller values for I/H will do.
The author's research described in this review was made possible by an NSF grant,
which is gratefully acknowledged. I am also grateful for a U.S. senior scientist award
from the "Alexander yon Humboldt Stiftung".
REFERENCES:
Abt, H. and Hunter, J.: 1962, Ap. J. 136, 381 Allen, L.H., Greenstein, J.L.: 1960, Ap. J. Suppl. ~, 139 Andersen, P.: 1973, P.A.S.P. 85, 666 Andersen, C.M., Stoeckley, R. and Kraft,R.P.: 1966, Ap. J. 143, 299 Baschek, B. and Oke, J.B.: 1965, Ap. J. 141, 1404 Baschek, B. and Reimers, D.: 1969, Astron. and Astrophys. ~, 240 B6hm, K.H.: 1958, Zs. f. Ap. 46, 245 B6hm-Vitense, E.: 1970a, Astr~n. and Astrophys, 8, 283 B6hm-Vitense, E.: 197Ob, Astron. and Astrophys. 8, 209 B6hm-Vitense, E. and Szkody, P.: 1974, Ap. J~ 19~, 607 B6hm-Vitense, E. and Canterna, R.: 1975, Ap. J, 194, 629 B6hm-Vitense, E. and Nelson, G.: 1976, Ap. J. in press B6hm-Vitense, E. and Johnson, P.: 1977, in preparation Canterna, R.: 1976, private communication Chaffee, R.H.: 1970, Astron. and Astrophys. 4, 291 Chandrasekhar. ~. :1961>,Hydrodynamic and Hydromagnetic Stability (Oxford,
Clarendon Press) p. 135 Collins, G.W.: 1965, Ap. J. 142, 265 Dickens, R.J., Kraft, R.P.~ and Krzeminski, W.: 1968, A . J. 73, 6 Frisch, H. and U.: 1975, Physique des Mouvements dans les Atmospheres
Stellaires (Centre National de la Recherche Scientifique, Paris 1976)
Garz, T. and Kock, M.: 1969, Astron. and Astrophys. ~, 274 Hayes, D.S. and Lantham, D.W.: 1975, Ap. J. 197, 593 Hoyle, F. and Schwarzschild, M.: 1955, Ap. J. Suppl. 13 Huan9, S. and Struve, O.: 1952, Ap. J..116, 410 Iben, I.: 1963, Ap. J. 138, 452 Johnson, H.L.: 1952, Ap. J. 11_~6, 640 Johnson, H.L. and Morgan,W.W.: 1953, Ap. J. 117, 313 Johnson, H.L. and Knuckles, C.F.: 1955, Ap. J. 122, 209 Johnson, H.L. and Mitchell, R.I.: 1958, Ap. J. 128, 31 Johnson, H.L., Mitchell, R.I. and Iriarte, B.: 1962, Ap. J. 136, 75 Johnson, H.L., Mitchell, R.I., Iriarte, B.;and Wisniewski, W.: 1966, Comm.
Lunar and Planet. Lab. No. 63 Kippenhahn, R.: 1972, In Stellar Chromospheres, Proceedings of ~ASA Collo-
quium.
86
Slettebak, A.: 1955, Ap. J. 121, 653 Str~mgren, B. and Perry, C.: 1965, unpublished report, Institute of Advanced
Study, Princeton, N.J. Traving, G.: 1975, Physique des Mouvements dans les Atmospheres Stellaires
(Centre National de la Recherche Scientifique, Paris 1976) Wallerstein, G.: 1962, Ap. J. Suppl. ~, 407 Wallerstein, G. and Helfer, H.L.: 1966 Ap. J. 71, 350 Wilson, 0.C.: 1976, Ap. J. 205, 823 Wright, K.O.: 1955, Transactions of the IAU, IX, 739 Zirin, H .: 1975, Reprint, Hale Observatories (Carnegie Institution of
Washington, California Institute of Technology, BBSO No. O150)
Kraft, R.P.: 1965, Ap. J. 142, 681 Kraft, R.P.: 1967, Ap. J. 148, 129 Lucy, L.B.: 1976, Ap. J. 206, 499 Mannery, E.J., Wallerstein, G. and Welch, G.A.: 1968, Ap. J. 73, 548 Matsushima, S. and Travis, L.D.: 1973, Ap. J. 181, 387 Mendoza, E.E.: 1956, Ap. J. 123, 54 Mitchell, R.I.: 1960, Ap. J. 132, 68 Oke, J.B.: 1964, Ap. J. 140, 689 Oke, J.B. and Conti, P.S. : 1966, Ap. J. 143, 134 Oke, J.B. and Schild, R.E.: 1970, Ap. J. 161, 1015 Paradijs, J. van: 1973, Astron. and Astrophys. 23, 369 Reimers, D.: 1976,Physique des Mouvements dans les Atmospheres Stellaires.
(Centre National de la Recherche Scientifique, Paris 1976) Rosendhal, J.: 1910, Ap. J. 16__~0, 627 Sandage, A. and Tammann, G.A.: 1971, Ap. J. 167, 293 Schwarzschild, M.: 1975, Ap. J. 195, 137 Sedlmayr, E.: 1975, Physique des Mouvements dans les Atmospheres Stellaires.
(Centre National de la Recherche Scientifique, Paris 1976)
DYNAMICAL INSTABILITIES IN STARS
P.LEDOUX
Institut d'Astrophysique
Unlversit~ de Liege
SUMMARY
The linear dynamical instability at the origin of convec-
tion in stars is reviewed and shown to depend essentially on the sign
of
A = ] d__~p ! dp p dr rlp dr
which is the usual argument of convection criteria. The case of two
or more superadiabatic regions separated by subadiabatic ones might
well deserve more detailed attention.
Once this instability is partially removed by the setting
in of convection its effects must be balanced by dis ~pation terms
if a stationary state is to result. This yields the value of a Ray-
lelgh number.
If energy generation is included in the non-c0nservative
terms, possibilities are somewhat enriched including a case of dyna-
mical instability in presence of A<0 (usually stable) but very small
in absolute value.
I. THE GENERAL PROBLEM
In the context of this conference we are not interested in
dynamical instability towards purely radial mQdes since convection
cannot manifest itself through these modes. We are thus left with the
problem of the response of the star to non radial perturbatius which
88
we shall assume very small to allow a linear treatment.
Of course, non linear effects may be of very great interest
and importance and are at least partially included in some of the ap-
proaches to steady convection as, for instance, in the mixing length
theory or various numerical attempts usually with other slm~lifylng
hypothesis and simple geometry (of. Spiegel ]971, 1972 Nordlund, 1976).
One of the most recent and most direct attaeksin general stellar cir-
cumstances is due to Deupree (1974, 1975 a-b, 1976). As far as I am
aware it has not revealed new instabilities such as may occur for
instance in metastable situations. It has not either restricted the
domain of significant dynamical instabilities but it has yielded
interesting information on the development of these instabilities
such for instance as a strong asymmetry between upward and downward
motions.
The study of non radial stellar oscillations does not go
far back. If we exclude Lord Kelvin's discussion of the homogeneous
incompressible sphere (Thomson, ]863) and some more or less timid
references by Moulton (|909) and Shapley (1914), the first significant
paper is that of Pekeris (|938) in which he solved the problem of the
non radial perturbation of the homogeneous compressible s~he~e.
Pekeris used the usual separation in time and spherical
coordinates of the Euclidian perturbations f~(pt,0', T',~') and of
the radial component of the displacement ~r
f'(r,O,~,t) - f'(r)pm(cosS)eim~e i~t -£~m~£
He showed thatp for each value of the degree ~ of the sphe~cal har-
monic~ apart from a positive spectrum with an accumulation point at
infinity corresponding to the pressure modes (or p modes, a~), there
89
existed a negative spectrum corresponding to gravity modes (or g
modes, a~) with an accumulation point at zero, all these modes being
(2~+I) degenerate with respect to m. An illustration of the distribu-
tion of the ~2 as function of ~ and the order of the modes can be
found for instance in Ledoux (1974).
Of course~ in this case, all the g modes (o~<0) are unstable
but I don't think that the connection with convection was pointed out.
Note that Pekerls choice of dependent variable (~= dlv ~) led him to
miss the so called fundamental mode (or f mode or Kelvin mode) which,
in this case, is exactly the same as the unique mode (for a given £7
of the incompressible sphere (~=O).
The next important paper is that of CowllnE (1941) in which
he tackled the case of the general polytrope of index n(p = Kpn+l/n).
In this case, the general problem is of the fourth order as it does
not split into ~wo second order differential equations which can he
solved successively as for the homogeneous model. However Cowling
noted that, except for the lowest modes and lowest values of £, the
perturbation of the gravitational field can 5e neglected without
serious effects. With this approximation, the problem reduces again
to the second order and, as Cowling showed, can be assimilated to a
St=rm-Liouville problem for large enough u 2 (high p modes) or small
enough I c21 (high g modest, the f mode for each ~ falling la between
the corresponding p and g spectra.
Furthermore if the generalized ratio of specific heats F 1
(or ~ in a pure gas) satisfies the inequalities
n+l r1> n (I)
all the o 2 are positive (stable g modes : g+) while, if 8
90
r < n+l i n (2)
they are all negative leading to instability (unstable g modes : g ).
But the criterion for convective instability in terms of
A = l dp l ap (3) 0 dr FlP dr
can be written very generally, even in the relativistic case (Thorne,
1966, Kovetz, 1967, Islam, 1970)
A >0 (4)
and becomes in a polytrope, with the usual notation,
A = 1O dEd--~0 (n -n_+i)111 >0 ( 5 )
or since (dO/dE) is negative
n+l (6) rl < n
2 which is identical to condition (2). Thus all the Og are negative
(dynamical instability towards non radial perturbations) provided the
criterion for convective instability (4) or (5) be satisfied every-
where in the star.
The negative g spectrum in the homogeneous model may be
interpreted in the same way since in that case
A I dp >0 Flp dr
gl
But apparently, it was only slowly (Ledoux, 1949), at least
in astrophysics, that the connection between dynamically unstable g
modes and the sign of A became to be recognized and that convection
became to be considered as the end effect of this instability.
In the general case, it is not difficult to have A appear
explicitly in the equations which can be written
~P _ O' ~r dp -- - -- + .... d i v ~r (7)
p p p dr
0~i ~ rip div ~r = ~2~r- grad (@' + ) + ~ A P .....
= r3 -I ! grad o ( ~ - i div ~)' - i_~a di~) i~ p o P
p' + rip ( °' + A~r) ~ ~ p(~- !div ~)' o is p
(s)
(9)
F 2 p iaCp p
A~' = 4~Gp' (11)
where
I kPikA - + -- ~ iVk e =£N E P
represents the total heat liberated per second by nuclear reactions
and viscosity.
One may note that A is related to the Brunt-V~is~l~ frequency
N by
N 2 ,, - gA
02
and to
S = r2 - i I d1~ I dT (12) r 2 p dr T dr
by
a = 4 - 3____/~ s + ! d£ (13) 8 ~ dr
if 8 is the ratio of the gas pressure ~o the total pressure.
The adiabatic approximation (right hand members neglected
in (8), (9), (I0)) is sufficient to discuss dynamical stability and
it should enable us to understand the correlation noted above between
the sign of A and the stability of the g modes.
If we neglect ~ we can for i~stance write a second order
differential equation for ~- ~r/r :
where
9 d~ 6 i do I d
dr 2 ( ) dr
÷ ¢ { 020 _ t ( t + 1) o 2 A8 + 6
rip r 2 0 2 r 2
d dr (PI~)
+ 3 i do I d 0r 2 r ~ ~r'r + ( ) dr (Plp)) ( ) r~p d-7 < ) } - o (14)
_0r2 )
For high p modes,o 2 large, the equation simplifies very
much and shows the acoustic character of these modes. However, they
are without interest in the present context since, in realistic stel-
93
far conditions, all the o 2 are always positive. But in the same way P
if one considers high g modes, I ~21 small so that terms proportional
to 02 can be neglected, the only term left over which contains 02
is the second one in the coefficient of ~ in (14), i.e.
~(~+l) A_~
r2 ~2
and the problem is essentially of the Sturm-Liouvllle type with a
parameter %~ 1/o 2 . It is well known in that case that if A keeps
the same sign throughout, the elgenvalue (l=I/=2)~fll be of the oppo-
site sign. In other words~if A is positive (convective instability),
at least the small u 2 will be negative. But it has been accepted g
generally that all the 02 will have this same sign, because it is g
difficult to see how the sign of 02 could change in going from small
to larger values if the sign of its coefficient is constant. Anyway
fairly recently Grisvard, Souffrin and Zerner (1972) using the second
order system
dv =(~( ~ + 1) Pr2 i p2tr l d--{ c 2 - r l ~ ~ w = aw (15)
dw (s2+ Ag) P d--r = ~ v = by (16)
equivalent to (14) managed to prove without any assumption as to the
order of magnitude of ~2 that the latter are all real positive if A
is everywhere negative. Thus a necessary condition for dynamical in-
stability is that A be positive at least in some part of the star.
In that case, the authors succeeded in establishing an upper limit for
the modulus of any negative c 2 g
94
I o~t < Max (Ag) (17)
where the maximum is taken on the region where A>O. They showed also
that 1o21 increases with £, i.e. when the horizontal wave length g
decreases.
However if the investigation of Grisvard et al is free of
any asymptotic restriction, it still treats the perturbation of the
gravitational potential 4' as negligible. Would the previous results
hold if 4' is not neglected, especially for the lowest modes and har-
monics? Lebovltz (1965 a-h, 1966) tackled this general problem on the
basis of the variational principle established earlier by Chandrasek-
her (1964)~ which expresses o 2 as the extremum of integrals exten-
ded to the whole configuration. This enabled him to show first that if
A<0 everywhere (convective stability) all the o 2 are positive which
is thus a sufficient condition for dynamical stability towards non
radial perturbations. It is also a necessary condition as he showed
later, since the existence of a region however small with A>O entails
negative eigenvalues. This implies also that one can find solution~
of the differential equations with appreciable amplitudes in that re-
gion only.
In this respect~the work of Ledoux and Smeyers (1966) was
more or less complementary since they pointed out indeed that when A
changes sign in O~r~R the asymptotic form (02 small, g modes) of equa-
tion (14) has s turning point in A = 0 and that the g spectrum splits
then into two : one of positive eigenvalues (a~ 20) corresponding to
modes oscillating in space with appreciable amplitudes in the convec-
tively stable region (A<O) and decaying exponentially in the unstable
region (A>O) while the other spectrum of negative eigenvalues (o~ <0)
95
corresponds to modes oscillating in the convectlvel F unstable
region (A>O) and decaying in the stable one.
Using equations (15) and (16), Scuflaire (1974, cf. also
Osakl 1975) showed that these properties of the unstable modes sub-
sist even for the first few modes~ i.e. they oscillate only in the
unstable region. However~ it may happen that stable modes oscillate
in a slightly superadlabatic region, as for instance in a condensed
model where the stable g modes characteristic of the central stable
core may continue to oscillate in an external convection zone.
2. MULTIPLE UNSTABLE ZONES
The unstable modes are of greatest interest here and an
interesting problem arises as to their behaviour if there are two
or more regions with A>O separated by stable zones (A<O). Tassoul
and Tassoul (1968) in a discussion of asymptotic g modes suggested
that there should be as many distinct unstable g spectra as there
are unstable regions.
However, this is not obvious since even in the simplest
case of two turning points (for instance two convectlvely unstable
regions separated by a stable one) the usual analysis of a single
turning point in terms of Bessel functions cannot be simply repea-
ted (Langer, 1959) at each of the two turning points. A solution
should rather be sought in terms of Weber functions allowing to cross
the two turning points at once.A considerable amount of literature
exists on the subject and a very recent paper by Olver (1975) opens
the way to straightforward applications. They may bring to our
attention, at least in special cases~ unstable solutions whlch~In
the above case, may have large and comparable amplitudes in the two
unstable regions with a relatively minor reduction of this amplitude
96
across the stable region especially if the latter is fairly narrow
with a density gradient only slightly subadlabatlc. Such modes could
be particularly efficient in mixing the whole star. This conjecture
is somewhat supported by some simplified or numerical investigations.
For instance, Goosens and Smeyers (1974) have found more or less
"accidental resonances" between some of the stable g modes of two
stable regions separated by an unstable one (Just the opposite of
the case considered above) giving rise to a stable mode with large
amplitudes in both stable regions decreasing only moderately in bet-
ween in the unstable region.
Other examples have been treated by students in Liege
which lead to slmilar conclusions. Consider for instance, the case
of a heterogeneous incompressible model composed of superposed layers
of different densities presenting two unstable discontinuities,
(Pin - Pext )<0~ separated by a stable one. The behavlour of the
elgenfunctlons associated with the two negative eigenvalues can
depend drastically on the closeness of these eigenvalues. In general~
i.e. as long as those two o 2 are not very close~ each of the elgen-
functions has a sln~le maximum at the unstable interface with which
it is associated. Kowever when the parameters of the discontinuities
are varied~ one of these solutions may acquire a secondary strong ma-
ximum at the other discontinuity, the minimum between the two remai-
ning apprecla~le ~r1~n t~n~s elgenvalue becomes very close to that cor-
responding to the other discontinuity.
3. THE EFFECTS OF THE DYNAMICAL MOTIONS AND THE REDUCTION OF THE
SUPERADIABATIC GRADIENT
Many of the examples where dynamically unstable modes have
been found are artlfleiel because the superadiabaticity (A) has been
fixed a priori at a much larger value than is ever llkely to occur
97
in stars. In such cases, the time-scales of tke growing motions
which are proportional to (A) -I/2 are rather short. Of course in the
end, these violent motions lead to the establishment of a convective
zone through which A is reduced to a very small value Just sufficient
to allow the residual energy no longer transferred by radiation to
be carried by convective currents. This implies important readjust-
ments in the internal structure of the star including transfer of
mass to deeper layers and various feed-back effects which may in-
crease considerably the extent of the convection zone with respect
to the initial superadlabatlc region. For instance, in going from
an homogeneous compressible sphere to a polytrope n - 3/2(FI=5/3)~
the energy released, if the mass and the radius are those of the sun,
is of the order of 0.i GM~ /R G = 5.1047 ergs in a short time of the
order of one hour. Of course this is an extreme case, but even if the
energy release is reduced by a factor I0 I0 and the duration increa-
sed by a few orders of magnitude, it would still remain a fairly
spectacular phenomenon which was considered at one time (Biermann~
1939; Schatzman, 1946) significant for the interDretatlon of novae.
In any case~ I suppose that there are no serious doubts
that the readjustments contemplated above would lead in the end to
the same model as the one that could be built a priori using the
usual method of having a convective adiabatic zone initiated at the
point where the radiative gradient becomes exactly equal to the adia-
batic one (A ~ 0).
In fact things are a little more subtle as the reduction
of the superadlabatlcity must proceed only so far that the subsisting
excess provides the necessary buoyancy force to balance the energy
dissipated by viscosity and conduction (radiative or otherwise). Of
course, when A and 1o21 are still large, the effects of these dissi-
98
pation terms could Be evaluated hy a pertuT~atlon metkod (similar
to that used for vibrational stability when ~2>O) yleldlug a damping
coefficient ~' correcting the adiabatic time dependence By a factor
-olt e
But one knows how cumbersome (cf. Ledoux 1974), the ex-
pression of ~' is. yo illustrate the situatlo~, it seems Better to
revert to a simple case as, for instance, the plane layer with cons-
tant coefficients. Let ~ and n represent respectively the thermome-
tric conductivity a=d the kinematic coefficient of viscosity. Assu-
ming a time depe~d~nce e st, one finds for the g modes, if A is posi-
tive and larger than K,~ or <+n
s = gA) z2- 2 (lg) k 2
where k~ and k z (k 2 = k~ + k ) are respectively the horizontal and
the vertical wave number. In this case, dlssipatio~ simply hinders
a little convection hut does not affect it very much.
On the other hand, when convection is established, the
effects of the dissipation forces are of the same order as those of
the residual buoyancy and the above formula is no longer significant.
In that case and if one includes the rate of energy Ke~eratlon c
(significant in the deep interior), one gets an equation equivalent
to that of Defouw (1970) with
~)g E L T = - _ = T ' L0 O
where v represents the sensitivity of ~ to T,~ =(dlog~)0/dlo~ T.
After separating a secular root s 3 = -n k 2, the dispersion relation
for g modes, gives solutions
99
1 ~ (~ - I) ) s = - y~ (~+ ~)k 2 -CT
P
2
+ i --~ (19) -'{~ ((~ - n)k2 CpT (v- i) 12 k zk~l gA} I/2
which for vanishing E reduces to the ordinary Raylelgh solution of
the B~nard problem
i + I s -- - y (n + ~)k 2 -{~ (~ 2 l/2
~) 2k4 kH + ~ gA} (20)
which yields back (18) if (k2/k2)gA>>~2k 4 and n2k 4
On the other hand, if we approach the marginal case
2 2 (st0), then (kH/k)gA must take the appropriate value to make the
square root exactly equal to the first term, i.e.
2
kH gA ~ nk 4 nk2 C~ (v-l) k 2
If e is negligible~this condition becomes
~Ad 4 = k6d 4
~n 2 k H
where the depth d of the layer has been introduced. If k z =~/d and
a ~ kHd , one gets finally
~Ad 4 = ( 2+ a2)3
<~ 2 a
= R
which is the usual value of the Rayleigh number. The energy genera-
tion could reduce somewhat the value of R. In this marginal case,
100
the tlme-scale- !/s Becomes infinite but a circulation sets in with
a finite turn-over time.
Thus convection~although originating essentially in a dy-
namical instability (conservative terms) with a short time scale,initi
ares fast motions to modify the medium itself in such a way as to
reduce gA fIBal~y to the same order as the dissipative (non conser-
vative) forces.
As Defouw (1970) pointed out, the energy generation term
has a destabilizing influence (cf.19) and if
CeT (v-l)> (n+~)k 2 P
it contributes directly to the instaDillty. If A>O, this effect
simply relnforc~ the Buoyancy.
On the other hand if A<O, and such that
2
k 2 p
the effect would correspond to a case of vibrational instability (or
overstability) of stable g modes. But if
2 kH gA<~ (< - -V 2 c T (v-l)
P
it seems that a growing non-oscillatory motion would arise which
might lead to some kind of convection, although A<O.
TASSOUL M. a~d TASSOUL J.L. 1968, Ann. Astrophys., 31, 251
THOMSON, W. 1863, Phil. Traus. Roy. Soc. London, 153, 612
THORNE, K.S. 1966, AsCrophys. J., 144, 201
OBSERVATIONS BEARING ON CONVECTION
K.H. BbHM
University of Washington~ Seattle, WA., U.S.A.
SUMMARY
I. Solar observations contain a considerable amount of information on the hydro-
dynamics of stellar convection. We emphasize and discuss especially
(a) the existence of two very different cell sizes,
(b) the unexpectedly high cross-correlation between vertical velocities and tempera-
ture fluctuations in the granulation,
(c) the fast "downdrafts" in intergranular regions,
(d) the existence of cells much larger than the scale height,
(e) thes~mnge behavior of the temperature fluctuations in the supergranulation, and
(f) the importance of convective overshoot.
2. The Li-Be problem and its possible relevance as an indicator of convective
overshoot is briefly summarized.
3. Convection may have a stronger influence on the observable properties of He-
rich ("non-DA") white dwarfs than of most other stars. We discuss especially
(a) the persistence of outer convection zones through a very wide range of effective
temperatures,
(b) the occurence of convection in high layers of the atmosphere,
(c) the relatively high efficiency of convection in white dwarf atmospheres, and
(d) the relevance of convection to the cooling problem.
! • INTRODUCTION
The main ~£iculty of the present topic is due to the fact that almost all
spectroscopic and color observations of cooler stars are somewhat related to the
convection problem but that there are so few observations which seem to be really
crucial in this context. Almost all of the really decisive observations seem to refer
to the sun. Though in the stellar case the conclusions concerning convection can be
extremely interesting, the way to reach them is often indirect and some doubt is
usually possible.
Consequently we fee~ chat the topics of our discussion are quite obvious as long as
we look at evidence from the Sun, but that our selection of topics in the stellar
cases will probably be somewhat subjective.
104
I shall first report on the solar evidence. After that I shall go on to discuss
some observational evidence for convective mixing processes in stars.
Finallywe shall consider a class of stellar atmospheres whose average temperature
structure is strong, changed by the presence of convection in rather high atmospheric
layers.
II. OBSERVATIONS OF SOLAR CONVECTION
A. GENERAL REMARKS
As we all know there are at least two groups of phenomena which are thought
to be direct indications of the presence of convective motion (including overshoot)
namely granulation and supergranulatlon. In the case of granulation it is more or
less generally accepted that we see the direct effects of convective motion, in the
case of supergranulation some slight doubts may be possible. Nevertheless most
astronomers believe that supergranulation is the manifestitation of the penetration
of large cell convection.
In addition to these direct convective effects there are indirect effects which give
us some information about the structure of the convection zone. There are firstly the
observed eigenmodes of the so-called five-minute oscillation. I refer to the work
of Ando and Osaki (1975), Deubner (1975), the review given last year by. J.P. Zahn
as well as the very interesting comment by Mclntyre (1975). Secondly, there are
observed mixing effects in the sun which may or may not be due to convective mixing.
I refer specifically to the very low abundance of Li 7 and the apparent absence of Li 6.
I shall first discuss the direct evidence for convection from the observations of
granulations and supergranulation.
One of the most interesting observed features of solar convection is that there are
two scales of motion present near the surface of the convection zone, namely the
granular scale of about 2000 km (maximum of the A I power spectrum) and the super-
granular scale of about 32000 km. Both numbers correspond only to the relatively
flat peaks of broad power spectra. This statement is of course not new. However,
it is surprising how few theoretical attempts to understand this fact have been made.
However, there are some notable exceptions including especially the paper by Simon
and weiss (1968).
Let me now try to summarize briefly our knowledge of granulation and then of the
supergranulation.
B. GRANULATION
How do we describe the observational information on granulation ? There
are essentially two possibilities. Either we use a description based on the autocor-
relation function and power spectrum or we look at single granules and, maybe, derive
from them the properties of an average (or typical) granule.
105
For some time it seemed that a description using correlation functions and power
spectra would be the only useful method. However, more recently studies of properties
of individual granules have found great interest. The reason for this is of course that
a description is needed which permits a separate study of the granules on the one
hand side and the intergranular lanes on the other side.
In studying granulation we should like (ideally) to obtain a three-dimensional picture
of the velocity and the temperature fluctuations. This requires observations of very
high spatial resolution, observations in a number of different lines (which are formed
at different depths) and center-to-limb observations. One of the fundamental diffi-
culties is, as we all know, the separation of the granular and the oscillatory velocity
fields.
One way to do this is to identify the region in the k-~ plane which corresponds to
granular motion. A simpler method is to assume that all horizontal scales smaller
than'b 4000 km correspond to granulation whereas all motions with a larger horizontal
scale are due to oscillations (cf. Mattig, Mehltretter and Nesis ]969, Beckers
and Canfield 1976).
In addition, one finds that the observed velocities first decrease with increasing
height and then increase again (cf. Canfield and Mehltretter 1973, Mattig and Nesis
1974). This fact can also be used for a separation of the two velocity fields.
The component which decreases outward is usually identified with convection and
convective overshoot whereas the increasing (or constant) component corresponds
to the oscillations.
Even today the problem of correcting the observations for the effects of the
contrast transmission function of the telescope and of the atmospheric seeing seems
to be a very difficult one. This is true for the determination of the intensity
and the corresponding temperature fluctuations but even more so for the velocity
determination. Consequently we find a rather large scatter of the observational
results which is certainly to a considerable part due to seeing differences. Another
difficulty is due to the fact that different authors use different spatial domains
and use lines which are formed at different depths in the atmosphere. Consequently
it does not make much sense to take averages of measurement by different authors.
Rather, we shall take the detailed high resolution observations by Canfield and
Mehltretter (1973) as a typical example of modern results. These authors find a definite
decrease of the r.m.s, velocity outward before it starts to rise again. (This
result has been confirmed by Mattig and Nesis 1974, Mattig and Schlebbe ]974,
Musman 1974, and others).
The largest r.m.s, velocity is found for the line formed at the relatively
largest depth. The relatively faint Fel line 5178 shows a formation depth of about
40 Pan above the zero level of HSRA. Canfield and Mehltretter find a r.m.s, velocity
of .54 km/s for this line if corrected only for instrumental effects and of .73 km/s
if a reasonable correction for seeing effects is applied. By extrapolation the
authors find a r.m.s, velocity of .8km/s at the height of continuum formation.
106
It seems that these velocities are very roughly compatible with predictions of the
simple mixing length theory.However, this type of theory (without convective overshoot)
predicts a much steeper variation of the r.m.s, velocity than is observed.(See ~ig.l.)
Any theory which includes overshooting (even linear mode calculations, el. B~hm 1963 a)
can reproduce the observational results in a qualitative way.
0 .8
.~ 0.6 E
> 0 . 4
0 . 2
~l I i "" I ' I ' I I - -MIXING LENGTH WITHOUT
_'~ O V E R S H O O T
~ Canfield, ~\ X,~ Mehltretter ~\ ( 1 9 7 3 ~
\ \ " . ,~ ,~o . . ~ - \ \
\ "- \ , ~ ~" -. ~X"3130 km
1 5 6 0 k i n " I , I " ,L I I
0 2 0 O 4 0 O h (kin)
6 0 0
F I G U R E l
Comparison of the observed (vertical) velocity stratification (corrected for finite telescope aperture and for finite slit width hut not for atmospheric seeing~ Canfield and Mehltretter |973) with some simple theoretical results. The filled circles represent the observations by Canfieldand Mehltretter (1973), the (long) broken line is the corresponding interpolation curve. The solid line shows the results for the standard (local) mixing length theory with I = H. The vertical velocity distribution for two linear modes with horizontal wavelength calculated for a detailed model of the solar convection zone (B~hm 1963)is given by the (short) broken curves~ Note that the linear modes contain an arbitrary amplitude factor.
107
.-2 ~.oo c
,4 t,=
0 . 7 5 n~ LU
0 a. 0 . 5 0 LU >
0 . 2 5 ..J W n,-
%
' I ' I ' I ' I ' -
-- ~\\ / , I X5171
I '~
/ / \ \ " --_rx5,64 -1 ", \ \ t xs,za /
, I i I ,- "" 1" J I ~
2 0 4 0 6 0 8 0 I 0 0 k x 10 - 4 km -I
FIGURE 2
Comparison of the velocity power spectra for the lines Fel ~ 5171 and Fel /% 5164, 5178, 5180 and the power spectrum of the intensity fluctuations in the continuum. The diagram is based on the data given by Canfield and Mehltretter (]973). The maxima of the contribution functions for the lines 5178, 5164 and 5180A occur at a height between - 20 km ands70 km in the Harward-Smlthsonian Reference Atmosphere, the maximum for the line ~ 5171 lies at h% + 470 km i~ the HSRA. Note that the velocity power spectrum for the lines formed near the depth of formation of the continuum (~ 5178, A 5164, ~ 5180) looks very similar to the intensity fluctuation power spectrum for the continuum.
108
The continuum r.m.s, intensity fluctuations are~ 8.2% with a reasonable seeing
correction (2% T-fluctuation) which (if naively interpreted) is lower than the
mixing length predictions (~20%).
However, Canfield and Mehltretter believe that the higher value of ~ I ~ 12%
derived earlier by Mehltretter (19711 from his observations may also be correct.
The spatial power spectra of the intensity fluctuations show (after correction
for seeing effects) a rather flat maximum at a horizontal wave number between
3 and 5 x |0 - 3km -I (~ between 2000 and 1250 km).
Itshould be noted that a number of investigators find the peak of the power
spectrum at considerably longer wavelength (cf. Mattig and Nesis 1974). We assume
that this is a consequence of seeing effects.
Power spectra ~f the velocity fluctuations for lines which are formed relatively
deep in the photosphere seem to approach a power spectrum similar in shape to
the A I power spectrum (Canfield and Mehltretter 19731 indicating that
T-fluctuations and velocity field are really related below T 5000 ,~ ,4.(Fig.2).
Earlier, Frazier (1968) had found a comparably high correlation between continuum
intensity and the velocity in the Sill 6371 llne which is formed very deep. (The
peak of the contribution function is at • ~a ,65,)
It is really astonishing how high the v - ~ I correlation is. One has to
remember two things to appreciate this fact. Firstly, the velocity and the
continuum intensity which are beeing cross-correlated do not refer to the same
layers. Secondly, there is a possibility that for turbulent convection even at
one given point the cross-correlation could be considerably lower than 1
(cf. Spiegel 1966 a).
It is also interesting to note that there seems to be roughly a 30 second phase lag
in the sense that the maximum granular continuum follows the velocity (Frazier 1968,
Edmonds and Webb 1972, Musman 1974, Beckers and Canfield 19761.
We shall now proceed to the description of individual granules which, as mentioned
earlier, turns out to be useful and very interesting from a hydrodynamic point
of view. A very fundamental study of this type has been carried out by Beckers and
Morrison (!9701. They observed a large number of granules at ~ = .84, .70, .60
and determined the velocity field of an average granule from these observations.
By making use of the different positions on the solar disk (different~)these
authors derive the vertical as well as the horizontal velocity field for the average
granulum. Since not all granula have the same size and shape we can not expect
to get a really quantitative picture of the hydrodynamics of a granulum by this
procedure. Since in the reduction process the granules are positioned such that their
centers fall on the same point the upward velocities at the center of the granule are
enhanced in the averaging process whereas horizontal and downward velocities (occu-
ring in the outer parts of the granulum) are reduced. Nevertheless the "average"
granulum shows very clearly the basic structure of the velocity field in a granulum.
(Fig. 3).
109
| i I ,
Z ~ k m I 0 ~
200 m / ~
I I I I iooo looounl
J
FIGURE 3
The vertical cross-section of the velocity field of an "average" granule. (See text for a brief description of the averaging procedure). From Beckers and Morrison (1970). By permission of Reidel Publi. Co., Dordrecht.
Of course, we have to keep in mind that all modes of smaller scale are smoothed out by
atmospheric effects and by the averaging process.
The importance of such investigations lies in the detection of an ordered horizontal
outflow from the granulum. This is, of course, not too surprising, but it is certainly
an observational fact which is not emphasized in the mixing length theory.
The investigation of individual granules has lead to the discovery of another interes-
ting phenomenon namely the "exploding granules". (Carlier, Chauveau, Gugan and
RDsch 1968, Musman 1972, Beckers and Canfield 1975).
The phenomenon is shown (in a rather schematic way) in Fig. 4. Musman (|972) has
interpreted this sequence of events as being due to the interaction Of a thermal
with the stable layers above the convection zone. Ha has carried out a laboratory
experiment in order to confirm this point of view.
110
4 MINUTES B MINUTES LATER LATER
APPARENT EXPANSION VELOC ITY
I.Skm/s
o 0 SMALL UNUSUALLY
BRIGHT GRANULES
HAS EXPANDED; SHOWS
SLIGHT DARKENING AT
CENTER
FURTHER EXPANSION
WITH DARK REGION AT
CENTER
GRANULE (RING) FRAGMENTS AND FADES
FIGURE 4
Schematic representation of the development of an "exploding" granule (after Musman 1972, see text). Note that the bright granular region is drawn dark here.
Another important group of investigation is concerned with phenomena occuring in the
intergranular regions. It turns out that observers find a number of unexpected pheno-
mena and that these observations change our idea about solar convection considerably.
According to Deubner (]975) observations show that very large downward velocities can
be observed fairly often in the dark intergranular lanes. He quotes that observations
which have been made under excellent conditions show often downward velocities (uncor-
rected for seeing effects) of 2 km/s. He argues that these observations lead to correc-
ted downward velocities of about 4 km/s or possibly to even somewhat higher values.
These results indicate a very large asymmetry between upward and downward motion and
should be of great relevance to an understanding of the hydrodynamics of solar convec-
tion. It is not surprising that this effect is not visible in the results of Beckers
and Morrison who take average over many granules of different sizes.
111
It should be emphasized that it is not clear how large a fraction of the intergranular
lanes really show these great downward velocities. Deubner discusses these problems
in connection with the occurence of magnetic flux ropes in the photospheric network
as observed by Stenflo (1973) but he definitely considers the possibility that the
downward velocity is very high in all intergranular regions.
B. SUPERGRANLILATION
We all know that the supergranular velocity field pattern is strongly correla-
ted with the chromospheric network. The average diameter of supergranulation cells is
about 32000 km. Cells with diameters in the whole range from about l0000 km to 60000 km
are present. The typical horizontal (outflow) velocities are 0.3 - 0.4 km/s (Simon and
Leighton 1964, Deubner 1971). Vertical downward motions specifically in the magnetic
regions at the borders of the cell are in the range 0.] - 0.2 km/s (cf. Simon and
Leigthon 1964, Frazier 1970, Musman and Rust 1970, Musman 1971, Deubner 1971, Worden
1975). Supergranular motions have been detected also deep in the photosphere as
indicated by Deubner's (1971) measurement of CI 5380 line (X-v 7.7 eV ).
It is generally agreed that the main downdrafts occur in the rather localized
magnetic regions at the cell boundary.
Upward motions at the center of supergranulum of the order of 50 m/s may be present
but have not yet been confirmed. (Worden and Simon 1976).
In order to judge whether supergranulation is really a convective motion a study of
the corresponding temperature fluctuations is of great importance. The results of
such studies are somewhat confusing though a clarification may now be in sight.
Somewhat surprisingly one finds normally a slight temperature increase at the cell
boundaries (~ 2.5°K) whereas according to the usual picture of a simple convective
cell a temperature decrease would be expected (Beckers 1968, Frazier ]970). It is
usually assumed that these very small temperature increases are due to the increase
of the magnetic field and are not directly related to the convective flow (Liu 1974).
One might hope to find the expected velocity -- ~ T correlation more easily in deeper
layers. Recently Worden (1975) has studied the supergranulation structure at 1.64
which represents the deepest observable layer in the solar photosphere. Using the
HSRA (Gingerlch et al. 1971) we find that at this wavelength we look down to about
T qu 1.8 where the temperature is ~ 7000°K. Worden finds an 0.7~intensity increase
near the cell boundaries. This has to be compared to an 0.4% intensity increase in the
higher photospheric layers. These results can be translatedinto a temperature diffe-
rence ofabout 50 ° in the deep photospheric layers. This rather small difference indicates
that the supergranulation (if it is convection) contributes only very little to the
total convective energy flux in the visible layers. Nevertheless, the fact that such
large convection cells do existis very important from a theoretical point of view.
I do not think that this conclusion is very much influenced if supergranulation is an
overshoot phenomenon or if the supergranulation elemen~ are counter cells (of. Spiegel
1966 b).
112
Some of the i~ediate theoretical implications of the existence of convection cells
of supergranular size have been discussed by Simon and Weiss(1968 a, 1968 b).
They point out among other things that we should expect that the depth of cell is
not very different from about one fourth of i~s diameter. This leads to a cell
depth of about ]0000km which is clearly much larger than a scale height. Simon and
Weiss are to some extent guided by the results known for polytropic atmospheres in
which the scale height is of the same order of magnitude as the distance to the
surface of the convection zone. The possibility that cells of supergranular size will
transport a large fraction of the convective energy flux in deep layers is of great
importance. Simon and Weiss also predict giant cells which are comparable in size to
the total thickness of the solar convection zone and which in the meantime have also
been detected observationally.
We close this chapter by listing briefly some of the interesting and hydrodynamically
relevant properties of solar convection in table I.
TABLE I
SOME INTERESTING PROPERTIES
OF SOLAR CONVECTION
| . GENERAL
A. Existence of two completely different cell sizes (2000 km and 32000 km)
2. GRANULATION
A. Very high correlation between ~J and A T (more than 80%)
B. Flat maximum of ATpower spectra between ~ = 1200 and ~ = 2000 km
C. Very fast down-drafts (~u 4 km/s) in intergranular regions
D. "Cell" structure
E. Convective overshoot
3. SUPERGRANULATION
A. Cell sizes much larger than scale height
B. Concentration of magnetic fields at cell boundaries
C. Positive ~ T in high layers, negative ~ T in deep photospheric layers
of cell boundaries, ~ T unexpectedly small.
113
I I I CONVECTIVE MIXING AND THE Li-Be PROBLEM
When we discuss convection in stars it is quite clear that the observational
evidence concerning convection has to be rather indirect. Our conclusions are usually
based on a mixture of observations and theoretical predictions (which often use a very
crude form of theory), Sometimes we even forget that theoretical assumptions are used
and claim that the result is purely observational though it is not.
Problems of convective mixing definltely belong to this category. In many cases
we certainly see effects of mixing but we can not be absolutely certain that this
mixing is really due to convection. Even if it is convective mixing it can have
happened rather recently or during an earlier phase of evolution. Probably~ the
best known example of such situation is the lithium-beryllium problem. As is well
known, the solar atmosphere contains no or almost no Li 6 and in contradistinction to
other old material in the solar system (chondrites) very little Li 7. In order to
explain this it is usually assumed that surface material has been mixed into layers
of 2.5 x 106K where the lithium is destroyed by (p, ~ ) reactions. A progressive
decrease of the lithium content from G 2 V to K O V stars (Wallerstein, Herhig and Conti
1965, Zappala 1972) has been attributed to the effects of pre-main sequence convection.
(Bodenheimer |965, |966). In addition amain sequence depletion time scale of about
1.5. x 109 years has been found (Herbig 1965, Zappala 1972). It is not yet completely
clear whether very slow convective overshoot (cf. B~hm 1963, 3966, Weymann and Sears
1965, Spiegel |968, Strau~, Blake and Schramm 1976) in combination with the relevant
nuclear rates can produce this time scale. However, Strauss,Blake and Schramm (1976)
argue very strongly that convective overshoot can explain the observed lithium abun-
dances. They emplasize the fact that the enersetically possible convective overshoot
theoretically decreases with decreasing mass of the star and that this is just the
feature which is required in order to explain the observations. If this result should
be confirmed the investigations of the Li problem would give us an excellent opportunity
to study convective overshoot below outer stella r convection zones. It seems that the
lithium problem is more useful in the study of stellar convection than many other mixing
problems since the time scale is known well and since it refers to main sequence stars
which are better understood than the advanced phases of stellar evolution in which most
other mixing processes seem to occur. The fact that it is probably related to convective
overshoot makes it especially interesting.
114
IV INFLUENCE OF CONVECTION ON ATMOSPHERE STRUCTURE AND COOLING TIMES IN COOL
WHITE DWARFS.
After these brief remarks about the Li problem I finally would llke to talk
about a group of convection problems which are also closely related to the above ques-
tions and in which I have been strongly interested in recent years. This is the field
of convection in white dwarf stars. I personally feel that there may be more drastic
effects of convection on observable properties in the ease of white dwarfs than in most
other stars for the following reasons (of. Bohm ]968, 1970, Van Horn 1970, Wiekramasinghe
and Strittmatter 1970, B~hm and Cassinelli 197], Wagner 1972, Grenfell ]974, Fontaine
et al. 1974, Fontaine and Van Horn 1976, Muchmore and B~hm 1976).
]. Atmospheric convection persists over a wider effective temperature range
in white dwarfs than in any other stars. This is especially true for the roughly 30 %
of white dwarfs which are called non-DA's and which have very He-rlch atmospheres. In
these objects outer convection zones are present in the effective temperature range from
about 25OO0°K down the lowest effective temperatures.
2. Convective instability sets in very high in cool. white dwarf atmospheres.
In a number of models this occurs at optical depths higher than O.O1. Consequently the
atmospheric structure is strongly influenced by convection. This is especially true for
white dwarfs with) say, Tel f < 8000 ° in which the high atmospheric density leads to
relatively high effectivity of convection and consequently to a temperature stratifica-
tion which differs considerably from a radiative equilibrium stratification (cf. Wagner
1972, Grenfell }972). In extreme cases (non-DAs with 3000°K < Tel f ~ 5000 °K) the convec-
tive flux reaches values of about 60 % of the total flux at ~ ~ ].0 (B~hm, Carson)
Fontaine and Van Hors |976). So, we should be able to detect the presence of convection
without difficulty by studying the line spectra Of cool non-DA white dwarfs like
Van Maanen 2 and Ross 640. (However, it should Be emphasized convection in cool white
dwarfs has to be studied through its influence on the mean stratification of the atmos-
phere. The velocities in these high density atmospheres are much too small to he detec-
ted directly.~
3. The existence of two different types of white dwarfs atmospheres, one consis-
ting of almost pure helium (non-DA), the other one of almost pure hydrogen (DA) has lead
to the suggestion that convective mixing processes in combination with gravitational
separation and/or accretion may be important in determining the chemical composition of
white dwarf atmospheres (cf. Strittmatter and Wickramasinghe }97}, Shipman 1972, Baglin
and Vauclalr ]973). However, the subject is controversial and simple convective
115
mixing is insufficient to explain the non-DAs (Koester 1976). Nevertheless, some
interesting convective mixing effects are to be expected since it is clear that
white dwarfs have a very ~hin envelope of hydrogen or helium surrounding the main
body of the star which consists of C and O (el. Weidemann |975).
6.0- I--
_.o
5.5-
!
0 -5 -10
I I
. . . oooo °
,,, ! ! . . . . . . . .
/ . _ - - BOO0 °
. . - - - - . . . - I o o o =
. . . . - 6500 °
5 . 0
6.5-
-20
FIGURE 5
Temperature stratification in the outer layers of cool He-rich (non-DA) white dwarfs. The diagram is based on calculations by Muchmore and B~hm (1976). We have plotted the logarithm of the temperature as a function of the local degeneracy parameter = = - ~ /kT (with ~ = chemical potential). Tel f is the parameter. The solid parts of the curves correspond to convection zones, the broken lines to the radiatlve-conductive zones below. The diagram shows how the temperature in cool He white dwarfs approaches its asymptotic core value T c and how large a fraction of the temperature rise towards T c occurs in the (geometrically thin) convection zone. This gives some indication of the importance of convection for the determination of the T e and Tel f relation (see text).
116
4. In the case of white dwarfs the evolutionary time scale is strongly
influenced by the time it takes for energy flux to go through the outer non-
degenerate and partially degenerate layers. When convection breaks throngh this
"insulating" layer the cooling becomes considerably faster. Consequently, the
cooling time and the luminosity function of white dwarfs is influenced by the
presence of convection (cf. Bbhm 1968, Ostr~ker 1971, Lamb and Van Horn 1975).
Eventually we shall he able to derive information about convection from all these
different effects. Some indication of the importance of convection for the deter-
mination of the relation between core temperature and Tel f is given in Fig. 5.
Finally it should be emphasized that the comparison of theoretical predictions of
white dwarf convection with observations involves one complication which
is not present in most other stars. The calculation of the equation of state, the
adiabatic gradient and the specific heat is made rather difficult by the presence
of partial degeneracy, pressure ionization and electrostatic interactions between the
particles. (In non-DAs of Teff < 3800 ~ we face some fascinating problems because
drastic complications of the equation of state, including partial degeneracy, occur
already within the atmosphere, see B~hm, Carson, Fontaine and Van Horn ]g76).
However, we hope that the problems related to the equation of state will be overcome in
the foreseeable future.
ACKNOWLEDGEMENT S
This work has been in part supported by NSF grant AST 74 - 24343 A 0].
A part of this paper was written while the author held the Gauss Professor-
ship of the Gbttinger Akademie der Wissensehaften. I am grateful to the
Akademieand to Professor H.H. Voigt and the other astronomers of the Gbttingen
University Observatory for their kind hospitality.
t17
REFERENCES
ALTROCK, R.C, and MUSMAN, S. 2976, Ap J. 203, 533
ANDO, H. and 0SAKI, Y. ]975, P.A.S.J. 27, 58]
BAGLIN, A. and VAUCLAIR, G. 1973, Astron. Astrophys. 27, 307
BECKERS, J.M. 1968, Solar Phys. 5, ]08
BECKERS, J.M. and CANFIELD, R.C. 1976, in Physique des Mouvements dans les Atmospheres Stellaires (ed. R. CAYREL and M. STEINBERG) Editions du C.N.R.S., p. 207
BECKERS, J.M. and MOKRISON, R.A. 2970, Solar Phys. 14, 280
BODENHEIMER, P. 2965, Ap J. 142, 451
BODENHEIMER, P. 1966, Ap J. 24_~4, 103
B~HM, K.H. 1963, Ap J. 138, 297.
B~HM, K.H. 2966, Z.f. Naturforschung 2l_~a, ]107
BOHM, K.H. 1968, Astrophys. Space Sci. 2, 375
BOHM, K.H. 2970, Ap J. 162, 919
B~HM, K.H. and CASSINELLI, J.C. 197], Astron. Astrophys. 12, 21
BSHM, K.H., CARSON, T.R., FONTAINE, G. and Van HORN, H.M., 1976, in preparation
CANFIELD, R.C. and MEHLTRETTER, J.p. 1973, Solar Phys. 33, 33
CARLIER, A., CHAUVEAU, F., HUGON, H. and R~SCH, J. ]968, Comptes Rendus 266, 199
DEUBNER, F.L. 2971, Solar Phys. 17, 6
DEUBNER, F.L. I975, Astron. Astrophys. 44, 371
DEUBNER, F,L, 2976, Astron. Astrophys. 47, 475
EDMONDS, F.N. and WEBB, C.J. 2972, Solar Phys. 25, 44
FONTAINE, G. and Van HORN, H.M. 1976, Ap J. Suppl. 3]_, 467
FONTAINE, G., Van HORN, H.M., BOHM, K.H. and GRENFELL, T.C. ]974, Ap J. 193. 205
FRAZIER, E.N. 1968, Ap J. 152, 557
FRAZIER, E.N. 1970, Solar Phys. 24, 89
GINGERICH, O., NOYES, R.W., KALKOFEN, W. and CIINY, Y. 1971, Solar Phys. 28, 347
SPIEGEL, E.A. 1968, Highlights in Astronomy (ed. L. PEREK) Dordrecht: Reidel Publ. Co.
STENFLO, J.O. 1973~ Solar Phys. 32, 41
STRAUSS, J.M., BLAKE, J.B. and SCHRAMM, D.N. 1976, Ap J. 204, 481
STRITTMATTER, P.A. and WICKRAMASINGHE, D.T. 197], M.N.R.A.S. 152, 47
VAN HORN, H.M. 1970, Ap J. (letters) ]60, L 53
WALLERSTEIN, G.W., HERBIG, G.H. and CONTI, P.S. ]965, Ap. J. 141, 610
WEGNER, G. 1972, Ap J. 172, 451
WEIDEMANN, V. 1975, in Problems in Stellar Atmospheres and Envelopes, (ed. B. BASCHEK, W.H. KEGEL and G. TRAVING) Heidelberg - New-York: Springer Verlag, p. 173
WEYMANN, R. and Sears, R.L. ]965, Ap J. ]42, ]74
WICKRAMASINGHE, D.T. and STRITTMATTER, P.A. 1970, M.N.R.A.S. ]50, 435
WORDEN, S.P. 1975, Solar Phys. 45, 52]
WORDEN, P.S. and SIMON, G.W. 1976, Solar Phys. (in press) abstract: Solar Phys. 45, 547 (1975)
ZAPPALA, R.R. 1972, Ap J. 172, 57
EVOLUTION PATTERN OF THE EXPLODING GRANULES
O. Namba and R. van Rijsbergen
Astronomical Institute, Utrecht
SUMMARY
The evolution pattern of the so-called exploding granules has been studied on
the basis of a time sequence from Princeton Stratoscope pictures of the solar granu-
lation. Some preliminary results are presented (section3). A new interpretation of
the phenomenon is suggested (section 4).
1. INTRODUCTION
During the morphological study of the solar granulation (Namba and Diemel,1969)
we were puzzled by some large granules which have a round darkening in their centre
often with several dark canals radiating from it toward the boundary. In 1967 we began
a study of this remarkable class of granules, which are now referred to as "exploding
granules" though the term is somewhat misleading. The exploding granule phenomenon was
discovered by Roseh and his co-workers (Carlier et al., 1968) in moving pictures ob-
tained at the Pic du Midi Observatory, an example of which was shown during the 1967
I.A.U. Meeting at Prague, Musman (1972) has proposed the first theoretical model of
the phenomenon on the basis of his laboratory experiment and observations obtained at
the Sacramento Peak Observatory. In this paper we present some observational features
of exploding granules, which may be of some importance for the theory of non-stationary
convection.
2. OBSERVATIONAL MATERIAL
The observational material is a time sequence of high-definition photographs of
the granulation obtained on the Stratoscope flight of August 17, 1959. The duplicate
negatives on 35-mm film were kindly lent us by the Princeton University Observatory.
The photographs were taken in the green-yellow region with an exposure time of 0.0015
sec at a rate of a frame every 0.929 see; the highest spatial resolution attained is
0.112" or 271 fun on the Sun.
The time sequence lasts about ]6min (Frame Nos. 2145 - 3200) and refers to a
quiet region near the disk centre; some of the frames are illustrated in Bahng and
Schwarzschild (196;), where the observational data are also given, The search for ex-
ploding granules and the study of their evolution have been done on positive enlarge-
120
a
2 1 5 3
Ore06 s
2413
4m07 s
i b 2 1 8 8
0 m 3 8 s
C
2 2 5 9
l m 4 4 s
g 2 5 5 0
6m14 s
h
2 7 0 6
8 m 3 9 s
J
|
d 2311
2m:
2 8 0 3
10 m 0 9 s
e "~- > 1 0 m 5 7 s
2 3 5 3
3 m l l s 0 5 10" I . . . . I , , , , I
Fig. I. Evolution of a typical exploding granule (centre) - a time sequence from Stratoscope granulation photographs obtained on August I?, 1959. Indicated are frane numbers, relative times, the lifetime, and the distance scale (I" = 725 km). Note also remarkable stability of surrounding granules. (Small specks and dots in some pictures are blemishes.)
121
mants of about 50 selected frames (the image factor: l"= 4 mm for all and 1 ''= 6 mm
for portions including the granules treated here). The work was troublesome, because
the focussing of the image was widely variable and high-resolution pictures are not
uniformly distributed in the sequence. Furthermore, besides small shifts of the granu-
lation field in the frame, there is a drift of the solar image that divided the time
sequence into two parts, one lasting about 10 min and the other 6 min, with some spa-
tial overlapping (cf. Fig.2 below).
In this study seven exploding granules were examined. Fig. l shows a series of pic-
tures for a typical example. Although the picture quality is somewhat variable, the
series illustrates nicely its evolution (cf. Carlier et al.,196g and Musman, 1972).
On the positive prints we measured the area of granules as a whole and expressed them
in terms of the average diameter (D). For the pictures shown in Fig. l microphotometry
has been carried out with a Joyce Isodensitracer.
3. RESULTS
Although individual exploding granules behave rather differently, their evolution
pattern may be summarized schematically as follows. A granule appears as a bright spot,
which grows quickly. As the size increases, a vague shade forms over the central part
of the bright granule; it soon becomes a round dark hole - see Fig. l. As the central
darkening develops a few dark canals radiate from it toward the boundary. This may be
regarded as the primary splitting of the granule, usually into two or three smaller
granules. Around the time where the granule reaches its maximum extension the split
granules break further into several parts (the secondary splitting?), provided the gra-
nule is large. So, one can count from several to more than ten granules, split from a
single granule. The evolution after the fragmentation depends upon the granule. One of
the split granules may become a new exploding granule while others fade away. Carlier
et al. (1968) showed an example that developed three "generations" of exploding gra-
nule from a single entity. We should mention that the splitting is a very common pheno-
menon of granules: even for small granules (D < I") it is rather difficult to find such
granules that do not split.
From the study of seven exploding granules the following characteristic features
have been derived.
(a) Measured diameters are plotted in Fig.2 as a function of time. The maximum size
reached during the evolution was from 3.3" (2200 km) to 5.4" (4000 km) in diameter. The
expansion rate AD/At is more or less linear with a speed of ].3 to 3.3km/s. Hence, the
radial expansion rate AR/At ranges from 0.7 to 1.7 km/s, in ~greement with the earlier
estimates of 1.5 to 2 km/s (Carlier et ai.,1968) and of 1.8 km/s (Musman, ]972). These
values are only a fraction of the sound speed in the photosphere (~8 km/s), but they
exceed the horizontal flow velocity of 0.34 km/s (maximum) at the top of granules,
measured by Beckers and Morrison (1970).
Furthermore,there is a tendency that the expansion speed is proportional to the ma-
122
6'
5
t n,,
LLJ
F,3
I I I I I 2200 2400 2600 2800 3000 3200
TIME (Frame N o) ,,
Fig. 2. Growth of exploding granules: average diameter (I" = 725 km) vs, time (a fr~ne every 0.929 eec), The granulej illustrated in Fig. 1, is the third from the top.
x 1.0
E
",~ .s i .4
,2
I ! . . . . . I I I I I I I
Dmax I
1 i i i i i i i i i O. 0 -8 -6 -4 -2 0 +2 +4 .6 *8 +I0
TIME (minutes)
Fig. 3. Relative growth of the exploding granules, derived from Fig. 2. The ~verage rate of growth &In D/At ~ O.045/min.
I23
ximumdiameter reached. In Fig.3, where the mean diameter relative to the maximum dia-
meter is plotted versus time, all the growing branches cluster around a line (not drawn)
with a slope
A in D O.045/min. (I)
At
(b) The above relation suggests that in the sub-photospheric layer the upper part of
the granules may look like huge cones, as sketched in Fig.4.
(c) The central darkening occurs only when the granule is roundish, uniformly bright
and exceeds a certain critical diameter D c ~2.3" (1600 km). It does not show up in
large but elongated granules with a width smaller than this value.
The isophotometry of the pictures shown in Fig.] yielded interesting data about the
development of the central darkening for this particular granule. In the interval from
picture b to picture e, the central darkness increases linearly with time from 13 to
25% of the surrounding brightness and the diameter at half the central depression grows
from 700 to II00 km at a linear rate of 3.0 km/s while the granule expands at a rate of
3.2 km/s in the same period.
(d) The lifetime for the individual granules has been found to be much longer than
the"correlation"lifetime of 8.6 min measured by Bahng and Schwarzschild (1961). The
time sequence allowed us to determine only a lower limit of 10 - 11 min. But Fig.3
suggests'a lifetime of, say, half an hour. In Fig. I also the remarkable stability of
normal granules surrounding the exploding granule is apparent: many persist through
the |0-min time. This is the case also for small granules in the range of 0.8" ~ D
< ].7" (Namba and P. Provoost, ]973, unpublished).
(e) In the time sequence we counted about 20 certain and ]0 suspected exploding gra-
nules with the characteristic features (central darkening with or without radial canals)
over an area of roughly 70" x 90" at any given moment; they cover 2 to 3% of the total
area. In addition, thanks to the time sequence, we found about 10 granules, which be-
came exploding ones, and nearly twice as many already "exploded" granules.
The spatial distribution of exploding granules does not seem to be random, but
we lack further observational material to investigate whether there is any correlation
between the locations of exploding granules and the supergranulatlon pattern. Allen and
Musmen (1973) found no such correlation in their observations.
4. A POSSIBLE INTERPRETATION OF THE EXPLODING GRANULE PHENOMENON
Musman (]972) interpreted the phenomenon as follows. When a rising granule pene-
trates into the overlying stable region, the internal g=anular motions and the conser-
vation of angular momentum act to change the form of the granule into a vortex ring,
which is stretched out horizontally and breaks. This"smoke rlng model, however, meets
difficulties in explaining some observed facts, for example, why some granules are of
the exploding type while man~ others are not, and why a part of the broken ring may
124
a photospheric
level-h . . . . - - . . . ~
b = 8 0 0 km
C "iO Fig. 4. A possible interpretation of the exploding granule phenomenon -
development of the central darkening (vertical cross section). Cf. Fig. I. a: as the granule rises the top layer is cooled instantly; the cold-matter flows outj b: since the granule grows fast and large (D >~ 1800 kin) the cold matt'er is left behind the expanding granule boundaryj and the central darkening begins to form. It develops parallel to the growth of the granule~ c: when the mass of the cold matter in the centre becomes too heavy,-~t sinks, breaking through the granule. The development is accompanied with splitting of the granule. Borne of the split granules may grow further while others fade away.
125
become a new exploding granule.
The evolution pattern reported here suggests an alternative aualitative interpre-
tation which is illustrated in Pig. 4. As soon as a hot granule reaches the photospheric
level its top is cooled. At first the cooled matter flows horizontally (with a velocity
of ~ 0.5 km/s) and then downward to join the surrounding intergranular region (Fig.4a).
The upper layer of the steadily growing granule is removed continuously in this way.
This process goes smoothly as long as the granule size does not reach the critical dia-
meter mentioned above.
However, when the granule expands with a speed faster than the horizontal flow velo-
city, the cold matter can never reach the granule boundary and is left behind, and con-
sequently the central darkening begins to form (Fig.4b). By this time the granule dia-
meter may have reached the critical value D c. The central darkening develops parallel
to the growth of the granule (paragraph 3c above).
The mass of the cold matter accumulated in the centre of the granule is supported
by the buoyancy and the upward motion in the granule, until its total weight becomes
too large. Then the balance is lost and the cold mass sinks, breaking through the gra-
nule (Fig.4c).
The development of the central darkening may be accompanied with the fragmentation
of the granule. The onset of the downward streaming at the centre might induce the
secondary splitting.
Our interpretation predicts a downward motion in the central darkening after some
moment. It is of particular interest to determine the Doppler velocity in the central
darkening and its change during the evolution of exploding granules.
ACKNOWLEDGEMENTS
Our thanks are due to the late Dr R.E. Danielson of the Princeton University Ob-
servatory, who kindly supplied us with the valuable Stratoscope material. The Project
Stratoscope I was sponsored by NSF, ONR, and NASA, U.S.A. We thank Mr J.L.A. van
Hensbergen for his assistance in photographic work. We acknowledge Drs A.G. Hearn,
G.D. Nelson and C. Zwaan for stimulating discussions.
REFERENCES
Allen, M.S. and Musman, S., 1973, Solar Phys., 32, 3]]
Bahng, J. and Schwarzschild, M., 1961, Astrophys. J,, 134, 3]2
Beckers, J.M. and Morrison, R.A., 1970, Solar Phys., ]4, 280
Carlier, A., Chauveau, F., Hugon, M. and Rosch, J., 1968, C.R.Acad.Se.Paris,226,199
Musman, S., 1972~ Solar Physo, 26, 290
Namba, O. and Diemel, W.E., ]969, Solar Phys., !, 167
GRANULATION OBSERVATIONS
A. Nesis
Fraunhofer-lnstitut
F r e i b u r g i~Br., FaG
The a n a l y s i s o f h i g h r e s o l u t i o n s p e c t r o g r a m s t h a t were o b t a i n e d in
Capr i w i t h t h e Coud6 r e f r a c t o r gave t h e f o l l o w i n g r e s u l t s r a g a r d i n g
the g r a n u l a r f i e l d s ,
a) The rms v e l o c i t y of t he g r a n u l a t i o n was 300 m / s e c . Th i s v a l u e r e -
p r e s e n t s t h e raw d a t a and s h o u l d be s u b s t a n t i a l l y i n c r e a s e d to a c c o u n t
f o r i n s t r u m e n t a l and s e e i n g e f f e c t s (W. M a t t i g e t a l . , 1969) . Note
t h a t t h e o s c i l l a t o r y component of t he t o t a l v e l o c i t y f i e l d has been
e l i m i n a t e d by a n u m e r i c a l f i l t e r °
b) In the photosphere the intensity and velocity fluctuations show a
similar structure. The size distributions of the observed Doppler
shifts and intensity variations have been studied by means of power
spectrum analysis (Fig.l, ~. Mattig and A. Nesis, 197~)o These spectra
are not corrected for finite instrumental resolution or for atmospheric
seeing.
The question that we put to the theorists is which of the current
models of convection is able to interpret these observations?
Should one resort to the anelastic approximation for convection to
study our data, or must one work with the thermals picture, t o limit
oneself to just two examples?
Since we can arrive at an understanding of the granulation only by
careful and systematic observations, I believe that a practically
orientated dialogue between theory and observations is indispensable.
References
Mattig, W., Mehltre%ter, J.P., and Nesis, A.: 1969, Solar Phys. I__0, 25~
Mattig, W., and Nesis, A.: 1974, Solar Phys. 36, 3
127
Io 9 P{K}
3-
LM -\ \ \
k , , r - " - ~
13."8 3."8 1:9 1"2 0~85 0."75 i i i I I ' i
23 46 k 69 92 115 x lO'"km "~
Fig. J. The power s p e c t r u m of t h e i n t e n s i t y I ( x ) and o f t h e v e l o c i t y
v ( x )
LoK.: c o n t i n u u m l e v e l s ~ L, Mo: Line c e n t e r
SOME ASPECTS OF CONVECTION IN METEOROLOGY
R. S. Lindzen
Center for Earth and Planetary Physics
Harvard University
Cambridge, Massachusetts 02138
ABSTRACT
Various aspects of convection in meteorology which may have some relevance for astro-
physics are discussed. In particular the role of convection in determining the gross
thermal structure of the atmosphere, the treatment of convective turbulence in the boun-
dary layer, and the larger scale organization of convection are dealt with. ~
I. INTRODUCTION
As noted by Prof. Biermann during this conference, a number of seminal approaches
to convection in astrophysics such as convective adjustment and mixing-length theory
originated in meteorology. The purpose of this paper is to briefly review the current
status of such notions in meteorology as well as to report some relatively recent approa-
ches to meteorological convection which may prove useful in astrophysics.
Since most of what I discussed at the conference has appeared in the meteorological
literature 9 1 will tend to use this written version of my lecture as a selective anno-
tated directory to this literature rather than a complete version of the lecture. No-
thing remotely approximating a complete review of convection in the meteorological li-
terature will be attempted.
In section 2, I will describe the observed thermal structure of earth's atmosphere
and explain why simple radiative models with convective adjustment prove inadequate --
qualitatively and quantitatively. I will also outline our current understanding of
the observed structure -- though this understanding is by no means complete.
In section 3, I will introduce a current phenomenological approach to penetrative
convection in the atmosphere which may prove a more consistent alternative to convective
adjustment in astrophysics. In both sections 2 and 3~allusions will be made to the
fact tha~in the atmosphere, convection often occurs in relatively narrow plumes, and
that such convection is generally associated with local static stability. The point
which may be relevant to astrophysics is that convection is not necessarily related
to local temperature gradients in a simple way.
129
Section 4 deals with a particular observed feature of atmospheric convection :
namely, when broad regions (for example the maritime tropics equatorwards of 20-30 =)
are uniformly unstable (or conditionally unstable) convection does not occur in a uni-
formly distributed manner. Rather, the convecting system itself appears to be unstable
to so-called mesoscale systems (physically akin to internal gravity waves) which in turn
organize the convection on scales much larger than the scale of the convective elements.
The mechanism for this organization appears to be different when the convection is deep,
extending almost from the ground to the tropopause (cumulonimbus convection), and when
convection occurs within the middle levels of the troposphere; both are discussed in
section 4. The point, however, is that in the earth's atmosphere convection rarely if
ever manifests itself in the appearance of a pattern characterized by the scale of the
convective elements alone; larger scales of organization and motion almost invariably
appear. This may offer some insight into such phenomena as supergranules on the sun.
2. THE LARGE SCALE THERMAL STRUCTURE OF THE EARTH'S ATMOSPHERE
In Figure ! a somewhat smoothed picture of the height and latitude structure of
longitudinally averaged temperature is presented (only heights up to |6 km are shown).
In Figure 2 several height profiles of temperature appropriate to various latitudes are
shown. From Figure 2 we may note a certain similarity of profiles from various latitudes.
In most cases temperature decreases with height above the ground with a gradient of
about 6°/km until some level (known as the tropopause) where the temperature gradient ~T
goes to zero or becomes positive. The relatively uniform lapse rate (- ~-~ ) of the
lower atmosphere as well as the observation that radiative equilibrium profiles for the
lawer atmosphere are statically unstable led early on to the use of a convective adjust-
ment model to explain the thermal structure of the lower atmosphere (Gold, 1909; Emden,
1913; Goody, 1949). Several important difficulties have, however, arisen in connection
with such models (some were recognized almost immediately) :
i) Convective adjustment would lead to the adiabatic lapse rate in the lower atmos-
phere. This is 9.g°/km not 6=/km. It is sometimes suggested that the atmosphere adjusts
instead to the moist saturated adiabatic lapse rate. While this lapse rate is on the
order of 6°/km near the ground, by 4km the atmosphere, because of its low temperature,
saturates with very small amounts of water vapor and the saturated lapse rate differs
little from the dry value. Thus, the use of the saturated lapse rate is no solution to
this problem -- even ignoring the fact that the atmosphere is rarely saturated. It is
interesting to note that the application of convective adjustment to the earth's atmos-
phere leads to errors on the order of 30=K or 10% in absolute temperature.
ii) Convective adjustment in no way can account for the abrupt change in tropopause
height near 30 = latitude: equatorwards it is near |6 km, polewards it is near 12 km.
1 3 0
iii) Radiative-convective equilibrium does not account for the fact that in the
nelghbourhood of the tropopause, a temperature minimum exists at the equator.
iv) Finally and revealingly, close scrutiny of Figures ] and 2 indicates that the
lapse rate of the tropopause is not really uniform,especially at high latitudes.
Figure ]. Zonally (longitudinally) averaged temperature as a function of height
and latitude. Contours are lines of constant temperature (°C). After Palmen and
Newton (1969).
131
4C . . . . . . . .
~ 5 C - - - -
E
b.,,I C~ :::) 20 l - - i I-- .-I
1 0
180 5 ~
I i 45t /~1~..,,,,P I JANUARY AND
60£~.. ~/~./~/i/. 2" ~ SPRING/FALL,
,~/j -~---~ . . . . . . . . . ~- T
I / o |
\ h I
i -
"
200 220 2 4 0 260 2 8 0 TEMPERATURE (OK)
Figure 2. Zonally averaged temperature as a function of height for various lati-
tudes. After U.S. Standard Atmosphere Supplements (1966).
Our current understanding of the atmosphere's structure suggests no uniform expla-
nation for the whole globe. Recent work (Schneider and Lindzen, 1976; Schneider, 1976)
shows that within a certain neighbourhood of the equator (extending to about 30 ° lati-
tude) the atmosphere cannot sustain significant horizontal temperature gradients (in
many respects this region is similar to a spherically symmetric atmosphere where rota-
tion is not of great importance). Large scale dynamic effects in this region serve
primarily to homogenize (horizontally) the temperature in this region, and as a result
the vertical temperature structure of this region is indeed describable in terms of
radiative-convective equilibrium. However, because the convection occurs in relatively
narrow cumulonimbus towers, it leads to finite stability rather than neutral lapse rates.
How this occurs is outlined in Appendix ]. From about 30-70 ° latitude, horizontal tem-
perature gradients are significant and rotation is of basic importance. It is generally
believed that convection in this region is due to baroclinic eddies whose energy is
drawn from horizontal temperature gradients. These eddies tend to carry heat upwards,
132
and the rate at which these eddies stabilize the atmosphere is much greater than the
rate at which radiation acts to destabilize the atmosphere~so the question of convec-
tive adjustment does not arise. The stability achieved in this region is primarily
r~lated to the north-south temperature difference, and at the moment there does not
appear to be any basic reason why temperature lapse rates at middle latitudes should
be the same as they are in the tropics. A discussion of how baroelinio eddies act to
establish the lapse rate in middle latitudes may be found in Stone (1972, 1973). The
relevance of this process to astrophysics is not at all clear. Finally, the arctic-
antarctic ice and snow cover lead to high surface albedos and radiation tends to stabi-
lize rather than destabilize the atmosphere. This, in turn, tends to suppress baro-
clinic eddies. A comprehensive discussion of terrestrial atmospheric stability based
on numerical simulation may be found in Reid (1976).
3. PENETRATIVE CONVECTION AND MIXED LAYERS
One may reasonably ask, at this stage, whether convection in the earth's atmosphere
ever leads to a neutral lapse rate. The answer is almost certainly yes, but it is not
clear that even in these instances, convective adjustment is the correct approach.
We shall, in this section, look at one of the more extensively studied examples of
convective mixing: namely the convective mixing of the air near the ground where the
convection is forced by solar heating of the surface. A substantial number of pheno-
menological theories exist for this process and there is still a measure of controversy
surrounding them. I will sketch one typical example of such theories due to Tennekes
(1973). The geometry of the situation is shown in Figure 3 where profiles of both po-
tential temperature and convective heat flux are presented. At the bottom of the mixed
layer there is a thin superadiabatic layer dominated by mechanical turbulence. The
nature of this layer is ignored except insofar as it delivers a heat flux (O-W)o to the
interior; this heating forces the convective mixing which proceeds over a finite layer
of thickness, h, topped by an inversion layer with temperature jump, A. The region dO
above this jump is stably stratified with~-~z= ~. As heating continues, h increases
with time -- whence the name "penetrative convection". The picture thus far is reaso-
nably well observed over land in middle latitudes. As the mixed layer rises into a
warmer environment, the cooling of the entrained warmer air must give rise to a nega-
tive flux (~)i beneath the inversion. This is mathematically expressed as follows :
dh (3. I) (Ow)i = ~ d--f •
An equation may be written for the time evolution of A, on noting that the penetration of
the mixed layer into the stable interior tends to increase A, while the heating of the
mixed layer tends to decrease A :
dA _ dh ~O d-{- Y ~ " - ('~t') b . l . (3.2)
133
we ignore radiative processes, Ob.l. satisfies a simple budget :
__ Cp0( )b.]. = - 3--~ ( Cp 08w),
orjintegratlng over the mixed layer, we find
(3.3)
(~w) o - (e-~) i 38 (TC)b. i . : h
(3.4)
Substitution of (3,4) into (3.2) yields
dA dh (E~;)o (@w)i dt ? dt h + h
• (3.5)
and (3.]) together with (3.5) are generally taken as the basic equations for the sys-
tem. (~)o is given, and (3.1) and (3.5) then form 2 equations in 3 unknowns : A, h
and (ew) i . Clearly another relation is needed (and it is at this point that the bulk
of the controversy is engendered). Tennekes (1974) first considers the turbulent
energy budget near the inversion:
# - q2w) ÷ * , (36) o o
where q is the magnitude of turbulent velocity fluctuations and ~ is a dissipation
rate which is empirically found to be negligible near the inversion. (T o is a mean
temperature.) Thus (3.6) suggests that the kinetic energy generated by buoyancy is
consumed in bringing heat down through the inversion. Since buoyancy tends
to generate vertical velocity, and buoyancy acts th~Dughout the mixed layer,
(-~ q2w) ought to scale as follows:
3 ~z (2 q2w) ~ - 0 (-~), (3.7)
where o W
is the vertical velocity variance, and
T 3 -(~)i ~ o ~ (3.8)
g h
In addition since o~ is generated by (~)o
G 2
O,.v * gh 0
we have
and ow 3 ,x, gh ( ~ ) o (3 .9 ) T
0
134
Combining (3.8) and (3.9) we have
- (~)w)i = k (SW) o : (3.10)
k is a constant which is empirically found to be about 0.2. Equation (3,10) closes the
system described by (3.]) and (3.5). The resulting equations have been used (with
moderate success) to describe a variety of convective boundary layers, For the diur-
nal boundary layer, surface heating during the day causes h and Qb.|. to increase;
the heat thus deposited is carried away by radiation during the night when (~)o is
zero. This incidentally explains how there can be a turbulent heat flux into the atmos-
phere in the mean even though the mean stability may be positive.
d8
d--fiY Z Z
t+dt
~e 8W
Figure 3. The vertical distributions of potential temperature and turbulent heat
flux in and above a convective boundary layerl after Tennekes (]973),
To be sure, the concept of a diurnal boundary layer is hardly applicable in astro-
physics. However, the above approach has also successfully accounted for the semi-
permanent mixed layer of the tropical maritime atmosphere (Sarachik, 1974). In that dh particular case there exists a between-cloud subsidence which causes ~ in equ, (3.])
dh and (3.5) to be replaced by (~ - w) and an equilibrium solution exists wherein
135
dh d-~ = O. More germain to astrophysics would be the inclusion of radiation in the above
picture. Equ. (3.]), (3.3), (3.6) and (3.7) would all need modification since radia-
tion would not only alter the gross budgets but would also act to dissipate buoyancy.
1% is also conceivable that convection, if it were to occur in plumes~would not lead
to an adiabatic lapse (well mixed potential temperature) (see Appeudix ]). This might
affect the validity of (3.8) and (3.9) since the mean stability would inhibit buoyant
acceleration. The above, of course, all remains to be done, but it might conceivably
form a more satisfactory alternative to convective adjustment. The possibility of con-
vection leading to inversion "discontinuities" etc. might have significant implications
as well.
4. MESOSCALE ORGANIZATION OF COI'~ECTION
We turn now to a last and somewhat different aspect of atmospheric convection.
Even when rather broad regions of the atmosphere are relatively uniformly unstable
(or more typically conditionally unstable with respect to moist processes), convec-
tion (in the form of cumulus clouds) rarely if ever occurs in a uniformly distribu-
ted manner. Instead, the convection is almost always organized into systems whose
scale is typically I-2 orders of magnitude larger than the scale of the cumulus
clouds themselves. The larger scale (100-4001~) is referred to in meteorology as
the mesoscale. Cloud clusters and squall systems are examples of mesoscale systems.
Mesoscale organization appears to be an intrinsic feature of atmospheric convection.
For certain types of atmospheric convection the relation to mesoscale organization
seems reasonably clear. In these cases moisture is concentrated near the surface
(in the first 2 kilometers of the atmosphere typically) and virtually the entire
depth of the troposphere is conditionally unstable. Such situations tend to be
characterized by intense cumulonimbus convection. The rainfall in such situations
tends to satisfy a simple moisture budget where the rainfall (and hence latent heat
release) is proportional to the convergence of moisture (plus evaporation where this
is relevant). Moreover~ since the moisture tends to he confined to Z < Z T (where Z T
is typically 2 km), the convergence of moisture tends to be proportional to the ver-
tical velocity at Z T. Finally the latent heat release is significant for the larger
scale motions. In the presence of an internal wave perturbation (which produces
convergence) one can imagine an interaction of the sort indicated below:
Latent Heating
J \ Surface convergence ~-" -- Internal waves
136
If the internal waves produced by latent heating produce more surface convergence
(in the proper phase) than is needed to maintain the wave, the system will be un-
stable. This mechanism is referred to as wave CISK (conditional instability of the
second kind), and is described in greater detail in Lindzen (1974). CISK is used
to describe any collective instability of cumulonimbus convection and larger scale
motions. The concept was introduced by Charney and Eliassen (1964) in connection
with hurricane generation. The mathematical problem in the present instance con-
sists simply in the solution of the equation for thermally forced internal gravity
waves which takes approximately the following form:
d2w+ ~2w = Q(z) (4.1)
dz 2
where are proportional to elk(x-ct).'" w is the vertical velocity, Q is all fields
proportional to heating, x is a harizontal coordinate, k is a horizontal wavenumber,
and c is a horizontal phase speed which may be complex (for unstable solutions).
For our purposes N2
~2 ~ C 2 (4.2)
where N is the Brunt-Vaisala frequency. Now it is an easy matter to write the solu-
tion for w (satisfying suitable boundary conditions) as functional of Q(z):
w ffi F c [Q] ( 4 . 3 )
where w depends on c (and z) as well as Q, But Q is proportional to W(ZT), and (4.3)
becomes
w(Z) = F c [q(Z')w(ZT)] (4.4)
where q(Z) is a specified function. At Z = Z T (4.4) becomes
w(Z T) = F c [q(Z')W(ZT)] (4.5)
which proves to be possible only for certain values of c--one of which is typically
associated with the greatest degree of instability. Current calculations indicate
that the imaginary part of c is much smaller than the real part and that for common
terrestrial situations Re(c) ~ 15m/s. Since solutions are of the form e Ik(x-ct),
growth rates are equml to k x Im(c) and one might infer that maximum growth rates
are achieved as k ÷ ~(and as the frequency k Rec ÷ ~ also). This, however, is in-
consistent with the fundamental premise of C!SK: namely, that convection is organ-
ized by large scale convergence. Clearly such organizatiun cannot be achieved on
time scales shorter than characteristic development times for the clouds. For ex-
ample in the tropics cumulonimbus clouds have a characterlstic time scale of about
i hour, which suggests a maximum frequency, ~, of about (I hour) -I. Now
1 ~ k x 15 mls. ~ kc ~ 3600 see
137
1 Hence k
3600 x 15 m 2~
and horizontal wavelength ~-~ 2~ x 3600 x 15 m ~ 339~. (4.6;
In fact, both this wavelength, and the predicted phase speed are characteristic of
tropical mesoscale disturbances, implying that the maximum frequency suggested above
is, in fact, what is realized. A similar approach has been used by Raymond (1975)
to account for the structure and evolution of intense co=vective storms in the mid-
western United States.
The relevance of wave-CISK for astrophysics is questlo=able since there appears to
be no astrophysical counterpart to rainfall. However, it is also observed in the
earth's atmosphere that cumulus convection which is restmicted to relatively shallow
layers within the middle of the troposphere and which is associated with little
(and sometimes no) rainfall is also organized into mesoscale patterns. Latent heat
does not appear at first sight, to play a major role in forcing these mesoscale sys-
tems. In a recent paper, Lindzen and Tung (1976) have shown that the near neutral
Stability created by mid-level cumulus activity helps trap internal gravity waves in
the stable region below the clouds~ creating a duct wherein wave modes may exist
without significant forcing. The phase speeds of these ducted modes (determined pri-
marily by the thickness of the stable region below the clouds) are in good agreement
with observations. Furthermore, observed periods appear to satisfy the relation
2w (4.7) T~ave rcloud
JUSt as in the case of wave CISK disturbances. Given a duct phase speed, c, and a
characteristic cloud time scale Tclo~d, the mesoscale wavelength is again
wavelength ~ 2~ c Tclou d (4.8)
The means for interaction between the waves and the cloud field are not entirely
dear in this case. However, the period given by (4.7) is still the shortest period
on which any interaction could take place. Moreover, the well known degeneracy of
such features of convection as its plan form suggests that the organization of con-
vection might be responsive to relatively weak perturbations. Similary, the waves,
being dueted, call for only small forcing.
At this point it is worth noting that the earth's atmosphere can sustain a class
of free oscillations (Lamb waves) which do not require explicit ducting. These
By the above arguments we ought to expect organization of convection with wavelengths
given by (4.8) based on the speed of sound and Tclou d. There is no clear cut ohser-
vatlonal evidence available for this suggestion. However, the wavelengths obtained
are on the order of several thousand kilometers, and on the earth, regions on this
scale with relatively uniform conditional instability are rare. The situation ap-
pears somewhat more congenial on the sun where a convective layer exists over the
entire star. Identifying the convective elements with granules for which T ~ 5 min-
138
utes and taking c ~ I0 km/sec one obtains from e~. (4.8) that the dominant wave-
length ought to be 40,000 km. Whether it is purely an accident that this is also
the scale of supergranules remains to be seen. Less arguably, the above discussion
demonstrates rather clearly that the appearance of structures of a given horizontal
scale need not imply vertical scales of the same order. Similarly, terrestrial ex-
perience suggests that convectio~ rarely involves merely a single horizontal scale.
ACKNOWLEDGEMENTS
The author wishes to thank E, Spiegel for encouraging the preparation of this
manuscript 9 and the National Science Foundation for its support under Grant ATM-75-
20156.
REFERENCES
Arakawa, A. and W.H. Schubert, 1974 : Interaction of a cumulus cloud ensemble with the large scale environment. J. Atmos, Sci., 3_[], 674.
Charney) J. and A. Eliassen, ]964: on the growth of the hurricane depression. J. Atmos. Sci., 2_[I, 68
Emden, R., 1913: Uber Strahlungsgleichgewicht und atmosph~rische Strahlung. Sitz. d. Bayerische Akad. d. Wiss., Math. Phys. KlasRe, p. 55.
Gold, E., 1909: The isothermal layer of the atmosphere and atmospheric radiation. Proc. Roy. Soc. A, 82, 43.
Goody, R.M., 1949 : The thermal equilibrium at the tropopause and the temperature of the lower stratosphere~ Proc. Rcy. Soc. A, ]97, 487.
Held, I.M., 1976 : The Tropospheric Lapse Rate and Climate Sensitivity, Ph.D. The- sis, Princeton University, 2]7 pp.
Herman, G., and R.M. Goody, ]976: formation and persistence of summertime arctic stratus clouds. J. Atmos. Sci., 33, ]537-1553.
Lindzen, RoS., ]974: Wave-CISK in the tropics. J. Atmos. Sci., 3~], ]56.
Lindzen, R.S., and K.-K. Tung, 1976: Banded convective activity and ducted gravity waves. Mon. Wea. Rev., 104, in press.
Palmen, E. and C. W. Newton, ]969: Atmospheric Circulation Systems, Academic Press, New-York, 603 pp.
Raymond, D. J., ]975: A model for predicting the movement of continuously propaga- ting convective storms, J~ Atmos. Sci.~ 32, 1308.
Sarachik, E.S., ]974: the tropical mixed layer and cumulus parameterization. J. Atmos. Sci,, 31 , 2225.
139
Schneider, E.K., 1976: Axially sy~netrie steady state models of the basic state for instability and climate studies. Part II: Nonlinear calculations. J. Atmos. Sci., 3_33, in press.
Schneider E.K., and R.S. Lindzen, ]976: Axially symmetric steady state models of the basic state for instability and climate studies. Part. I; linear calculations. J. Atmos. Sci., 33, in press.
Stone, P.H., ]972: A simplified radiative-dynamical model for the state stability of rotating atmospheres. J. Atmos. Sci., 29, 405.
Stone, P.H., ]973: The effect of large scale eddies on climatic change. J. Atmos. Sci., 3__00, 521.
Tennekes, H., ]973: A model of the dynamics of the inversion above a convective boundary layer. J. Atmos. Sci., 30, 558.
U.S. Standard atmosphere supplements, ]966: available Superintendant of Documents U.S. Government Printing office, Washington, D.C. 20402.
APPENDIX ]. HEAT TRANSFER BY THIN PLUMES
The following discussion is based on work by Arakawa and Schubert (]974) concerning
cumulonimbus clouds. The present discussion, however, ignores moisture (both for sim-
plicity and because of its irrelevance to astrophysical problems). We shall consider
convection which occurs in plumes which occupy a small fraction of the total horizon-
tal area and which despite their small area contribute significantly to the mean ver-
tical mass flow. By "mean" we shall always refer to an average over an area large
compared to the cross-sectional area of plumes, but small compared to any large scale
flow. Our aim will be to parameterize the effect of plumes on this large scale flow.
Means will be indicated by overbars. The approach will be analogous to the use of
Reynold's averaging where the eddies will be convective plumes.
We will first partition the mean vertical mass flux into plume and environmental
(non plume) contributions: %
p-'w = Mp + M, (A.])
where p = density, Mp = plume mass flux and M = environmental mass flux.
poses the following quasi-Boussinesq continuity equation will suffice:
V o (pl) + ~ (~-6) : 0
For our pur-
(A.2)
(V • ~) will here refer to horizontal divergence of q.
consider an ensemble of plumes where
It will also prove useful to
Mp = EMi0 (A.3) i
140
Each plume may either be entraining mass from its environment in which case
,~Mi ~oi, ~Mi 8oi Ei = i--~z + P =~t )' -~z + ~ > 0 (A.4a)
or detraining into the environment in which case
,~Mi "-~)~°i'' ~Mi i D i = - t--~ + --~ +--~ < O; (A.4h)
o. is the fractional area occupied by the i th 1
p ied by a l l plumes i s
plume and the fractional area occu-
= Z o.. (A.5) p .l i
Mp satisfies the following budget :
~Mp Bop E - D = BT + P ' - - ~ , (A.6)
where
The static energy:
= E Ei entraining
plumes
= S Di
detraining plumes
s = c T + gZ (A.7) P
is conserved during adiabatic processes. The budget for s in the environment is given
by
op)psJ " ~ (M s) + QR (A.8) ~--- [(] - ~ = - V ~ (PVs) - Es + S DiSDi - --Bz , ~t d,p,
where l refers to a sum over detraining plumes and SDi is the static energy of
the d.p, i th detraining plume; QR represents radiative heating in the environment.
Using equ. (A.]), (A.2) and (A. 6) we may easily transform (A,8) to the following:
+d~p. Di (SDi - ~) + Mp ~- + Qr . (A.9)
We will now assume the following to be adequate approximations:
Let us finally assume each plume detrains at precisely that level where its static
energy equals that of the environment (i.e., where it looses buoyancy: this is con-
sistent with the known instability of decelerating jets). Then
Z Di(Ssi - ~) = 0 d.p.
and (A. II) becomes
~-~ + p ~ ~ . V ~ + pw ~ z = Mp ~ + ~. (A.12)
We see from (A.]2) that the primary effect of convective plumes on the environment 37
is to introduce a heating term Mp ~z " This term is easily interpreted: a portion
of pw (i.e., Mp) which rises in plumes does not give rise to adiabatic cooling in
the environment - and appears, therefore, as a heating term. If we ignore p-w, Mp must
be compensated by equal subsidence which does, in fact, lead to eompressional heating
in the environment. A crucial point which may be made from (A.12) is that if convec-
tive plumes are to supply heat which is then carried off by radiation, ~z must be
positive!
NUMERICAL METHODS IN CONVECTION THEORY
N. O. Weiss
Department of Applied Mathematics and Theoretical Physics
University of Cambridge
SUMMARY
Two and three-dimenslonal computations have enlarged our understanding of non-
linear convection, particularly in Boussinesq fluids. However, we cannot
adequately predict the relationship between convective heat transport and the super-
adiabatic temperature gradient. Nor is there any indication of a preferred length
scale, other than the depth of the convecting layer, in a compressible fluid.
I. INTRODUCTION
The standard procedure for calculating the structure of stellar convection
zones is to use mixing length theory, calibrated to fit the sun and neighbouring
stars on the main sequence. Mixing length theory is based on plausible physical
assumptions and seems to provide qualitatively acceptable results but, as Dr. Gough
has emphasized, it lacks any firm theoretical basis. The principal need in astro-
physical convection is for a soundly based theory that can confirm, or replace , the
procedure now adopted. In particular, we would like to establish the functional
relationship between the convective heat transport and the superadiabatic temperature
gradient, and to determine the preferred scale of convective motion. Spiegel's
(1971b, 1972) excellent review of astrophysical convection contains a thorough dis-
cussion of the basic fluid dynamical problem and includes a full list of references,
which has been brought up to date by Gough (1976). go I shall limit myself to
describing recent progress towards understanding nonlinear convection by solving
model problems numerically on a computer.
Direct observation of solar convection reveals cellular patteraa. Hot gas
rises, cold gas sinks and the lifetime of an individual cell is of the same order as
the time taken for fluid to turn over in it. The photospheric granulation has a
horizontal scale similar to the local density or pressure scale height, and comparable
with the thickness of the strongly superadiabatic layer at the top of the convective
zone. Supergranulea have diameters about 15 times larger; their relationship to
features with strong magnetic fields implies that they correspond to more deep-
seated convection. There are also suggestions of motion on a scale comparable to
the depth of the convection zone, while speckle photometry indlcates that there may be
large scale convective cells in the outer layers of red giants.
Observations provide few constraints on the relationship between heat flux and
temperature gradient. Nor can the parameters appropriate for astrophysical convection
be modelled in laboratory experiments. Hence we must attempt to solve the governing
equations which, since they are nonlinear, have to be tackled on a computer. The
143
full problem is still too difficult. So it is necessary to make various simplifying
geometrical and fluid dynamical approximations. We hope that a better description
of astrophysical convection will eventually emerge from the results of a sequence
of idealized numerical experiments.
2. THE IDEALIZED PROBLEM
Let us consider convection in a horizontal layer, heated uniformly from below
and confined between the planes z = O, d, where the z-axis points vertically
upwards. In the absence of motion the superadiabatic temperature gradient
~ ]
where T is the temperature and the adiabatic gradient
Cp Here g is the gravitational acceleration, Cp the specific heat at constant pressure
and = the coefficient of thermal'expansion (for a perfect gas = = l/T). In the
Boussinesq approximation we assume that the layer depth d is much smaller than the
temperature scale height Cp/g~ and that the Mach number U/e s <<I (where U is a
typical velocity and c s is the velocity of sound): then the velocity ~ and density
satisfy the equations
where %, T O are constant, and the configuration is described by two dimensionless
parameters, the Rayleigh number
R =
and the Prandtl number ~ = v/X , where ~, ~ are the thermal and viscous diffusivl-
ties. The heat flux can be expressed in terms of the dimensionless number
where the total heat flux is Cp~ F. For an infinite layer N is a function of R
and ~ only.
In most laboratory experiments the convecting fluid is confined between rigid
boundaries at which ~ vanishes, and the resulting flow is dominated by viscous
boundary layers. These boundary conditions are inappropriate for stars and it
usual to assume that the tangential stress and normal velocity vanish at the
surfaces z = O, d, which are held at fixed temperatures T O + ~d, T O . These "free"
boundary conditions are dynamically fairly passive and mathematically convenient.
Nevertheless, any technique or theory should be capable of describing experimental
results correctly before it is applied to astrophysical convection.
A bewildering array of power laws has been put forward for the function
N(R, ~). For R >> 1 an asymptotic upper bound with N ~ R ½ has been established
(Howard 1963, Busse 1969). At high Prandtl numbers NNR I/3 for free boundaries
but the radiative conductivity is high in stars and the Prandtl number is therefore
small~ For ~ << I we might expect that the energy flux should not depend explicitly
144
on the viscosity W , so that N = N(S), whore S = ~R. In the sun, ~m 10 -9 but
S is typically of order 1012 . Arguments can be found for suggesting asymptotic
power laws of the form N ~ S r with r = 2, ½, 1/3, 1/4, 1/5 (Spiegel 1971 a,b;
Gough and Weiss 1976; Jones et al. 1976; Gough et al. 1975) but it is not obvious
which, if any, of these exponents is correct.
3. BOUSSINESQ CONVECTION
In the Boussinesq approximation the pressure can be eliminated by taking the
curl of the equation of motion. The time-dependent equations then become
and ~' : V^(~^~ - ~T^ 9 * ~V~
together with V.H = O, where the vorticlty ~ : V^~ and e, is a unit vector in the
z-direction. In two dimensions, with motion confined to the xz plane and indepen-
dent of the y co-ordlnate, the vnrticity has only a y-component and the velocity
can be expressed in terms of a stream function ~ such that
where @~ is a unit vector in the y-direction.
The vorticity equation then reduces to
~ _ ~ ~ ~ V~ .
3.1 Rigid boundaries
Convection sets in at the critical Rayleigh number R c and two-dimensional
solutions for R ~ I000 R have been available for some time (e.g. Fromm 1965, c
Schneck and Veronis 1967, Plows 1968). Busso (1967) first showed that two-dimenslonal
solutions at infinite Prandtl number may be unstable to three-dimensional pertur-
bations and the development of rolls into three-dimensional and time-dependent regimes
has been studied experimentally (e.g. Busse and Whitehead 1974) for fluids with high
Prandtl numbers. The stability of two-dimensional rolls was systematically inves-
tigated by Clever and Busse (1974): for Prandtl numbers of order unity, the rolls
develop a wavelike oscillatory instability when R ~ 3.5 R c. The most thoroughly
investigated case is convection in air ( ~ = 0.7) for which Veltishchev and Zelnin
(1975) and Lipps (1976) have computed three-dlmensional solutions with R 4 15 R c.
At low Rayleigh numbers Lipps' numerical experiments show the development of rolls
whose preferred width differs from that which maximizes the heat transport. As R
is increased, the oscillatory instability appears and solutions become time-
dependent. For R ~ 15 Rc, motion is three-dimensional and aperiodic. However the
change from two to three dimensional convection does not greatly affect the time-
averaged Nusselt number. These numerical results are all supported by experiments
(Willis and Deardorff 1967, 1970; Krishnamurti 1970a,b, 1973; Brown 1973).
Unfortunately, apart from the experiments by Rossby (1969), few results are available
for low Prandtl number convection.
145
3.2 Free boundaries
Convection in two-dimensional rolls has been studied in numerical experiments
with R 6 IO00 Rc (Fromm 1965, Veronis 1966, Moore and Weiss 1973). For high Prandtl
number ( V >> R ~) there are steady solutions with N~R I/s, which ere apparently
stable (Straus 1972). For V << i, the Nusselt number depends only on R and for R>>]
N ~ R 0"36 (Moore and Weiss 1973). In these laminar solutions the vorticity ~ is
nearly constant on the streamlines. The nonlinear term in the vorticity equation
remains small even when the Reynolds number is large: the rolls behave like flywheels
and are slowly accelerated until, after they have turned over many times, the buoyancy
torque is eventually balanced by friction. However, the oscillatory instability sets
in near the critical Raylelgh number for v << I (Busse 1972) and the rolls should
break down into three-dimensional cells.
In the two-dlmensional solutions, rising and falling plumes are exactly symmetrical
but this syrmmetry is no longer present in, say, a hexagonal cell where fluid can rise
in a central column and sink around the perimeter of the cell. It was conjectured that
this geometrical change might affect the physics so that N could depend on S for
<< i. So we investigated axisymmetric convection in a cylindrical cell (Jones et al.
1976). This idealized model is mathematically two-dlmensional but geometrically three-
dimensionalj though the cells cannot be packed together to fill a plane. Referred to
%ylindrical polar co-ordinates (r,~,z) the velocity is given by a Stokes stream
function ~ (r,z) such that
where e~ is a unit vector in the ~-dlrectlon, and the vortlcity
= r~l _e e ,
where
~'-~ - r ~ '7
We found, however, that the convective flux was similar to that for two-dimensional
rolls. For high Prandtl numbers N~R I/3 again, while for f<< i N ~R 0"4 approximately.
So the Nusselt number is still independent of • and approaches closer to the upper
bound. The form of the solutions is displayed in Fig. i, which shows streamlines,
isotherms and profiles of the modified vorticity~ for a steady solution with
= O.O1, R = IOO R . Vortex tubes are stretched as they move away from the axis with c
the fluid and~l is nearly constant along streamlines. So flywheel solutions exist in
a cylinder and would also, presumably, appear in hexagons.
Are these solutions stable? Jones and Moore (1977) have recently shown that the
axis~mmetric flow is Unstable to non-axisymmetric perturbations. Thus a cylindrical
cell can fragment into sectors, like unstable vortex rings in laboratory experiments
(Widnall and Sullivan 1973, Widnall, 1975). As the Reynolds number increases, three-
dimensional convection cells should therefore become unstable and split up. Such e
phenomenon is observed in the sun: large granules explode and break up into smaller
146
TZ
F (a)
F
(b)
Figure i. Axisymmetric convection in a cylindrical cell. Results for R = i00 Rc, p = 0.O1. (a) Isotherms and streamlines: equally spaced contours of T(left) and ~ . (b) Profiles of the modified vorticlty~, which is nearly constant along the stream- lines.
147
cells (Musman 1972). In a tesselated convection pattern there may also be collective
instabilities which allow vortex rings to reconnect, so that cells are swallowed up
and disappear.
3.3 The modal approximation
An alternative approach to three-dimensional convection has been to adopt a
truncated modal expansion. For example, the vertical velocity w can be expanded in
eigenfunctions of the two-dimensional Laplacian operator:
,
The single mode expansion, normalized so that ~= O, f2 = i, 7 ~ 2C (where the bars
denote horizontal averages) has been studied in great detail (Gough et el. 1975;
Toomre et ai. 1977) and numerical solutions have been obtained for Rayleigh numbers
up to 1025. In this approximation the plan form of a convection cell is prescribed
by the linear eigenfunction, and enters the equations through the parameter C. For
two-dimensional rolls C = 0 and the equations reduce to the mean field approximation;
for cylinders C = 0.18 and for hexagons C = 0.41.
With rigid boundaries the results for a single mode agree quite well with experi-
ments; with free boundaries N~R I/3 when ~ >> I but N N (S in S) }/5 for R -| << ~ << |.
The imposed plan form generates a large nonlinear term in the vorticity equation. If
the flow is constrained only to be laminar and steady then it can adjust its plan
form to make ~^(~^~) very small and the effective dissipation can therefore be
reduced.
The modal expansion and the flywheel solutions are two extremes. We might expect
that instabilities would limit the lifetimes of three dimensional convection cells,
so that they are comparable with the turnover time and laminar flywheel solutions
cannot be attained. Then N should depend on S, though it is not clear what power law
would hold. This problem will not be resolved until the results of fully three-
dimensional computations have become available. Meanwhile, the power law derived
from mixing length theory (NNS ½ for S >> i) remains as good as any other.
4. COMPRESSIBLE CONVECTION
The Boussinesq approximation is manifestly inadequate for stellar atmospheres
that extend over many scale heights (the density increases by a factor of 106 in the
solar convection zone). It i8 commonly supposed that the dimensions of convection
cells should be of the same order as the local density or pressure scale height.
This assumption fits the photospheric granulation and some physical arguments can be
adduced to support it (Schwarzschild 1961; Weiss 1976). In mixing length theory
(which is essentially Boussinesq) the mixing length is generally set equal to some
multiple of the local pressure scale height. It would be comforting to have some
theoretical justification for this choice of length scale.
Linear theory gives no help: in a polytropic atmosphere convection sets in with
a horizontal scale that is comparable with the layer depth, even for the complete
148
atmosphere where the scale height shrinks to zero at the upper boundary (Spiegel
1965; Gough et al. 1976; Graham and Moore 1977). At supercritical Rayleigh numbers
modes with smaller horizontal scales have higher growth rates, at least when dis-
sipation is ignored (see Spiegel 1972). B~hm (1967) discussed the growth rates of
linear modes in a model of the solar convection zone, neglecting turbulent viscosity,
and found that the growth rate increased monotonically with the horizontal wavenumber.
Vandakurov (1975a,b) has included the effects of an eddy viscosity and found a
maximum growth rate for cells with a horizontal scale intermediate between those of
granules and supergranules. In these gravest modes there is no reversal
of the velocity, though the amplitude is strongly peaked near the surface. A
preliminary study of the marginal stability problem (Bbq~m 1975) indicates that there
may be internal nodes but their interpretation is obscure. (The reversal in the
temperature perturbation reported by Vickers (1971) is apparently due to
numerical error.) Smaller length scales seem to be produced not by the density
variation but by the strongly superadiabatic gradient, coupled with ionization, near
the top of the convective zone.
In nonlinear studies sound waves can be filtered out by using the anelastie
approximation (Gough 1969) which is valid provided the Mach number remains small.
This has been applied, using the modal approximations to study (inefficient) con-
vection in A-type stars (Latour et al. 1976; Toomre et al. 1976). However, no
careful study of the transition from Bousslnesq to compressible convection has yet
been carried out.
Dr. Graham will describe his numerical experiments on fully compressible non-
linear convection in two and three dimensions. For steady convection in two-
dimensional rolls the eye of an eddy is no longer at the centre of the cell but is
displaced downwards and towards the sinking plume (Graham 1975). This asymmetry is
observed in solar granules, which show a broad column of hot gas, rising at their
centres, surrounded by narrower, more rapidly sinking ring of cold material (Kirk
and Livingston 1968; Deubner 1976). Graham finds no evidence for small scale motion;
convective cells extend across the entire layer, even when the density varies by a
factor of 50. In studying compressible convection it is most straightforward to
assume that the dynamic viscosity ?g is uniform. Then the viscous term dominates
the equation of motion near the upper boundary, where the density is small (Gough
et al. 1976). If the aim is to represent turbulent dissipation by an eddy viscosity,
then the kinematic viscosity ~ can be obtained from a model of the convection zone.
For the sun~ ~ is roughly constant (Cocke 1967, B~hm 1975). However, Graham finds
that the cell size is not altered by setting ~ constant across the convecting layer.
So fare the only suggestion of small scale motion has come from some nonlinear
calculations by Deupree (1976), whose resolution is too coarse for the results to
be credible. Unless further computations on compressible convection reveal some
new pattern of behaviour, we shall have to suppose that the observed scales of con-
vection in granules and supergranules are caused by boundary layers near the surface
149
of the sun, rather than by the changing density scale height. If so, the mixing
length cannot be locally determined and mixing length theory is, at best, reliable
only near the surfaces of main sequence stars.
REFERENCES
B~hm, K.-H., 1967. Aerodynamic phenpmena in stellar atmospheres (IAU Symp. No. 28) ed. R. N. Thomas, p. 366j Academic Press, London.
g~hm, K.-H., 1975. Physique des mouyements dans les atmosph_~r@s st ellaires , ed. R. Cayrel and M' Steinberg, p. 57, CNRS, Paris.
Brown, W., 1973. J. Fluid Mech. 60, 539.
Busse, F. H., 1967. J. Math. Phzs. 46, 140.
Busse, F, H., 1969. J. Fluid Mech. 37, 457.
Busse, F. H., 1972. J. Fluid Mech. 52, 97.
Busse, F. H. and Whitehead, J. A., 1974. J. Fluid Mech. 66, 67.
Clever, R. M. and Busse, F. H., 1974. J. Fluid Mech. 65, 625.
Cocke, W. J., 1967. A strophys. J. 150, 1041.
Deubner, F. L., 1976. Astr. Ast[ophys. 47, 475.
Frorm, J. E., 1965. Phys. Fluid s 8, 1757.
Gough, D. 0., 1969. J. Atmos. Sci. 26, 448.
Gough, D. O., 1976. Trans. IAU 16A Part 2, 169.
Gough, D. O., Moore, D. R., Spiegel, E. A. and Weiss, N. 0., 1976. Astrophys. J. 206, 536.
Gongh, D. O., Spiegel, E. A. and Toomre, J., 1975. J. Fluid Mech. 68, 695.
Gough, D. O. and Weiss, N. 0., 1976. Men. Not. R. Astr. Soc. ll~76l, 589.
Graham, E., 1975. J. Fluid Mech. 7qO, 689.
Graham, E. and Moore, D. R. 1977. In preparation.
Howard, L. N., 1963. J. Fluid Mech. 17, 405.
Jones, C. A., Moore, D. R. and Weiss, N. O., 1976. J. Fluid Mech. 73, 353.
Jones, C. A. and Moore, D. R., 1977. In preparation.
Kirk, J. G. and Livingston, W., 1968. Solar Phys. 3, 510.
Krishnamurtl, R., 1970a. J. Fluid Mech. 42, 295.
Krishnamurti, R., 19705. J. Fluid Mech. 42, 309.
Krishnamurti, R., 1973. J. Fluid Mech. 60, 285.
Latour, J., Spiegel, E. A., Toomre, J. and Zahn, J.-P., 1976. As troph~s. J. 207, 233.
150
Lipps, F. B., 1976. J. Fluid Mech. 75, 113.
Moore, D. R. and Weiss, N. 0., 1973. J. Fluid Mech. 58, 289.
Musman, S., 1972. Solar Phys. 26, 290.
Plows, W., 1968. Phys. Fluids II, 1593.
Rosshy, H. T., 1969. J. Fluid Mech. 36, 309.
Schneck, P. and Veronis, G., 1967. Phys. Fluids iO, 927.
Schwarzschild, M., 1961. Astrophys. ~.t 134, I.
Spiegel, E. A., 1965. Astroph~s. J. 141, 1068.
Spiegel, E. A., 1971a. Comm. Astrophys. Space Phys. 3, 53.
Spiegel, E. A., 1971b. Ann. Rev. Astr. Astrophxs. ~, 323.
Spiegel, E. A., 1972. Ann. Rev. Astr. Astrophys. iO, 261.
Straus, J., 1972. J. Fluid Mech. 56, 353.
Toomre, J., Gough, D. O. and Spiegel, E. A., 1977. J. Fluid Mech. 79, i.
Toomre, J., Zahn, J.-P., Latour, J. and Spiegel, E. A., 1976, Astrophys. J. 207, 545.
Vandakurov, Yu. V., 1975a. Solar Phys. 40, 3.
Vandakurov, Yu. V., 1975b. Solar Phy.~. 45, 501.
Veltishohev, N. F. and ~elnin, A. A., 1975. J. Fluid Mech. 68, 353.
Veronis, G., 1966. J. Fluid Mech. 26, 49.
Vickers, G. T., 1971. Astrophy s. J. 163, 363.
Weiss, N. 0., 1976. Basic mechanisms of solar activity (IAU Symp. No. 71), ed. V. Bumba and J. Kleczek, p.ZZ9, Reidel, Dordrecht.
Widnall, S., 1975. Ann. Rev. Fluid Mech. ~, 141.
Widnall, S. and Sullivan, J., 1973. Proc. ROy. So~. A 332, 335.
Willis~ G. E. and Deardorff, J. W., 1967. phys. Fluids IO, 931.
Willis, G. E. and Deardorff~ J. W., 1970. J. Fluid Mech. 44, 661.
COMPRESSIBLE CONVECTION
Eric Graham
Department of Applied Mathematics and Theoretical Physics
University of Cambridge, England*
I . INTRODUCTION
Stellar convection zones often extend over several pressure scale heights and
convective velocities can be comparable to the local sound speed. Neither laboratory
convection experiments~or analytic solution of the non-linear equations are feasible
in such regimes. In order to gain insight into the details of s te l lar convection we
are obliged to use numerical simulations. At the present time, even this approach
cannot be applied to the parameter range typical of s te l lar interiors~ however sol-
utions can be obtained which extend over many scale heights and have non-negligible
Mach numbers. Under these conditions i t is necessary to employ the fu l l compressible
equations rather than the anelastic approximation (Gough [1] ) or the Boussinesq app-
roximation (Spiegel & Veronis [~ ).
2. THE PROBLEM
Rather than attempting to model a complete star, we w i l l employ a simplified
geometry. In this way we can fac i l i ta te the numerical calculation, while avoiding
the complexities of treating the transition between the convective zone and the opt-
ica l ly thin region. As a standard problem, we consider a gas confined in a rectangular box with slipp-
ery walls. The upper and lower faces are maintained at fixed but different temperat-
ures, T u and T I . The side walls are thermally insulating, A constant gravitational
f ie ld is imposed which has suff ic ient magnitude to produce a signif icant density var-
iation with height. The equation governing the problem are
and
Present address: National Center for Atmospheric Research, High Alt i tude Observatory,
P.O. Box 3000, Boulder, C'olorado 80303, USA.
t52
where ~ j = ~ ( ~ H- ~)X; -- ~- ~Xs" ,
S is the specific entropy,~} is the coefficient of viscosity, K is the thermal conduct-
i v i t y and al l the other symbols have their usual meaning.
Even i f we prescribe the equation of state, the functional forms of the conduct-
i v i t y and the viscosity and the aspect ratios of the box, we s t i l l have f ive degrees
of freedom in setting up the problem (see Graham E~). In addition we can choose our
i n i t i a l velocity, density and temperature distribution.
Numerical solutions for the two-dimensional problem have been presented by Graham
[3]. The most important parameters are found to be the Rayleigh number, R, the Prandtl
number,Or , and the layer depth parameter, Z, given by
= o p t / K , .
I t is sometimes useful to have solutions of the linear equations for the onset of
convection. These equations have been treated by Spiegel[4], Gough et al[5] and by
Graham and Moore[61. A relative Rayleigh number,~ , can be defined by scaling R by
the cr i t ical Rayleigh number of the linear problem.
3. NUMERICAL METHODS
A variety of numerical methods have been used for compressible convection. The
f i r s t solutions were obtained for two-dimensional motions using a modified Lax-Wendroff
f in i te difference scheme. This method is described by Graham [31. The scheme has
been generalised to three-dimensional flows. An alternative to the f in i te difference
method is the pseudo-spectral or collocation method. This has been successfully used for compressible convection equatiomby Graham (unpublished). Each dependent variable
is approximated by a truncated Chebyshev series in two or three space dimensions. The
series are substituted into the dif ferential equations. The time derivatives of the
coefficients of the series are determined by requiringthat the differential equations
be satisfied at selected collocation points. Because Chebyshev transforms can be cal-
culated using fast Fourier techniques, the method is economical. Both the Chebyshev
scheme and the Lax-Wendroff scheme suffer from numerical s tab i l i ty problems for low
Prandti numbers and large values of the Rayleigh number. Solutions have been obtained
withX =100,c=0.I and Z=IO. Current work is directed at developing an alternating
direction implici t f in i te difference scheme.
153
4. THE RESULTS
Because of the computational labour involved in obtaining three-dimensional sol-
utions, most of the calculations are restricted to two dimensional flows. The calcul- ations reported by Graham [3] relate to a perfect gas law and constant K and n~
A number of general results were found. 1. Two-dimensional solutions evolve to steady state flows. The time taken to reach a
steady state increases with increasing horizontal box dimension and decreasing or. This suggests that for more extreme configurations, there may be no steady solution.
2 '
Uu PP er/u lower
_ ~ t a n t V
constant
I I0 I00
Udownwa rd/uu pwa rd
, , J
1 I0 I00
Figure la Figure lb
0
0. :
Uupper/Ulower
const
c o n s t a n t "r I
1 10
Udownward/Uupward
constant
constant -~ " -
0 ,,,, ,, , I
0.I I
O-
10
Figure Ic Figure ld
154
2. There is an asymmetry between upward and downward velocit ies, downward velocities
usually being larger. Horizontal velocities are similar at the upper and lower surfaces,
with the lower velocity often being s l ight ly larger. This is a surprising result, par-
t icu lar ly for large values of Z, because continuity arguments have been proposed to su-
ggest that convective velocities are larger in low density regions.
3. When the horizontal box dimension is large enough to permit several convective ro l ls ,
the horizontal wavelength differs signi f icant ly from that which would maximise heat
f lux.
4. Convective cells extend over several pressure scale heights in the vertical direction.
No cases were found where the flow breaks up into several rol ls in the vert ical.
Further calculations have been performed with a constant kinematic v iscosi ty , ] / ,
rather than a constant dynamic v iscosi ty ,~. I t had been conjectured that the increase
of l / near the surface reduced the upper horizontal velocity. Figure la shows the ratio
of upper to lower velocity as a function of ~ with Z=10. We see that i t is only for
small values of ~ that the upper velocity is enhanced. Figure lb shows the correspond-
ing behaviour of the ratio of downward to upward velocit ies. The variations with Pran-
dtl number is shown in figures lc and Id. The general conclusion is that the solutions
are insensitive to the form of the viscosity law in the cases of large R and small G ,
which is the regime found in ste l lar convection zones.
1 \(
I I
j7
_ ::):
i
; J ,/
J /
Figure 2
155
Relatively few three-dimensional calculations have been performed. I f the horizontal
box size is comparable to the vertical size, two-dimensional flow patterns are found.
As the horizontal size is increased, the flow pattern becomes time dependent even for
modest values of ~ . Figure 2 shows a velocity f ie ld for Z=I and~ lO . In this per-
spective picture, a rectangular bite has been removed from one corner of the box to
reveal the interior. The arrows represent velocity components parallel to the faces.
The arrows are distributed at random with a probability proportional to the density.
The cut away portion shows that the f lu id has significant vertical vor t ic i ty . Such
regions are observed to be short lived, being dissipated and then reforming in a new
position.
5. CONCLUSIONS
Numerical simulat ion of compressible convection provides a way of obtaining a
detai led picture of s t e l l a r convection. At the present time, solut ions are s t i l l far
from the parameter range found in s t e l l a r i n te r io rs . However the solut ions are well
removed from the Boussinesq l i m i t of laboratory convection experiments. I t is to be
hoped that future developments in the form of more e f f i c i e n t algorithms for computat-
ional f l u i d dynamics, turbulence theories for handling the f ine scale features of the
flow and increases in the available computing resources wi l l al l help in attempts to
construct more real ist ic models of ste l lar convection zones.
REFERENCES
1 Gough, D.O., The anelastic approximation for thermal convection, J. Atmospheric
Sciences 26 (1969) pp. 448-456 2 Spiegel, E.A. & Veronis, G., On the Boussinesq approximation for a compressible
f luid, Astroph~s. J. 131 (1960) pp.442-447 3 Graham, E., Numerical simulation of two-dimensional compressible convection, J .
Fluid Mech. 70 (1975) pp. 689-703 4 Spiegel, E.A., Convective instabi l i ty in a compressible atmosphere I, As trophys.
J. 141 (1965) pp. 1068-1090 5 Gough, D.O., Moore, D.R., Spiegel, E.A. and Weiss, N.O. Convective instabi l i ty in
a compressible atmosphere I I , As__trophys. J. 206 (1976) pp. 536-542 6 Graham, E. & Moore, D.R., The onset of compressible convection, To appear
CONVECTION IN ROTATING STARS
F.H. BUSSE University of California, Los Angeles
SUMMARY
It is shown that many features of convection in rotating spheres and spherical
shells can be understood on the basis of plane layer models. The phenomenon of
differential rotation generated by convection is emphasized. The potential applications
and limitations of analytical and numerical models for problems of astrophysical
interest are briefly discussed.
I INTRODUCTION
In thinking about the effects of rotation in stars a variety of thoughts comes
to mind. Some are more negative: Rotation is a breaker of symmetry. It spoils our
notion of a star as an ideal spherically symmetric body which is in static equilibrium
except for the convection zones. Even the latter can be regarded as spherically symme-
tric with respect to their gross properties in the absence of rotation. Although the
deviations from spherical symmetry are small in most rotating stars, the effects of
rotation are only partially known and continue to irritate the theoretician involved
in computations of stellar evolution.
On the other hand stellar rotation is an exciting subject because of the variety
of interesting phenomena associated with it. The generation of magnetic fields is in
general connected with rotation. The shape of surfaces of equal potential in a
rotating star may become unstable in phases of contraction. Rotation can cause
meridional circulations and mixing processes and, in addition, there is a variety of
phenomena connected with differential rotation.
For the theoretical fluid dynamicist rotation brings to mind still other thoughts.
Whenever the Coriolis force becomes dominant the dynamics of fluids are profoundly
altered. The intuition developed from experience with hydrodynamics in non-rotating
systems is no longer valid. Intuitive concepts like mixing length theory appear to be
even less applicable in the case of low Rossby number convection, i.e. when the vor-
ticity of motion relative to the rotating system is small compared to the rotation
rate. On the other hand, theories developed for small amplitude convection appear to have
a much larger range of validity than in a non-rotating system. The two-dimensionality
enforced by a dominating Coriolis force tends to suppress instabilities and restricts
the degree of freedom for turbulent motion.
In the following theoretical and experimental results for the small Rossby number
N
o)
O)
a) Convection columns aligned with the axis of rotation.
b) Convection rolls in a
layer rotating about a
vertical axis.
FIGURE
]
158
case will be presented. Among the nonlinear phenomena caused by convection in rotating
systems we shall emphasize the generation of differential rotation. In discussing the
application to rotating stars we shall restrict our attention to the Sun and Jupiter.
The detailed surface observations available in both cases offer the best hope for
eventual quantitative tests of theoretical concepts.
2 BASIC EFFECTS OF ROTATION ON CONVECTION
The dynamics of nearly stationary motions in a rotating system are
governed by the Proudman-Taylor theorem which states that a small amplitude stationary
velocity field of an inviscld incompressible fluid must be independent of the
coordinate in the direction of the axis of rotation. It is of interest for astrophy-
sical applications that the theorem holds for barotropic fluids as well if the velocity
vector ~ is replaced by the momentum vector p~: By taking the curl of the equation
of motion
2 ~ x pv = - Vp- pV~
and using the equation of continuity
V- p~ = 0
the relationship
2~" V~ = 0 (I)
is obtained. In the following we shall restrict our attention, however, to the
case of incompressible fluids, or, more exactly, Boussinesq fluids for which the
temperature dependence of the density is taken into account in the gravity term
only.
We start the discussion of convection in rotating systems by considering two
simple cases as shown in Figure 1. In case (a) the vectors of gravity and rotation
are at a right angle and convection solutions satisfying the Proudman-Taylor theorem
are possible. The Coriolis force is entirely balanced by the pressure gradient in
that case and the critical value of the Rayleigh number for the onset of convection
becomes the same as in a nonrotating system. Since the Coriolis force always increases
the critical Rayleigh number unless it is balanced by the pressure, the solution
corresponding to convection rolls aligned with the axis of rotation is physically
preferred. It can be easily realized in the laboratory by heating a cylindrical
rotating annulus from the outside and cooling it from the inside and using the
centrifugal force as gravity (Busse and Carrigan, 1974).
While the stabilizing effect of the Coriolis force vanishes in case (a) it
reaches its maximum in case (b) when the vectors of gravity and rotation are parallel.
This is realized when a fluid layer heated from below is rotating about a vertical
axis. Release of potential energy by convection requiresa vertical component of
I
I I .~
..4 .
...
i -
s
(o)
g---
~
Convection columns in an annulus with
inclined top and bottom boundaries.
Fig
ure
2
: Convection layer inclined with
respect to axis of rotation.
f
/
(C)
Convection layer with
changing light.
160
motion which cannot occur without violating the Proudman-Taylor theorem. In order to
overcome the constraint of the Proudman-Taylor theorem viscous friction must become
sufficiently strong, thus playing a destabilizing role in this case. The non-dimen-
sional number describing the ratio between viscous friction and Coriolis force is
the Ekman number
E = ~ / ~ d 2 (2)
where d is the thickness of the layer and ~ is the kinematic viscosity. Since E is
very small in most applications, the horizontal scale of convection must become much
smaller than the vertical in order to increase friction. More detailed analysis (we
refer to Chandrasekhar's (|961) book) shows that the horizontal scale decreases like
E I/3 and the Rayleigh number for onset of convection increases like E -4/3 for small
E. Besides the Ekman number and the Rayleigh number, which is a measure of the buoyan-
cy, the Prandtl number is the third dimensionless parameter of the problem. It des-
cribes the ratio between thermal and viscous time scales of convection. For Prandtl
numbers less than a value of about I, oscillatory convection offers an alternate way
to overcome the constraint of the Proudman-Taylor theorem wihtout changing, however,
the power laws in the dependences on E.
3 EFFECTS OF INCLINED BOUNDARIES
The two extreme cases (a) and (b) of Figure I obviously correspond to equatorial
and polar regions, respectively, of rotating spherical fluid shells heated from within
and subjected to spherically symmetric gravity. There are, however, some important
deviations because of the finite dimensions of the spherical shells. To discuss these
effects let us consider the influence of inclined boundaries in (a) and (b). If top
and bottom boundaries are added in case (a) convective motions satisfying the Proud-
man-Taylor theorem are still possible as long as the boundaries are parallel and vis-
cous friction is negligible. Boundaries inclined with respect to each other, however,
require a dependence of the velocity field on the coordinate in the direction of the
axis of rotation, which we shall cal z-coordinate. A typical example is shown in
Figure 2 (a). The deviation from the Proudman-Taylor condition is accomplished by a
combination of time dependence and viscous friction in this case: Convection still
has the form of columns aligned with the z-axls, but the columns are travelling like
Rossby waves in the prograde or retrograde azimuthal direction depending on whether
the distance between top and bottom boundaries decreases or increases with distance
from the axis. In addition the azimuthal wave number ~ becomes large in order to
increase frictional effects. In the limit of small values of E we find
R = ( ~ ~ )4/3 , ~ = ( ; )I/3 , -I
(3)
/-
0 Figure 3:
Buoyancy force A and inhibition C
caused by inclined boundaries in
a sphere as a function of distance
S from the axis.
Figure 4:
Sketch of motion at the onset of
convection in a rotating sphere.
162
rotation rate, and n is the tangent of half of the angle between the inclined
boundaries. A detailed theory and an experimental study of the stabilizing effect
of inclined boundaries can be found in the papers by Busse (1970b) and Busse and
Carrigan (1974). In section 4 we show how the theory can be applied more or less
directly to the case of convection in a sphere.
Since convection at the mid-latitudes of a rotating spherical shell corresponds
to intermediate angles between gravity and the rotation vector, it may be anticipated
that it shows properties intermediate to those of the extreme cases (a) and (b) of
F~gure I. Indeed, it is easily shown (Chandrasekhar, 1961) that both the Rayleigh
number and the wave number at the onset of convection are governed by the expressions
derived for case (a) if the rotation rate ~ is replaced by its vertical component
cos ~ . Convection occurs in the form of rolls aligned with the horizontal component
of ~ , as indicated in Figure 2(b). Accordingly, the component of the Coriolis force
proportional to ~ sin ~ is balanced by the pressure and drops out of the dynamical
considerations.
When applying the theory of plane parallel convection layers to spherical shells
the strong dynamical coherence of the fluid along any line parallel to the axis of
rotation must be kept in mind. For this reason convection in a spherical shell
exhibits the effects of non-parallel boundaries even though the distance between the
boundaries is constant. Since the tangential surfaces to the spherical boundaries
of the shell are not parallel at the points intersected by the same line parallel to
the z-axis, the dynamics of convection exhibit the same effects as in the case of the
convective layer shown in Figure 2(c). The variation of "height" with distance from
the axis of rotation induces a wave propagation property of the convective motions
similar to that of the convection columns in Figure 2(a). Because of the particular
phase relationship between buoyancy force and motion the phase propagation velocity is
opposite that of Rossby waves, at least for Prandtl numbers of the order I/3 and
larger (Busse and Cuong, 1976).
4 CONVECTION IN ROTATING SPHERES AND SPHERICAL SHELLS
The problem of convection in a self-gravitating rotating fluid sphere has been
traditionally considered for the case of homogeneous internal heating. Both gravity
vector and temperature gradient vary linearly with distance r from the center in this
case. Roberts (1968) gave a detailed mathematical analysis of the problem. The physically
realized mode was determined by Busse (1970b).
An approximate solution of the problem can be obtained without any numerical
analysis by applying the concept of convection in a rotatin~ annulus, as shown in
Figure 2(a). Because of the coherence in the z-direction enforced by rotation and the
small length scale of the convection columns in the perpendicular direction,
convection in any cylindrical section of the sphere behaves as in the corresponding
163
Figure 5: Laboratory simulation of convection in a rapidly rotating sphere. The motions are made visible by small flaky particles which align with the shear.
164
annulus problem. In Figure 3 the stabilizing effect C of the Coriolis force owing
to the inclination of the boundary has been plotted as a function of the distance
s from the axis together with the buoyancy force A~ which is given by the product of the s-
components of gravity and temperature gradient, since the z-component of the buoyancy
force has little effect on the convection motion. The minimum of C/A at a distance s
of about half the radius indicates the cylindrical surface where the onset of
convection will occur as the critical value of the temperature gradient is reached.
Figure 4 gives a qualitative sketch of the solution of the problem.
The fact that only the component of gravity perpendicular to the axis of
rotation enters the dynamics in a first approximation is the basis for the laboratory
simulation of the convection process (Busse and Carrigan, 1976). By using centrifugal
force in place Of gravity and by cooling the sphere from the inside and heating it
from the outside the convection flow described above can be realized in a laboratory
experiment. The onset of convection occurs in the form of regularly spaced columns,
as shown in Figure 4. When the buoyancy force increases beyond the critical value,
the region of convection is extended until the entire sphere is filled by convection
columns. While amplitude fluctuations and the difference in the speed of propagation
cause deviations from the regular picture at low amplitudes the perfect alignment of
the columns persists, as shown in Figure 5.
The analysis of the spherical case applies directly to the equatorial region of
spherical shells outside the cylindrical surface touching the inner boundary at the
equator. In all cases the Rayleigh number for the onset of convection is lower in that
region than in the other parts of the fluid shell. Inside the cylindrical surface the
onset of convection can be described approximately by applying locally the theory of an
inclined convection layer if the effects discussed in connection with Figure32(b) and
2(c) are taken into account. Of particular interest is the prograde propagation of
convection modes everywhere except at the poles. An asymptotic analysis for different
radius ratios and for varying Prandtl number P is given by Busse and Cuong (]976).
Figure 6 shows the local Rayleigh number for onset of convection as a function of the
distance from the axis in a typical case. The corresponding wave number and frequency
of convection are also shown. The asymptotic results agree reasonably well with the
earlier numerical results obtained by Gilman (1975) at finite values of E in the case
P= I and for a radius ratio ~/~ = 0.8, which is appropriate for the solar convection
zone.
Figure 7 illustrates the most important feature of convection in a rapidly
rotating spherical shell: The change in the character of convection across the
cylindrical interface s = r i. While the vorticity of the motions is nearly z- independent
for s > r i the z- component of vorticity changes sign between lower and upper parts
of the convection cell for s < r.. This change in the symmetry of convection has z
important effects on the nature of the differential rotation generated by convection
and on the heat transport.
165
t00
RE-Z/3
50
Figure 6:
R a
\
\ ,
/
a i # / ,'~
,, . . - ; : q ~t ~
1 ; J
/
" ' ' " I I , i t I
.5 S
R
1 I I I
2
S ,,,"?'./wE-
Rayleigh number R for the onset of convection in a spherical shell with radius ratio r4/r~ = 0.6 as a function of distance S from t~e ~xis. Wave number a and frequency e of convection columns are shown by dashed lines.
I66
j/
I
/
Figure 7: Sketch of convection modes in a spherical shell.
iO s
RO
TATI
ON
AL
REG
IMES
D
UE
TO
CO
NVE
CTI
ON
IN
R
OTA
TIN
G
SP
HE
RIC
AL
SH
ELL
SO
LID
R
OT
AT
ION
R
~
T.8
T z
/3
I r,."
10 4
bJ
El
Z
I (3
Ld
.d
10 3
>
- S
OL
ID
RO
TA
TIO
N
R ~
0
,84
T z
/3
Pra
nd
tl
Num
ber
P=
1 S
tre
ss
fre
e
bo
un
da
rie
s
I0 z I0
z
IO 3
I0 4
I0 s
TA
YL
OR
N
UM
BE
R
T -"
" Figure 8:
Regimes of differential rotation from Gilman
(1976a).
IO s
168
5 NONLINEAR ASPECTS
The phenomenon of solar differential rotation has stimulated much of the
recent effort to understand convection in rotating spherical shells. It was first
shown by Busse (1970a) that convection in a spherical shell can generate a differen-
tial rotation of the same form as that observed on the Sun. While Busse used an
analytical perturbation method in the thin shell limit, Durney (1970) independently
developed a mean field approach for the solution of the problem from which he
obtained--after using the wave propagation property demonstrated by the analytical
theory--essentially the same results. The exciting aspect of the observed solar
phenomenon as well as of the theoretical results is that a prograde differential
rotation occurs at the equator. This eontradicts the earlier notion of angular mixing
by convection which would have led to a deceleration of the equatorial region.
That the hypothesis of angular momentum mixing by convection is incorrect
can easily be demonstrated in the case of convection in a cylindrical annulus
discussed earlier. Since the Coriolis force can be entirely balanced by the pressure
in this case, the influence of rotation disappears from the full nonlinear equation
for two-dimentional convection rolls. Differential rotation cannot be a part of the
solution since the basic equations are identical to those in a nonrotating region in
this case and since a preferred azimuthal direction cannot be distinguished. Generation
of differential rotation obviously depends on secondary features such as the curvature
of the boundaries, and cannot be predicted by simple physical arguments.
How complicated the phenomenon of differential rotation in a convecting
spherical shell can become at higher Rayleigh and Taylor numbers is evident from
the numerical computations of Gilman (]972, 1976a,b). Because both the Reynolds stresses
of the fluctuating convection velocity field and the meridional circulations caused
by the inhomogeneity of convection contribute to the generation of differential rotation,
small changes in the parameters of the problem may change the form of differential
rotation dramatically. Figure 8 from Gilman (1976a) shows how the equatorial maximum of
angular velocity changes into a relative minimum as the Rayleigh number is increased.
The influence of boundary conditions also appears to be important. The almost exclusively
used stress-free boundaries actually represent a singular case in the thin shell limit
(Busse, ]973) since an equilibration between Reynolds stresses and viscous stresses can
take place only in the latitudinal direction.
In order to investigate the generation of differential rotation in a conceptually
simple ease, the problem of convection in a rotating cylindrical annulus has recently
been studied both experimentally and theoretically. Since the measurements are still in
progress we restrict our attention to the qualitative picture, as shown in Figure 9. No
differential rotation is generated in the case of straight top and bottom boundaries
S
/
/
---~--~--Figure 9:
Differential~
rotation generated by convection
~-in
a rotating cylindrical annulus.
170
of the annulus. The experimental observations show an increase of the gradient of
angular momentum for convex boundaries and a decrease for concave boundaries, in
agreement with theoretical predictions.
Meridional circulation and latitudinal variation of the convective heat
transport are other important nonlinear properties of convection in spherical shells.
Both phenomena are closely linked since the variation of the mean temperature caused
by an inhomogeneous heat transport is the most important cause of meridional circulation.
The lack of observational evidence for either phenomenon on the solar surface has been
a source of controversy in the interpretation of theoretical models. We shall return
to this point in the next section.
6 APPLICATIONS TO THE SUN AND JUPITER
It is fortunate for the theory of convection in rotating stars that there
exist two quite different celestial bodies for which detailed surface observations are
available. In the case of the Sun the influence of rotation is relatively small: The
Rossby number is large compared to unity at least for the velocity field in the
upper part of the convection zone. Jupiter represents the opposite case of a rapidly
rotating system characterized by a small Rossby number. Although about half of the
energy emitted from the surface of Jupiter is received from the Sun, the convective
heat transport required for the other half is the dominating source of motions in the
Jovian interior. In this respect Jupiter does indeed represent a low Rossby number
example of a rotating convecting star.
The application of theoretical models which are valid at best for systems of
laboratory scales to systems of stellar dimensions faces obvious difficulties. It is
con~mon practice to take into account the effects of turbulence owing to motions of
smaller scale than those considered in the form of an eddy viscosity ~e which
replaces molecular viscosity in the equations of motion. The main justification for
this procedure is that it appears to work well in many cases.
If ~ is chosen sufficiently large that the Rayleigh number and Taylor number e
4E -2 are not too large the differential rotation observed on the Sun resembles that
predicted by the theoretical models fairly well. There is also evidence for the large-
scale convection cells, often called giant cells, girdling the equator like a cartridge
belt (Howard and Yoshimura, 1976). Figure I0 shows a laboratory simulation. The radius
ratio in the laboratory experiment is closer to unity than in the solar case and the
number of cells is correspondingly larger. Otherwise the cells show a surprising
resemblance to those observed on the Sun by Walter and Gilliam (1976). Because the
latter authors show magnetic regions a direct physical interpretation of the phenomeno-
logical resemblance is difficult, especially since the simultaneous occur~neeOf magne-
tic features which are syrmetric or antisymmetric with respect to the solar equator is
not well understood.
171
Figure 10: Laboratory simulation of convection in a rotating spherical fluid shell with inner radius r i = 4.45 cm and r = 4.77 Cm.
o
172
The measurement of the Coriolis deflection of the horizontal motion in
supergranules by Kubicela (2973) appears to be the only direct determination of the
effect of rotation on solar convection. Kubicela interprets the observed deflection
of the velocity as the Coriolis acceleration multiplied by the lifetime of a super-
granule. Using a lifetime of 20 h he finds reasonable agreement with the measurements.
Since the supergranular velocity field is defined as the mean over a field of highly
fluctuating granular motions, the eddy viscosity concept can be used as an alternative
possibility of interpretation. Using the linear solution for a convection cell in a
rotating layer with stress-free boundaries (Chandrasekhar, 196]) we find the expression
2e tgy
~ 2d2 e
for the angle 7 of deflection, where d is the depth of the supergranular layer.
For simplicity we have assumed that the horizontal wavelength of the cells is large
in comparison with d. Using ~ ]09 cm and ~ = 2.6 " 20 -6 sec -2 we derive from the
observed angle y ~ 20 ° an eddy viscosity of the order 2 • j0]2cm2sec -2 , which is in
reasonable agreement with values derived from other more heuristic considerations.
For the larger scale of giant cells a slightly higher value of ~e appears to be
appropriate yielding an Ekman number of approximately ]0 -2, which is of the same order
as the value used by Gilman (]976b) in his numerical simulation of the solar convec-
tion zone.
It should be mentioned that earlier theories of the solar differential rota-
tion by Kippenhahn (2963) and others used the concept of an anisotropic eddy viscosi-
ty proposed by Biermann (1958). This concept often mimics the anisotroplc dynamical
influence of large-scale eddies. If the deviations from rigid rotation are described
in terms of an anisotropic viscosity it would seem reasonable in view of the more
detailed theory described above to use a latitude-longitude anlsotropy rather than
a horizontal-vertical anisotropy as proposed by Biermann.
Raylelgh numbers for stellar convection zones are based on the superadiaba-
tic part of the temperature gradient, which amounts in general to only a small frac-
tion of the total temperature gradient. A small change in surface temperature causes
a disproportionately large change in the Rayleigh number and an even larger in the
convective heat transport. The convection zone reacts like a highgain amplifier to
any change of the temperature at the surface and it is not surprising that no subcri-
tical large-scale variations of the solar surface temperature are observed. Since the
temperature determines the energy emission, the convective heat flux must adjust itself
to a uniform value. Ingersoll (]976) has emphasized this point in the case of Jupiter,
where the convection heat transport adjusts itself in such a way that large-scale va-
riations of the surface temperature vanish.
For this reason the heat flux variations and associated meridional circulations
N
Ob
serv
ed
C
om
pu
ted
,, -
' ,
~ _
l\
~~
C~
:~
'
~ \ \
\ \ \
~ \\\
~\
\ ~\ \
\ ~ \
\ 1_
1 ~r~
I
1\
\ k i,
\\ \
\\\
\ \~
\\\:
\\\
\\\x
t\\'
\l
\'t\~
l\~lk
"
l \
II
I1
\
..... I,
i ..
..
.
i .
..
..
..
..
..
..
..
Figure ii:
Comparison between theoretical predictions and observations
of bands on Jupiter (from BUSSe, 1976).
174
of low Rayleigh number models do not have much meaning for high Rayleigh number
stellar convection zones. Even in laboratory experiments it is apparent that the
inhibiting influence of rotation on the convective heat transport reverses itself
with increasing Rayleigh number. Rossby's (1969) measurements even show a slight
increase in heat transport owing to rotation at high Rayleigh numbers. The
generation of differential rotation, on the other hand, depends on the alignment
effect rather than the inhibition effect of rotation. It seems intuitively reasonable
that the former effect, which does not have direct energetic consequences, persists
at high Rayleigh numbers, while the latter effect is diminished by nonlinear processes.
Because of its low Rossby number, convection in the planet Jupiter may be
more accessible than solar convection to theoretical analysis. A simple model has
recently been proposed (Busse, |976). It is generally believed that a transition
from molecular to metallic hydrogen occurs at a radius of about 5/7 of Jupiter's
outer radius and that the interface inhibits penetration by convection. Accordingly
we are faced with the problem of convection in a rotating shell as sketched in Figure 7,
which was actually drawn to apply to Jupiter. The fact that a relatively sharp transition
from the low latitude band structure to the polar region of random eddy motion is
observed on Jupiter at about 45 ° latitude appears to be the strongest argument for a
dynamical influence of rotation along the lines outlined in this paper. To obtain a
more detailed comparison as shown by Figure 11 the concept of an eddy viscosity must
be invoked again. The value of ~ required for a fivefold layer of convection columns e
is in good agreement, however, with the eddy viscosity deduced from convection models
for the heat transport. More elaborate models are clearly possible and Jupiter may
well become the testing ground for future theories of convection in rotating stars.
175
REFERENCES
BIERMANN, L. 1958 IAU Syrup. N ° 6, 248
BUSSE, F.H. 1970a Astrophys. J. 159, 629-639
BUSSE, F.R. 1970b J. Fluid Mech. 444, 442-460
BUSSE, F.H. ]973 Astron Astrophys. 27, 27-37
BUSSE, F.H. 1976 Icarus, in press
BUSSE,F.H. and CARRIGAN, C.R. 1974 J. Fluid Mech. 62, 579-592
BUSSE F.H. and CARRIGAN, C.R. 2976 Science 192, 81-83
BUSSE, F.H. and CUONG, P.G. 1976 Geephys. Fluid Dy., in press
CHANDRASEKHAR, S. 1961 Hydrodynamic and Hydromagnetic Stability Oxford Clarendon Press
DURNEY, B.R. ]970 Astrophys.J. 26]_, ]II5-]127
GILMAN, P.A. ]972 Solar Phys. 27, 3-26
GILMAN, P.A. 1975 J. Atmos. Sci. 32, 1332-2352
GILMAN, P.A. 1976a Proc. IAU Symp. 71, in press
GILMAN, P.A. 1976b J. Fluid Mech., submitted
HOWARD, R. and YOSHIMURA, H. 1976 Prec. IAU Syrup. N ° 71, in press
INGERSOLL, A.O. 1976 Icarus, in press
KIPPENHAHN, R. 2963 Astrophys.J. 137, 664
KUBICELA, A. ]973 P roe. 1st European Astr~ Mt~. Solar Activity and Related Interpla-
netary an d Terrestrial Phenomena, J. Xanthakis, ed. Springer
ROBERTS, P.B. |968 Ph~l. Trans. Roy. Soc0 London A 263, 93-I]7
ROSSBY, H.T. ]969 J. Fluid Mech. 36~ 309-335
WALTER, W.T. and GILLIAM, L.B. 1976 Solar Phys., in press
MAGNETIC FIELDS AND CONVECTION
N. O. Weiss
Department of Applied Mathematics and Theoretical Physics
University of Cambridge
SUMMARY
In a highly conducting plasma convection is hindered by the imposition of a
magnetic field. Convection may set in as direct or overstable modes and behaviour
near the onset of instability depends on the ratio of the magnetic to the thermal
diffusivity. Vigorous convection produces local flux concentrations with magnetic
fields that may be much greater than the equipartition value. The interaction
between magnetic fields and convection can be observed in detail on the sun and is
essential to any stellar dynamo.
I. INTRODU6TION
Magnetic fields - whether primeval or maintained by dynamo action - are
ubiquitous. Any rotating, convecting star seems able to generate a magnetic field,
though the interaction between convection, rotation and magnetic fields bristles with
problems for the theorist. We can usefully distinguish between the problem of
maintaining large scale fields by dynamo action, which will be discussed by Dr
Childress, and that of the interaction between small scale convection and an imposed
magnetic field. I shall assume that any convective timescale is short compared with
the lifetime of large scale magnetic fields and I shall not concern myself with their
origin.
The scale of ordinary laboratory experiments is too small for them to model
hydromagnetic behaviour in astrophysical plasmas. However, the sun provides a
marvellous laboratory where such phenomena can be observed. Sunspots are dark
because normal convection is suppressed by the strong magnetic fields; on a smaller
scale, it is now possible to resolve features a few hundred kilometres across and
to follow the interaction between weak fields and granular convection.
This increase in resolution has revealed more magnetic structures and strongsr
magnetic fields than had been expected.
The theoretical description of a convecting system is particularly rich when
stabilizing and destabilizing effects compete in it (Spiegel 1972). Dr Huppert has
reviewed thermohaline convection; the nonlinear Lorentz force makes magnetic con-
vection yet more complicated. I shall first summarize the results of linear theory
and then discuss various nonlinear problems: is motion steady or oscillatory? are
there subcritical instabilities? how is energy transport affected by the field?
what limits flux concentration between convection cells and how strong are the fields
177
produced? Not all these questions are yet answered but nonlinear magnetic con-
vection is gradually being understood. Finally, I shall try to relate this theory
tosolar magnetic fields and to some aspects of the dynamo problem.
2. LINEAR THEORY
In the absence of a magnetic field a stratified gas is stable to adiabatic
perturbations if Schwarzschild's criterion is locally satisfied. The imposition
of a unlform magnetic field inhibits the onset of oDnvection: a plane, perfectly
conducting layer is eonvectively stable if
(Gough and Tayler 1966), where B O is the vertical component of the magnetic field,
T is the temperature, p the pressure, ~ the ratio of specific heats, /x the
permeability and the adiabatic gradient (dlnT/dlnP)ad = (~-i)/~ for a perfect gas.
Strong magnetic fields can therefore hinder the onset of convection in a star,
though the difference between the adiabatic and the radiative gradient is usually
large enough for instability to occur before the latter is attained (Moss and
Tayler 1969, 1970; Tayler 1971).
When the conductivity ~ is finite, plasma can move across the lines of force
and the stabilizing effect of the magnetic field is relaxed. What happens depends
on the relative values of the magne~c diffusivlty ? = ~#~)" and of the thermal and viscous diffusivities ~ and ~ . In typical stellar conditions, ~ q ~ ~.
The onset of instability in a Boussinesq fluid has been studied in detail (Thompson
1951; Chandrasekhar 1952, 1961; Danielson 1961; Weiss 1964a; Gibson 1966). For a
plane layer of depth d the stabilizing effect of a uniform magnetic field is
measured by the dimensionless Chandrasekhar number
_ ~ a ~
which is the square of a Hartmann number and can be regarded as a "magnetic Rayleigh
number" (Spiegel 1972). A configuration is defined by Q, by the RayleiKh number
R = g~d4/~9 , where ~ is the coefficient of thermal expansion and ~ the super-
adiabatic temperature gradient, and by the Prandtl number ~= ~/~ and the
magnetic Schmidt (or Frandtl) number
=
If, for simplicity, we adopt "free" boundary conditions (Chandrasekhar 1961,
Gibson 1966) then the linear modes have the form
, , , , = Wc ) e ,
where W(z) = WosinrCz (0 4 z ~ d) and
178
with W and a constant, referred to cartesian co-ordinates with the z-axis vertical. o
If ~ q (~)T) linear instability sets in as in ordinary Rayleigh-B6nard
convection. The growth rate s is real and instability sets in as a direct mode~
corresponding to an exchange of stabilities, when R = R Ce) . (Semantics are
succinctly sunnnarized by Spiegel, 1972.) For large Q, R (e) is a minimum when the
dimensionless horizontal wavenumber
so convection first appears in vertically elongated cells at R = R (e~ ~ ~2Q. c
Standing hydromagnetic waves in an unstratified fluid produce oscillations
which are damped by ohmic and viscous dissipation. When ~ >~ these oscillations
may be destabilized by the thermal stratification (Cowling 1976a), so that con-
vection sets in as overstable oscillations when R = R t~ For sufficiently large
Q, overstability first occurs in elongated cells, when
When ~< ~ , therefore, R (°) ~ R (~ and instability first appears as overstable e c
oscillations. At R = R (~ there are two complex conjugate growth rates but as the c
Rayleigh number is raised IIm (~[ decreases until for some R = R ~O the growth
rates are purely real. Thus convective instability sets in with direct modes at
R=RC° AsQ+ ~ , for ~<<~<' (~?,>~) ~ ~CZ~Q, =(~)~ and the minimum value of R ~+) is ~++) ~ (~/=) =~ ~ (Danielson 1961; Weiss
1974a); thus R +~ <~ RIO << R+~ For R +O ~ R < R +~ there are two distinct c c c
positive real growth rates. One of these changes sign when R = R(e) but this
exchange of stabilities has no physical significance.
So far we have considered only free boundary conditions. Analogous results
hold also for other boundary conditions (Chandrasekhar 1961, Gibson 1966). In
particular, the effect of superposing a stable layer on top of the unstable region
has been investigated by Musman (1967) and Savage (1969). The treatment has also
been extended to include some effects of compressibility (Kato 1966; Syrovatsky and
Zhugzhda 1967, 1968; Saito and Kato 1968). If the Alfv6n speed is small compared
with the sound speed, slow magnetosonic oscillations become overstable; if the
Alfv~n speed is large, the fast magnetosonic mode can be destabilized (Cowling
1976b).
3. NONLINEAR cONVECTION
In a Boussinesq fluid the magnetic field satisfies the induction equation
~B
~t
179
while the vorticity _&) and the temperature T are governed by the equations
~ . g - - - - ~ - - - -
"~T
and
~.~ ~ 0 , V_B = O.
Here ~ = ~-J ~zA~ - is the electric current, ~ is the velocity and ~ the density.
For two-dimensional convection, with ~ and B confined to the xz-plane and independent
of y, B can be described by a flux function (the y-component of the vector
potential) A such that
and
while the vorticity equation reduces to
The most convenient boundary condition on the field is obtained by setting B x = 0
at z = O, d. This is somewhat artificial but corresponds to the free boundary
conditions adopted for linear theory.
Near the exchange of stabilities (R = R (e)) finite amplitude solutions can be
constructed using modified perturbation theory. Veronis (1959) observed in a
footnote that subcritical instabilities were possible when ~ . Busse (1975) has
considered a two-dimensional model in which the magnetic field affects the
amplitude, but not the form, of the motion. For R near Rc, the critical Rayleigh
number in the absence of a magnetic field, he combined a perturbation expansion for
the velocity with a computed solution for the distorted magnetic field. He showed
that when ~ >> ~ stationary convection is possible with R c ~ R ~ R :~ In
these solutions the magnetic Reynolds number
U~
(where Q is a typical velocity) is large and the magnetic field is confined to
narrow regions so that its overall stabilizing effect is correspondingly reduced; an
analogous argument applies to the thermohaline problem that Dr Huppert has described.
Since subcritical convection appears when ~4 T and R~)~< R ~) this technique
cannot rigorously establish whether steady finite amplitude solutions are possible
before the onset of overstability. However, Busse's results do suggest that such
subcritical instabilities may occur and Proctor (private communication) has developed
a simplified model of magnetic convection which shows steady motion when R is close
t o R . e
180
Perturbation methods are reliable only while the P~clet number (U d/w) is low
and the convective energy transport relatively small. The efficiency of convection
is measured by the Nusselt number N = (F -~ad)/~, where F is the total thermo-
metric flux and ~ad the adiabatic temperature gradient. The effect of a magnetic
field on the Nusselt number was investigated by van der Borght, Murphy and Spiegel
(1972)~ using the mean field approximation (Spiegel 1971). They considered only
steady convection which, in this approximation, is independent of both ~ and ~ .
For a fixed Rayleigh number, N decreased monotonically and smoothly with increasing
Q until the exchange of stabilities was reached~ thereafter convection was completely
suppressed. Van der Borght (1974) has also attempted to describe the time dependent
problem.
Fully nonlinear two-dimensional computations show a different range of
5ehaviour (Weiss 1975). For free boundary conditions with ~= I, ~ ~ i there is
a general tendency to generate nonlinear oscillations. When ~ = 5, convection first
appears as overstable oscillations in accordance with linear theory. If Q is then
decreased, while R is kept fixed, these oscillations are stabilized at some small
but finite amplitude and convection remains comparatively inefficient. As Q is
further reduced~ R ~) becomes less than R and linear theory predicts direct,
exponentially growing modes. These appear in the numerical solutions but eventually
develop into periodic nonlinear oscillations. The Lorentz force is quadratic in
and linear theory underestimates the restoring force. Hence nonlinear magnetic
convection differs from thermohaline convection, where the stabilizing force remains
linear. If Q is decreased yet further the oscillations develop into irregular
aperiodic motion and, eventually~ into steady convection. The tlme-averaged Nusselt
number rises monotonically as Q decreases but there are no noticeable discontinuities,
nor could any hysteresis be detected,
Dr. Galloway will describe his numerical study of axisymmetrlc convection in a
magnetic field. The results are qualitatively similar, though he found some
hysteresis~ indicating different solutions when Q was decreased from the critical
value and subsequently increased. So far, however, numerical experiments have
provided no evidence of any Jump in the Nusselt number or of any metastable conducting
state associated with suberitlcal convection. Computations on thermohaline convection
(~uppert and Moore 1977) demonstrate that such phenomena can occur. Busse's finite
amplitude results and Proctor's simple model both suggest that, with suitably
chosen parameters~ metastable magnetic configurations should exist. Further com-
putations, with a wider range of diffusivities, are needed to establish w~ether
subcritical convection can be found. It is obviously important to determine what
parameter ranges allow metastable states and whether linear stability theory has any
relevance to convection in a strong magnetic field in a star.
4. FLUX CONCENTRATION
In the limit when Q is su~dclently small the magnetic field is weak and the
Lorentz force has no dynamical effect. The velocity R can then, in principle,
181
be derived from some theory of ordinary Rayleigh-B~nard convection. If ~ is then
fed into the induction equation the kinematically distorted magnetic field can be
calculated. This has been done for various plausible velocity fields (Parker
1963; Clark 1965, 1966; Weiss 1966; Clark and Johnson 1967; Busse 1975). When the
magnetic Reynolds number is high, magnetic flux is rapidly swept to the edges of
convection cells to form ropes. Within a ceil, the field is wound up until the
lines of force eventually reconnect and magnetic flux is expelled. If R >> i the m
pattern of motion must persist for many turnover times before the expulsion process
is completed. However, ropes are formed between the cells by the time they have
turned over once. Within these ropes the field strength has an approximately
Gaussian profile.and the peak field
B* ~ R ½ B in two dimensions mo
R B in three dimensions~ mo
where B is the average initial field (or the field in the absence of convection). o
Astrophysical length scales are large and ~l is big enough for enormous
magnetic fields to be produced locally if concentrations were purely kinematic.
Eventually, the J^B force in the flux rope must become powerful enough to halt the
concentration: amplification of the field is then dynamically limited. But it is
not immediately obvious what limiting field strength can be produced. Partly on
dimensional grounds, it has been popularly supposed that the local field strength
cannot exceed the equipartition field Be, where
~ : , U ~ = ~
The principal argument for this limit depends on considering pressure fluctuations
associated with convection but in a Boussinesq fluid the pressure can be eliminated
from the equations and the equipartition limit should therefore be irrelevant.
Busse (1975) showed that for small amplitude two-dimensional convection
B*~ (~)i/4. Hence B*/B e could be made arbitrarily large by a suitable choice of
. The full two-dlmensional problem has been investigated numerically for con-
vection driven by imposed horizontal temperature gradients (Peckover and Weiss
1977) and by heating from below (Weiss 1975), and Dr, Galloway has computed solu-
tions for axisymmetric convection. The maximum value of the peak field B* can be
estimated by a simple argument. The two-dlmensional results show that kinematic
amplification is halted when ohmic dissipation in the flux rope becomes comparable
with viscous dissipation throughout the convection cell. It follows that the
maximum field B* ~.~/4,-- a result confirmed by the computatio~ In three dim- max
ensions a similar argument yields a maximum field B*max~'~'(galloway etal. 1977).
By choosing ~" sufficiently large, B* can be made much greater than B and solutions e
have been obtained with B*/Bem 5 (though the particular value has no significance).
Once the magnetic field becomes dynamically significant, vorticity is
generated in the flux ropes, where there is a local balance between the magnetic
and viscous terms in the vorticlty equation. The buoyancy force generates vorticity
182
with one sense, while the Lorentz force generates vorticity with the opposite sense
and viscosity maintains a balance. The resultant vorticlty distribution corresponds
to a velocity field with a reduction in the transverse flow that concentrates the
flux. A simple physical description confirms that two-dimensional amplification
is halted as ohmic dissipation reduces the overall flow. In three dimensions motion
can be excluded from the flux rope slightly earlier (Galloway, private communication).
As B is further increased, the flux ropes grow broader and develop a different o
structure. The field within a rope is more nearly uniform, dropping abruptly near
its boundary. The ensuing current sheath produces a Lorentz force which prevents
the motion from entering the flux tube. Ultimately the layer separates into con-
veering cells, from which magnetic flux has been expelled, and stagnant flux ropes
in the interstices between them.
5. SOLAR MAGNETIC FIELDS
Cowling (1953, 1976a), Sweet (1971) and Mullan (1974b) have reviewed magnetic
fields in the sun. The discussion of flux concentration relates most directly to
intense, small scale magnetic fields in the photosphere (Weiss 1977). Over the
last eight years ground-based observers have succeeded in resolving magnetic
structures with a scale smaller than that of the granulation and fields of up to
about 1500 G (Schr~ter 1971; Harvey 1971, 1977; Dunn and Zirker 1973; Mehltretter
1974; Stenflo 1976). These features are formed between granules and have lifetin~s
similar to those of individual granules. The fields are much larger than the local
equipartition field (Be~5OO G) and the magnetic pressure alone is almost sufficient
to balance the external gas pressure. Such high fields can only be contained by that
gas pressure (Parker 1976a). A full theory of convective transport in strong
magnetic fields is needed to explain the formation of these flux ropes but a crude
extrapolation from ~he Boussinesq results indicat~that the field can be amplified
to reach the strengths observed (Galloway et al. 1977).
On a larger scale, magnetic flux is swept aside by supergranules and concen-
trated at their boundaries to form a network in which most of the small scale
features are located. Irregular small scale fields have recently been detected
within the network (Harvey 1977) but the flux involved is relatively slight. As
more flux is brought together the magnetic field inter~res with convection so that
the gas is cooled. Dark pores or sunspots then appear between the supergranules.
The magnetic flux that emerges through a sunspot is presumably assembled into a
rope deep in the convective zone, though supergranules certainly play a part in the
formation of a spot. Conversely, though small flux ropes can be shifted to fit the
pattern of supergranular convection, large sunspots are anchored deeper down and
long-lived, stable convection cells may form around or near them (Harvey and Harvey
1973; Livingston and Orrall 1974; Meyer et al. 1974).
In the umbra of a sunspot the magnetic field is nearly vertical and strong
enough to suppress convective instability, Following a suggestion by Biermann
183
(1941), Cc~ling (1953, 1976 a,b) argued that motion across the magnetic field is
inhlbitedj and that convection is limited to predominantly vertical, oscillatory
motion, in slender elongated cells. Theoretical models of sunspots (Chitre 1963;
D~inzer 1965; Chltre and Shaviv 1967; Yun 1970) show that radiation alone cannot
supply the energy emitted from the umbra and microturbulsnt velocities (Backers
1976) provide some observational evidence for convection. (Umbral dots are too
sporadic to be an essential feature of the transport process.) Various attempts
have been made both to relate linear stability theory to umbral and penumbral
van der Borght, R., 1974. Men. Not. R. Astr. Soc. 166, 191.
van der Borght, R., and Murphy, J. 0., 1973. Austr. J. Phys. 2~6, 617.
van der Borght, R., Murphy, J. O. and Spiegel, E. A., 1972. Austr. J. Phys. 2~5, 703.
Busse~ F. H., 1975. J. Fluid Mech. 7~i, 193.
Chandrasekhar, S., 1952. Phil. Mag. (7) 4~3, 501.
185
Chandrasekhar, S., 1961. Hydrodynamic and hydromagnetic stability. Oxford.
Chitre, S. M., 1963. Mon. Not. R. Astr. Soc. 126, 431.
Chitre, S. M. and Shavlv, G., 1967. Solar Phys. ~, 150.
Clark, A., 1965. Phys. FIo ~, 644.
Clark, A., 1966. Phys. FI. ~, 485.
Clark, A. and Johnson, A. C.+ 1967. Solar Phys. ~, 433.
Cowling, T. G., 1953. The sun, ed. G. R. Kulper, p. 532. University of Chicago Press.
Cowling, T. G., 1976a. Magnetohydrodynamics. Hilger, Bristol.
Cowling, T. G., 1976b. Mon, Not. R. Astr. Soc. 177, 409.
Danielson, R. E., 1961. Astrophys. J. 134, 289.
Danielson, R. E., 1965. Solar and stellgr magnetic fields (IAU Symp. No. 22), ed. R. LUst, p. 314. North-Holland, Amsterdam.
Danielson, R. E., 1966. Atti de l c0+nve~no sulle macchie solari, ed. G. Righini, p. 120, Barbara, Florence.
Danielson, R. E. and Savage, B. D., 1968. Structure and development of solar active re~ions (IAU Symp. No. 35), ed. K. 0. Kiepenheuer, p. 112. Reidel, Dordrecht.
Defouw, R. J., 1977. Astrophys. J. (in press).
Deinzer, W., 1965 Astrophys. J. 141, 5~8,
Drobyshevsky, E. M. and Yuferev, V. S., 1974. J. Fluid Mech. 65, 33.
Dunn, R. B. and Zirker, J. B., 1973. Solar Phys. 33, 281.
Eltayeb, I. A., 1972. Proe. Roy. Soe. A 32__~6, 229.
Eltayeb, I. A.~ 1975. J. Fluid Mech. 71, 161.
Eltayeb, I. A. and Roberts, P. H., 1970. Astrophys. J. 162, 699.
Galloway, D. J., 1975. Solar Phys. 44, 409.
Galloway, D. J., Proctor, M. R. E. and Weiss, N. 0., 1977. Submitted to Nature.
Gibson, R. D., 1966. Prec. Camb. Phil. Soe. 6_22, 287.
Gilman, P. A., 1970. Astrophys. J. !62, 1019.
Gough, D. O. and Tayler, R. J., 1966. Mon. Not. R. Astr. Soc. 133, 85.
Gubbins, D., 1974. Key. Geophys. Space Phys. 12, 137.
Harvey, K. and Harvey, J°, 1973. So!at Phys. 28, 61.
Clarendon Press,
186
Huppert, H. and Moore, D. R., 1977. J. Fluid Mech. 78, 821.
Kato, S., 1966. P ubl. As tr. Soc. Japan, 18, 201.
Livingston, W. and Orrall, F. Q., 1974. Solar Phys. 39, 301.
Malkus, W. V. R., 1959. ~ . 130, 259.
Mehltretler, J. P°, 1974. Solar Phys. 38, 43.
Mestel~ L. and Weiss, N. 0., 1974. Masnetohydrodynamics. Swiss Soe. Astr. Astrophys., Geneva.
Meyer, F., Schmidt, H. U. and Weiss, N. 0., 1977. Mon. Not.R. Astr. Soc. (in press).
Meyer, F., Schmidt, H. U., Weiss, N. O. and Wilson, P. R., 1974. Mon. Not. R. Astr. Soc. 169, 35.
Moffatt, Ho K., 1977. Masnetic field seneration in electrically, conduetin$ fluids. Cambridge University Press.
Moore~ D. R. and Proctor, M. R. E., 1977. In preparation.
Moore, R. L., 1973. Solar Phy s . 30, 403.
Moss, D. L. and Tayler, R. J., 1969. Mon. Not. R. Astr. Soc. 145, 217.
Moss, D. L. and Tayler, R. J., 1970. Mon. Not. R. Astr. Soc. 147, 133.
Mullah, D. J., 1974a. Astrophys. J. 187, 621.
Mullan, D. J., 1974b. J~ Franklin Inst. 298, 341.
Mullah, D. J. and Yun, H. S., 1973. Solar Phys. 30, 83.
Musman, So, 1967. Astro~hys. J. 149, 201.
Parker, E. N., 1955. Astrop y .h s J. --121, 491.
Parker, E. N., 1963. Astrophys. J. 138, 552.
Parker, E. N., 1970. Ann. Rev. Astr. Astro~h~s. 8, i.
Parker, E. N., 1974a. Solar PhTs. 36, 249.
Parker, E. N., 1974b. Solar Phys. 37, 127.
Parker, E. N., 1975a. Solar Phys. 40, 275.
Parker, Eo N., 1975b. Astrophys. J. 198, 205,
Parker, E. N., 1976a. ~ . 204, 259.
Basic mechanisms of solar activity (IAU Symp. No. 71), ed. Parker, E. N., 1976b. V. Bumba and J. Kleczek, Reidel, Dordrecht.
Peckover, R. S. and Weiss, N. 0., 1977. In preparation.
Pikel'ner, S. B., 1961. Osnovy kosmieheskoy elektrodinamiki.
Roberts, Bo, 1976. ~ . 204, 268.
Nauka, Moscow.
Roberts, P. H., 1971. Mathematical prob!ems in the geophysical sciences, ed. W, H. Reid, p. 129, Am. Math. Soc., Providence.
187
Roberts, P. H. and Stewartson, K., 1974. Phil. Trans. A 277, 287.
Roberts, P. H. and Stewartson, Ko, 1975. J. Fluid Mech. 68, 447.
Saito, M. and Kato, S., 1968. Solar Phys. ~, 531.
Savage, B. D., 1969. Astroph~s. J. 156, 707.
Schr~ter, E. H., 1971. Solar magnetic fields (IAU Symp. No. 43) ed. R. Howard, p. 167, Reidel, Dordrecht.
Spiegel, E. A., 1971. Ann. Rev. Astr. Astrophys. 9, 323.
Spiegel, E. A., 1972. Ann. Rev. A str. Astroph~s. I__OO, 261.
Stenflo, J. O., 1976. Basic mechanisms of solar activity (IAU Symp No. 71), ed. V. Bumba and J. Kleczek, p. 69. Reidel, Dordrecht.
Sweet, P. A., 1971. Solar magnetic fields (IAU Symp. No. 43), ed. R. Howard, p. 457, Reidel, Dordrecht.
Syrovatsky, S. I. and Zhugzhda, Y. D., 1967. Astr. Zh. 44, 1180 (Soy. Astr. II, 945, 1968).
Syrovatsky, S. I0 and Zhugzhda, Y. D., 1968. Structure and development of solar active re$ions (IAU Symp. No. 35), ed. K. 0. Kiepenheuer, p. 127. Reidel, Dordrecht.
Tayler, R. J., 1971. 9" J" R. Astr. Soc. 12, 352.
Thompson, W. B., 1951. Phil. Ma~. (7) 4-2, 1417.
Unno, W. and Ribes, E., 1976. Astrophys. J. 208, 222.
Vainshtein, S. I. and Zel'dovich, Y. B., 1972. Usp. Fiz. Nauk 106, 431 (Sov. Phys. Usp. 15, 159).
Veronis, G., 1959. J. Fluid Mech. 5, 401.
Weiss, N. 0., 1964a. Phil. Trans. A 256, 99.
Weiss, N. 0., 1964b. Mono Not. R. Astr. Soc. 128, 225.
Weiss, N. 0o, 1966. Proc. Roy. Soc. A 293, 310.
Weiss, N. O., 1969. Plasma instabilities in astrophysics, ed. D. G. D. E° Tidman, p. 153. Gordon and Breach, New York.
Wentzel and
Weiss, N. O., 1975. Adv. Chem. Phys. 32, IO1.
Weiss, N. O., 1977. Highlights of astr0nom[, ed. E. MUller, Reidel, Dordrecht.
Wilson, O. C., 1976. Basic mechanisms of solar activity (IAU Symp. No. 71), ed. V. Bumba and Jo Kleczek?Reidel, Dordrecht.
Yun, H. S., 1970. Astrophys. J. 162, 975.
AXISYMMETRIC CONVECTION WITH A MA~ETIC FIELD
D. J. Galloway Astronomy Centre
University of Sussex Brighton, BNI 9QH
England
Summar~
The n o n - l i n e a r B o u s s i n e s q e q u a t i o n s d e s c r i b i n g a x t s y m m e t r i c c o n v e c t i o n i n a c y l i n d e r
w i t h an i n i t i a l l y u n i f o r m m a g n e t i c f i e l d have been i n t e g r a t e d f o r w a r d i n t i m e
n u m e r i c a l l y . When t h e f i e l d i s weak a s t r o n g c e n t r a l f l u x r o p e i s formed a t t h e
axis. In this case the maximum field strength can be limited either klnematlcally
or by dynamical effects, and the equipartitlon prediction B 2 ~ 4~pu ~ is eg~ily max
exceeded. If the field is strong oscillations can occur and hysteresis is posslble
as the field is increased and decreased.
i. Introduction
The interaction between convection and a magnetic, fleld determines many features
observed in the solar photosphere. Sunspots and smaller scale magnetic fleld
elements are symptoms of the ability of convection to concentrate a weak average
field into strong fluxropes. Oscillatory phenomena such as running penumbral waves
can occur in the presence of a strong field. To study such effects it is necessary
to solve non-linear problems, and clearly to do So in three dimensions if at all
possible. The recent work of Jones, Moore and Weiss (1976) on axlsymmetric con-
vection is easily extended to include the presence of a ma~etlc field with average
strength B . This problem is geometrically three-dlmenslonal but depends mathe- o
matlcally on only two variables, thereby rendering Itself tractable to nnmsrlcal
computation. The normal equations of Bousslnssq convection are modified to include
the effect of the Lorentz force in the vortlclty equation, and in addition the
electromagnetlc induction equation is solved to update the magnetic fleld as the
system is integrated forward in time.
S. The Problem
To solve the equations it is convenient to set up stream functions for the
velocity and magnetic fields thus:-
u= Va(o, , o) = (- r ~z o, r ar ,
B= V^(O, ~r' O) = (- r ~z O, r az "
We use cylindrical polar coordinates (r, 8, z). The geometry of the problem is
shown in figure 1.
189
Bo- In~t~a ll~.,, =,q~co,',~.
F=O
_
ff"e:rn p. T : T,,+AT
Temp.T--To
ease r'= a.
Fig. 1
Geometry for the axisymmetrie problem, showing basic cylinder on left and
axis-edge cross-section on right.
The equations V.~ = 0 and V.~ = 0 are automatically satisfied by Zhe above
and B flelds. Those remaining can be non-dimenslonalized and put in the following
form:
~-~ = -- p3 R -- - r - ~ r + ( V . ( r V ( r 2 ~ ) ) , ( 2 . 1 )
~_TT = -V. (Tu) + 1 V. (VT) Bt -- (pR)½
( 2 . 2 7
~t p3(pR)~ *
(2.3)
w h e r e ~ = (VA--u)o = - r ~ r2 - r ~ r ~z 2
J = (V^S--)e = - - r ~r ~z
There are five dimensionless parameters specifying each solution to the problem;
these are
2 2 __ 3 B d
R = g~wdz o K K~) ' Q = 4W~O~ ' p = ~ ' P 3 = ~ , a n d
a , t h e r a t i o o f c e l l w i d t h tO c e l l h e i g h t . The n o n - d i m e n s i o n a l i z a t i o n h a s b e e n t
c o n d u c t e d w i t h l e n g t h s c a l e d ( l a y e r d e p t h ) , t i m e s c a l e ( d / g o , AT) ~, f i e l d s t r e n g t h
scale B and temperature scale AT. The density and coefficient of volume expansion o
of the fluid are Q and ~ ;~ ,<, and i'] are its viscous, thermal and magnetic
190
diffusivities respectively, and g is the acceleration due to gravity.
The following boundary conditions are used:
T = T O + AT, B r = O, @ = O, ~= 0 (z = O)
T = T , B = 0, @ = 0, ~= 0 (z = 1) o r
~T - - = = ~r O, X const,, ~ = O, ~ = 0 (r = a)
~T ~--~= O, × = O, ~ = O, ~ = 0 ( r = O)
The conditions on the fluid are those commonly known as stress-free. The con-
straint X = const, at r = a fixes the total flux in tho cylinder and circumvents
Cowllng's theorem, so that steady solutions are possible.
The above equations and boundary conditions have been solved by finite-difference
methods similar to those described in Moore~ Peckover and Weiss (1973). The
equations were integrated forward in time until the solutions converged to a steady
state or a repeating oscillation. In many cases one solution was started from
another, and in this way the effects of continuously varying one parameter could be
investigated.
3. Discussion of Results
To correspond most closely with highly conducting astrophysical plasmas the
program WaS run with values of </n ranging from 10 to 50. For fixed and moderately
non-linear values of the Rayleigh number the followlng types of solution are found
as Q is increased.
i) For very weak fields the convection is unaffected and concentrates all the flux
kinematically into a central rope. The structure of this rope is fixed by the
balance between diffusion and ndvection in the induction equation. The maxi-
mum field strength B m is higher than the input field B ° by a factor of the order
of the magnetic Reynolds number, and the profile of the rope is Gaussian. Such
solutions have been described by Weiss (1966) and Clark and Johnson (1967).
ii) As Q is increased a regime ensues where the imposed field remains compressed in
a fluxrope hut can exert a dynamical influence on the convective flow. Within
the rope motion is minimal: at its edge, typically a few mesh points from the
axis, there is a shear layer and the velocity reaches a value comparable with
that in the absence of the field. The dynamics are dominated by a balance
between the total thermal and magnetic torques; dissipation can be ohmic or
viscous, and the maximum field can be Successfully predicted by s power law of
the form
191
Bo ~ The numerical experiments yield average ~ = ~= 0.63 and ~ = 0.77; the power-law
behaviour extends over typically two orders of magnitude. The greater ability of a
three-dimensional geometry to concentrate flux means it is far easier to chart this
regime in the axlsymmetric case than for two-dimenslonal rolls. It is also possible
to advance physical arguments based on a boundary-layer structure (Galloway, 1976),
and predict a law similar to (3.1). The exact values of ~,8 and 7depend on whether
viscous or ohmic dissipation is dominant, and the formulae involve weakly varying
logarithms, but agreement with the numerical experiments is generally very good.
An example of one of these dynamically limited solutions is shown in figure 3,
which is the case Q = i00, R = 20Rc, p = 1, P3 = I0, and a = 4/3. (Here R c =
(27/4)~4). The fluxrope is almost stagnant; st its edge there is a current
sheet which generates a large localized amount of negative vortlcity. Within the
fluxrope horizontal temperature gradients cause a very weak countercell to develop;
this has an advective effect on the field and causes the fluxrope to develop a
maximum some distance away from the axis. The run of the fluxrope profile as Q
increases is shown in figure 2.
Q = 1 q = 5 Q = 20 Q = I00 Q = 1000
Fig. 2. Fluxrope profiles for R = 20R , p = I, </n = I0, a = 4/3 e
i i i ) The c e n t r a l f l u x r o p e b r o a d e n s a s Q i s f u r t h e r i n c r e a s e d a n d e v e n t u a l l y i t
o c c u p i e s a b o u t a h a l f o f t h e r a d i u s o f t h e c e l l . At t h i s s t a g e t h e r o p e b e g i n s t o
o s c i l l a t e w h i l s t t h e o u t s i d e c i r c u l a t i o n r e m a i n s s t e a d y , a n d t h e r e i s a c o r r e s -
p o n d i n g v a r i a t i o n i n t h e h e a t t r a n s p o r t , T h i s f o r m o f s o l u t i o n h a s a n a t u r a l
e x p l a n a t i o n . T h e i n i t i a l l y i m p o s e d f i e l d B ° i s w e a k e n o u g h t o a l l o w s t e a d y c o n -
v e c t i o n ~ w h i c h c o n c e n t r a t e s t h e f l u x i n t o a r o p e o f s t r e n g t h B a n d r a d i u s ~ a . F l u x m
conservation suggests Bm % 4]30, and this means that, considered in isolation, the
central rope is overstable to linear theory. The frequency of the computed solution
agrees moderately well with such a linear prediction. When the senses of the two
192
a) magnetic lines of force
f
b) streamlines
c) isotherms
I///~ "///.////~////~/ d) vortlcity ~ (axis on left)
193
circulations are opposite, upward moving plumes are adjacent and the heat transport
is a maximum, When the senses are the same, the cold downdraught of the countereell
is next to the hot updraught of the main flow and lateral diffusion reduces the heat
transport to a minimum.
iv) Finally Q is so strong that only finite-amplltude oscillations are posslble.
These are confined mainly to the outer half of the radius, so that the solutions
are quite different to the elgenfunctions of linear theory. Periods are typically
10% - 20% faster than the linear values, presumably because the essentially quad-
ratic Lorentz t e rm in (R.l) is badly underestimated in the linear approximation.
The nature of the solutions depends on the five dimensionless parameters defined
earlier. However there are also occasians when the system adopts a configuration
dependent on the initial conditions, so that hysteresis occurs. This effect is
encountered when the field is fairly strong. A solution with given (Q,R,p,~/~,a)
can then be steady if it is part of a branch with Q inereasin~and oscillatory if
part of a branch with Q decreasing - the system remembers what it was doing for
earlier values of Q. This effect can be quantified by using the Nusselt number N,
averaged in time if necessary, as a measure of the amplitude. A graph showing the
variation of N with Q as the latter is increased and decreased is shown in figure
4. For this example, Q = 10,300 marks the onset of overstability and Q = 2535 the
transition from steady to oscillatory modes according to linear theory. The slight
increase in N as Q increases in the lower branch at Q = 4000 appears real.
5
3
!
N
"\
i 0 | 0 0 i O 0 0 LO)O00
Fig. 4. Variation of N against log Q for R = 20Rc~p = I, </q = i0, a = 4/3.
ible with boundary conditions, and from these determine the one
with maximum E B (now dimensionless). By (11) this value cannot
exceed R/5~ 2. Such an operating state, where the mean magnetic energy
is as large as possible for a given heating rate, may be taken as
"optimal", since it is presumably stable locally and can only lead
to smaller energy under a finite perturbation. Of course it is not
clear that the system admits uny nontrlvial solutions (5 # 0).
Note that if such a solution existed for some R, and if it were
known that rotation was essential for a dynamo effect, then it would
be necessary that solutions terminate for sufficiently small Gnd
sufflciently large values of R.
The existing theory of convective dynamos has concentrated on
cases whlchare probably far from optimal in the above sense. Viscous
effects are freely admitted, parameters and geometry chosen to a11ow
the convective modes to be determined by considerations at marginal
stability, and various devices are used to simplify the analysis of
electromagnetic induction. In the following sections we study various
aspects of these highly idealized models, but return to some of the
questions raised above at the end of the paper.
we can seek solutions of (13)-(15) which are compat-
201
3. KINEMATIC INDUCTION
This aspect of the problem has a large recent literature (see e.g.
the reviews of Roberts 1971, Weiss 1971, and Gubbins 1974). One of
the more direct evaluations of the regenerative effect is possible if
the fields are taken to be periodic in space and time. (The spatially-
periodic case was treated by Childress 1967, 1969b, 19701 the theory
in its most general setting was developed by G. O. Roberts 1969, 1970,
1972.) This situation arises naturally in planar or almost-planar
models involving simple boundary conditions. The analysls is facili-
tated (and can be made explicit) if the two dimensionless numbers
r = ~ /tlk~ r k = U/~k, (16)
where U, k , and ~ are the speed, wavenumber, and frequency charac-
teristic of the velocity field, satisfy
r k = o(1), r - o(1). (17)
That is, the magnetic Reynolds number of a fluid eddy must be small,
and the time scale of the motion should be of the order of the decay
time of a magnetic field structure of the same size. With (17) it
becomes rather easy to demonstrate self-excltatlon of a magnetic field
which is slowly-varying relative to the scales k, ~. (I¢ is unlikely
that the first of (17) is satisfied in the Earth's core, but the basic
inductive mechanism, which goes back to the pioneering paper of
Parker (1955), can in fact he deduced without such a restriction
(G. O. Roberts 1970).)
We return to the dimensional induction equations, which are
V-~= 0
(is)
(19)
Consider the solenoidal velocity field
~(a) = U(O, sin o, siu(~+~)), a = kx + ~t, (20)
and suppose t h a t ~ has t he d e c o m p o s i t i o n
= ~ + g, g = o(i), ~ slowly-varylng. (21)
202
Uslng (20) and (21) in (18), (19), one sees that the part ~ will
approximately satisfy
- f-Vu ,
so that fl k d~
g = (nk 2 ~+ ~u) n2k4+~ 2
The slowly-varylng component will then satisfy
-- - = V x ( u x g ) = ~ t
422)
( 2 ~ )
V"~ = 0, (24)
where the overbar denotes the o-average and A is a constant pseudo-
tensor. For (20) the only non-zero component of A is
All = . (qk3U2(sln ¢)/(~2k4+~2)) ~ (25)
Thls additional contribution to mean electromotive force is usually
referred to as the "a-effect". The most general Q-effect, involving
arbitrary symmetric A, can be created by suitably combining linearly
independent modes of the form (20). It is easily seen that, by exam-
ining the case of diagonal A, that (24) can be made to admit exponen-
tlally-growing spatially periodic solutions (and note that (17)
insures that they will be slowly-varying).
Let us look more closely at the underlying inductive mechanism
when ¢ - ~/2. From (22) it is clear that the source of small-scale ÷
magnetic structure is proportional to the x-derivative of u, l.e. to
the sheur of the flow. Now trigonometric spatial modes of the diffus-
ion equation decay without change of shape, but there is a phase shift
between the solution and moving sources. Combining this shift with
that introduced by differentiation, we see that ~ is proportional to
~(O + $)where ~k2/~ - tan ~. As O varies, ~ and ~ rotate in the yz
plane while maintaining thls phase difference, so the induced current,
obtained as a cross product, is independent of o and proportional to
sln ~.
For a given mode 420) the corresponding entry in A is maximized
when ¢ = ~ TM W/2 s l.e. the motion is both quasl-steady and Beltraml
(vortlcity and velocity everywhere parallel). Thls maximizes
the mean heZ~u~ty (Moffatt 1968), defined as the volume average of
u. Vxu, for a given mean kinetic energy. Note that the mean hellclty
is opposite in slgn to ~ for these elementary Beltrami modes.
To summarizer time-lndependent velocity modes having the property
203
that the velocity is orthogonal to the wavenumber vector and the
two orthogonal components are 90 ° out of phase, provides a basic
element of a particularly efficient kinematic dynamo process, charac-
terized by a constant mean heliclty. A variety of other, less effici-
ent dynamo mechanisms (involving an A which either vanishes or has
rank 1~ see e.g. case IV in G. O. Roberts 1972) can he studied by a
refinement of these procedures~ but in the present context it is
rather a slightly different point of view which is needed, since the
relevant convective modes cannot be regarded as exclusively small-
scale. We accordingly consider next the dynamo mechanisms which are
compatible with the dynamics of convection.
4. DYNAMICS
The efficient kinematic dynamos considered above are very special
in that the hellclty of the flow may be averaged over space and time
to obtain a non-zero pseudo-scalar H. It appears to be difficult,
however, to find physical systems which will exhibit this property.
For example, owing to dissipative processes we expect a rotating
sphere of heated fluid to settle down so that H can be defined Indepen-
dently of the initial conditions. Now restart the system but with
initial conditions T(-~,0), -~(-~,0). If the magnetic field is zero
the Bousslnesq equations are invarlant u~der this reflection (recall
g - g0r), so the system will evolve toward a mean hellclty -H = H,
implying H = 0. To argue this in a different way, the mean state of
the system should depend only upon the mean heating rate and various
material parameters (all scalars) and the pseudo-vector ~, from which
it is impossible to construct a pseudo-scalar. If the system is +
endowed with a magnetic field having mean dipole moment m (a vector) ~
H could be expressed as an odd function of m.~, but the record of
magnetic reversals suggests that for the Earth there is no preferred
polarity and therefore m = O!
Ifj nevertheless, rotation is to be regarded as essential to the
convective dynamo, its action must be not to create mean hellclty,
But rather to "polarize" heliclty in space (or time) in such a way
that the resulting pattern of induced currents can be self-exclted.
This self-excltatlon is difficult to visualize and compute when the
length and time scales are unique (as in the system (13)-(15)) since
the induced currents resulting from the polarlzatlon are bound up
closely with the "eddy" currents which dissipate the field. For the
purpose of analysis of the effect it is therefore fortunate that
sufficiently rapid rotation of the fluid introduces two spatial scales
204
into the marginal stability problem, at least for sufficiently weak
magnetic fields (Chandrasekhar 1961), so that large-scale induction
and small-scale dissipation can be clearly distinguished.
To take a concrete example of a zero mean hellcity dynamo, con-
sider the solenoidal fleld
u = (- sin kx cos az, sin kx cos az, cos kx sin az). (26)
If k >> a the induction problem can be solved approximately as in
Section 3, with similar results except that now there is an additional 1 factor ~ sin 2z in A. But note that (26) can also be written
g . sln(k +az) ÷ sln(k -a )+ , ( 2 7 )
t h a t i s , a s a sum o f two O ( l ) r a p i d l y - v a r y i n g f i e l d s , e a c h h a v i n g c o m -
p o n e n t s in p~ase, so that each fails as an "efficient" dynamo of the
kind considered above. An analogous calculation can be carried out
for the standing wave
= (0, sin kx cos at, cos kx sin at), a << nk 2 (28)
to obtain helicity varying as sin 2at, and (28) can be expressed as
a sum of two progressive waves, moving in opposite directions with
phase speed a/k.
More generally, let
= ~ s i n ~ + ~ ' s i n ~ ' ~ - + ~ t , ( 2 9 )
and aseume Lk - k'I << k, I= - ='l <<~k 2" One them finds, using the
n o t a t i o n of Section 3,
A - ~k2 ,In~-a') (~,x~)o~ (30) n2k4 + 2
The operation k + k' can be thought of as a reflectlon across a
plane normal to ~ - ~', and if = = =' the dispersion relation for
the modes must be invarlant under this reflection; in addition, from
(30) it is clear that the two corresponding amplitudes must not be
parallel. The special case ~ = ~', ~' = -~, which could arise in
a conservative system, gives the tlme-periodlc induction.
These properties pertain to infinite fields having the proper
structure. In oan#u~ned rotating fluids hellclty can also be polarl-
205
ized as a result of "Ekman pumping" into a quasi-geostrophic flow. If
the latter has wavenumber k, the secondary flow set up by the Ekman
layer is of magnitude ~ Ta-i/dkL, L being the length scale for the
container, and the resulting helicity may or may not he comparable to
that introduced by other processes, depending upon the magnitudes of
k and Ta. Under certain conditions it can be demonstrated that the
Ekman layers are essential to a convective dynamo effect (see Roberts
and Soward 1972), but in certain idealized models (Section 5) they
definitely are not.
The polarity of the resulting hellcity is fixed by the direction
of rotation and is easily c~puted. At a point on the boundary with
outer normal ~, the nearby heliclty has the sign of -~-~. For the
rapidly rotating Benard layer (case I below), the polarity is the same
as that introduced globally by the convection mode for free boundar-
ies, but the distribution is different and (as Just noted) it is
smaller, by a factor Ta -I/12.
We turn now to the situation in rotating convection. We have in
mind, of course, convection in a heated sphere or spherical annulus,
but to get a qualitative picture it is helpful to consider several
planar "approximations" to parts of a spherical annulus, which we
indicate in Figure 2.
Figure 2. Planar "approximations". The heavy lines indicate isother- mal boundaries to be represented by tangent planes. In IV the included angle is made small. The polarization of helicity is indi- cated for rotation in the direction shown at the top. In a homogene- ous sphere~region ~ can be regarded as extending from top to bottom.
206
We consider these regions in turn. Letting g and ~ be parallel and
oonstant~ and choosing units of length, time, and speed to be L, L2/~
and K/L, we find the dimensionless linearized equations to be
8~ + Vp + Ta I/2 ÷ ÷ ffi T' ~--[ i~u - v2~ - R l g , v.~ = o (31)
!
@T' - V2T = 0 (32) ~ T i - ÷ ~ - I g
where ~g and ~ are unlt vectors, T' is the temperature perturba-
tion, and we take Ta >> i.
Case I. This classical problem is treated by Chandrasekhar
(1961). In the limit Ta ÷ ~, provided P exceeds about 0.68, convec-
tion ensues as a small-scale pattern. The critical wavenumber vectors
(L is now the layer thickness) are almost perpendicular to ~G, the
motion is quasl-geostrophic, and under reflection across the plane of
the layer the vertical velocity component changes sign. There is
accordingly polarization of the helicity along the axis of rotation,
in a manner similar to that obtained for the motion (26). The criti-
cal parameters are
R = 3(~2/2)2/3Ta 2/3, k ~ (2Ta/2)l / 6, (33) c e
Using the term "roll" to denote the convection field corresponding to
a given wavenumber vector ~ in the plane of the layer, and setting
~G = (0,0,1), a single roll has, in the case of free boundaries, the
form
~ sin ~z cos • ~3 + (~2Tal/2/k4) ÷ ÷ cos
E a c h s u c h r o l l c o n t r i b u t e s a n e n t r y i n t h e u p p e r l e f t 2 x 2 s u b m a t r i x
o f A, w h i c h i s a n e g a t i v e m u l t i p l e o f s i n 2 ~ z . The c o r r e s p o n d i n g
h e l i c i t y h a s t h e p o l a r i t y s h o w n i n F i g u r e 2 . F o r r i g i d b o u n d a r i e s t h e
results are similar over the interior, the secondary Ekman flow being
smaller by a factor k Ta -1/4 ~ Ta -1/12 according to (33). c
From the point of view of dynamo action, the essential feature of
this ease is the high degeneracy of the geostrophic flow, allowing
rolls of arbitrary direction. The fact that A can be made to have
rank 2 implies a relatively efficient dynamo process based exclusively
on small scale motions (see Section 5). This realizes in a simple
planar geometry the so-called ,, 2,, kinematic dynamo (Roberts 1971).
On the other hand region I can hardly be regarded as typical of the
sphere as a whole, especially sinoep strictly speaking, the geostro-
207
phic contours have this degeneracy only at the poles!
Case II. Here gravity and i~ = (0,0,1) are orthogonal, and the
choice of geostrophic velocity ~ = Y×$(x~y)~ 3 reduces the problem
to classical B~nard convection without rotation. There is no dynamo
effect from these rolls since particle paths lie in planes. (Indeed
from (18) it follows that B 3 decays, and the resulting two-dimenslon-
al non-dynamo can be regarded as a special instance of Cowling's
theorem, of. Roberts 1967.)
However~ these special solutions completely neglect the presence
of "sidewalls" which might represent the effect of the sloping
spherical boundary. As Busse (1970) has emphasized, the sidewall
constraint drastically upsets the geostrophic balance and the near-
equatorial region is best approached through case IV below.
Case Ill. We let ~g = (0,0,-i), ~ = (sin ~,O,cos ~). For
steady convection the Rayle~gh number is g~ven by
R = [k 6 + Ta(klSln ~ + k3cos ~)2]/-(k 12+. 2K2 )
where ~ is now an arbitrary vector. To make this expression an even
function of k 3 and therefore obtain modes of the kind needed to
satisfy conditions at the plane boundaries, it is seen that k I must
vanish, in which case the problem reduces to case I above but with Ta
replaced by Ta cos2~. The effect of the obliqueness is therefore to
reduce the critical Rayleigh number somewhat, and to restrict the
locally horizontal wavenumber vector to be nezrly perpendicular to
the plane of ~ and ;. Thus the 2 dynamo of Case I is reduced to an
incomplete or near incomplete s-effect, strongly biased toward induc-
ing i 2 current from i 2 fleld. In kinematic dynamo theory this induc-
tion is nevertheless essential to the success of the "~" dynamo
(Roberts 1971)~ as orlglnally envisaged by Parker (1955). In the
a~ mechanism the a-effect is supplemented by large-scale shear~ which
here replenishes ~2 field. Thus case III, which might be taken to the
be typical of a large fraction of spherlcal annulus~ suggests a
natural mechanism for obtaining "one-half" of the dynamo effect from
convection.
Case IV. Here one seeks to represent the effect of the sloping
sidewalls. A class of such models was studied by Busse (1970) and
subsequently used in a convective cycle (Busse 1975p see Section 6
below). The model has the advantage of describing rather closely, in
a simple geometry~ the essential physics of the convective instability
in a rapidly rotating, heated homogeneous sphere (Busse 1970).
The sidewalls upset geostrophy through Ekman pumping as well
208
as by their inclination to the axis of rotation~ but the latter
effect can be made to dominate. In this case Busse's results can be
obtained rather easily by inverting a device of the oceanographer and
replacing a slow decrease of the depth by a slow increase of ~. Let
= (-l,0,0) ~ - (0,0,i) and replace Ta I/2 by Tal/2(l" + Xx) in g , 3
(31). L e t t i n g
ffi V x ~(x,y,t)~ 3 , p - -(I +Xx)Ta I12 ~ + p'
and eliminating pt by cross-dlfferentlation we obtain
2 2 (-~ - V2)V2, = -~y (XTal/2* + RT') •
~T' V~T' ~-~ 2 ~2 ~2 _ = = +
P ~t ~y • V2 ~x 2 ~y2 "
If we look for modes proportional to exp(i~t + lax + iby)
( P i e + c 2 ) ( i ~ c 2 + c 4 + i a A T a 1 / 2 ) = Ra 2 ,
(35)
( 3 6 )
we have
( 3 7 )
where c 2 ffi a 2 + b 2. Thus
ffi - ( a A T a l / 2 ) / ( p + 1 ) c 2 • a2R ffi c 6 + (P/(P+I))2a2ATa/c 2
a n d t h e c o n d i t i o n s f o r m a r g i n a l s t a b i l i t y b e c o m e
= 2 - 1 1 6 ( ~ P l ( t + p ) ) l l 3 T a 116 , b = O, R = 3 ( X P I ( l + p ) ) 4 1 3 ( T a 1 2 ) 2 1 ~ ac c c
(38)
If X is regarded as small• the perturbation is on the strict geostro-
phic equilibrium of case If, but it is a singular perturbation with
strong roll selection. If A is regarded as % i, the ordering (38) is
in accord with that of cases I and IIl~ although the mode in the
present case has some features of a Rossby wave. In Busse's formula-
tion A is a typical sidewall slope and should be taken as positive,
so the phase speed given by (38) is "eastward". (If one regards the
observed "westward drift" of the non-dlpole geomagnetic field as a
phase speed of a flew field associated with the ~-effect, this result
is disappointing. However• as Busse notes the g~oup velocity ~s west-
ward, and this raises the question of whether sufficiently super-
critical convection in this system might not take the form of inter-
mittent wave packets.)
As we have noted the model can be interpreted as an appropriate
local section of a sphere, parametrized by the latitude of the inter-
209
section with the boundary andjif this is done~reasonable agreement is
obtained with the stability analysis of a rapidly rotating heated
sphere (Roberts 1968, Busse 1970). Roberts found that convection
begins on a cylindrical annulus oriented parallel to the axis of rota-
tlon~ with radius about half that of the sphere, and consists of
vertical rolls of dimension % Ta -I/6 along the aximuth, the radial
thickness of ~he convection zone being % Ta -I/9 both relative to the
radius of the sphere. It would appear that in the planar model the
radial structure is lost, or rather reflected only in the vanishing
of b c in (38). Certainly the spherical case should have a local
approximatlon with radial structure, which could be incorporated into
a dynamo model. Soward's remarks at this meeting concerning his
current calculations of the localized D-effect point toward such a
possibility, which would in effect open the way to an asymptotic
analysis of the spherical dynamo.
5. B~NARD-TYPE MODELS
We c o n s i d e r i n t h i s s e c t i o n t w o e x a m p l e s o f a e o n v e c t l v e d y n a m o
cycle based upon a classical B~nard layer. Busse (1973) considers
B~nard convection without rotation. It is assumed that only one set
of rolls is present, so hellclty is created by adding a shear flow
alone the axis of the rolls. Such a flow is somewhat artificial for
a B~nard layer, but it can in principle be driven by a modification
of the mean temperature profile. Moreover, such distortions might
well arise in a different geometry through convective heat transport
obllque to the direction of gravity. To effect a scale separation it
is assumed that the (spatial) mean magnetic field is dominated by a
component orthogonal to the roll axis and slowly-varylng along it.
In this model the mean hellclty vanishes, since the unidirectional
shear flow passes down rolls of alternating sign. The kinematic
dynamo effect is therefore not the first order D-effect of Section 3,
but rather a hlgher-order mechanism involving the spatial derivatives
of the mean magnetic field. (In the terminology of Roberts 1971 the
dynamo is of "B~" type.) Busse uses a numerical method to study the
equilibration of the system and the partitioning of internal energy.
While his approach, being essentially quasi-steady, does not deal with
the dynamics of equilibration, it does reveal an interesting balance,
similar to that suggested by (ii*), which is perhaps typical of near-
critical convective dynamos. Namely, if R - R > 0 is sufficiently c
+ bE B = constant, where a and be are posi- small, one finds that aE u *
tive constants, but that E must exceed a critical value E in order U U
210
to have a dynamo effect. The conclusion is that for E > E* the u u field energy will increase as the kinetic energy falls, while for
E < E the kinetic energy will rise as the field decays. Ultimately u u the magnetic field will be sustained at the maximum energy compatible
with the above dynamic constraint as well as stationary dynamo action.
The second model, put forward by Childress and Soward (1973) and
worked out by Soward (1974) for equilibrium at minimum field energy,
is based upon the rotating B~nard layer considered as case I in Sec-
tion 4. This model utilizes the rotation of the fluld to effect the
scale separation, in a manner which permits the dynamics of equilibra-
tion t o be followed in detail. We take P~ ~ i.
The "weak-field" case studied by Soward (1974) assumes that the
Hartmann number of the induced field is ~ I. This a pr~or~ hypothesis
on field intensity is then Justified by exhibiting consistent, appar-
ently stablejoperating states. The rolls have the form (34) with ~ an
arbitrary vector in the plane of the layer. The length L is here the
thickness of the layer, so that for regeneration of the field we must
have (of. (25) with ~ = 0)
R 2 ~ k ~ Ta I/6 , m c
where Rm = UL/~ is a magnetic Reynolds number based on roll ampli-
tude U. Thus Rm % Ta 1/12 and the small-scale field satisfies
g ~ Ta-I/12f. This ordering generates a series solution in powers
of Ta -1/12 .
The mean field equations are easily obtained for a discrete or
continuous distribution of rolls, and in the former case, with
= [Bl(Z,t),B2(z,t),O), take t h e form
~B i ~2B i + 2~A ~ [sin 2Wz Mij(t)B j] ...... 0 (39)
~z 2
where, in terms of the A in (24),
M = [ -A21A11 -A22 IA12 (40)
Here the unit of time is L2/n. Note that, since we have in mind that
roll structure is to be determined by auxiliary equations of evolution,
the discreteness or continuity of the roll pattern is determined by
the initial conditions. In (39) and (40) the normalization is such
that All + A22 = 2 so that A(t) i s a parameter proportional to
the kinetic energy of the flow.
For the weak-field solutions, it turns out that near marginal
211
instability A is fixed by the quantity (R - Rc)/R c. Thus the dynamics
of the model reduces to the study of how the magnetic field detez-
mines the partitioning of a fixed constant kinetic energy among the
various rolls. Soward was able to show that the roll structure indeed
evolves on the same time scale as the field. If q(t,~) denotes the
kinetic energy in a given r011, where ~ = ~/Ta I/6, the equation for
Penetrative convection occurs in a fluid whenever a conyectively unstable region
is bounded by a stable domain. This situation is encountered in many stars, and it is
also a very com~non circumstance on Earth: in the oceans and in the atmosphere. One
would therefore expect that the astrophysicists may largely benefit from the experience
accumulated on this subject by the geophysicists.
However, this is only partly the case. In the ocean, salinity plays a very
important role and especially so at the interface between a stable and an unstable
(mixed) region. In the atmosphere, the behavior of the convective planetary boundary
layer is dominated by the 24 hour thermal cycle, so that a steady state is never achie-
ved ~ as it is in a star (at least in one that is not pulsating). Furthermore, the ratio
between viscosity and conductivity, as measured by the Prandtl number, is of order unity
for water and air, but it drops to 10 -~ and less in a star. Finally, the effec~of stra-
tification are much stronger in stars where convective regions often span several density
scale heights.
For all these reasons, the astrophysicists have developed methods of their own
to describe stellar convection, even though some are widely inspired by those used by
the geophysicists. The same is true for convective penetration, whose study cannot he
separated from that of convection itself. The purpose of this review will he to recall
those methods, and to verify if they are suited to describe the penetration of convective
motions into stable surroundings.
~I. Phenomenological approaches
In those approaches, one hypothesizes a flow which is plausible in that it does
not seemingly contradict the laws of fluid dynamics and that it conserves heat and
kinetic energy. One then calculates the gross parameters that characterize this flow:
convective flux, mean temperature gradient. The most commonly used of such procedures
are based on the concept of mixing length, and have already been discussed in this
colloquium by D.O. Gough.
226
I. Non-local mixing-length treatment@
All mixing-length procedures applied to stellar convection are in fact based
on the two differential equations describing:
i) the variation with height z of the density excess 60 between a convective element
and the surrounding medium, in which the densities are respectively p* and P
~zz(~p ) d__e* - d_@_p ( ] ) dz dz ,
ii) the variation of the kinetic energy of that convective element
d l (2) d--z (5 P v2) = - 6P g ,
where g is the gravity.
The standard prescription (Vitense ]953) is to replace these equationsby
dO l ~ (3) ~P :k~ ~ ~ ~ ,
] v 2 ~Sp ~, (4) ~- = - C g P 2 ,
being the mixing length and C an efficiency factor which allows for the production
of turbulent energy. In this treatment, both the density excess and the convective
velocity are functions of local quantities only (the mixing length and the density
gradients); by construction the convective motions cannot penetrate into the stable
adjacent regions.
That constraint may however be relaxed by treating the original differential
equations in a less crude way. This was done by Shaviv and Salpeter (1973), Maeder (1975a)
and Cogan (1975), to be specifically applied to the overshooting from a convective stellar
core. The differential equations are integrated over one mixing length (or up to the
point where the velocity vanishes, whichever happens first):
d0 1 (" 1
1 v 2 6 p d z ( 6 ) y =-c g T ' • z-z.~£ l 1
227
(To formally recover certain results of the standard scheme, Maeder identifies the
integration distance with half the mixing length). The density stratification dp/dz
of the ambient medium is adjusted until the constancy of the total energy flux
(convective plus radiative) is realized.
This non-local mixing-length treatment permits the description of many ~eatures
of penetrative convection in the laboratory or in the Earth~atmosphere. A convective
element ceases to be buoyant at some distance from the unstable region, where also the
convective flux vanishes; from there on its momentum carries it still further into the
stable region, and since it is cooler than the surrounding medium, the convective flux
is of opposite sign. In a stellar core, the P~clet number is very high and thus the con-
vection is extremely efficient; it follows that the whole domain where the motions occur
is kept nearly adiabatic.
The main weakness of this approach, as one may expect, is that all quantitative
predictions depend on the assumption made for the mixing length. Another parameter plays
here also Some role, and it too can only be guessed: it serves to measure the fraction
of space filled by the convective elements. In the bulk of the unstable domain this
parameter is probably close to unity, but in the overshooting region, it drops to one
half and possibly much less, because it is unlikely that many downwards moving elements
are present there.
In a generalization of the mixing-length procedure proposed by Spiegel (]963),
the number of convective elements is not fixed a priori, but is governed by an equation
of conservation similar to the radiative transfer equation. Travis and Matsushima (]973)
have applied this non-local theory to the solar atmosphere, and they obtain an apprecia-
ble overshooting into the photosphere. In order to match the solar limb-darkening obser-
vations, they must choose a ratio of mixing length to pressure scale height of 0.35 or
less. Unfortunately, this value is too small to yield the correct solar radius, within
the assumptions that can be made for the chemical composition. Travis and Matsushima
suggest that this discrepancy be removed by allowing the above mentioned ratio, between
mixing length and scale height~ to vary with depth.
228
2. Other procedures
A different approach has been used by the meteorologists to model cloud
dynamics (Stommel 1947). It is based on the concept of thermals, and has since been
applied to a variety of other problems; it was Moore (1967) who brought it to the
attention of the astronomical community. A thermal is an organized cell which, llke the
eddy of the mixing-length treatment, exchanges heat and momentum with the surrounding
medium, but has also the property of gaining or loosing matter through entrainment or
turbulent surface erosion.
The only serious attempt to apply this concept to an astrophysical case was
made by Ulrich (]970 a, b), who used it to build a model of the solar atmosphere. He
had to overcome such difficulties as the absence of any ground level (from where the
thermals start on Earth), fragmentation (since the thermals are bound to move over
several scale heights) and radiative exchanges (the P~clet number becomes rather small
above a certain level). His model displays substantial overshooting well into the
photosphere, but one may wonder whether this is not due mainly to a simplifying assum-
tion he made for the correlation between the velocity of a thermal and its temperature
excess. Another consequence of this is that there is no sign change of the convective
flux in the stable region.
A similar treatment has been proposed recently by Nord!und (]976), in which the
medium is organized in two streams of rising and falling fluid. Those behave like the
thermals in the sense that they too exchange matter, heat and momentum, but here there
is no ambient medium. Dimensional arguments are invoked to write down the equations
governing the exchanges between the two streams. Solar models constructed with this
procedure are characterized by an appreciable penetration up to an optical depth of
T = 0.]; the quantitative predictior~of course depend on the choice of the dimen-
sionless parameters that occur in the equations.
229
III~ Direct approache s
In the past ten years a new approach has been explored thanks to the fast
computers with large memory storage that are now available: one can start directly
from the fluid dynamics equations, instead of replacing them by simpler ones that are
more tractable. Of course, it is not feasible yet to treat the most general problem:
as we will see, the solutions obtained to date all suffer from some kind of restriction.
But at least they help to build up an intuition which has been lacking so far. We
shall consider here only the nonlinear investigations; the main interest of the linear
studies has been to determine the critical conditions (Gribov and Gurevich ]957,
Stix |970, Whitehead 197]), but they cannot be used to predict the extent of penetration,
which is strongly related to the amplitude of the solution.
]. Bo_____ussinesq convection
The prototype of penetrative convection in the laboratory is the ice-water
experiment suggested by Malkus (]960) and performed among ethers by Townsend (1964) and
Myrup e~t ~. (|970). Water has the peculiar property of presenting a density maximum
at 4°C, so that a tank of water whose bottom is kept at 0°C will he conveetively
unstable up to the level of maximum density, and stable above. Veronis (1963) gave the
criterion for the onset of the instahility, which is of the finite amplitude type. There-
after Musman (]968) made the first quantitative predictions for the extent of penetra-
tion, using the so-called mean-field approximation (Herring 1963). The next improvemer
came from Moore and Weiss (]973), who solved the two-dimensional problem without furthe~
simplification.
A slightly different experiment is that of a fluid heated in its bulk by Joule
effect, in which the parabolic temperature profile creates two superposed domains of
respectively unstable and stable stratifications (Tritton and Zarraga 1967). This
experiment has been modelled by Strauss (|976)~ again with a two-dimensional code; his
results are similar to those of Moore and ~eiss (1973).
These two-dimensional studies are fairly ~ueeessful in predicting, at moderate
Rayleigh numbers, the mean temperature profile and thus the extent of penetration. But
it is doubtful that they can be extrapolated to the parameter range which is of astro-
physical interest (high Rayleigh numbers and low Prandtl numbers). Moreover, these
two-dimensional studies are unable to describe the time-dependent temperature fluctuations
which are observed at the boundary of the well-mixed region. These seem to be excited
randomly, and are essentially three-dimensional in their nature. The astrophysical
importance of these oscillations must not be underestimated: in the Sun, they would
occur just at the base of the photosphere and would generate gravity waves.
230
Another suggestion that the two-dimenslonal studies may be somewhat misleading
comes from the results obtained hy Latour ~t ~. (1977). They analyze the penetration
of convective motions from an unstable slab into the stable adjacent regions. The
solutions are expanded into orthogonal modes in the horizontal, and a finite differences
scheme is used in the vertical. In the special case of a single mode with a two-dimen-
sional planform, this procedure reduces to the mean-field approximation of Herring used
by Musman (]968). But one can also choose a three-dimensional planform representing, for
instance, prismatic cells of hexagonal base. The comparison of solutions derived with
the two types of planforms reveals that penetration is much stronger when the conveetlve
motions are allowed to be three-dimenslonal (Figure ]). In the simplest three-dimen-
sional case, where only a single planform is retained, the solutions are asyr~netrieal:
the overshooting occurs mainly on the side to which the centerline flow is directed in
the hexagonal cells. The mean temperature profile becomes symmetrical again when one
superposes two patterns of hexagonal cells with opposite centerline velocities;
remarkably enough, the total kinetic energy of the flow does not vary as one switches
from the one-mode solution to this two-mode solution. And the total extent of penetra-
tion too remains unchanged, if it is defined as the sum of the penetration depths at
either side of the unstable layer.
2. Convection in a stratified medium
In the laboratory (or Boussinesq) case, the extent of penetration is related to
the only natural length that characterizes the problem, namely the thickness of the
unstable layer. But what should one expect in a stratified medium, such as the solar
convection zone, where the unstable domain spans several density or pressure scale-
heights?
This question has not been answered yet. Toomre ~t ~. (]976) have studied the
penetration from the deeper convection zone of an A-type star; this zone is due to the
second ionization of helium, and it measures about one pressure scale height. Using the
technique mentioned above of truncated modal expansion, and retaining only one single
three-dimensional mode, they find that the motions penetrate up to one scale height
into the stable region below. More recently, they have established that the convective
motions penetrate also above, as far as to build a link between the deeper convection
zone and the upper one, which is caused by the ionization of hydrogen. But the situation
considered is admittedly not one of severe stratification, and these results cannot
be extrapolated to the Sun, for instance, Moreover, the solutions obtained so far are
all stationary, missing thereby the time-dependent character of penetrative convection
which may be of primordial importance.
Another difficulty with these drastically truncated modal calculations is that
they depend on the choice made for the horizontal wavelength of their single planform.
Fortunately, the results are not too sensitive to this parameter, which is felt mainly
in the horizontal heat exchanges; it does not play the dominant role of the mixing
length in the phenomenological approaches.
231
I
a
i I "~I # l
| I
,,f,, 'ii
-"t!
t - -
o i"'%
l ! 1 1 ! 1
V
b
: ' l . \ / - I / ! I l ~'.
. . . . . . . . '~ I / ! I t " - ~ - -
\ 1 ] \ II
, J ~f
c / ' ' k ~ W ; % 1
i*x e l
. . . . . . 4 . , I \ i l l ' , . . . . . . o . . . . : ' * / \ ~I
; I
0 ~ • " ° " "
i *
o 1 z
Figure I . Modal solutions for penetrative Boussinesq convection.
The unstable layer, which extends in depth from z = O to z = ], is imbedded in an infinite stable domain from which only a fraction of thickness ~z ~ 2 on each side is shown here. The same Raylelgh number R ~ |0 s characterizes the stability and the instability of the three superposed layers (it corresponds to about thousand times critical). The amplitudesof the vertical velocity, W, and of the temperature fluctuations,G, are displayed as functions of z. Figure la shows a single two-dimensional mode (which may be visualized as a horizontal roll), figure Ib a single three-dlmensional mode of hexagonal horizontal planform, and figure |c two non-interacting three-dimensional modes of that same geometry. In all cases, the value of the horizontal wavenumber is 2, and the Prandtl number is I. Notice that the overshooting into the stable surroundings is much more pronounced with the three- dimensional motions.
232
The only way to avoid any extra assumption would of course be to directly
integrate the basic equations in three-dimensional space. This has been done hy
Graham (]975), whose latest results are presented in this colloquium. But even the
most powerful computers which are presently available set a rather low limit on the
number of grldpoints that can be used. This in turn fixes the highest Rayleigh or
Reynolds numbers that can be reached: typically one hundred times critical. There is
thus still a very long road to go before meeting the numbers characterizing a stellar
convection zone, but in the meanwhile these numerical experiments are very useful as
a workbench to test the various approximations that have been proposed.
IV. Observational tests
It is relatively easy to confront theoretical predictions of Boussinesq
penetrative convection with laboratory experiments. But, as we were already reminded
by K.H. Bbhm, the comparison of astrophysical models with stellar or solar obser-
vations is more delicate, for the physical parameters that can he determined often
depend on other factors than just the properties of convection.
For the stars, one is forced to rely on the few gross parameters which can
be observed. In principle the classical tests for probing the internal structure
of a star may be used to determine the extent of the regions which are in nearly
adiabatic stratification, at least once their location is roughly known. These tests
can complement each other: the apsidal motion test (see Sehw arzschild ]958) is more
sensitive to the overall mass concentration in a star, whereas the pulsational period
of a variable star (see Ledoux and Walraven ]958) depends more on the stratification
of its envelope. There is even a s]ight hope to interpret the properties of the dynamical
tide in a close binary system, which are closely related to the size of the quasi-
adiabatic core of the two components (Zahn ]977).
But the most promising tests are probably those which sense the inhomogeneitles
in chemical composition. Prather and Demarque (1974) and Maeder (]975b, ]976) have
included some amount of overshooting in their calculations of evolutionary stellar
models. They find that the evolutionary tracks, lifetimes and cluster isochrones all
are appreciably modified by an increase of the convective core. Prather and Demarque
obtain the best fit between their theoretical isoehrones and the cluster diagram of M 67
for a penetration depth of about ]0% of the pressure scale height; Maeder's value is
slightly less and he uses it to calibrate his non-local mlxing-length procedure.
The thickness of a convective envelope (together with its penetrative extension)
may be inferred from the abundance of elements which undergo nuclear destruction at
moderate remperatures, such as llthium~ beryllium and boron. In the case of the Sun,
additional information can be gathered from the composition of the solar wind (Boehsler
and Geiss ]973). But when interpreting such observations, one must keep in mind that
other instabilities than convection may also lead to a thorough mixing of the stellar
material.
233
It looks at first sight as if the Sun should be the ideal object on which to
check the theories of penetrative convection. The solar atmosphere becomes eonvectively
unstable below optical depth T = 2, which means that the overshooting motions should
occur in the photosphere and thus be visible. The difficulty however is to distinguish
in the observations of Doppler-shifted lines what is due to waves or oscillations, and
what is due to genuine penetrative convection. The accuracy of correlation measurements
between velocities and temperature fluctuations is still not sufficient to permit the
separation of both types of motions (for a recent and complete review on such measure-
ments, see Beekers and Canfield ]976). And one encounters the same problem when it
comes to the interpretation of the non-thermal energy flux: the convective (enthalpy)
flux is blended with the flux of kinetic energy, which is carried by both convection
and waves. But the solar observations are rapidly progressing toward better precision
and spatial resolution, and one may hope that these questions will he settled in the
not too distant future.
284
Biblio~"~aphy
Beckers,J.M., CsLufield,R.C. 1976, Physique des Mouvements dans les Atmos- pheres Stellaires, R.Cayrel and M.Steinberg eds., CNRS, p.207
The solar granulation, with its horizontal temperature fluctuations,
and its associated velocity field, is a consequence of overshooting
convective motions. Theoretical estimates of the magnitude of the
temperature fluctuations and mass fluxes involved were obtained in a
recent paper (Nordlund, 1976, Astronomy & Astrophys. 50, 23). Here, a
simple model of the instantaneous granular velocity field is presented,
and the effects of this velocity field on photospheric spectral lines
are described.
The vertical velocity component is modelled by a simple, parameterized
expression:
pv z (x,y,z) = ~o(~2/2) sin (~x/d) sin (~y/d)e-Z/Zo/(l+e-Z/Zo).
The three parameters specify the amplitude (4o) of the vertical mass
flow, the horizontal size (d) of a model granule element, and a typical
scale (Zo) for the vertical variation of the mass flux.
The horizontal velocities are determfned from the vertical velocity by
the condition of continuity. Since the granular motions are slow and
approximately anelastic, the condition of continuity can be well approxi-
mated by
div (~v) = O.
This simple, quadratic pattern'of alternating vertical velocities, with
the corresponding horizontal velocities determined by the condition of
continuity, represents a crude model of the instantaneous granular
velocity field. Some important conclusions are possible, however, using
this model to fit the non-thermal broadening of photospheric spectral
lines:
The three parameters (~o' Zo" and d) can be used to fit the observed
238
half widths of a set of photospheric spectral lines, at two different
center to limb distances. The values 4o=0.35 kgm-2s -1, Zo=100 km,
d=1500 km produce a reasonably good fit. The velocity along a line of
sight varies along the llne of sight in the model granular velocity field.
When the line of sight velocity varies, the line absorption coefficient
is shifted in and out of the intensity profile of the line, and an
increased absorption results. With the given parameter values, this
effect is negligible for vertical sight lines. However, because of the
increase of typical optical scales for increasing angles of inclination,
the variation of the velocity along a line of sight becomes important
with growing angle of inclination.
Due to this effect, the observed center to limb behavior of the equiv-
alent widths of the spectral lines is also reproduced, without the need
for classical microturbulence. With a classical macro/microturbulence
model, the same behavior could have been achieved only by assuming a
depth-dependent and anisotropic macro- and micro-turbulence.
Turbulent motions on scales smaller than granular are certainly gener-
ated by the larger scale, granular motions. However, the (apparently
anisotropic) center to limb effects on the equivalent widths, which
correspond to classical microturbulent velocities of the order 1 - 2
kms -1, can be explained entirely as a consequence of the granular scale
velocity field.
In conclusion, this study shows that the observations of llne broadening
and line strength are consistent with a situation where the amplitude
of the velocity field is at maximum on granular scales, where the motion
is being driven by convection, and with the amplitudes of smaller scale
motion being progressively smaller and smaller.
THERMOSOLUTAL CONVECTION
Herbert E. Huppert
Department of Applied Mathematics and Theoretical Physics, Silver Street,
Cambridge CB3 9EW England
I. Introduction
The aim of this contribution is to survey a relatively new form of convection,
which is very easy to investigate in the laboratory, plays an important role in
the oceans and many chemical engineering situations and is likely to prove
essential in the understanding of some areas of stellar convection. Thermosolutal
convection (or double-diffusive convection as it is often called) owes its exis-
tence to the presence of two components of different molecular diffusivities which
contribute in an opposing sense to the locally vertical density gradient. The
different sets of components studied have covered a wide range including
a. heat and salt - two components relevant to the oceans and a number of
laboratory experiments;
b. heat and helium - two components relevant to certain stellar situations;
c. salt and sugar or two different solutes - components useful for laboratory
investigations; and
d. heat and angular momentum - components which are likely to be relevant to
some stellar situations. In each case, the most rapidly diffusing component has
been listed first. Thus, while in the paper the terminology of heat and salt
will be used, different components can be envisaged by reference to the above
examples.*
Aside from its many applications, thermosolutal convection has received
considerable attention because it can induce motions very different from those
predicted on the basis of purely thermal convection, that is, convection with
only one component. In particular, diffusion, which is known to have a stabil-
izing influence in thermal convection, acts in a destabilizing manner in thermo-
solutal convection. By the action of diffusion, instabilities can arise and
vigorous motion take place in situations where everywhere throughout the fluid
heavy fluid underlies relatively lighter fluid.
*Ed. Spiegel paraphrases this by the maxim: for salt, think helium. Is this his secret of gourmet cuisine?
240
An example of naturally occurring thermosolutal convection which highlights
its counter-lntuitive nature is afforded by Lake Vanda. Situated in Antarctica,
approximately 5 km long, 11 kmwlde and 65 m deep; Lake Vands has a permanent ice
cover of 3 - 4 m. Just below the ice the water temperature is 4.7°C and t~e
temperature increases with depth, often in a step-like fashion, until at the
bottom the temperature is 24.8°C (Figure l). There is a corresponding increase in
density, from 1.004 Em cm -3 just beneath the ice to a maximum of I.I0 Em cm -3
- - J
52
54~
-------~S-- --B 62--
TEMPERATURE °C
Figure 1. The temperature profile in Lake Vanda as a function of depth indicated in meters (taken from Huppert and Turner, 1972). Note the existence of a layer of uniform temperature (7.6°C) between 14.2 and 37.9 m which has been partially omitted from this figure.
at the bottom, due to the presence of salt. Vigorous convective motions take
place in the upper portions of the lake, maintaining the regions of uniform
properties, which are the hallmark of thermosolutal convection. Any model of the
lake based solely on considerations of temperature, or density, is doomed to
failure. Only by incorporating thermosolutal effects can a successful model be
derived (Huppert and Turner, 1972).
The plan of this survey is as follows. The two fundamental mechanisms of
thermosolutal convection are described physically in §2. These form the foundation
of the quantitative analysis of a suitable Rayleigh - Benard convection problem,
whose linear and nonlinear aspects are discussed in §3. The mechanism by which a
series of layers and interfaces can be maintained, as in Lake Vanda, is considered
in §4. In §5 a few ways in which a series of layers and interfaces can originate
are described. The structural stability of such a series is investigated in §6.
Conclusions are presented in §7.
241
2. The Fundamental Mechanisms
The first of the two fundamental mechanisms of thermosolutal convection occurs
in a fluid for which the temperature and salinity both decrease with depth, while
the (overall) density increases with depth, as indicated in figure 2a. In this
(a)
v
T S 0
?
(:b) T S p
Figure 2. Typical temperature, salinity and density profiles for: (a) the finger situation and (b) the diffusive situation, including a sketch of the motion of a disturbed parcel of fluid.
statically stable situation, the dynamic instability that arises can be examined
by considering a parcel of fluid displaced vertically downward. Initially warmer
and saltier than its surroundings, the parcel comes to thermal equilibriumbefore
its excess salinity can be diffused. It is thus heavier than its surroundings and
continues to descend. The ensuing motion consists of adjacently rising and
failing cells, interchanging their heat, and to a much smaller extent their salt,
much llke a heat exchanger. The kinetic energy of the motion is extracted from the
potential energy stored in the salt field. Experiments indicate that in typical
conditions, the plan form of the cells, called salt-fingers, is squarish with a
horizontal length scale of {(~g/KTV) (d~/dz)} -1/4, where ~ is the coefficient of
thermal expansion, g is the acceleration due to gravity, <T is the coefficient of
thermal diffusivity, v is the kinematic viscosity and ~d~/dz) is the mean
(positive) vertical temperature gradient. This length scale, discussed further
in the next section, represents a balance between dissipative effects acting pre-
ferentially on small scale motions and the increasing inefficiency of diffusing
heat over ever larger horizontal distances.
The second fundamental mechanism occurs in a fluid whose temperature,
salinity and (as before) overall density increases with depth, as indicated in
figure 2b. Displacement of the typical fluid particle vertically downwards now
places it in a warmer, saltier and more dense environment. As before, the thermal
field of the parcel begins to equilibrate with its surroundings more rapidly than
does the salt field. The parcel is then lighter than its surroundings and rises.
But due to the finite value of the thermal diffusion coefficient, the temperature
field of the parcel lags the displacement field and the parcel returns to its
242
original position lighter than it was at the outset. It thus rises through a
distance greater than the original displacement, whereupon the above process
continues and leads to a series of growing oscillations, or overstability, which is
resisted only by the effects of viscosity. This oscillatory form of motion has
been experimentally documented (Shlrtcllffe, 1969) and some of its characteristics
explored by Moore and Spiegel (1966) in an imaginative paper which develops an
analogy between this form of thermosolutal convection and the motion of a flaccid
balloon in a thermally stratified fluid. For sufficiently large temperature
gradients, steady motion can occur because a large temperature field can overcome
the restoring tendency of the salinity field. The criteria at which this first
occurs are discussed in the next section.
3. The RaTleish-B~nard problem
The fundamental mechanisms of the previous section form the basis of all
quantitative calculations. The most straightforward and hence frequently considered
calculation relates to the extension of the classical Rayleigh-B6nard problem:
what is the motion of a fluid confined between two horizontal planes across which
there is a temperature difference AT and a salinity difference AS? A major
motivation behind such studies is the expectation that just as the purely thermal
problem has successfully explained a variety of phenomena, as summarised by
Spiegel (1971), so also will the thermosolutal extension. And indeed this expect-
ation has already been partially fulfilled.
All calculations so far performed have essentially assumed two-dimensional
motion, dependent on one horizontal co-ordinate, x, and the vertical co-ordinate z.
Considering this restriction and non-dimensionalising all lengths with respect to
D, the separation between the planes, time with D2/KT and expressing the v e l o c i t y
q* in terms of a s t r e a ~ u n c t i o n ~ by
q* ~ (KT/D) (Bz~ , - Bx~), (3.1)
the temperature T* by
and the salinity S* by
T* m T O + AT (i - z + T) (3.2)
S* = S + AS (I - z + T) (3.3) o
where T and S are constant reference values, we can write the governing o o
Boussinesq equations of motion as
o-Iv2~t~-o-Ij(~,V2~) = -~3 x T+R s ~xS+V4~, (3.~)
~t T + ~x ~2-J(~,T) = V2T (3.5)
~t S + ~X~ -J(~,S) =TV2S (3.6)
where the Jacobian, J, is defined by
243
J(f,g) = ~xf~z g - ~zf~x g. (3.7)
We have also assumed the linear equation of state
P* = Po (I - aT* + 8S*), (3.8)
where ~ and 8 are taken to be constant, in the expression for the body-force term
in (3.4).
Four non-dimensional parameters appear in (3.4)-(3.6): the Prandtl number
~=V/KT; the ratio of the dlffuslvlties T=KS/K T, where K S is the saline dlffusivity,
which is less than KT; the thermal Rayleigh number ~ = ~gATD3/(KTV); and the
saline Rayleigh number R S = 8gASD3/(KTV).
To these equations must be added a series of boundary conditions. Mainly
because of their mathematical simplicity, the most frequently used conditions are
those obtained by assuming that both horizontal planes are stress free and per-
fectly conducting to both heat and salt. That such an assumption is a reasonable
one for a model which is to apply in the interior region of a star can be fairly
well defended (and often has been). One aspect of the defense incorporates the
belief that the use of other, possibly more realistic, conditions is likely to
lead to only slight quantitative differences. Indeed, Huppert and Matins (1973),
in a series of experiments described below, give an example of this. Mathematically,
free-free boundary conditions, as the above are often loosely called, are expressed
by ~=~2zz $ = T - S = 0 (z = O,I). (3.9)
a) Linear Disturbances
The equations governing infinitesimal motions are obtained by deleting the
nonlinear Jacobian terms of (3.4)-(3.6). The resulting differential system has
constant coefficients and a solution in terms of the lowest normal modes
~(x, z, t) = 9 ° slnwax ) (3.10a)
T(x, z, t) = T O cos~ax I ePtsin~z (3.10b)
S(x, z, t) = S o cos~ax (3.10c)
leads to the dispersion relationship
where
p3+(c+T+l)k2p2+ {(O+T+l)k 4 - ~2c~2k-2(RT-Rs)}p
+oTk6+~2~a2(Rs-T~) = O,
(3.11)
k 2 = ~ 2 ( 1 + a 2 ) . ( 3 . 1 2 )
S i n c e ( 3 . 1 I ) i s a c u b i c w i t h r e a l c o e f f i c i e n t s i t s z e r o s a r e e i t h e r a l l r e a l o r
c o n s i s t o f one r e a l r o o t and two compl ex c o n j u g a t e r o o t s . E x c h a n g e o f s t a b i l i t i e s ,
w h i c h a r i s e s when one o f t h e r o o t s e q u a l s z e r o , i . e . p~ O, o r e q u i v a l e n t l y
~t~0, occurs first for ~= 2 -1/2 and
244
- RS/T + 27~4]4 (3.13)
O v e r s t a b i l l t y , wh£ch a r i s e s w h e n the pa r r of complex-conjugate roo t s c rosses the
imaginary axis, that ~s Pr = O, occurs first for the same wavenumber, ~ ffi 2 -1/2
and ~ ffi (O+T)Rs/(~+I) + 27~4(l+r)(l+ro-l)/4. (3.14)
I0000
R S
5000
~" ~I~ (3.13)
(3.74)
I I | I I -I0000 -5000 ~ 5000 IOO00
-5000
-I0000
Figure 3. The linear stability results for 0 = i0 -I, T = I0 -I. Along (3.13) one of the temporal elgenvalues, p, is identically zero; along (3.14) two of the (complex conjugate) p are pure imaginary; and along C two complex conjugate elgenvalues coalesce on the real axis.
In the RS, ~plane the linear stability boundary is a combination of (3.13)
and (3.14), as depicted in figure 3, which presents a complete smmnary of the
linear results for ~ = 10 -I, T ffi I0 -I.
An investigation of the fastest growing mode evaluated by linear theory has
been presented by Balnes and Gill (1969), although owing to the linear constralnt~
the results are of at most academic interest. ~n agreement with the result
245
originally calculated by Stern (1960), Balnes & Gill find that in the salt finger
region, the unstable portion of the third quadrant in figure 3, the wavelength of
the disturbance of most rapid growth is much smaller than the marginal value,
23/2~, except very close to the marginal stability llne (3.13). This is in accord
with the physical description of §2, which indicates that a thinner mode acts as a
be~ter heat exchanger.
The following experiment is an example to which results based on linear theory
can be profitably applied. A uniform layer of hot, salty water is carefully placed
over a unlform layer of relatively colder, fresher water. The temperature and
salinity distributions across the initially paper-thln horizontal interface evolve
by diffusion, leading to a situation similar to that considered at the beginning
of this section. Equating the central gradient of the diffusing distribution to
the temperature and salinity gradients which appear in the marginal stability
criterion (3.12), Huppert & Martins (1973) calculate that under typical laboratory
conditions, specifically ~, T-IRs >> 27~4/4, salt-fingering should occur if
~S/C~T > T 3/2, (3.15)
where AT, AS are the initial temperature and salinity differences across the inter-
face. The results of a series of experiments, conducted with a variety of pairs of
solutes with different values of T, are in very good agreement with (3.15).
b) Nonlinear Disturbances
Fully nonlinear, but two-dimensional, investigations have been conducted by
Straus (1972) for ~, R S < 0 and by Huppert & Moore (1976) for ~, R S > 0.
The former reduces the complexity of the governing equations by assuming that
T ÷ O, R S ÷ 0 with Rs/T fixed.
In this limit, the inertial terms in the momentum equation and the adveetlon of
the disturbance temperature, but not the disturbance salinity are negligible.
Straus calculates the solutions for a variety of different values of u as
increases from the marginal stability value. + He also tests the linear stability
of these solutions. His principal conclusions are that as ~ increases, both
extremes of the range of stable wavenumbers increase significantly. Beyond a
specific ~, dependent upon Rs/T, the range of stable wavenumbers no longer
includes the wavenumber at marginal stability. The form of motion with the most
stable wavenumber corresponds to the long thin cells of the (somewhat different)
experiments and is close to that wavenumberwhieh leads to a maximum salt flux.
These results are interesting and suggestive, hut the two-dlmensional and small T
assumptions might limit the generality of some of the specific conclusions.
The calculations for ~, R S > 0 of Huppert &Moore aim to follow the form of
solutions as ~ increases for fixed RS, ~ and T. Drawing on the results of a
+The marginal stability point is supercrltical, that is, there is only the con- ductive solution for ~ less than the marglnal value.
246
number of numerical experiments, they put forward the following general con-
clusions. There are two rather different branches of solutions. Along one branch,
which may be initiated either subcritically or supercritically from the linear
oscillatory critical point given by (3.14), the solutlons are oscillatory. In
R~
4 ~
i
3(I0(I
10000
RT
90OO
8000
2"0 4.0 6 0 X 0
.Ms
, t i _ _ 1 21o ~ ' 4!o 6.0
M e
Monotonic
...! |0~0
Figure 4. The stabl~ solutlon branches in a thermalxRayleigh number, maximum Nusselt number at z 0 plane for (a) ~ = I, T = i0 -~, R s = I04 and (b) 0 = I, T = I0-I, R s = I07/2. Where relevant both local maxima are shown and the rapidly oscillating curve indicates that no definite maximum can be assigned to the aperiodic motion in this range. The dots indicate the transitions that can take place between the oscillatory and monotonic branches.
general, as R T increases, a transition point is attained at which the solution
changes from being relatlvely simple to being fundamentally more complicated, yet
247
still periodic. At a yet larger value of ~ another transition takes place, a
transition to aperiodic motion. Finally, beyond a still larger value, stable
aperiodic solutions cease to exist and only solutions which are ultimately steady,
and make up the second branch of solutions, can he found. The two branches for
particular values of o, T and R S are graphed in Figure 4. Steady motion exists
in a thermosolutal fluid because of the tendency of the temperature fleld to cause
an almost isosolutal core to be produced, confining all solute gradients to thin
boundary layers. The temperature, salinity and density fields for a typical steady
solution are presented in figure 5. For given o, T and RS, there is a minimum
T S p
Figure 5. The temperature, salinity and density fields for R T = 10 4 , O = I and T = 10-2.
= 10700,
value of ~ for which steady motion exists and one of the aims of the investigation
by Huppert & Moore is to calculate this minimum. For details of this result and
others the reader is referred to the original paper. The major finding is that
for sufficiently small T, steady convection can occur for values of ~ less than
that obtained from the llnear stability boundary (3.14) (and thus much less than
the value at which linear theory suggests non-oscillatory convection occurs).
The specific results obtained by Huppert & Moore, primarily by numerical
computation, were limited to 14 different values of o, T and ~. Guided by these
calculations, M. R. E. Proctor and independently Huppert & Gough are currently
attempting to obtain analytic expressions for various limiting cases, in
particular, the astrophyslcally relevant situation T ÷ O.
4. LaTers and Interfaces
As the experiment of Huppert and Manins, described in the previous section,
progresses, the fingers and the interface between the two layers extend in length.
Within the interface there is a strong background gradient of density~ and the
interface is hence an ideal site for internal waves, which are generated by dis-
turbances induced by the salt-finger motion. These internal waves cause the
fingers to sway back and forth, like a banner fluttering in the breeze. If the
fingers become too long, this motion causes them to loose their vertical coherence,
248
or break, much like a long strut subjected to an oscillatory transverse load.
Guided by the Navier-Stokes equations of motion rather than the analogies used
above, Stern (1969) argues that for very small T an established field of salt
fingers is limited in length by the requirement that
8FsI ( ~ j < C, (4.1)
where F S is the salt flux through the interface, to be discussed below, d~/dz is
the mean temperature gradient and C is a constant of order unity. Thus for a
fixed salt flux, the length of the salt fingers and the thickness of the inter-
face increase until equality in the constraint (4.1) is reached.
At the two edges of the interface the salt fingers impart an unstable
buoyancy flux on the adjacent layers, which causes the layers to convect.
Developing an analogy with purely thermal convection at high Rayleigh number,
Turner (1967) argues on the basis of dimensional analysis that the relationship
between the saline Nusselt number and the Rayleigh number is of the form
Nu$ ~ F S D/(KSAS) = fF(c~T/~AS, ~,T)R~/3 , (4.2a~b)
where D cancels in (4.2h) and fF is some function of its three argments. Also,
argues Turner, the resultant heat flux, FT, is related to F S by
C~FT/~F S = gF(~AT/BAS, ~, T) . (4.3)
Turner obtained experimentally the explicit form of fF and gF for heat and salt
in water and found that for the range of edT/BAS considered, 2 < ~AT/~AS < I0, fF
is such that as 0~T/~S ~ I the salt flux is approximately 50 times as large as
if the same salinity difference were maintained across a region hounded by two
solid boundaries, and fF decreases slowly with increasing ~AT/~AS. The constancy
of gF indicates that~ independent of o~T/~AS, a constant fraction of the potential
energy released by the salt field is supplied to the temperature field. Linden
(1973) experimentally evaluated fF and gF using a different technique and deter-
mined the same fF but a different, yet still constant, gF" Which result is in
error is still not known.
If a uniform layer of hot, salty water is placed below a uniform layer of
relatively colder, fresher water, heat and salt are transferred upwards through
the thin interface primarily by diffusion, with the resulting unstable buoyancy
flux driving convection in the layers as before. For this case, known as the
diffusive situation, the relationships equivalent tO (4.2) and (4.3) are
Nu T E F T D/(KTAT) = fD(6AS/~AT, O, T)~/3 (4.4a,b)
and
6FS/a~ T = gD(6~S/~T , ~, T) (4.5)
for some functions fD and gD" Using the results of another series of experiments
by Turner (1965) with heat and salt in water , Huppert (1971) suggests that for
249
thi8 partlcular case
Nu T = 3.8 (SAS/cU~T) -2 1~/3 (4.6)
=/l 85 0 85(B S/ T) I < S S/ T < 2 (4.7)
8Fs/~F T ] tO.15 2 < 8AS/~AT. (4.8)
A deductive model of the diffusive interface has not as yet been obtained, though
a number of ad hoc arguments, some of them described by Turner (1974~ lead to
formulae in soma agreement with equation (4,8). A few experiments with two solutes,
rather than heat and salt t have been performed. For both salt-fingering and dif-
fusive cases, all the experiments indicate a constant value of the flux ratios,
(4.3) or (4.5), for a large range of BAS/o~T. The deduction explanation for this
fact is awaited and is one of the major theoretlcal prizes still to be gained. To
be more general, a major advance in the subject would be achieved on building a
mathematical model which predicts the heat and salt fluxes for all values of
and T.
Notwithstanding our current lack of knowledge, the important conceptual
statement that can already be made is that the above mechanisms can be extended to
include a series of convectlng layers, separated by fingering or diffusive inter-
faces, as the situation demands. This is the explanation of the profiles of Lake
Vanda, those obtained under the drifting Arctic Island T3 displayed in figure 6,
and of many other oceanographic examples. The main aim of this review is to
support the suggestion that a process which occurs so readily on earth must also
play a fundamental role in stellar convection.
5. The Buildin~ $f Layers
As suggested in the previous section, a series of convecting layers separated
by thin interfaces can be easily constructed in the laboratory by carefully placlng
one layer on top of another. They arise in natural situations by a large number of
different mechanisms. A few situations have received a fair amount of quantitative
analysis and will be described here.
Consider a fluid with a uniform salinity gradient increasing with depth
subjected to a constant heat Tlux, F H, at its base. Initially a growing overstable
oscillation occurs. Shortly thereafter a convectlng layer develops adjacent to the
bottom because hot fluid rising from the base can penetrate only a finite height
into the stable salinity gradient. As time proceeds the height of this layer, h,
grows according to h = (2Bt)~/Ns, (5.1)
2 dS where B ffi - ~gFH/(Pc) and N S ffi - gS"~ • (5.2)
The relationship (5.1) is a consequence of the conservation of heat and salt and
the experimentally observed fact that the density (but not the temperature or
salinity) is continuous across the top of the layer (Turner, 1968).
This growth does not, however, continue indeflnltely. There is a thermal
250
3 0 0
3 2 0
E
3 3 0 - -
t
340 1
Temperature (degrees Celsius) - - 1.0" , ,,O"
(o) Typicol remperoture Prof/le Sect/on
- - ( b J
Section o.f Profile Recorded ot High Gain
0,01" C
l< - - O . I 'C - 4
150
E --250
C~
f~ I
500
350
Figure 6. The temperature profile under the Arctic Ice Island T-3 (taken from Neal e_.t_tal, 1969).
boundary layer ahead of the advancing front and when a critical Rayleigh number,
Rc, is reached the region above the first layer ceases to grow. This can be
calculated to o c c u r when h = ---(~<~RcB31K)I/41N ~ . (5 .4)
The second layer then grows, the thermal boundary layer ahead of its advancing
front becomes unstable~ and so in time a series of layers is built up. Heat and
salt are transferred across the interfaces~ in the manner of the last section~ and
in the course of time some of the lower interfaces disappear because the density
difference across them tends to zero. A combined theoretical and experimental
investigation of the depths of all the layers and time scales for their formation
is currently being undertaken by Huppert & Linden (1977).
This heating a salinity gradient from below mechanism, but acting in reverse,
that is, cooling a sallnity gradient from above, produces the layers under T3
shown in figure 6.
The above mechanism involes an entirely one-dimensional model. Many natural
phenomena can be expected to be two - or even three-dimensional. As yet such
extensions are only in the early stages of investigation.
251
In a series of qualitative experiments, Turner &Chen (1974) show that even a
relatively small disturbance applied to one side of a thermsolutal fluid, thereby
introducing horizontal inhomogeneities, can have significant effects. For example,
the raising of a small flap at the wall of a vessel containing a thermosolutal
fluid induces, in the salt-flnger situation, a rapidly propagating wave motion
which is accompanied by the initiation of convection over large horizontal distances.
In the diffusive situation, the disturbance propagates horizontally more slowly and
can cause local overturning which leads to the initiation Of salt fingers.
Another example concerns the introduction of a small source of warm, salty
water into a uniform layer of relatively colder, fresher water of exactly the same
density. Fluld which conwnences to fall diffuses its heat to the surroundings,
thereby beco~ng heavier. Nelghbourlng fluld, having been warmed, is relatively
lighter and rises. Large vertical motions, both upwards and downwards, in the
form of plumes result. As the motion proceeds, the density difference between
each pl-me and the surroundings increases. Thus, starting only with fluid of
uniform density, solely by diffusion both heavier and lighter fluid are formed.
If the plumes impinge on horizontal boundaries, they spread out and build a series
of layers and interfaces through the entire vertical extent of the fluid.
A final example is afforded by introducing an insulated sloping boundary. In
a stably stratified fluid, stratified with respect to only one component, such a
sloping boundary induces a thin slow steady upwards motion adjacent to the boundary.
The flow provides a convective density flux equal to the diffusive flux in the
interior and allows the isopycnals, horizontal in the interior, to bend near the
boundary and intersect it at right angles. In a fluid stratified with respect to
two components, the curves of constant T and constant S must intersect the boundary
at right angles and no steady boundary layer flow can accomplish this. Alternatively,
it is not posslble for a single boundary-layer to give rise to convective T and S
fluxes which balance the unequal diffusive T and S fluxes in the interior. Instead,
a series of layers and interfaces from throughout the fluld, as shown in figure 7
to build that characteristic structure of a thermosolutal fluld.
Figure 7. A series of layers and interfaces set up in a laboratory tank by introducing a solid sloping boundary (from Linden and Weber, 1977)o
252
The size of layers and the tlme-scale of their initiation is dependent upon
the angle of the sloping boundary, or more generally their existence is due to its
presence. However, this specific example presents another illustration of the
basic point: initially small horizontal inhomogeneities in a stably stratified
fluid leads to large scale layering with vertical transports considerably in excess
of those calculated on a molecular basis.
6. The Destruction of LeT era
In the experiment of heating a salinity gradient from below, discussed in
section 4, the tendency of the lower layers to merge as the density difference
across them tends to zero was mentioned. Such merging is due to the continually
imposed flux of heat from the bottom. Layers can merge, or be destroyed by more
natural, internally imposed conditions, which will be described in this section.
Consider a three-layer system consisting of two semi-infinite layers of
uniform T and S between which there is a finite layer of intermediate properties.
All layers are assumed to be convecting~ with temperature and salinity fluxes
across the interfaces in accord with (4.2) and (4.3) or (4.4) and (4.5). There
is then a single-valued relationship between the temperature and.salinity in the
intermediate layer for which the flux through the lower interface equals that
through the upper. The conditions under which such equilibrium situations are
stable is partially answered by Huppert (1971). Assuming that merging takes place
without any vertlcal migration of the interfaces~ he shows that only if the
conditions across each interface are in the 'constant regime', gF or gD equals
constant, will the layer system persist. Otherwise one or other of the interfaces
will disappear and two semi-lnflnite layers separated by one interface remain.
The analysis can be extended to any number of intermediate layers to yield the
same result. Thus the prediction is that no stable system of diffusive layers of
hot salty water exist if BAS/~AT < 2. A controlled laboratory experiment to test
this prediction has yet to be performed, although Turner and Chen (1974) and
Linden (1976) have observed merging which they believe to be due to the above
mechanism. Turning to large scale measurements, we can at present report that no
series of layers has been observed under T3~ in the Red-Sea or elsewhere with
BASIaAT < 2.
Experiments by Linden (1976) indicate that another form of instability is
possible, whereby merging occurs by the vertical movement of one interface to
coalesce with its neighbour. A quantitative analysis of this situation has not yet
been performed. It would clearly be interesting to know which instability is
favoured under specified conditions because layer merging will need to be accounted
for in any future quantitative model building.
7. Su~aar~ and Conclusions
This review has attempted to bring out the following salient points. Fluids
253
Stratified with respect to two (or more) components can exhibit motions very
different to slngly-stratlfied fluids. Instabilities can arise even when the
overall density is statically (very) stable by drawing on the potential energy
Stored in one particular component. The typically observed signature of a thermo-
solutal fluid is a series of convecting layers separated by thin interfaces through
which properties are transported by either diffusion or the action of fingers.
This transport is very much larger than one based on consideration of purely
molecular diffusion across a quiescent region.
Large stars have a heated hellum-rich core surrounded by lighter hydrogen.
The Composition gradient in the core/envelope regions is thus of the diffusive
type and it would be expected that convection of this form predominates. Some
attempt has been made to incorporate this process in a semiconvection zone,
although quantitative calculations would benefit from a precise description of
physics in this zone.
The salt-flnger type of instability has been hypothesised to occur in the
outer layers of differentially rotating stars (Goldreich and Schubert~ 1967),
where the two components with different diffusivities are heat and angular
momentum. Turner (1974) appears to be of the opinion, however, that such an
effect would be obliterated by baroclinic instabillties which occur on a much
larger scale. This concluslon should be questioned in view of the large obser-
vational evidence for the existence of salt-fingers in the ocean, also subject to
baroclinlc instability.
We conclude by suggesting that what is achieved so easily in the laboratory
and the oceans might also be attained by the stars~
This survey benefited from a careful reading of a first draft of the manu-
script by Dr N. O. Weiss.
References
Baines, P. G. & Gill, A. E. 1969 On thermohaline convection with linear gradients, J. Fluid Mech. 37, 289-306
Goldrelch, P. & Schubert, G. 1967 Differential rotation in stars, Astrophys. J. 150, 571-587
Huppert, H. E. 1971 On the stability of a series of double-dlffusive layers, Deep-Sea Res. 18, 1005-1021
Huppert, H. E. & Turner, J. S. 1972 Double-diffuslve convection and its implications for the temperature and salinity structure of the ocean and Lake Vanda, J. Phys. Oceano~. 2, 456-461
Huppert, H. E. &Manins, P. C. 1973 Limiting conditions for salt-fingerlng at an interfacej Deep-Sea Res. 20, 315-323
254
Ruppert, R° E. & Moore, D. R. 1976 Nonlinear doubte-diffusive convection, ~. Fluid Mech. 78, 821-855
Huppert, H. E. & Linden, P. F. 197 ? On heating a salinity gradient from below, (work in progress)
Linden, P. F. 1973 On the structure of salt fingersp Deep-Se.a Res. 20, 325-340
Linden, P. F. 1976 The formation and destruction of fine-Structure by double- diffusive processes, Deep-Sea Res. 23, 895-908
Linden, P. F. & Weber, J. E. The formation of layers in a double-dlffusive system with a sloping boundary, J. Fluid Mech. (to appear)
Moore, D. W. a Spiegel, E. A. 1966 A thermally excited nonlinear oscillator, Astrophys. J. 143, 871-887
Neal, V. T., Neshyba, S. & Denner, W. 1969 Thermal stratification in the Arctic Ocean, Science, 166, 373-374
Shlrtcllffe, T. G. L. 1969 An experimental investigation of thermosolutal convection at marginal stability, J. Fluid Mech. 35, 677-688
Spiegel, E. A. 1971 Convection in stars. I. Basic Boussinesq convection, Ann. Rev. Astron. and Astrophys. 9, 323-352
......
Stern, M. E. 1960 The 'salt-fountaln' and thermohallne convection, Tellus, 12, 172-175
Stern, M. E. 1969 Collective instability of salt fingers, J. Fluid Mech. 35, 209-218
Straus, J. M. 1972 Finite amplitude doubly diffusive convection, J. Fluid Mech. 56, 353-374
Turner, J. S. 1965 The coupled turbulent transports of salt and heat across a sharp density interface, Inst. J. Heat Mass Transfer. 8, 759-767
Turner, J. S. 1967 Salt fingers across a density interface, Deep-Sea Res. 14, 599-611
Turner, J. S. 1968 The behaviour of a stable salinity gradient heated from below, J. Fluldldech. 33, 183-2OO
Turner, J. S. 1974 Double-diffusive phenomena, Ann. Rev. of Fluid Mech. 6, 37-56
Turner, J. S. & Chen, C. F. 1974 Two-dimensional effects in double-diffuslve convection, J. Fluid Mech. 63, 577-592
THE URCA CONVECTION
Giora 8haviv
Departement of Physics and Astronomy
Tel-Aviv University
Ramat Aviv, Israel
The possible role that B-decays may play in stellar collapse was first discussed
by Gamow and Schoenberg (1940, 194]). The above authors proposed this mechanism a
mean of extracting quickly the energy content of the star and transporting it out-
side. In this way they hypothesized that stellar collapse may proceed.
The URCA process is composed of B-decay and inverse B-decay scouring inside the
Star. Let (A, Z-i) be a B-decay unstable nucleus in vacuum (i.e. on earth), namely
(A, Z-l) + (A, Z) + e- + U (I)
where (A, Z) is a nucleus with Z protons and A nucleons (protons plus neutrons) and
9 is the emitted antineutrino. The energetics of the process is seen in Fig. I. Note
that AQ must include the rest mass energy of the electron. The transformation of a
neutron into a proton leads to a lower energy configuration. The transition can go
from the ground state of the nucleus (A, Z-l) to the ground state of the nucleus (A, Z)
(arrow ]) or from the ground state to an excited state (arrow 2) if such exists and
if the transition is allowed. Only under high temperatures of the order of about
MeV is the (A, Z-l) nucleus excited and then transitions from excited states are pos-
sible. There is no basic difference between ground state and excited states trans-
itions. The most important properties for convection are: The energy difference
between the two states is shared by the two emitted particles, the electron (inclu-
ding its rest mass) and the neutrino. The electron is usually more energetic than
the surrounding and hence is slowed down (quickly) and deposits its energy in the
surrounding. The neutrino has an extremely small cross-sectlon for interaction with
matter (about 10 -44 2 cm ) and hence escapes from the star. The energy carried by the
neutrino is a net loss to the star. The rate of the decay depends on the density
(we shall return to this question) but is usually of order of minutes and longer.
T AQ
± ( A
,Z-I
)
(A,Z
)
Figure I.
The energetics of the E-decay.
inve
rse
~-de
cay
Ep s
hell
Figure 2.
The density profile in the star
and the URCA shell.
m/M
tot
Pshe
ll
257
The matter in the star is practically fully ionized at the relevant densities
(above 106 gm/am 3) and the electrons are degenerate. The fermi energy, which is the
average energy needed to add an electron at thermal equilibrium, is a monotonic
function of the density (at constant temperature ~$ep I/3 at the relevant densities).
At a density of about 5 x I05 gm/cm 3 the fermi energy is about I/2 MeV, namely it
equals the rest mass energy of the electron. As the density continues to increase
a moment will come at which the degenerate electrons, which are pushed to higher and
higher energy levels will be energetic enough to cause the inverse process, namely :
(A,Z) + e- ÷ (A,Z-I) + v , (2)
Here ~ is the antineutrino which has the same nuclear properties as the neutrino
(cross section ~ I0 -~# emZ)and escapes from the star, in general, so long as the
densities are below nuclear ones. At P=Pshell when the inverse process (2) occurs we
get the URCA shell. The name URCA was given by Gamow and Schoenberg after the famous
Casino in South America where the gambler was bound to lose his money. The URCA shell
is quite narrow, i.e. the process occurs for Ap << Pshell" At densities p < p shell
the dominant process is B-decay while for p > p shell inverse S-decay is dominant.
The location of the URCA shell is given by the condition
2 ef = AQ - meC (3)
where AQ is the energy difference between the two nuclei.
Tsuruta and Cameron (1969) considered the effect of density variations on the rate of
URCA losses. The picture of Gamow and Schoenberg is static. As the star contracts
the location P=Pshell advances outward and gives rise to neutrino (and energy) losses.
In the case of Tsuruta and Cameron the same nucleus can oscillate (by means of general
stellar vibrations) around P=Pshell" When the nucleus (A,Z) moves to a higher density
it absorbs an electron and emits an anti-neutrino, when the product nucleus (A,Z-I)
moves to the low density, it finds that the phase space of the sea of electrons around
it is free and it B-decays releasing the electron and neutrino. The sum of the two
processes can therefore be written schematically as:
(A,Z)-~ (A,Z) + v+ v (4)
Paczynski (J972) considered the effect of convection on the URCA process. The funda-
mental problem and motivation was the difficulty in the theory of neutron star form-
ation. There are some statistical arguments (Gunn and Ostriker 197]) that hint at
the fact that stars in the range 4-10 ~should be the progenitors of pulsars. However,
when models of such stars are calculated it is found that due to the particular
behaviour of stars with nuclear shell burnings, the carbon ignites at high densities
109 - 1010 gm/cm 3 and low temperatures. The ignition of the 2C ÷ Mg reaction under
258
these conditions is dominated by the corrections to reaction rates due to the high
density and gives rise to detonation. Numerical calculations byvarious people have
shown that no remnant is left. On the other hand, if carbon ignition is not achle~ed
with a violent reaction rate, e.g. if a fast and efficient cooling mechanism would be
available~ then the collapse could be delayed to still higher density and the outcome
of the explosion is a neutron star. The fast and concentrated energy production by
the carbon gives rise to convection. Paczynski considered the effect this convection
has on the URCA pairs. The estimates of Paczynski, based on mixing lengh theory
and some properties of stellar convective cores~ yielded the following result
L ~ T 170 (5) c
where L is the neutrino luminosity and T the central temperature. c
This very high temperature sensitivity follows from straightforward application of
the expressions derived by Tsuruta and Cameron (|970) for stellar vibrations to
convective cores. The oscillations considered are very fast compared to the typical
~-decay times and hence the nuclei will be out of equilibrium (and not in equilibrium
as assumed by Tsuruta and Cameron (]970) and Paczynski (1973)). Next, one has to
conserve the number of decaying nuclei. Suppose a convective core inside which an
URCA shell exis~ is given and a nuel~s inverse-8 decays far away from the URCA
shell on the high density side. This nucleus Cannot (practically) B decay before
it crosses the URCA shell to the low density side. Hence the energy loss must be
evaluated first per cycle and not immediately per unit time.
The extreme sensitivity of the neutrino lossesto temperature (much more than the
nuclear reactions energy production) led Paczynski to the conclusion that the UKCA
neutrinos can cool the star sufficiently fast and control earbon burning. Consequently
he assumed stable carbon burning which delayed the collapse of the star and
yielded the desired conditions for the formation of pulsars~aczynski 1973).
A new development came when Bruenn (1973) showed that an accurate calculation shows
that the final outcome of the URCA pair may be heating and not cooling. Consider
first the B-decay. Assume that convection turn-over time scale is short compared
to 8-decay rate (a good assuption). The B-unstable nucleus will therefore be
carried by the convection way past the URCA shell to regions of low density. The
escaping neutrino causes of oourse an energy loss but the emitted electron may
(since the electrons are emitted with a certain spectrum of energies) have energy
well above the average energy of the electrons in the medium (~f at that place). The
fast electron is slowed down and transfers its extra energy to the medium i.e. it
heats the medium.
The same situation occurs when the (A,Z) nucleus is transferred by the convection
to the high density region. Again~ the inverse B-decay process is slow compared to
conveetion velocities and the inverse process may oecur at densities for which ef
259
is much greater than the energy difference between the nuclei. Since the energy diffe-
rence is fixed, so will be the energy of the absorbed electron. Under these conditions
the absorbed electron will come from deep in the sea of electrons. Thus a "holeUis
created below the fermi level. The subsequent thermalization of the distribution, name-
ly the relaxation to a new thermodynamic equilibrium with a smaller number of electrons
will convert some of the energy difference~ef - Eth > 0 (the difference between the
average energy and the energy of the absorbed electron) into thermal energy of the whole
sea of electrons. Said in other words, a high energy electron may jump into the hole
and give its extra energy to the rest. This particular behaviour of 8 decays is well
known to nuclear physicists, namely if you put a G unstable nucleus in a container the
radioactive decay heats the medium in spite of the fact that the neutrino escapes.
The basic argument raised by Bruenn can be demonstrated in the following way. Let
E be the internal energy of the matter in which the species Ni 8 decays. We have from
the first law :
DE DE DE dE = (-~-) dV + (-~--) aT + E (-~) DN.
T'Ni V'Ni i l V,T,N i
= - <E > - pdV v (6)
where the heat lost from the unit mass considered is replaced by <E > the average
energy of the emitted neutrinos. Bruenn assumes the process to occur at constant
volume. This is not the case in stars. The fast pressure equilibrium will give
rise to compression in the case of electron capture (there are fewer particles) and
to expansion in electron emission. In spite of thls neglec?, the qualitative result
is correct. The change in temperature due to the electron capture is therefore given
hy
f ~E DE = L - - { + (
DE ~E + ( - 7 ) ] / ( ~ )
e V,T V,Ni (7)
The sum of the second and third terms on the right hand side
DE 3E ("ST') = (~-Tb"-') = Ef + %c 2 (8)
e V,T e V,S
is exactly AQ. Since we consider only very degenerate matter for which kT <<El, we can
approximate the thermodynamic derivatives with those evaluated at T=O. Hence one gets:
260
2 where m c
e is the rest mass energy of the electron. One finally gets:
~E ~E dT = ( - <e~> + Ae ) / (-~--) a"T" > 0 (9)
V,N i
Where Ae = ef - eth is the difference between the electron fermi energy and the 2
electron capture threshold and eth = -meC " AQ. Cooling will occur only if
<eg> > Ae. The calculations by Bruenn have shown that for T < Tneut (0) the e-capture
will result in heating and vice versa. The effect occurs at p ~ 109 gm cm -3, for T ~ 10 ~ °K, namely higher than the carbon ignition temperatures and hence Bruenn
concluded that the URCA process cannot stabilize the carbon burning and the story
was back the beginning. Figure 3, taken from Regev (1975) demonstrates the basic
result. Close to the URCA shell the electrons have very little extra energy and the
cooling dominates, however, outside a very narrow strip A0 << Pshell heating domina-
tes and if a convective core extends over a sufficiently large density gradient the
total heating may overcome the total cooling and convection may have the opposite
effect: heating instead of cooling.
The pendulum swung in the opposite direction after Couch and Arnett (1974) intro-
duced the idea of a cycle. Consider a given mass element moved up and down by the
convective currents. The energy balance at the high density side is
A E ec = ec - AQ E ee (10)
and at the low density:
aFi = aQ - E -~ (11)
Here ~ = ef + met2 is the chemical potential of the medium. The index ec denotes
that it must be evaluated at the point at which the e-capture takes place and vice
versa, ~- is the chemir.al potential at the place of e-emlssion. Consider now the
full cycle. The mass unit starts at the high density side by absorbing an electron.
It then moves upward where it decays as soon as the density falls. We find that
the downward moving mass unit has one extra electron compared to the upward moving
mass-unit. Consequently there is a net transfer of electrons downward. (The upward
moving electrons are "hidden" in the form of a neutron.) Couch and Arnett add therefore
the two energy equations to get:
ee - ec AE ec + AE = ~ - ~ - (e +av ) (12)
Theynow reason that the difference between the fermi energies of the two locations
must be invested in maintaining the convective flow since the electron must be
brought back to the original place. Note that the electrons move downward and hence
261
u
O~
%
O .
1.6
1,4
1.2
1.0
Heating
Cooling
Heat ing
I i I
2 3 ~; 5 6 T / 1 0 8 "K
Figure 3.
The regions in the O T plane for which heating occurs. The calculation is carried
for the Mg 25 - Na 25 URCA pair with uniform abundance (X = I0-4"I). The broken line 2
indicates the place where AQ =m e c - the URCA shell.
262
release potential energy rather than absorb. Moreover, in hydrostatic equilibrium
the total chemical potential must he constant, hence 9 mp ~ const or d~ mp
where # is the gravitational potential and m the mass of the proton. The gravita- P
tional potential is determined by the C and 0 nuclei and not by the electrons and
much less so by the URCA pair.
The fact that the change in chemical potential of the electrons must be equal to the
change in gravitational potential due to the nuclei gives rise to the high degeneracy
of the electrons. The difference ec _ P is therefore not equal to the difference in
potential energy of the electrons m e A~ but to the difference in potential energy Of
the nuclei - and this in turn is not so relevant (see later).
Couch and Arnett argue that the convective blob (of given mass) contracts upon
e-capture (pressure equilibrium - just the term ignored by Bruenn) and becomes denser
than the surroundinglwhile the downward moving blob emits an electron and becomes
lighter than the surrounding&. Thus the convective flow has to carry denser matter
upward and lighter downward. This difference costs the extra energy that appears
in the term d~ = ec - ~ . A similar argument to the previous one shows that this
is not the case. Consequently, the final result of Bruenn remains valid.
Regev and Shaviv (|975) considered the question of convective stability of the URCA
process. If heating is important,then convection may start before the Schwarzschild
criterion is violated because a small perturbation in the bubble may heat it. It was
found that convection may start earlier than assumed before (according to the Schwarz-
sehild criterion). However, the rise times are quite long and it is impossible to give
a final answer as to what will happen without a detailed stellar evolution calculation.
The analysis was a local one.
Lazareff (]975) considered the details of the convection process with URCA heating
and concluded that no stationary convective core can exist. This conclusion is
correct but for completely different reasons. Lazareff assumed mixing-length theory
in which only rising bubbles exist and considered the detai~ of this motion. He
assumed that during this motion the URCA process releases heat and concluded that if
this process is integrated over the whole convective zone the total entropy will
increase in time and hence no stationary state exists. The error in this kind of
treatment is readily seen in the case of no URCA pair. One finds that the entropy
density continues to increase even in the case of normal stellar convection. The
problem appears because (a) the downward bubbles are not included,
(h) the work done hy the buoyancy force (absorbed by the
bubble but lost by the surrounding) is not accounted for and
(c) the mixing-length theory discusses the perturbed quantities
and not the actual quantities and hence integral theorems are unavailable.
ShaviV and Regev (1976) proceeded in two steps. First the motion of a single bubble
was analyzed and then the global properties of the convective zone were discussed.
26,3
The change in temperature of a blob in surroundings
by (Regev (1975)) :
aT I aT ( r , g ) = v ( r , ~ ) ~ v ( r , ~ ) d--f
ad
containing an URCA pair is given
, T , X ) (13)
C P
when ~ is the place of formation, v the velocity, X the composition at time t.
Quantities with asterisk denote the values inside the blob. C is the specific heat P
at constant pressure. The composition of the blob changes in time according to
dX I ~ ( r , ~ ) • X v(r,E) dr = - X I X I (r,~) + ~2 (X - X I (r,g)) (14)
where X; ~ is the mass-fraction of the first species of the URCA pair and X is the
total mass-fractlon of the pair.
The equations of motion are then integrated in order to find the motion. A typical
result is shown in figure 4. We find that (a) for most cases of rising blobs the URCA
heating "helps" the buoyancy forces and the velocity is increased. (b) downward moving
blobs are somewhat disturbed by the heating (c) a subadiabatic gradient may lead to
unstable upward-~noving blobs (in agreement with the local stability analysis of Regev
and Shaviv (]975)) but prevents blobs moving downward.The most important result is:
(d) the URCA losses have a negligible effect on the motion of the rising bloh.Actually,
as the blob starts to move, its velocity is small and the effect of the URCA on acce-
leration very large, but as soon as the velocity becomes large, the time scale becomes
too short to have any effect on the motion. We find therefore that the URCA losses are
the result of spreading the URCA isotopes uniformly over the whole convective core.
Consider now a convective core as a zone with q~(r) losses and q(r) heating per unit
mass and time. Clearly, close to the URCA shell~q~ is greater than the heating but far
away q(r) l which includes the nuclear and the URCA heating;is the dominant factor.
The two contributions have different spatial behaviour.
The energy equation is :
P d-tdE + PV.~ = pq - pq~ + pqf - V.F (15)
where F is the radiative flux through radius r and qf the rate of heat generated by
dissipation. The equation of motion is given by
d~ Vp - ~ - pV~ (16) 0 dt
where f is the frictional force per unit volume and ~ the gravitational potential.
The integration over the whole convective zone down tb the place where ~=o yieldspafter
264
u
E u
>
0 M
7
6
3 w
5
iIi
×I
! I I I ........ I i
6 7 8 9 40 44 42
r/40 "4 R e
-6
-8
X
O
-10
" -12
Figure 4.
The velocity of a blob in the case of Na 23 URCA pair. Line ] is log v
for an adiabatle blob and line 2 is log v for a blob with the URCA pair.
Line a is the equilibrium abundance of Na 23 in the surrounding medium,
This criterion holds approximately in the limit of zero viscosity and with the omis-
sion of radiative conduction terms as indicated. The dimensionless quantity ~2j
which arises in radiative cooling problems (Unno and Spiegel 1966), should be cho-
sen so as to minimize the right hand side of (3.12). The resulting value of ~ is
then inserted to give the local stability criterion. Of course, if we are led to
extreme values of ~ we should worry about the possible violation of physical con-
straints that have been removed in this simplified analysis. (I~nextremis, we could
Just solve the problem properly.)
To make the appropriate choice we observe that for y > 4/3 the right side
of (3.12) increases as ~ decreases. In that case, the instability criterion is
obtained with the largest possible values of ~, hence with modes of large horizon-
tel scale in the length unit [3p2K(K+O) ]-1/2. In stellar interlorsmost scales of
interest satisfy this condition and the conventional criterion would apply. In
transparent regions, however, it may be that geometrical constraints intervene and
large ~ cannot be achieved. In that case, the maxlmumvalues allowed for
should be taken, and here we should note that once ~ exceeds unity there is not a
large difference from the results at very large ~,
In cases where 7 < ~ the situation is reversed and the right hand side of 3'
(3.12) decreases as ~ decreases. The preferred value of ~ is now the smallest
one possible; that is we want the largest allowed value of k. If the particle mean
free path is much less than the photon mean free path we can choose small ~ with-
out worrying about the breakdown of fluid dynamics, But we do have to make sure
that we don't choose a k which is so large that diffusive effects wipe out the in-
stability. In fact this amounts tO finding the preferred mode in the usual way, but
here the choice determines not just diffusive corrections to the critical gradient,
but also the effective adiabatic gradient itself. Unfortunately, there is a compli-
cation that arises in this situation.
275
& The case y <~ will normally occur in ionization zo-~es and therefore has to
be treated with some care. In fact, Underhill (1949b) ~as evaluated the "adiabatic"
temperature gradient with partial ionization and in the presence of an important
radiation field. But that calculation was what I have been calling the conventional
criterion. That is, she has applied the condition of zero total (matter plus radia-
tion) entropy gradient which corresponds to the marginal stability conditions with
=m. Howeverj the possibility exists that finite values of ~ may be more correct
since the zones of partial ~onlzatlon tend to occur in stellar envelopes. We could
then ha~ a somewhat increased convective instability but the modes involved, being
radlatively leaky, might not carry heat effectively. It appears therefore that for
most purposes the standard convection criterion is good. However, it would be more
comfortable to have a detailed treatment of this problem, and I predict that there
soon will be one.
IV, PHOTON BUBBLES
In thinking about ordinary stellar convection we maybe guided by solar ob-
servations, but we have not such direct experience to guide us in photoconvection.
Instead, we may appeal to observations of a laboratory flow that is analogous to
photoconvection. We have already seen that the radiation in this problembehaves
(in the Eddington approximation) like a fluid flowing through a deformable porous
medium. This closely resembles the situation in a fluldlzed bad (Thorns 1973,
Prendergast and Spiegel 1973). Though the analogy is not a perfect one (Spiegel
1976), it can be used to suggest the qualitative nature of nonlinear photoconvection.
And one of the most striking implications of this analogy is that instead of convec-
tive thermals having relatively low densities, we should expect real bubbles
in photoconvectlon. These are filled with radiation and contain virtually no
matter. How this modification of the normal convective process may influence the
heat flux can only be crudely estimated (Thorns 1973), but there are also other
features of convection which are strongly affected. In partlcular~ bubbles feel
the full effect of gravity rather than the reduced gravity of ordinary eonvection~
hence large (that is, sonic) convective speeds may be anticipated.
In this section, I shall sketch an approach to the treatment of photon bubbles
borrowing heavily from the literature on fluidized beds (Jackson 1970, Rowe 1971).
In comparison to fluidlzation, this theory suffers from the disedvantage that we
have not yet seen a photon bubble. However, John Lin at Columbia is looking serious-
ly at the prospects for removing this drawback experimentally.
We wish then to study a photon bubble of radius r ° rising at speed V. We
shall assume that r O << H,~ where H, ~ R T/E~, and that the bubble may be taken to be
quasl-steady when described in its own reference frame. We may nevertheless intro-
duce the dynamical time scale r /V. Let us assume that this time is much shorter o
276
than the thermal time of e region of size r ° and then presume from this that
there is validity in neglecting thermal effects. Then we may tentatively set K = 0
in the basic equations.
Now let E and F be representative values for the ambient radiant energy
density and flux and let p, be a representative ambient matter density. The fol-
lowing dimensionless parameters are of interest:
V EV = V 2 (4.1) c =-f,~ =T,f2 .r,= p,cr o.
gr o
We assume that ~ << 1 and, from the analogy to fluidizatlon, we anticipate that
f2 is of order unity. If E and F may he estimated from their static values
(see (1.11)), we have E/F ~ e/T, for T >> i, where T is the optical depth. Hence
~ ~T. Then, when the bubble is only a few radii below the surface, T, ~ T and
we have that 8 ~ ET,, which is the regime we shall study here. A further restric-
tion to be used in the following analysis is ~ << I, but l shall mention at the
end what may happen at larger depths when 8 becomes of order unity.
If we nondimenslonalize the basic equations and make use of the foregoing
approximations, we obtain a greatly reduced set of equations. In dimensionful form
these are
(4.2) P ~t + u.Vu) - -Vp-gp~ + c F
(4.3) ~-E + V'(p~) = 0 ~t
(p/p~f) + u . V ( p / p "f) - o (4.4) 3-~
( 4 . s ) v.~" = o
(4.63 3 VE - ~c ~"
This description is about as primitive as it can be while still involving the ele-
ments of photohydrodynamice. Let us now seek approximate solutions for a bubble
rising at constant speed V. We presume that the medium is unstable, which is
true if = < 0.2.
Suppose, in first approximation, that the bubble is a spherical hole of
radius r • If the bubble does not greatly disturb the ambient density, we see that o
equations (4.5) and (4.6) are simply the transfer equations for a static medium with
a hole in it. This results because in t h e present approximation the radiation field
adjusts quickly to the state of the medium (~ << i); also the motion is so slow
that the difference between ~ and ~ may be neglected (~ << i)° We have also
assumed that the bubble radius is much less than the local scale height~ hence O
in (4.6) is approximately constant outside the hole. Equations (4.5) and (4..6) may
then he solved separately with D = 0 inside the spherical cavity.
277
Let us introduce a spherical coordinate system with origin at the center of
the hole and with % = 0 at the top of the hole. Far from the hole, the flux is
Fo~, where F ° is a constant given by the static solution and we have the condition
3p~F o ^ ( 4 . 7 ) VE ÷ - - - z a s r ÷ ~ .
c
M o r e o v e r , E and t h e component of ~ n o r m a l t o t h e b u b b l e s u r f a c e a r e c o n t i n u o u s
across the surface. Since (4.6) implies that E is constant inside thehole, we
have the boundary condition
0 1 1 r = r ( 4 . 8 ) E = E ° o
where E is the constant value of E inside the hole. Now (4.5) and (4.6) show o
that E is a harmonic function and, with conditions (4.7) and (4.8), we find
2 r o
oro - c o s e . (4.9) E - E ° - c o r
Since cE/(3p~) is a potentlal for ~, we have
(4.101 ~ ffi V-{Fo o r': [__.r. (~]2]coa e}. o
Alternatively, we can express F in terms o f a Stokes stream function:
1___!__ - 3T Fe i ~ (4.11) Fr = r2sin 8 ~8 ' = r sin 8 ~r '
w h e r e
( 4 . 1 2 ) r ~=_~1 For2[1 + 2(~)3]sln2e
The flux co, isis of ~e original unifo~ part plus a d~ole generated by the hole,
a reset familiar from analogue problems ~, for example, electroata~cs. ~e
radiative flow is shown in the neighborhood of ~e bubble in ~e figure on the
following page. Inside the bubble, the fl~ is 3Fo, if equation (4.6) ~y be wed.
~is last point is a ~llcate one as we have used ~e Eddin~on ~proxi~tion
for the transfer ~eo~. However, this approximation holds if the radiation is iso-
tropic ~d, when pot ° >> i, it probably is. ~e reason is that for T, >> 1 indi-
~dual photons will scatter off the b~ble surface (actually a l~er of ~ickness
(p~)-l) many times before escaping, hence the r~iation field inside ~e b~ble
should be reasonably isotroplc.
~e deformation in the radiation field produces an additional for~ on ~e
~tter. ~e total exte~al force density is 3
-gPz + 0~ F" -g*Pz " c ~pV cos 8
278
/
J
e %.. .'2
Streamlines for the radiative flux around a hole, according to (4.12). The solid circle ehc~s the original hole. ~ The dotted curve indicates the estimated deforma- tion of the hole obtained by setting h = 0 in (4.22) with ~ = 0 and V chosen as in (4.24).
279
(4.13)
v-V(p/p 7) 0, (4,14) ÷ =
(4.1s) v.(~) = o,
where
roF r 3,
and
where g* " g-gR and gR ffi Fo/C" The additional dipole force produces a fluid
circulation which causes the hole to rise and, in general, to deform. Let us study
these effects.
If we work in the bubble's frame and assume a stationary situation we have
the equations
v'Vv ~ -Vh-g,~-V~ ,
I have not changed notation to indicate the coordinate transformation except to
call v = u - V z the new velocity. The correction to ~ due to the motion of
the bubble is of order 6 and is neglected.
First we shall determine V on the assumption that the bubhle remains
spherical. This we do with the approximation p = const, whence
(4.1s) v-~ = 0.
We may therefore take ~ to be the incompressible flow around a spherical ob-
stacle. Such a flow has a vanishing normal component on the bubble boundary and
-V ~ as r -~ ~. Solutlons of this problem are well known and if it approaches
we also set
(4.19)
we find
V x v = O ,
r 3
(4.2O) ~ffi-VV[r cos 811 +~r31 ].
Moreover, because of (4.19) we may rewrite (4.13) as
(4.21) V [h+#+g, zq~2/2 ] = 0;
hence for r ~ ro,
280
(4.22)
h = gro{l-[~ + (l-s) ] cos 8}
- ~V {i+ (1-3 cos 2 @) +~ (1+3 cos 2 @)},
where ~ = g,/g and an arbitrary constant has been chosen so that
On r = r we have o
9 2 (4.23) h = gro[l- cos 8 -~f (l-cos 2 8)].
h(ro,O,O) = O.
For p = const, h = p. Alternatively, the choice p/p7 = const, which satisfies
(4.14), gives h ~ p(7-1)/y. In either case we would like to have h = 0 on r = r o since p = 0 inside the bubble (and E is continuous across the interface). But
(4.23) shows this to be impossible with the present approximate treatment. However,
we do have the freedom to choose f2 to match the pressure boundary conditions as
well as possible. In fluidization theory the procedure used by Davies and Taylor
(1950) for ordinary bubbles is usually adopted. In the present instance this comes
down to setting ~2h/~@2 = 0 at r = r o, 8 = O, whence f2 = 4/g (see also
Batchelor 1967). The argument for this is that h and ~h/~8 are already zero at
r = to, 8 = 0, and we would like to extend the region where h is very small as
far as possible. Let us adopt this choice; (Any other choice of this type would
also give a value of f of order unity. For example we might mlni~zethe inte-
gral of h 2 over the surface r = r .) Thus we have an estimate of the speed of o
rise of the bubble which can also be used to see the magnitude of the distortion of
the spherical hole by the dipole force. For the latter purpose we may simply com-
pute the surface on which h = 0 with
= 2 (4.24) v ~(gro)l/2.
For ~ = 0 this surface is the dotted line indicated in the Figure above. The
distortion of the hole is caused by the need to balance the fluld-dynamical pressure
~2/2 and it represents a problem which is also encountered in the theory of ordi-
nary gas bubbles in liquids (Moore 1959). As long as appreciable speeds occur next
to the bubble this difficulty arises. In an actual fluidization bubble the problem
is resolved by the formation of an indentation at the rear of the bubble. The in-
dentation fills with particles which effectively move with the bubble. This feature
has to be built into the theory in a self-consistent way.
With the present estimates a second problem arises, namely that for r >> r o and @ > 0 we encounter a region of negative h when ~ > 0. This difficulty does
not arise in fluldization theory since that subject is confined to ~ = 0. We
therefore have no experimental guide to the meaning of this result. There are some
speculations that might be offered here but perhaps the message is simply that bub-
bles only occur when ~ is very close to zero.
281
Now it is evident that the foregoing discussion does not really provide an
acceptable theory. It might be different if photon bubbles were an observed phen-
omenon that we were trying to understand qualitatively. But the real question is
whether photon bubbles actually exist and the answer will almost surely have to be
given experimentally. In spite of these worries, I would llke to close this
theoretical discussion of bubbles with one further qualitative remark about what
may happen at very large optical depths.
The total radiant energy density includes the usual energy density plus the
pressure, hence it is ~E. The energy flux divided by this energy density gives
a speed to be associated with the radiant fluid.
F
3 When ~ exceeds some critical value %T , V exceeds v R, and the bubble is
moving faster than the radiative fluid. In that case, the radiation does not ad-
Just quickly to the matter. Rather, we may expect the radiation associated with
the bubble to be swept along with the bubble, much as in the corresponding case
Of fluldlzatlon where one sees a trapped cloud of fluid circulating in and around
the bubble. When this occurs, I expect that photoconvective transport should be-
come very efficient. The optical depth at which this occurs is given approximately
by
T ~ lO 24/9 I~eff] 4/~ • [i--~5 j
V. CONCtUSION
The main questions considered here have to do with the nature of photoconvec-
tlon and the conclusion which is tentatively adQpted is that the two-fluid nature of
the process may make for some qualitative differences from basic Boussinesq convec-
tion. I have tried to sketch how photon bubbles may behave in a~alogy with fluidi-
zation bubbles. The analysis is sufficiently simple that one can easily see what
is going on, but there is one point about the results that I want to emphasize.
The bubble is not simply held open by an excess of radiation pressure inside it.
The radiative force is vital to the process and this is proportional to the flux.
The figure in ¶IV is helpful in seeing how this works: flux converges onto the
bubble from below and diverges upward from the bubble. This forces the fluid
flowing by the bubble to go around it which in turn causes the hole which produced
the flux pattern in the first place. This seems to be a dynamically consistent
situation, whether or not the equations have been completely solved. Whether the
thermodynamics of radiation interacting with matter (and which has not been dis-
cussed at all properly here) can spell the picture, seams difficult to decide, and
that is, to me, the biggest question to be faced at present. But i£ we put doubts
282
aside for now we may imagine some astroDhysicall7 interesting aspects of photon
bubbles.
The generation of large amplitude, complicated velocity fields in hot
stellar atmosphere is one of these. Another is suggested by the rapid separation
of particles of differing properties in bubbling, fluidlzed beds. In this process,
called elutriation, particles of relatively large drag are carried up through the
bed by the bubbles. Similarly we can imagine that particles with large scattering
cross section may be carried swiftly through stellar material By photon bubbles.
Moreover, there are some interesting consequences involved when bubbles collapse
near a stellar surface. The heatlngmay cause hot bursts of radiation (as
J. Pringle has suggested) or radiation of acoustic and shock noise. Also, non-
spherical collapse could squirt matter off the stellar surface at high speed, as a
preliminary computation by J. Theys confirmS.
But these are presently speculative topics and more immediate aimS should
also command attention in this subject.
We need a more complete stability theory, a study of flnlte-amplltude
stability, and some numerical simulation. In this respect, we should be aware of
related work on high density plasmas (Estabrook, Valeo, and Kruer, 1975), though
much of what I have said here leaves out plasma kinetic effects and assumes rela-
tively low density s such as is encountered in stars.
I should like to conclude by acknowledging my indebtedness to the many
people whose remarks have influenced aspects of the presentation and to list Just
a few of them: S. Childress, L.B. Lucy, K.H. Prendergast, and J.C. Theys. I am
~rateful to G. Baran for running his contour routine. And flnslly, I thank the
National Science Foundation for supporting the work reported here under Grant
NSF PHY-7505660.
REFERENCES
Batchelor, G.K. 1967, An Intrqductign t O Fluid Dyqamlcs, Cambridge Univ. Press, p. 475
Berthomleu, J. Provost, A., and Rocca, A. 1976, Astron. and Astrophye., 47, 413 Chandrasekhar, S. 1939, "An Introduction to Stellar Structure " Davies, R.M. and Taylor, G.I. 1950, Prec. Roy. Soc. Lon. A, 200, 375 Estabrcok, K.G., Valeo, E.J. and Knuer, W.L. 1975, Phys. Fluids, 18, 1151 Hearn, A.G. 1972, As tron. and Astrophys., 19, 417 Ream, A.G. 1973, Astren. and Astrophys., 23, 97 Hsleh, S.-H. and Spiegel, E.A. 1976, Ap. J~., 207, 244 Huang, S.-S. and Struve, O. 1960, Stellar Atmospheres, J.L. Greenstein, ed.,
Univ. of Chicago Press, p. 300 Jackson, R. 1971, Fluidization, J.F. Davidson and D. Harrison, eds., Academic Press,
p. 65 Joss, P.C., Salpeter, E.g., and Ostrlber, J.P. 1973, Ap, J., 191, 429 Moore, D.W. 1959, J.F.M., ~, 113 Prendergast, K.H. and Spiegel, E.A. 1973, Co mmentsA p. Space Phys,, ~, 43
283
Reimers, M.D. 1976, Physique des Mouvements dans. les Atmospheres Stellalres, R. Cayrel and M. Steinberg, eds., C.N.R.S., p. 42]
Rowe, P.N. 1976, Fluidization, J.F. Davidson and D. Harrison, eds., Academic Press, p. 121
Simon, R. 1963~ 3. QuaBt ~ Spectr0sc. Radiat. Transfer, ~, 1 Spiegel, E.A. 1976, Phys~iquedes Mouvements dans les Atmospheres .Stellaires,
R. Cayrel and M. Steinberg, eds., C.N.R.S., p. 19 Thorns, V.A. 1973, Report RC59, Dept. of Computer Science, Univ. of Reading Wegener, p.p. and Parlange, J.-Y. 1973, Ann. Rev. Fluid Mech. , ~, 79 Wentzel, D.G. 1970, Ap. J., 160, 373
CONVECTION IN THE HELIUM FLASH
A. J. Wickett
Department of Astrophysics and Nuclear Physics Laboratory,
University of Oxford
Abstract
The evolution of a star through the helium flash depends upon uncertain aspects
of convection theory. Observations place some constraints on the theory of convection
in stellar cores.
Evolution of Stars in the Mass ranse 0.6 M@ to i~ M@
When stars finish burning hydrogen to helium in the core, the burning region
moves outwards to form a shell source. Matter moves inwards through the burning region
into the core which becomes progressively hotter and denser.
As the density of the core increases, the electron gas becomes degenerate. The
hydrogen outside the hydrogen-burning shell forms a diffuse convective envelope of
low density and great spatial extent (radius ~ 5xlO10m). The star is a red giant.
Figure 1 shows the structure just before the helium flash.
The site of the Helium Flash
In the eorejthermal conduction by degenerate electrons is efficient and there
are no heat sources before helium burning starts, so to a first approximation the core
is isothermal. However heat is lost from the densest part of the core by the plasmon
neutrino process
Y plasmon ~ ~ + 9"
A slight temperature inversion therefore appears, which causes helium burning to ignite
some distance (- D.1 M@ ) from the centre. The peak temperature T m " 108K.
285
9.0
8,0
7.0 log p
6.0
(kg m-3
5.0
~1.0
3,0
2.0
1.O
0.0
-1.O
-2.0
-3.0
-4.0
6.0
7.o
log T
6.O
5.O
4.O
m
m
m
m
m
m
m
I .... i i i I ! | ....
- j
, , , I I ...... I I I I I 0.i 0.2 0.3 0.4 0.5 M 0.6 0.7
Me Figure !. Temperature and density profiles for star of 0.75 M@ and zero-age composition X = 0.7, Z = 0.0004 at start of helium flash
286
The rate of the reaction
34He ~ 12 C + Y
(the 3~ reaction) is proportional to about the 40th power of the temperature. Since the
degenerate electron gas can take up much heat without a large pressure increase, a
thermal runaway can occur.
Quasistatic evolution through the Helium Flash
The energy generation rate in the hellnm-burning shell is great enough to require
a convective zone outside it to transport the heat away. The peak temperature rises
a~d the convective zone extends; the rest of the star adjusts its structure adiabat-
ically. When the electron degeneracy is lifted in the burning region~expansion takes
over and the peak temperature falls. Figure 2 shows how the structure of the centre of
the core changes through the flash.
Calculations with an accurate equation of state (Thomas 1970, Demarque and Mengel
1971, Zimmermann 1970, Wickett 1976) show that if, in the convection zone, the temper-
ature gradient is adiabatic or slightly superadiabatic as prescribed by the standard
mixing-length theory and if the convective zone does not reach further towards the
centre than the initial helium ignition zone, the burning timesoale at the peak of the
flash is large compared with the convective timescale. Thus the traditional formulation
of convection theory leads to a quasistatic helium flash.
Convective Uncertainties
Edwards (1975) shows however that the traditional convective stability criterion
3tar < adiabatic ~> unstable
is modified in the presence of a very temperature-dependent energy generation rate.
Even a positive gradient (temperature rising with radial coordinate) is unstable if,
as is the case here, the reaction rats depends sufficiently strongly on the temperature.
It is not clear exactly how this oonclusion affects the evolution; however a calc-
ulation was performed in which the convective zone was extended to the centre and the
temperature gradient was the adiabatic one. The formulation is more fully described in
the author's thesis (1976). At the peak of the flash, the burning time
8,.2
log
T
8.1
8.o
7.9
7.8
7.7
7.6
6.5
Figure 2.
...... I .
....
....
....
....
..
t ......
.....
I i
~odel
.....
/Will
'
I .
..
.
I I
7.0
7.5
8.0
log~
(kg ~3) 8.5
Temperature-density plots for core of star of figure i
during helium flash
r
9.o
288
T T n=~
for constant density and no heat flow at the hottest point was equal to the convective
timescale
i c
T C U " C
U and 1 are the convective speed and mixing length evaluated by the usual theory. o c
The time for thermal runaway is considerably less than T n as the burning rate rises
sharply with temperature. So convection cannot keep up with the rate of increase of
heat production. Thus if this picture of the evolution is correct a thermal runaway
and explosion must occur.
D~namicCalculation
A hydrodynamic code including gravity and nuclear reactions for a spherically
symmetric model was used to follow the explosion.
A spherical detonation wave with peak temperatures ~ 2.5 x 109K and propagation
speed l0 times the sound speed in front and twice that behind passes through the core.
Half of the helium burns to neon (or species of similar mass number) releasing
~ 5 x 1043j. The entire star is disrupted with energy typical of a supernova of type I.
Observational Constraints
We assume that the 108 Galactic globular sluster stars (Allen 1973) have masses
distributed according to Salpeter's (1955) mass function
dN = M. -1"35 d in M.
with a lower limit of 0°l M@. The variation of age at helium flash, T hf" with mass
din T hf - - - -5.31 dlnM,
was calculated using Eggletonls (1971, 1972) code. For age i0 i0 years we are interested
in N, = 0.92 M® and find that Galactic globular cluster stars now undergo 5.5 x l0 "13
helium flashes per star per year or, 5 x lO -5 per year for the whole Galaxy.
289
Since one would expect to see a supernova remnant for l06 years (Woltjer 1972)
and no supernovae remnants or gas of any sort is seen in globular clusters, we con-
clude that the supernova rate is less than l0 -l# per star per year. So there is an
Upper limit of one supernova for 50 helium flashes.
Conclusions
It is not clear how to calculate the temperature gradient or the convective time-
Scale when the energy generation rate is very temperature-dependent. It is however
clear that the traditional prescription does not work.
As to the helium flash supernova model, the observations allow 2% of helium flashes
to be that violent. If this is the case then the small fraction might arise either from
rather narrow ranges of mass, rotation etc. being faveurable or, in some ways more
appealingly, from fluctuations in the convection. This implies that the evolution of
two stars with the same gross properties may be substantially different because of the
random nature of the convective process.
Acknowledgements
This work was supported by the United Kingdom Atomic Energy Authority and the
Science Research Council.
References
ALLEN, C. W., 1973. Astrophysical Quantities, 3rd ed., Athlone Press, London
DKMARQUE, P. and MENGEL, J. G., 1971. Ap. J. 164, 317
EDWARDS, A. C., 1975. M. N. R. A. S. 173, 207
EGGLETON, P. P., 1971. M. N. R. A. S. 151, 351
EGGLETON, P. P., 1972. M. N. R. A. S. 156, 361
SALPETER, E. E., 1955. Ap J. 121, 161
THOMAS, H.-C., 1970. Ap Space Sci. 6, 400
WICKETT, A. J., 1976. The Hydrodynamics of the Helium Flash, D. Phil. thesis,
University of Oxford
WOLTJER, L., 1972. Ann. Rev. Astr. Ap 10, 129
ZIMMERMANN, R. E., 1970. .The Hydrodynamics of a HeliumShell Flash in a Star of one
Solar Mass, Ph. Do thesis, University of California, Los Angeles
WAVE TRANSPORT IN STRATIFIED, ROTATING FLUIDS
M. E. McIntyre
Department of Applied Mathematics and Theoretical Physics,
University of Cambridge
SUMMARY
Momentum and energy transport by buoyancy-Coriolis waves is illustrated Dy means of a simple model example. The need for careful conslderation of a complete problem for mean-flow evolution is emphasised, especially when moving media are involved. Then a recent generallsation of the wave-action and pseudomomentum concepts is introduced, and used to exhibit in a very general way the roles of wave dissipation, forcing, or transience in the mean flow problem, for a certain class of "nearly-unidirectional" mean flows. This class includes differentially-rotating stellar interiors, which could well be systematically changed by wave transport of angular momentum. Similar results hold £or MHDand self-gravitating fluids. Finally the physical distinction between momentum and pseudomomentum is discussed.
i. INTRODUCTION
Some of the most spectacular natural manifestations of wave transport
effects are those believed on the basis of recent evidence to Occur in the
stratospheres of Earth and Venus I-4. Closely analogous effects appear
likely to influence the evolution of the rotation of stellar interiors 5,
and to be important in other astrophysical contexts 6. They are often
associated with rather complicated kinds of low-frequency fluid-dynamical
waves, in which buoyancy and Coriolis forces are essential. The waves set
up a "radiation stress" whereby the mean azimuthal velocity at one height
and latitude can undergo systematic acceleration at the expense of a
corresponding deceleration at a more or less distant location. Thus
transport of angular momentum by the waves is involved. This transport can
result in drastic changes to the pattern of differential rotation (which
in turn can drastically affect the wave propagation and lead to some
interesting feedback effects~.
291
An idealisation illustrating this kind of "radiation stress" phenomenon
is the model problem suggested in figure i. Inertio-gravity waves, which
are the simplest type of buoyancy-Coriolis wave, are being generated by a
slippery, corrugated boundary moving parallel to itself with constant
velocity c:
C
FIGURE I. Inertio-gravity waves being generated in a stably-stratified fluid (specific entropy increasing upward) by a rigidly-moving boundary. The frame of reference is rotating with constant angular velocity~= (O,O,A) and the effective gravity g = (O,O,g) = constant. (It can be shown that the waves have their crests or'lines of constant phase sloping forward, unlike the sound waves which might be generated if the boundary moved much faster.)
Here the Cartesian x direction plays the role of the azimuthal direction,
and the mean state is assumed independent of x. The mean pressure gradient
has no x-component; thus the fluid is free to accelerate in the x direction
in response to the radiation stress.
If the waves are being dissipated in some layer L at the top of the
picture, there is a systematic tendency for the mean flow to accelerate
there. So the wave-drag force which the boundary exerts on the fluid is not
felt at the boundary, as far as the mean flow is concerned; it is felt at
L. This is a typical radiation-stress effect.
If the waves were generated not at a boundary but by a moving system
of heat sources and sinks in some layer in the interior of the fluid, then
total momentum would be constant, and the mean acceleration at levels where
the waves are dissipated will be accompanied by a corresponding deceleration
where they are generated 3'4. The close connection between mean flow changes
and wave dissipation or forcing can be verified by detailed solution of
the appropriate sets of equations, but is not usually obvious from the
equations themselves.
Where the waves are dissipated will depend not only on the physics of
292
the dissipative processes but also upon the solution of the wave propagation
problem for the particular mean-flow profile involved. Other things being
equal, we usually get enhanced dissipation of the waves in places, if any,
where their intrinsic frequency (i.e. frequency in a frame of reference
moving with the local mean flow H ) is Doppler shifted towards zero - that
is, we tend to get enhanced dissipation near an actual or virtual "critical line "8 5(y,z) = c.
In this review I shall pay particular but not exclusive attention to
the class of problems exemplified by figure i. Their characteristic feature
is the existence of a coordinate x (cartesian or curvilinear) such that mean
quantities are independent of x, and "mean" can be defined as an average
with respect to x. Such problems, which I shall call "longitudinally
symmetric" happen to comprise an area of recent advances, and also serve
to illustrate some of the subtleties and pitfalls which can arise in thinking
in general terms about the transport of conservable quantities such as
energy and momentum by waves in material media,and most particularly waves
in moving media. (We are, of course, dealing with moving media par excellence
as soon as Coriolis forces are relevant.) In section 2 some of these points
are illustrated by describing in more detail what happens in the problem
of figure i. Most of the phenomena encountered can be found in one or other
of several related problems which have been discussed in the literature by Eliassen 9, Phillips IO, Matsuno II, Uryu 12, Grimshaw 13, and others.
In sections 3 and 4 I turn from illustrative example to general theory,
and survey some very recent developments which appear to be of quite wide significance, but which have proved to be especially powerful for longitud-
inally symmetric problems. A simple yet very general version of the
"wave-action" concept is involved, resulting from a synthesis and extension
of ideas from "classical field theory "14 and the more recent work of Eckart 15, Hayes 16, Dewar 17, and Bretherton 18. Equally relevant is the
pioneering work of Eliassen & Palm 19 and Charney & Drazin20; and a related
but not identical line of development is contained in the work of Soward 21 on the Braginskii dynamo problem. A remarkable feature of the general
results is that they enable useful statements to be made without requiring
validity of approximations of the "slowly-varying wavetrain" type and attendant concepts like "group velocity'. Also, they can be developed for
finite-amplitude waves 22. Their special value in longitudinally-symmetric
problems is that they lead to ways of expressing the problem for the
mean-flow changes which do directly exhibit the abovementioned general connection between those changes and wave dissipation or forcing 22-25.
Finally (section 5) I shall make some remarks about that elusive entity,
or rather, nonentity, wave "momentum'.
293
2. MORE ABOUT THE PROBLEM OF FIGURE 1
2.1 Equations
The simplest relevant set of model equations is the usual set for a
Boussinesq, incompressible, stratified fluid in a rotating £rame of
reference, with constant angular velocity JL :
~,t + ~'~ + 2J~^B + polyp - e~ = -X (2.1a)
8,t + U.V8 = -Q (2.1b)
V.u = O . (2.1c)
Here u = (u,v,w) is velocity, ( ),t stands for 6( )/6t, ~ is the unit vector
(O,O,i), e is the buoyancy acceleration given by minus the effective
gravity-plus-centrifugal acceleration (assumed constant in this model)
times the fractional departure of the density from its constant reference
value Po" For descriptive purposes we shall think of 8 as a measure of
temperature or potential temperature. The departure of the pressure from
the hydrostatic value associated with Po is denoted by p . The terms X =
(X,Y,Z) and Q may be thought of as representing arbitrary body forces and
~eating, which may or may not be functionally related to the fields of motion
but which in any case will be zero if the waves are neither dissipated nor
generated internally.
2.2 Excess momentum flux, and the mean-flow problem
Let an average with respect to x be denoted by an overbar: for instance
the mean velocity in the x-direction in figure 1 is 5(y,z). If we average
(2.1a) and make use of (2.1c) the result may be written in suffix notation
(i,j=i,2,3) as
ui,t + {uiuj + ~ij},j ÷ (2~A~)i - ezi = -{u[u~},j - xi " (2.2a) Po
(It will be convenient in what follows to use (x,y,z) and (u,v,w)
interch___angeably with (Xl,X2,X 3) and (Ul,U2,U3).) Here ( )" is defined as ( ) - ( ) , the departure from the mean, and (),j means d( )/6xj . Eq.(2.2a)
contains mean-flow quantities only, except for the term involving the
Reynolds stress uiu j . The equation tells us that uiu j is the excess mean ...... > i momentum flux due to the waves. Note that uiu j is a wave property, by which
I mean something which can be self'consistently evaluated as soon as you know the linear wave solution, i.e. when you know the fluctuating quantities ( )" to leading order.
294
It might be tempting to conclude that nothing more need be said: Eq.
(2.2a) states that the momentum transport by the waves is equal to uiu j ;
so "obviously" -ui~ is the stress whose divergence will give the mean
acceleration u,t, or at least the contribution to this acceleration
attributable to the waves. The average of Eq.(2.1b), namely
8,t + {ujS},j = -{u~8"},j - Q , (2.2b)
is irrelevant, one might think, because how, after all, can the excess heat
flux u{8" due to the waves affect momentum transport?
This conclusion would, however, be wrong (for reasons to appear
shortly), and the fact that it has appeared in the past literature
illustrates the dangers of "incomplete reasoning" about wave transport
effects on the basis of superficial consideration of a relevant-looking wave
property - in this case the excess momentum flux u~u~. Another illustration
will be encountered in section 2.6. In fact the only safe general recipe
for getting a self-consistent picture is to include a conslderation of the
complete problem for the mean flow and its solution correct to second order
in the wave amplitude a. In the present example, the wave properties u[u~
and ~ appear as forcing terms in the mean-flow problem; and both turn
out to play essential roles.
The result o£ averaging (2.1c) is
~.~ = 0 , (2.2c)
and this completes the set of equations, (2.2), for the mean quantities
and 8 . To obtain a well-determined model problem it is simplest to suppose
that the flow is bounded laterally by a pair of vertical walls y = O, b
on which the normal component of velocity vanishes, implying that
= O on y = O, b . (2.3)
We must beware, however, of assuming that ~ vanishes at z = O; in fact for
a rigidly-translating, corrugated boundary whose shape is described by a
given function h ,
z = h(x-ct, y) , (2.4)
where h=O(a), h=O, and c is a (real) constant, it can be shown that
+ O(a 3) at z = 0 . (2.5) = (v'h),y
This illustrates the fact that ~ , which is an average along a horizontal
line such as ~ in figure I, can represent a vertical mass flux, into or out
of the thin region betweeen ~ and the actual boundary, which is continuous
with a horizontal, O(a 2) mass flux within that region, associated with any
tendency for the disturbance velocity to be one way along troughs and the
other way along ridges in the boundary.
295
In fact, such a tendency turns out to be the rule rather than the
exception when Coriolis effects matter; for instance if h is of the form
a sin k(x-ct) then v" for conservative, plane inertio-gravity waves on a
uniformly stratified basic state of rest turns out to be exactly in
quadrature with w" and therefore exactly in phase with h at z=O. This can
easily be verified by setting 8,z = constant, 5 = 8,y = 0 , and X=O, Q=0,
and calculating the elementary plane-wave solutions ~ exp i(kx + mz -~t)
of the linearised disturbance equatlons derived from (2.1) (namely (3.2)
below). Other pertinent features of such plane-wave solutions are that 8",
being proportional to the vertical displacement through the basic stable
stratification 8 is (like h at z=O) in quadrature with the vertical ,Z '
velocity w'; also incompressibility dlctates that u" is in phase with w',
since (2.1c) implies iku" + imw" = O . Thus u'w', v'8" are nonzero, and v'w',
zero, in a plane inertio-gravity wave. The frequency of such a wave ~,
(= kc), satisfies the dispersion relation
~2 = (~ zk2 + 40~2m2)/(k2 + m 2) (2.6)
when H = O. (It should be noted that this implies that c 2 must lie between
4jl2/k 2 and O,z/k 2 for the inertio-gravity waves to be generated.)
2.3 Solution
I shall now describe, for the simplest relevant example, the result of solving the O(a 2) mean flow problem; ~ and Q will be set to zero, so that
we are talking about the effect on the mean flow of the waves alone. The
waves are supposed to have propagated upwards as far as L either because
they are being dissipated there or because a finite time has elapsed since
the bottom boundary started moving. Well below L we can take the waves to
have reached a steady state and the moticn to be conservative - we assume
that X'and Q'are zero there as well as ~ and Q. To keep life as slmple as
possible we shall assume that H = 0 initially, and follow its evolution as long as it can be considered to be O(a2). We also take e,z = constant +O(a 2)
for the moment.
The simplest kind of mathematical analysis for the waves (we omit the
details, since the results of section 4 will supersede them) makes the usual
kind of "slowly-varying" approximation, in which the plane wave solution
is locally valid. This involves inter alia an assumption that the layer L
is sufficiently deep compared with a vertical wavelength. We also take h
to be of the form a.f(y).sin k(x-ct) , where f(y) is a sufficiently slowly-varying function (which we assume vanishes at y=O,b). Then by the
properties of plane inertio-gravity waves previously mentioned, the important term on the right of the x-component of (2.2a) is -(u'w') and
Fz that on the right of (2.2b) is -~SVT,y . The remaining terms are not of
course exactly zero, because plane waves represent only the leading
approximation; but in fact it is consistent to neglect them. The response
of the mean flow to the forcing -(v'8"),y together with the forcing
represented by the inhomogeneous boundary condition (2.5) involves a mean
"secondary circulation" indicated schematically by the arrows in figure 2.
296
The picture assumes that the wave amplitude is a maximum near y=~band falls
monotonically to zero on either side, so that (~) changes sign once, ,Y
near y=~b. The mean flow feels an apparent "heating" on one side of the
channel, and "cooling" on the other (about which more will be said in section
2.5). This gives rise to an O(a 2) mean vertical velocity~which beautifully
satisfies the boundary condition (2.5) and, by Eq.(2.2c) , demands a mean
motion across the channel, i.e. a contribution to V, in the vicinity of the
layer L where the wave amplitude goes to zero with height.
"Z- f
1 WAVES
i
I
!
t
I
!
w
!
t
t
FIGURE 2. Left: end view (looking along the x axis) of the problem of figure i. Right: typical profile of the mean acceleration in the longitudinal or x direction. The left-hand picture indicates how the secondary circulation ~, ~ is closed by a mass flux "in the bottom boundary', associated with a positive correlation between the disturbance y-velocity, v', and the depth -h of the corrugations in the boundary.
The Coriolis force associated with this O(a 2) contribution to ~ accounts
for a contribution to H t which is generally comparable with that from the
Reynolds stress dzvergence-(u w ),z zn the x component of Eq.(2.2a). In fact
the two contributions, in the present simple problem, can be shown to stand
approximately in the ratio
-6s 2 Reynolds stress divergence _~ Coriolis force associated with wave heat flux 4_Q- 2
. (2.7)
The two contributions are comparable in magnitudewnenever the Coriolis term
is significant in the dispersion relation (2.6); indeed if k2<<m 2 in (2.6),
~2 e 4~2 and the two contributions are almost equal and opposite. In that
297
case an estimate of effective momentum transport from the Reynolds stress
-~-~walone could be too large by an order of magnitude. (It is always wrong
in order of magnitude in another example, namely that of quasi-geostrophic,
vertically propagating Rossby waves II'12 - in which case it is too small,
by a factor of order the Rossby number.) Only when both contributions are
accounted for will the calculated total rate of change of mean momentum
u,tdydz agree, as it must, with minus the horizontal wave-drag force F
exerted by the fluid on the lower boundary. F is defined correct to second
order as the integral with respect to y of
= P'h,xlz= O + O(a3) . (2.8)
This agreement (the detailed verification of which is omitted here) provides
a useful check on the correctness of the overall picture.
2.4 Lagrangian-mean flow and "radiation stress"
There are crucial differences between the foregoing picture, which is
based on Eulerian averaging, that is averaging at fixed values of (y,z),
and the same problem solved using a Lagrangianmean (definable approximately
as the mean following a fluid particle). It turns out that the Lagrangian-
mean secondary circulation is negligible sufficiently far below L. In a
region of steady waves, when Q=O, the fluid partlcles merely oscillate about
a constant mean level, and have no systematic tendency to migrate up or down.
This is no more than mlght be expected for adiabatic motion in stable
stratification. So in a Lagrangian-mean description there is no secondary
circulation linking the regions of wave generation anddlsslpation, and thus
no "Coriolis" contribution to the effective transport of momentum by the
waves. The analogue, in the Lagrangian-mean momentum equation, of the
Reynolds stress - ~ in the Eulerian-mean momentum equation (2o2a), thus
gives a more direct description of the momentum transport, as was recognised
by Bretherton 18,26, who suggested that it be identified as the radiation
stress. Its xz component, taking the place of -u'w r in the present,
Eulerian-mean description, is equal to the "wave-drag" force in the x
direction per unit area across amaterial surface whose undisturbed position
is a plane z = constant - which force is evidently the same as (2.8) when
z=O. These ideas have been further developed by Grimshaw 13 for the
slowly-varying case, and an exact "generalised Lagranglan-mean" descrip-
tion for arbitrary, finite-amplitude waves has been developed by Andrews
& McIntyre 22. The reader is referred to those papers for more discussion
of the differences between the Lagrangian and Eulerian-mean descriptions,
and to Bretherton 26 for a sufficient physical explanation, in terms of the
average Coriolis force on a thin piece of fluid bounded above by a corrugated
material surface and below by a flat, "Eulerian" control surface, as to why
the corresponding momentum fluxes differ in general; see also (4.7) and
(4.9) below.
2.5 More details about the Eulerian-mean secondary circulation
Returning to the example of figure 2, we describe in a little more detail
298
how, in the present description, the forcing term - ~ , y "gives rise"
to the Eulerian-mean vertical velocity W, since this will help motivate the
more general analysis of section 4. In the region below L, where we may
suppose the waves to be in a steady state and not dlsslpating (X'=O, Q'=O),
the forcing term -(v'e~,y is in fact balanced mainly by ~ 8,z on the left
of (2.2b), when we rewrite that equation with the aid of (2.2c) as
8,t + ~ 8,y + ~ 8,z = -{v'S'),y . (2.9)
The term ~ e ~, is O(a 4) and therefore negligible, because ~ is O(a2), and
e,y is also O(=a 2) for the following reason. We have taken ~=O(a2), which
implies that 8,y=O(a 2) because the x component of the curl of (2.2a) gives
(when X=O and ~,~ are O(a2))
~,y = -231U,z + O(a 2) . (2.10)
(This is known in geophysical fluid dynamics as the "thermal wind
equation" ) The other term 8 t on the left of (2.9) turns out to be negligible
unless we are within a distance of order the "Rossby height"
H R = 2~b/(e,z)%
£rom the layer L where the waves are unsteady or dlssipating 9. This point
will be further explained in section 4.6 below. Thus, sufficiently far below
L we have a balance between W~,z and -(~,y , which implies that
~ - ~ y , (2 ii)
where we have defined
= v'8"/(O,z) , (2.12)
again using (2.10) and our temporary assumption that H=O(a 2) in order to
neglect e,y z .
There is another didactic point to be made here, incidentally: it has
o£ten been assumed in the literature, for instance in connection with
thermodynamic arguments, that the nonzero value of ~-~ signifies a tendency
for the waves to transport heat across the channel. Even more than with , this is true but misleading. There is no tendency at all for the
mean temperature to rise on one side and fall on the other, if we are
sufficiently far below L. The adiabatic heating or cooling associated with closely compensates the divergence of ~ This compensation is
intrinsic to the nature of the wave motion, as is underlined by the already-mentioned consideration that indlvidual fluid particles are not
being heated or cooled below L, because the motion is adiabatic there.
The right-hand half of figure 2 schematically indicates the profile
o£ the mean acceleration H If the layer L is shallower than the Rossby It" height H R, then additional contributions to ~ and ~arise in a layer of depth
H R centred on L. These adjust the values of e,t and 5,t in such a way as
299
to keep the thermal-wind equation (2.10) satisfied; there is
a circulation only in a layer of depth H R (see Eq.(4.12) 'room" for such below).
It turns out that Eq.(2.11) still holds for mean flow profiles H(y,z)
and 8(y,z) which vary sufficiently slowly and are such that the Richardson number ~,z/(U,z )2 is large. Then
V e ~,z (2.13)
can be significant below L. The associated Coriolis force does not, however,
lead to an acceleration of the mean flow well below L, because it turns out
that it is always cancelled by an equal and opposite Reynolds stress
divergence if the waves are steady and conservative. The fact that this
cancellation must take place will be seen as a corollary of the much more
general results to be described in section 4.
2.6 Energetics
Our simple example is also quite instructive as regards questions of
energy transport. Suppose that the waves are dzsslpating in the layer L and
heating the fluid there. (The amount of heat involved does not change
significantly within the Boussinesq approximation, but that is beside the
point.) What is the source of this energy? Obviously, the work done by the
agency moving the bottom boundary. How does the energy get from the boundary
up to L ? Answer: there is a vertical energy flux p'w" due to the waves. Indeed, the rate of working by the boundary
-~C = -cp h,x z=O = P w z=O ' (2.14)
in virtue of (2.8) and the fact that w'= -ch
here are wave properties. ,x" All the quantities involved
What we must not forget, however, is that this simple picture, while
correct to O(a2), depends crucially on the circumstance that 5 is zero, apart
from the O(a 2) contribution indicated in figure 2. If we look at the
similar problem in which the boundary is brought to rest
and the fluid is moving past it with velocity H = -c + O(a2), the picture
is quite different. Clearly the boundary can now do no work. The source of
energy is now the kinetic energy of the mean flow near L, whose density is changing at a rate Po times
(~u2),t = UU,t =-cu,t + O(a4) • (2.15)
The integral of (2.15) over the yz domain is, indeed, equa I to Fc, by the remarks above Eq.(2.8).
I n t h i s p r o b ] e m there is no need for the waves to transport
any energy into L at all; and indeed it does turn out that there is no net transport, despite the fact that p'w" is still the same as before. Note first
that in the region of steady, conservative waves below L, the work done
300
across a material surface corrugated by the waves is obviously zero, because
in the present p r o b ] e m such a surface is immobile. Alternatively,
in an Eulerian-mean description of the energy budget correct to O(a2), based
on the model equations (2.1) and (2.2), the total energy tlux across a
horizontal control surface contains two terms which combine to cancel the
contribution Ip'w'dy . The first comes from the O(a 2) part of the advection
by w" of the leading contribution ~po(U + u') 2 to the total kinetic energy:
PoW'(~52 + 5u" + ~u "2) = po H u'w" + O(a 2) . (2.16)
(This contribution to the total Eulerian energy £1ux has been drawn
attention to in the literature just about as often as it has been forgotten
aboutL) The second contribution is the mean pressure-working f~dy
associated with the y-dependent part of the mean pressure whose gradient
balances the Coriolis force associated with H .
The general conclusion to be drawn is that, whenever moving media are
involved, we must expect that a solution to the O(a 2) mean flow problem will
be essential to a self-consistent picture of the way in which waves
contribute to the energy budget. We must also remember that, as always, use
of the energy concept requires us to pay attention to £rames of re£erence!
There is no such thing as "the" net energy transport due to the waves; and
the transport can be identified with p'u" only if the medium is everywhere
at rest.
None of this affects the quite separate fact that the wave property
p'~" is the quantity usually related to the group velocity (when that concept
is applicable). It usually turns out that
p'u" = Ex(group velocity relative to the local mean flow) (2.17)
for plane waves, where E is intrinsic wave-energy density, a wave property
which in the present problem takes the form
E =]~po ( u.2 + S.2/~,z) (2.18)
(The second term is the "available potential energy "27 associated with
vertical displacements of particles in the stable stratification; there is
no internal energy because our model assumes incompressible flow. Brether-
ton & Garrett 28 have analysed the idea of "wave-energy" as a physical concept
in some depth, and in particular have established condltions under which
E can be uniquely defined in a general manner independent of the mathematical
formulation of the wave problem. I shall not go into that here except to
say that, roughly speaking, E is the work you would do in setting up the
disturbance, in a frame of reference in which the mean flow is at rest -
which clearly makes approximate sense in problems of moving media only when
the mean state varies sufficiently little over a wavelength.) I said that (2.17) "usually" holds, by the way, because there are some exotic cases such
as Rossby waves where the two sides differ by an identlcally nondivergent
term. However, in the case of our plane inertio-gravity waves the two sides
301
can be verified to be equal.
3. THE GENERALISED WAVE-ACTION PRINCIPLE
I deliberately avoided writing down the full conservation equations
for the energy budget, partly because the details get quite complicated for
all but the very simplest mean flow structures. Besides, I want to leave
space to introduce another kind of conservable quantity, generalised
wave-action, which apart from its wider significance will lead to a more
powerful approach to problems of the kind just dlscussed. This approach will
depend in no essential way upon any "slowly-varying" approximations.
Wave-action is an O(a 2) wave property which, in its most general form,
satisfies a conservation relation, apart from source terms in X" and Q',
for any mean-flow structure whatever. This remarkable property is to be
contrasted with the equation for wave-energy, whose "right-hand side"
contains a complex of terms representing exchange of energy with the mean
flow:
~E + ~. (~E + p'u') = 5t
. . . . . • - -
= po[-5i,juiuj - (~ij-zizj)e,iuj e - 2 , zt+UjS,zj )]
-Po[~'.~" + 8"Q'/8,z] • (3.1)
wave-energy fails to be conserved as soon as you have a moving medium. Eq.
(3.1) is just for the Boussinesq case, and is derived by dotting the
linearised versions of (2.1a) with PoU" and of (2.1b) with poe'/e,z and adding. We should always keep in mind that (3.1), as implied by the
discussion just given in section 2.6, represents only a part of the whole
energy budget. The linearised equations corresponding to (2.1) are
Dtu + u'.~7u + 2 NAu ÷ pj " - 6)" = " (3.2a)
DtS'+ u'.~z~ = -Q" (3.2b)
V.~" = O , (3.2c)
where the linearised material derivative
Dt( ) = ( ),t + ~.V( ) .
Here we shall define the generalised wave-action correct to O(a 2) only.
(It can be defined exactly, for finite-amplitude disturbances, once one has the generalised Lagrangian-mean description 22, but that is beyond the scope
of this review.) Two preliminaries are needed. The first is to introduce
the O(a) particle displacement field ~(x,t), which is defined to satisfy
302
~7.~ : O, ~ = O, and
Dt~ = u i , (3.3a)
where u ~ is the Lagrangian disturbance velocity
u" u = ~ + [ . V ~ ( 3 . 3 b )
For the problem of figure i, (3.3) simplifies, correct to O(a2),to
Ot~ = v', Ot~ = w', Ot~ = u" + ~u y+ ~U,z , (3.4)
where ~, ~, and ~ are the components of ~.
The second preliminary is a formal device (see Hayes 16 and Bretherton 25)
which is introduced for the sake of the greatest possible generality: we
reinterpret the Eulerian averaging operator ( ) as an ensemble average, over
an ensemble of wave solutions distinguished by a single, smoothly varying
parameter o~ . In a stochastic problem o~ would range over a "sample space';
but random waves are merely one possible case. In the determznlstic problem
of figure i, for example, we can generate a suitable ensemble just by
translating the boundary and the wave pattern a distance ~ in the x
direction. Then (--~ may be trivially re-defined in terms of an integral over
o~ rather than over x. For the axisymmetric mean flows important in
astrogeophysical applications the principle is the same but the details less
trivial 22'25. Quite generally, we nave the basic property
{( ),~} = {( )},~= 0 , (3.5)
whenever the ensemble of disturbance fields depends differentiably upon the
parameter c~ , which we shall take to be the case.
Instead of dotting the linear ised momentum equation (3.2a) with PoU ",
we now take its dot product with the derivative Po~,o~ and average. After some manipulation there results 22'23
Correct to O(a2), (4.4) can be regarded as a set of equations for the five unknowns
{~,t , V*, ~*, §,t ' P,t} , (4.6)
since Uy, U z, ~,y and e,z can be regarded as known "coefficients" apart from contributions O(a2). We are of course thinking of the linearised wave problem as having been solved, so that the wave properties on the right are
known forcing terms.
The transformation (4.3) is dependent on our choice of coordinates; a coordinate-independent preliminary transformation may be used instead, at the cost of getting more complicated-looking versions of the results 32.
4.3 Excess momentum fluxes and generalised Eliassen-Palm relations
We now multiply the x component of (3.2a) by ~ and then by ~ 26, and average. The resulting pair of relations reduces, after a little manipulation in which we use (3.4), (3.9) (with ~ replaced by x) , and our
assumption that ~ = (~,O,O) + O(aZ), to
u'v" + pjl~7, x p- _ Uz ~ = ~ + _IUy(-~),t + (~u'),t + O(a4) (4.7a)
u w + po -I ~,x p" - Uy~-~ = ~ + ½Uz(~),t + (~u'),t + O(a4)" (4.7b)
The second term in (4.7b) is to be compared with (2.8), the wave-drag force - are the excess momentum fluxes per unit area, and indeed ~,xp" and - p;
(or minus the "radiation stress" components) in one of the forms in which they appear in Lagrangian-mean analogues of (2.2a) 13'18'22. Eqs.(4.7)
relate these to the "Eulerian" excess momentum fluxes u'v" and ~-~. Note that -~,x p~ and - ,x~ are also equal to the y and z components of the nonadvective part of the flux of the pseudomomentum component PI; see
and ~ times the first plus ~ times the second of (3.4) gives
, (4.8)
307
i ~ i , t = ~ + ~"w"~ (4.9)
These may he used to eliminate ~v? and ~w" from (4.7). The result is a pair
of relations like (4.7) except that Sxy and Sxz appear in place of u'v" and ~ , and the remaining terms, apart from ~xp-~ and ~ , are either of the form [--5, t or contain a factor X" or Q'. If the first relation is differentiated with respect to y and the second with respect to z, and (4.2) used to eliminate the two terms in p" , there results
By multiplying (3.2b) by 0", averaging, and differentiating with respect to z, we get
G,z = I~/~,z),~ + ~-~/~,z],z + o(a4) 1 (4.10b)
4.4 Deductions
The results (4.10) imply that the O(a 2) forcing terms on the right of the mean-flow problem (4.4) can be expressed as a sum of terms each of which falls into one of three categories:
(i) the Eulerian-mean external forcing terms ~ and Q (which we shall choose to regard as unconnected with the waves),
(ii) wave terms of the time-differentiated form ( ),t ,and
(iii) wave terms all involving X', Q', or q" , that is, all depending explicitl Z on the forcing or dissipation of the waves.
This immediately shows that in the proDlem of figure l, for instance, the forcing of mean-flow changes vanishes below the layer L, where the waves are steady and conservative, for any initial pro£iles of mean flow and stratification, no matter how complicated, and no matter whether or not approximate, "slowly-varying" descriptions of the waves are valid.
It also carries the implication that although strictly conservative waves or instabilities (X', Q" and q" zero) can change the mean flow if they themselves are growing or decaying in amplitude, such changes are temporary
in that no net change to the mean flow persists if the waves propagate cut of the region of interest. This is almost obvious 23 from the fact that all the O(a 2) wave terms on the right of (4.4) can then be written in the form (-~,t ,with the aid of (4.10). However, if avery small amount of dissipation
308
is present, its effects can be greatly enhanced by the occurrence of
mean-flow changes that would otherwise be temporary. For a striking example
of this, see reference ii. A change in the mean-flow profile due to wave
transience can bring about an approach to "critical-line" conditions
somewhere, giving the dissipation terms a chance to take over locally, in
turn causing further and more permanent mean-flow changes. Theoretical work
on such highly nonlinear feedback effects, which could easily be important
in the evolution of stellar differential rotation, for example, is still in its infancy 7'8.
4.5 Extensions
Precisely analogous results, enabling the same qualitative conclusions
about mean-flow evolution to be drawn without solving wave problems in
detail, have been derived for:
I) rotationally as well as translationally invariant mean flows, with
mean velocity predominantly in the longitudinal (azimuthal) direction 22-2~ 32
2) a fluid with a general equation of state p = S(8,p); the Boussinesq
approximation is not essential 22'25'32
3) a self-gravitating fluid 22
4) a conducting fluid, with mean magnetic field as well as mean velocity
predominantly longitudinal 31.
4.6 Simplifications for the problem of section 2
It is of interest in connection with the previous discussion to exhibit
the approximate form taken by the transformed mean-flow problem for the
almost-plane inertio-gravity waves of figure i. A self-consistent set of
approximations requires that the Richardson number ~,z/(H,z)2 be large, and
the depth of the layer L, and other scales of mean variation in the vertical,
including the Rossbyheight H R= 2_O.b/(~,z )i , large compared to the vertical
radian wavelength m -I. The horizontal scale b is then large also, compared with k -1, assuming that both terms in the dispersion relation (2.6) are not
too different in magnitude. The upshot of such approximations is that all
the terms of the form (-~,y or (--) ~ in (4.10) become negligible, and (4.4)
can be shown to simplify, when X and ~ are zero, to
U,t - 2/i~* = - ~,x.X" - ~,X q" + poIPl,t (4.11a)
2]lu,t + P,ty = O (4.11b)
-8,t + P,tz = O (4.11c)
- -* (4. lld) ~,t + ~9,zW = O
V,y + w, z = O • (4.11e)
808
According to the second and third equations the mean flow stays in
geost~ophic and hydrostatic balance as it changes: knowledge of P,t implies
knowledge of ~,t and 8,t " (The thermal-wind equation (2.10) holds, with
its O(a 2) contribution negligible). If we eliminate 5 t and 8 t in favour of P,t in (4.11a) and (4.11d), and cross-differentiate to eliminate ~* and m* w via (4.11e), there results
The form of the elliptic operator on the left shows why the Rossby height
= 2J~b/(8 z)% takes over as the vertical scale of the response of the H R mean flow, whenever the forcing on the right of (4.12) is confined to a layer
L of depth smaller than H R .
5. PSEUDOMOMENTUM IS NOT MOMENTUM
It is surprising how often one seems to encounter the conceptual mistake
that since waves can transport momentum, they must possess it. No less than
Lord Rayleigh 36 appears to have been under this impression when he wrote
that "if the reflexion of a train of waves exercises a pressure upon the
reflector, it can only be because the train of waves itself involves
momentum'. Nowadays one often reads about "the momentum of the waves" and
abo~t waves "exchanging'(their) momentum with the mean flow, and suchlike;
and there appears to be a tendency to assume that this momentum which the
waves are supposed to have is to be identified with the wave property Pi defined in (4.1a), or rather the wave property
Pi = Eki/~+ , (5.1)
to which Pi can be shown to reduce in those "slowly-varying" circumstances where Bretherton & Garrett's 28 arguments apply.
On the other hand, as Brillouin 37 pointed out in 1925, Rayleigh's
statement is a non sequitur because in a material medium you can perfectly well have a nonzero flux of momentum unaccompanied by any momentum density
- try leaning against a brick wall. Two specific counterexamples to
Rayleigh's statement which I happen to know are the obvious one of waves in solids (phonons) 37'38 , and a simple fluid-dynamical example Ipublished
in 197340 . In the latter example, apacket of "inertia" (pure Coriolis) waves
propagates along a waveguide comprising an incompressible , homogeneous
liquid between rigid, parallel boundaries in a rotating frame of reference.
The mean momentum is zer____~o, for reasons of mass continuity. Nevertheless
310
there is a non-zero recoil force when the wave packet is reflected from an immersed obstacle.
That problem has the further interesting feature that the wave packet
does have a well-defined "fluid impulse', I, which gives the recoil force.
is not equal to the integral of P; indeed it can have the opposite sense]
(In fact the momentum flux due to the mean pressure ~ plays a leading role in this particular problem.)
An example of a somewhat different kind is the celebrated problem of
a packet of electromagnetic radiation in a refractive medium. This problem
has long been controversial, but has been convincingly clarified in recent years by Penfield & Haus 42, Gordon 39, and Peierls 38, to whose papers,
together with the review by Robinson 41, the reader is referred for some very
interesting history. In this problem, provided we neglect dispersion, there
is a definite, non-zero total momentum M i which travels with the waves.
Again, this is not generally equal to the integral of Pi (or rather its
electromagnetic counterpart, the "Minkowski quantity'), nor is it equal to
the electromagnetic part of the total momentum (the "Abraham quantity').
On the other hand recoil forces, are, this time, simply related to Pi (but not to Mi!) in at least some circumstances 39.
Brillouin's point is that waves don't have to possess momentum; the
examples show that in fact they sometimes do and sometimes don't - and that
when they do, the momentum is not necessarily related to recoil forces. The
wave property Pi may or may not be closely related to either; and whether
it is depends in fact on 91obal considerations - on the full O(a 2) mean
problem and its boundary conditions. So we must either say that Pi may
sometimes "be interpreted" as momentum, and sometimes not, depending on the
global problem - surely a most unsatisfactory conceptual structure - or we
must decide that Pi is simply an entity in its own right, not necessarily related to any momentum, whereupon the conceptual problems disappear. This
is why I like to have a separate name "pseudomomentum" for Pi, just as one
likes to have separate names for other pairs of quantities, like energy and
torque, which have the same dimensions but different physical natures. I
want to use the term "momentum" in its ordinary, elementary sense, of course
(which we use when thinking intuitively about forces and accelerations).
The terminology follows the usage of workers in solid-state physics, to whom all this has naturally been more obvious than to most. Blount and Gordon 39
have carried the terminology, and the distinction between momentum and
pseudomomentum, into classical electrodynamics, and shown how it helps
clarify the issues of the so-called Abraham-Minkowski "controversy" over
electromagnetic waves referred to above.
"But, the reader may say, you said in section 3 that Hayes
conservation relation results from replacing certain space or time
derivatives by ( ),~ in the variational derivation of the conservation
relation for the energy-momentum tensor. So the result of going in the
reverse direction, which is just (4.2) without the "dissipative" terms, is
nothing but the conservation law satisfied by the "momentum" part of the
311
energy-momentum tensor itself. So surely Pi i_ss a momentum."
My reply is that that would follow if the same mathematical formalism
always represented the same physical entity. Here, however, the blanket
term "energy-momentum tensor" tends to obscure the fact that different
variational principles, springing from different basic formulations of the
physical problem, can be used as starting point. Different basic formul- ations carry different implications about the kinds of invariance
properties associated with conservation laws. Certainly Eq. (4.2) is
associated with invariance under translations in space, just as is
conservation of momentum. But the translational invariances referred to
are in fact quite different, a point well made by Peierls 38. Conservation
of the i-component of momentum, for instance, depends on invariance of the
basic physical problem, including force potentials (e.g.gravitational),
under translations in the x i direction. Conservation of Pi depends on
translational invariance of the mean flow, insofar as it enters into the disturbance problem. This condition does not necessarily involve external
force potentials. Other necessary conditions for the two kinds of conserv-
ation law to hold are also quite different. For instance, whether or not
momentum is conserved has nothing to do with whether or not the motion is
adiabatic, while conservation of Pi does depend on the motion being
adiabatic since it requires, inter alia, that q" = O in (4.2).
It is the description in terms of particle displacements (and Hamilton's
principle in the classical sense) that lies behind the appearance of
pseudomomentum rather than momentum in the "energy-momentum" tensor. By
contrast, if we form components of the Eulerian-mean energy-momentum tensor
from the usual pure-Eulerian variational principle in fluid dynamics, namely the Clebsch-Herivel-Lin principle 30, then in place of Pi and its flux
Qij we get the Eulerian-mean density and flux of momentum ,
pu i and P6"ij - Puiuj , (5.2)
to within an identically nondivergent contribution. This should be no
surprise in view of the foregoing remarks on translational invariance.
Conservation of pseudomomentum, as distinct from momentum, is connected
with invariance to a displacement of the disturbance pattern while mean
particle positions are kept fixed, as distinct from a displacement of the whole system, particles as well as disturbance pattern 38. The idea of "fixed
mean particle positions" cannot be directly expressed within a purely
field-theoretic or Eulerian description, which does not keep track of where
fluid particles are. But it is implicit in a description of the disturbance
in terms of particle displacements (from "mean particle positions'). (What
this means for finite-amplitude disturbances is dealt with in reference 22.)
One possible reason why momentum and pseudomomentumhave sometimes been
mistaken for one another may be that in certain examples, even~ore idealised
than those already cited, not only are both quantities conserved (requiring
X'and Q~to be zero in the case of pseudomomentum), but also their conservation
relations reduce to the same form. If in these examples we generate the waves
312
starting from an initial state in which momentum and pseudomomentumare both
zero, it can happen that they evolve in parallel and remain equal. The
simplest example is the trivial one of an electromagnetic wave in vacuo.
Here there is no medium present to make the translational symmetry
operations different. But there are also examples involving waves in media,
all of them longitudinally-symmetric problems, in which the lon@itudinal
components of pseudomomentum and mean momentum evolve in parallel. Perhaps
the most celebrated example is that of Stokes" periodic waves on the surface of an infinite, inviscid ocean. The initial conditions of no motion are
hidden in the assumption of irrotational motion~ Approximate longitudinal
symmetry would be enough for approximate conservation of pseudomomentum;
but we need also that there be no mean horizontal pressure gradient and that,
concomitantly, the mean mass continuity equation (which constrains the
distribution of mean momentum but not that of pseudomomentum) plays no
significant role. Exactly the same considerations explain why a further such
example is provided by the problem of section 2, in the case when H R << H, the scale of the layer L. Eqs. (4.11) then imply that, for conservative
waves,
poU,t = Pl,t (X=O, Q=O, HR<<H). (5.3)
These examples are very special, and in any case the question of whether
or when there happens to be a momentum density equal to Pi is not the most
relevant one in practice. Statements which are more useful and general can
be made about the fluxes of momentum and pseudomomentum. Especially when a Lagrangian-mean description is used, the excess momentum flux due to the
waves is often simply related, although not usually equal, to the flux of pseudomomentum. It is basically this fact which accounts for examples of
the kind just mentioned. It is also why (4.2) could be used to eliminate
the terms in p" during the derivation of (4.10a) from (4.7). The reasons
for the existence of such relations are hinted at by Eqs. (2.14), (2.17),
(5.1), and the well-known argument about the relation between wave-drag,
phase speed, and the rate of working across a material surface.
More explicitly, in many "slowly-varying'situations it turns out that
the mean flow can be defined in such a way that (i) the excess momentum flux is the only wave term in the leading
approximation to the O(a 2) mean-flow problem (Eqs.(4.11) provide an example
of this), and (2) the excess momentum flux is then either equal to the pseudomomentum
flux, or differs from it by a contribution Cij which in some cases does not cause systematic mean-flow changes because it can be balanced quasi-
statically by the reaction of the medium. When MHD effects are not involved, and the fluid is compressible, Cij is
an isotropic~ pressure-like contribution which can be thought of as a kind
of acoustic "hard-spring" effect coming from the nonlinearity of the equation of state Ig'37'44. Analytically this results from redefining the
mean pressure in such a way as to avoid having a wave term in the equation
of state for the mean flow. For electromagnetic waves in refractive media Cij is again isotropic 42'39 and comes from "electrostrictive" and "mag-
The same remark applies in the classical theory of sound waves.
313
netostrictive effects 41. In MHD problems Cij is not, however, isotropic 17.
Whether or not such statements are helpful or misleading depends on whether or not, in the problem in question, the difference Cij between the pseudomomentum and excess momentum fluxes has time to be balanced quasi- statically by the mean stress in the medium (it usually does have enough time in "slowly-varying; situations), as well as on whether that mean stress happens to affect the answer to the particular question being posed. (To take a classical example, Ci~ is relevant to the force exerted by the absorption of acoustic waves znto the end wall of a closed container, but not into an absorber immersed within a larger volume of fluid 43,44. It is
presumably the former situation more than the latter, incidentally, to which the problem of solar wind acceleration by Alfv4n radiation pressure is analogous.) When (5.1) holds, and "group velocity" is meaningful, it is often true that the analogue of (2.17) holds also, namely that Qij equals Pi times the jth component of the group velocity. So there are some slowly-varying situations (those in which the questions being asked permit Cij to be ignored in some sense) where one can say mnemonically that the
waves transport momentum as if a local momentum density equal to Pi were being carried along through a vacuum at the group velocity. But the difficulty of saying in general terms when this mnemonic is applicable, and
when it isn't, brings us back in the end to the point made earlier: the only safe and completely general recipe for studying wave transport effects is to consider not only the "wave properties~ which can be evaluated from the O(a), linearised problem, but also a self-consistent analysis, correct to O(a2), of whatever global mean-flow problem is relevant.
REFERENCES
i. Lindzen,R.S. 1973 Boundary-layer Meteorol.4, 327-43 2. Holton,J.R. 1975 The dynamic meteorology of the stratosphere and
where A=(dlnOo/dr)-(i/Yl)(dlnPo/dr). The operation : [(37)/r2-(34)] gives the expres-
sion of ~r/~r. Unfortunately, the resulting expression is not so simple. We shall
be satisfied with a crude approximation in which we introduce an effective number of
nodes n such that ~2(r2~r)/Sr2~-n26r and ~2(rZ6P/p0)3r2~-n2~P/P O. If we neglect A, we
obtain from equation (37),
~r/~r ~ - {n2/[n2+E(£+l)]} (6P/PO) (38)
Physical processes of stability. With the help of the results in the preceding two
sections, all the integrands in the work integrals given by equations (17)-(22) can be
transformed into the forms proportional to the power of pulsation.
After a number of partial integrations, we obtain finally,
WM= ~I ~t0'[2u}- r2Pt ,Z y3-± ~ ~-~ -
W T = (i/2)L(Vad -I - K T) I~T/T012r= R
- I m2)V' IK~R°+{%-I ) - ' - 2+u~R+'~TR}FC011+T/~0t~dV
323
+ I POeN0[ (~N)T, ad + (Y3-1)-1+2aT-Vad-r(l-Vad,P+eN,p/2) ]IST/ToI 2dV
+I I'dv ~0eV0[2(UT -£T ~'6T )-i-{~ In(r2p0gV0)+
+ I [FR0(Vad-1-V0-1 )+FcOVad-I ]H0[[ a (~T/T0)/ar] 2+{% (%+1)/r~} 16T/ToI 2 ]dV
d I-V ST 2 ,R ,R 9 aT 2 (4o)
where approximations (5) and (6) have been used, aT, ~ etc. are defined by, [equations
(28)-(31)],
~Sr ST nZ/(y3,11 ~T ' a r = ~T ~ 0 % - n2+£(%+I) T~ '
SH ST , 8 .ST. V_a~)~T H_O - 8 (ST) --H0 = HT TO0 + HT ~LTo0) = (aT + qad TO ÷ Vad ~TO0 '
, ~ ,~T,.
~0 ~0 +~T~'~o ~ =ra~H~ ~rl% HT
' ~ 8 .ST, ~u I aT , 8 caT. V~d-IVad.P) aT ( £T u0 To + = + + + "rr%' = l-iLoT 2-i~T ~0 l-im~
and the other symbols have been defined before except that
~N,P = d(IngN0)/dlnP0 " and Vad,P = d(inVad)/dlnP0
The turbulent pressure work represented by the first integral in equation (39) can be
positive or negative depending upon the mode (% and n) and the relative convection time
mT, while the turbulent stress work represented by the second integral is always nega-
tive. The ratio between them is of the order of (~T) -l which can be larger or smaller
than unity depending upon the mode and the stellar structure. If the integrands of the
thermodynamical work W T are estimated to be of the order of (Fco/~0) I6T/T012 and FCO ~
PoU0 in convection zones, the ratio WM/W T becomes to be of the order of ~T or (~T) 2
The mechanical work should, therefore, be taken into account in the stellar stability
calculation.
In equation (40), the first llne on the right hand side shows the spherical effect and
the <-mechanism. If K increases strongly with increasing temperature and density (nega-
tive KT) , larger radiation loss results at lower density higher entropy state, causing
the destabillzation of the oscillation. The sphericity has a similar effect, though
not so pronounced. The second llne shows the correction to the K and other effects. The
third llne shows primarily the e mechanism which is the destabilizing effect due to high
324
temperature sensitivity of the nuclear burning process. The fourth line shows the simi-
lar effect by the viscous dissipation, but its work like the turbulent pressure work can
be positive or negative. The term proportional to FRO in the fifth line describes the
Cowling-Souffrin-Spiegel (or radiative Cowling) mechanism (Graff 1976), and the term
proportional to FC0 describes the turbulent conduction mechanism (or convective Cowlin~
mechanism) which seems to be found by the present analysis. The sixth line shows the
corrections to these two mechanisms. Because of the latter corrections, the radiative
Cowlin~ mechanism may not work for high frequency modes, while the convective Cowling
mechanism may remain effective. If a fluid element moves up (or down) slowly (small ~) in a suDeradiahatic layer, the
temperature inside becomes higher (or lower) than outside, and the density inside be-
comes lower (or higher) than outside because of the pressure balance. Both the radia-
tion and the turbulent conduction decrease (or increase) the thermal energy inside
where the entropy is higher (or lower). This explains the radiative and convective
Cowling mechanisms. Both mechanisms are due to the superadiabaticity, but they differ
greatly in efficiency, because the gain (or loss) in thermal energy is of the same or-
der as the convective flux for the convective Cowling mechanism while it is smaller by
a factor of (?0-Vad) for the radiative Cowling mechanism. Although none of the mecha-
nisms can be neglected, the K mechanism and the convective Cowling mechanism are the
main destabilizing mechanisms of pulsation in a convective star. Various approximations have been employed in the derivation of equations (39) and (40).
Many of them should be easily avoided in numerical work of stellar stability.
The author thanks Dr Y. 0saki for discussion.
References. Ando, H., Osaki, Y. 1975, Publ. Astron. Soe.Japan, 27, 581 Baker, N., Kippenhahn, R. 1962, Z. Astrophys., 54, ll4 Biermann, L. 1932, Z. Astrophys., 5, 117 Boury, A., Gabriel, M°, Noels, A.,--Scuflaire, R.,Ledoux, P. 1975,Astron.Astrophys.
41, 279
Christy, R.F. 1964, Rev.Mod.Phys., 36, 555 Cox, J.P. 1963, Astrophys.J., 138, 487 Dilke, F.W.W., Gough, D.O. 1972, Nature, __240, 262 Dziembowski, W., Sienklewicz, R. 1973, Acta Astron.,23, 273 Gabriel, M., Scuflaire, R., Noels,A., Boury,A. 1975, Astron. Astrophys.,40, 33 Gough, D.0. 1976, Astrophys. J., 2i].4, 196 Graff, Ph. 1976) Astron. Astrophys., 49,299 Kato) S. ]968) Publ. Astron. Soc. Japan, __20, 59 Lighthill) M.J. 1952, Proc. Roy. Soc., A 211,564 Mark, J.W.-K. ]976, Astrophys. J., 206,]-~-~-- Nakano, T. 1972, Ann. Phys., 73, 326 Osaki, Y. 1974, Astrophys.J., ]89, 469 Parker, E.N. ]974, Solar Phys., 36,249 Roberts, B. ]976, Astrophys. J., 204, 268 Shibahashi, H., Osaki, Y., Unno)W', Puhl Astron. Soc. Sapan, 2__7, 40J Stein, R.F. 1967, Solar Phys., 2, 385 Unno, W. 1964, Transaction IAU XilB, Academic Press, New York, p.555. Unno, W. ]967, Publ. Astron. Soc. Japan, 19, ]40(ef.Proc.Astron.Soc.Australia,!,379 ,
(By definition f(n)(x) = f[f(n-])(x)] and f(]) E f) . The distance between the tans-
forms~f== x I and x 2 is multiplied after each application of f by a quantity larger than
min l~x 1 that is larger than I, thus it increases at least exponentially. This means
that, at some time, the two image points will be separated by the discontinuity of
f, and" the subsequent trajectories starting from xl or x 2 will be completely diffe-
rent from each other. This is a version of the mixing property : a small fluctuation
in the initial conditions yields,after some time, a huge difference in the arrival
points ; in other terms unless one knows the initial conditions with an infinite
accuracy, the motion is unpredictible after some ~n~te time.
Let us come back now to a more realistic description of the Lorenz attractor
I have already noted that it cannot be considered as a surface in the usual sense,
since two sheets cannot merge owing to the deterministic character of the equations
of the motion. To understand what really happens, it is only necessary to replace the
shaded line where the two sheets collapse by a small surface parallel to this line
(as a thin stick).Now the Poincar~ transform is no longer given by a function of one
variable, but by a plane transform: that is,given a starting point inside the stick,
one wonders what is the next crossing point of the trajectory inside this stick.Essen-
tially (although things are a little bit more complicated), the Poincar~ transform
looks very much like a Baker's transform (15) : the stick is first cut in two
pieces, (this cutting remembers of course very much the discontinuity in f), each
piece is stretched and the two resultant pieces are put together inside the stick
(see fig. 4). The transform of the coordinate parallel to the stick is very similar
to the one dimensional transform represented in fig. 3, but now the Poincar6 trans-
form has been made invertible as the coordinate perpendicular to x allows one to
~Actually Lasota and Yorke (Trans. of the A.M.S. 186, 481 (1973)) show that there
is an absolutely continuous invarlant measure for such f, and f is ergodic for this
measure.
343
distinguish between the arrival points with the same x. By a repeated action of the
transform pictured on Fig. 4, one obtains a section of the attractor, which is the "ob-
ject" stable under an infinite number of applications of Poincar~ transform. If one
cuts this section by a line perpendicular to the axis of the stick, one easily sees
that, after each application of the transform, the central part of the stick is de-
leted, then the central part of the remaining segments is deleted, and so on. This
is precisely the way in which one generates Cantor sets. Accordingly the Lorenz attrac-
tor has the structure of a Cat, tot set perpendicular to its "surface". It is a surface
with a number of sheets which has the power of a continuum.
A
E ~ I F ~ .......... I
Let us notice that these properties of the Lorenz attractor have actually not
been proved from a rigorous study of the system (1), although it is a very reasonable
extrapolation from the computer studies.
Another important feature of the Lorenz attractor is that it appears by an inver-
ted bifurcation from a pair of stable fixed points : in a domain of values of the
parameters (r,~,b), one reaches~from some initial conditlons,one of the stable fixed
points, or/and some other Lorenz attractor.This manner of occurrence of turbulence is
well known, for instance in Poiseuille flow(16); in a range of values of the
Reynolds number the laminar flow is linearly stable, but unstable against perturbations
with a finite amplitude, and at the upper limit of this domain the laminar flow
becomes linearly unstable. It must be stressed that the stability of convection
flows is much less well known than the one of the Poiseuille flow, so that it is not
$44
~lemr if the Lorenz system describes, even in a rough way, the bifurcation toward
turbulence of convective flows. From this point of view the existence (or the non
existence) of two sorts of flows for the same value of the Rayleigh number would be
an important test.
2. The theories of ,La~dau-HoFf and of Ruelle-Takens
Landau (17) and Hopf have given a quite convincing picture of the onset of turbu-
lence. In order to understand their idea, it is necessary to introduce the notion
of quasiperiodic funation. Let us consider a dynamical system with a periodic limit
cycle. The existence of such an oscillatory behaviour is known to occur (18) in con-
vective flows.
The theory (19) shows that, by a certain type of bifurcation (called the Hopf
bifurcation), this limit cycle may give rise to amotionwith two ~ncommensurate frequen-
cies, say ~| and ~2 (these frequencies are incommensurate if no non zero integers P and q
exist such thatp~| = q~2). Then any function of time in this flow should be ~asi-
periodic. To define such a function, let us consider a function of two variables,
say t I and t2, which is periodic of period 27 with respect to each of the~ variables:
f(t I + 2~n, t 2+ 2~m) = f(tl,t2)
whatever the integers n and m are.
From this function we may build the quasiperiodic function ~(t) = f(mlt,~2t). II
ml and ~2 are incommensurate, this function will appear (at least at first sight) as
completely choatic, although it is not chaotic in the sense of the mixing property.
Its frequency spectrum is concentrated at the frequencies m1' ~2' and
more generally at any linear combination p~! + q~2 with integer coefficients.
The extension of this construction to a function that depends periodically with
the period 2~ on any set of variables, say t!, t2,...,tn, allows one to define the
most general quasiperiodic function which has not the mixing property.
One must take care that such a quasiperiodic behaviour is not structurally stable
(except for the case of a single period). If a parameter as, say, the Rayleigh number
varies in the domain of quasiperiodic behaviour, then a periodic (and structurally
stable) limit cycle should be reached every time when ~I and ~2 (which both depend
on the Rayleigh number) are commensurate. The non periodic behaviour is reached for
~solated values of the parameter R only (20). The idea of Landau and Hopf is the a following: when the Rayleigh number increases many new bifurcations occur, which
always correspond to frequencies 1~commensurate with the already existing ones. Then,
one can show that, if the quasiperiodic function depends actually on all these fre-
quencies, then it tends toward a chaotic signal (in the sense of the mixing @roperty)
after an infinite number of frequencies have appeared.
345
One may wonder first about the validity of this theory, as it is unclear whether
an infinite or finite Rayleigh (or Reynolds) number is required for the occurrence
of an infinite number of bifurcations.
Landau and Lifshitz (2|) relate the number of "degre~of freedom" of a turbulent
motion with its Reynolds number R e . Asymptotically this number should increase as
(Re)9/4 , so that the existence of an infinite number of degrees of freedom requires
an infinite Reynolds number. They assume that each "degree of freedom" is actually
connected with the freedom in the choice of the phase at a bifurcation where a new
frequency appears. Although this notion of "degree of freedom" is widely used in
theoretical physics, the way in which the (R)9/4 formula of Landau Lifshitz counts the e
number of bifurcations between R = 0 and a large value of R is rather unclear. e e
On the other hand followlngRuelle and Takens (l) , the Onset of turbulenc% as dese;ibed
by Landau and Lifshitz, cannot be a '~eneric" phenomenon. They show that, after the
occurrence of a few non cormnensurate frequencies, the next bifurcation is toward a
non periodic attractor. This non-perlodic attractor is built as follows : let us
consider the case of a quasiperiodic motion with four ~ncommensurate frequencies, if
one represents the trajectory by the motion of a point in a four dimensional space,
it intersects a 3-dimensional hyperplane (i.e. a usual 3-d space) following a full
torus. Again one considers that the motion describes a one to one application of
this torus into itself (that is, each point inside the torus has an image which is
the next crossing of the trajectory with the hyperplane). It is possible (22) to
find a transformation of the torus into itself that is continuous, invertible , and
which transforms the torus into a strange attractor after an infinite number of
applications (see fig, 5). This attractor is also structurally stable.
Therehave been attempt~ 23) to find if this picture of the onset of turbulence is
valid for Taylor instabilities, it is not completely clear whether the experimental
findings are or not in agreement with this theory.
346
3. Approach of chaos by succesivebifurcationfis
For some values of the parameters, the Lorenz system (I) has one (or two) perio-
dic limit cycle(24~ When a parameter such as r varies in this domain, this limit cycle
becomes very rapidly much more complicated by a mechanism of "cascading bifurcations'~
These bifurcations are rather striking as they describe a smooth transition from a
periodic limit cyclewith a single period toward a strange (i.e. non-periodic) attractor.
For r ~ rl, the period of the limit cycle is, say T ; for r just below rl(bi-
fureation point)the period is 2T ; but as the motion is anharmonic, the amplitude of
the Fourier component of frequency |/2T start from a zero value at rl, then increases
continuously as r becomes smaller than r I . In order to make clear possibility of this
mechanism of frequency division, it is enough to draw a closed trajectory (= the
limit cycle) in the space of thevariables x,y,z (Fig. 6), when r becomes just a little
smaller than rl, this closed curve with singl~ orb becomes a closed curve with two
orbs, as drawn (approximately)in fig. 7 .
i
J
/
347
There is a series of such splitting of the limit cycle when r decreases, so
that the period of the limit cycle becomes 2T, 4T, ... 2nT. There is an infinite num-
ber of such bifurcations when r decreases from r I to, say r . The limit r~ is appro-
ached in a geometric fashion : let r be the value of r which the limit cycle of 2 n
period 2nT becomes unstable giving birth to a limit cycle of period 2n+]T, then the
quantity (r2n - r )/(r2n+l-r~itends to a limit as n increases, which is an universal
number (= independent of the detail of the equations) approximately equal to
0.214I 693. At the end of these bifurcations (i.e. at r , and for lower values ofr )
the period of the motion is infinite,which means that this motion is chaotic. Actually,
the recurrence time for a given point on the limit cycle is the period. It is easy to
see that the autocorrelation function for such a periodic system musttake the
same finite value at separation time of one period, two periods,... N periods, which
forbids it to tend to zero at infinite separation times (so that a periodic system
is obviously not ahaot~c). On the contrary, if the period of the motion is infinite,
the autoeorrelation function may tend to zero for large separation times, and the
motion may be chaotic.
At the present time,these are only theoretical examples for this occurrence of
chaos by an infinite number of bifurcations in a finite domain of variation of the
parameters. However this is probably (25) the best understood case.
CONCLUSION
Even if the proofs of theorems are quite remote, there is some hope at the pre-
sent time for understanding the way in which turbulence may occur in flows (convective
or not). However let us emphasize again that this approach leaves aside the question
of spatial chaos. Further studies in this domain are needed in order to know
whether the spatial chaos Dccurs or not in the infinite Rayleigh (or Reynolds) number
limit,as required by the Landau theory of turbulence.
REFERENCES
I. D. Ruelle, F. Takens, Comm. Math. Phys. 2iO , 167 (I971)
2. P.R. Halmos, Ergodic theory, Chelsea Pub. Comp. New York (1956)
3. R. May, Nature 261, 459 (1976)
4. T.Y. Li and J.A. Yorke, Am. Math. Month. 8_~2, 985 (1975)
5. T. Rikitake, "Electromagnetism and the earth interior" Elsevier (1968)
Vol. 433: W. G. Faris, Sslf-Adjoint Operators. VII, 115 pages. 1975.
Vol. 434; P. Brenner, V. Thom(~e, and L. B. Wahlbin, Besov Spaces and Applications to Difference Methods for Initia/Value Problems. II, 154 pages. 1975.
Vol. 440: R. K. Getoor, Markov Processes: Ray Processes and Right Processes. V, 118 pages. 1975.
Vol. 442: C. H. Wilcox, Scattering Theory for the d'Alembert Equation in Exterior Domains. III, 184 pages. 1975.
Vol. 446: Partial Differential Equations and Re}ated Topics. Proceedings 1974. Edited by J. A. Goldstein. IV, 389 pages. 1975.
VoI. 446 : Spectral Theory and Differential Equations. Proceedings 1974. Edited by W. N. Everitt. XlI, 321 pages. 1975.
Vol. 449: Hyperfunctions and Theoretical Physics. Proceedings 1973. Edited by F. Pham. IV, 218 pages. 1975.
Vol. 458: P. Waltera, Ergodic Theory - Introductory Lectures. VI, 198 pages. 1975.
VoL 459: Fourier Integral Operators and Partial Differential Equa- tions. Proceedings 1974. Edited by J. Chazarain. VI, 372 pages. 1975.
Vo]. 461 : Computational Mechanics. Proceedings 1974. Edited by J. T. Oden. VII, 326 pages. 1975.
Vol. 464 : C. Rockland, Hypoellipticity and Eigenvalue Asymptotics. III, 1 71 pages. 1975.
VoL468: Dynamical Systems- Warwick1974. Proceedings 1973/74. Edited by A. Manning. X, 405 pages. 1975.
Vol. 470: R. Bowen, Equilibrium States and the Ergodic Theory of Anosov Diffeomorphisms. III, 108 pages. 1975.
VoL 474: Seminaire Pierre Lelong (Analyse) Annie 1973/74. Edit~ par P. Lelong. Vl, 182 pages. 1975.
VoI. 484: Differential Topology and Geometry. Proceedings 1974. Edited by G. P. Joubert, R. P. Mcussu, and R. H. Roussarie. IX, 287 pages. 1975.
VoI. 497 : H. M. Reimann und 1". Rychener, Funktionen beschr;~nkter mittlerer Oszillatlon. VI, 141 Seiten. 1975.
Vol. 489: .1. Bair and R. Fourneau, Etude G~cm~rique des Espaces Vectoriels. Une Introduction. VII, 185 pages. 1975.
Vol. 490: The Geometry of Metric and Linear Spaces. Proceedings 1974. Edited by L M. Kelly. X, 244 pages. 1975.
VoL 503: Applications of Methods of Functional Analysis to Problems in Mechanics. Proceedings 1975. Edited by P. Germain and B. Nayroles. XlX, 531 pages. 1976.
VoL 507: M. C. Reed, Abstract Non-Linear Wave Equations. VI, 128 pages. 1975.
V.cI. 509: D. E. Blair, Contact Manifolds in Riemannian Geometry. VI, 146 pages. 1976.
Vol. 515: B,4cklund Transformations. Nashville, Tennessee 1974. Proceedings, Edited by R. M. Miura. VIII, 295 pages. 1976.
VoL 516: M. L. Silverstein, Boundary Theory for Symmetric Markov Processes. XV!, 314 pages. 1976.
Vol. 518: S6minaire de Th~orie du Pctentiel, Proceedings Paris 1972-1974. Edit~ par F. Hirsch et G. MokobodzkL VI, 275 pages. 1976.
VoL 522: C. O. Bloom and N. D. Kazarinoff, Short Wave Radiation Problems in Inhomogeneous Media: Asymptotic Solutions. V, 104 pages. 1975,
Vol. 523: S. A. AIbeverio and R. J. Heegh-Krohn, Mathematical Theory of Feynman Path Integrals. IV, 139 pages. 1976.
Vol. 524: S6minaira Pierre Lelong (Analyse) Ann6e 1974/75. Edit6 par P. Lelong. V, 222 pages. 1976.
Vol. 525: Structural Stability, the Theory of Catastrophes, and Applications in the Sciences. Proceedings 1975. Edited by P. Hilton. VI, 408 pages. 1976.
Vol. 526: Probability in Banach Spaces. Proceedings 1975. Edited byA. Beck. VI, 290 pages. 1976.
Vol. 527: M. Denker, Ch. Grillenberger, and K. Sigmund, Ergodic Theory on Compact Spaces. IV, 360 pages. 1976.
Vol. 532: Th(~orie Ergodique. Proceedings 197311974. Edit~ par J.-P. Conze and M. S. Keane. VIII, 227 pages. 1976.
Vol. 543: Nonlinear Operators and the Calculus of Variations, Bruxelles 1975. Edited by J. P. Gossez, E. J. Lami Dozo, J. Mawhln, and L. Waelbroeck, Vii, 237 pages. 1976.
Vol. 552: C. G. Gibson, K. Wirthm011er, A. A. du Plessis and E. J. N. Looijenga. Topological Stability of Smooth Mappings. V, 155 pages. 1976.
Vol. 556: Approximation Theory. Bonn 1976. Proceedings. Edited by R. Schaback and K. Scherer. VII, 466 pages. 1976.
Vol. 559: J.-P. Caubet, Le Mouvement Brownlen Relativists. IX, 212 pages. 1976.
Vol. 561: Function Theoretic Methods for Partial Differential Equations. Darmstadt 1976. Proceedings. Edited by V. E. Meister, N. Weok and W. L. Wendland. XVlII, 520 pages. 1976.
Vol. 564:.Ordinary and Partial Differential Equations, Dundee 1976. Proceedings. Edited by W. N. Everitt and B. D. Sleeman. XVIII, 551 pages, 1976.
Vol. 565: Turbulence and Navier Stokes Equations. Proceedings 1975. Edited by R. Tomato. IX, 194 pages. 1976.
Vol. 566: Empirical Distributions and Processes. Oberwolfach 1976. Proceedings. Edited by P, Gaenssler and P. R~v~sz. VIi, 146 pages. 1976.
Vol. 570: Differential Geometrical Methods in Mathematical Physics, Bonn 1975. Proceedings. Edited by K. Bleul~r and A. Reetz.. Vltl, 576 pages. 1977.
VoL 572: Sparse Matrix Techniques, Copenhagen 1976. Edited by V, A, Barker. V, 164 pages. 1977.