Translation:On a Heuristic Point of View about the Creation and Conversion of Light

From Wikisource
Jump to navigation Jump to search
On a Heuristic Point of View about the Creation and Conversion of Light (1905)
by Albert Einstein, translated from German by Wikisource
59468On a Heuristic Point of View about the Creation and Conversion of Light1905Albert Einstein

6. On a Heuristic Point of View about the Creation and Conversion of Light

Maxwell's theory of electromagnetic processes in so-called empty space differs in a profound, essential way from the current theoretical models of gases and other matter. On the one hand, we consider the state of a material body to be determined completely by the positions and velocities of a finite number of atoms and electrons, albeit a very large number. By contrast, the electromagnetic state of a region of space is described by continuous functions and, hence, cannot be determined exactly by any finite number of variables. Thus, according to Maxwell's theory, the energy of purely electromagnetic phenomena (such as light) should be represented by a continuous function of space. By contrast, the energy of a material body should be represented by a discrete sum over the atoms and electrons; hence, the energy of a material body cannot be divided into arbitrarily many, arbitrarily small components. However, according to Maxwell's theory (or, indeed, any wave theory), the energy of a light wave emitted from a point source is distributed continuously over an ever larger volume.

The wave theory of light with its continuous spatial functions has proven to be an excellent model of purely optical phenomena and presumably will never be replaced by another theory. Nevertheless, we should consider that optical experiments observe only time-averaged values, rather than instantaneous values. Hence, despite the perfect agreement of Maxwell's theory with experiment, the use of continuous spatial functions to describe light may lead to contradictions with experiments, especially when applied to the generation and transformation of light.

In particular, black body radiation, photoluminescence, generation of cathode rays from ultraviolet light and other phenomena associated with the generation and transformation of light seem better modeled by assuming that the energy of light is distributed discontinuously in space. According to this picture, the energy of a light wave emitted from a point source is not spread continuously over ever larger volumes, but consists of a finite number of energy quanta that are spatially localized at points of space, move without dividing and are absorbed or generated only as a whole.

Subsequently, I wish to explain the reasoning and supporting evidence that led me to this picture of light, in the hope that some researchers may find it useful for their experiments.

A certain problem concerning the theory of "black body radiation".


We begin by applying Maxwell's theory of light and electrons to the following situation. Let there be a cavity with perfectly reflecting walls, filled with a number of freely moving electrons and gas molecules that interact via conservative forces whenever they come close, i.e., those collide with each other just as gas molecules in the kinetic theory of gases.[1]

In addition, let there be a number of electrons bound to spatially well-separated points by restoring forces that increase linearly with separation. These electrons also interact with the free molecules and electrons by conservative potentials when they approach very closely. We denote these electrons, which are bound at points of space, as "resonators", since they absorb and emit electromagnetic waves of a particular period.

According to the present theory of the generation of light, the radiation in the cavity must be identical to black body radiation (which may be found by assuming Maxwell's theory and dynamic equilibrium), at least if one assumes that resonators exist for every frequency under consideration.

Initially, let us neglect the radiation absorbed and emitted by the resonators and focus instead on the requirement of thermal equilibrium and its implications for the interaction (collisions) between molecules and electrons. According to the kinetic theory of gases, dynamic equilibrium requires that the average kinetic energy of a resonator equal the average kinetic energy of a freely moving gas molecule. Decomposing the motion of a resonator electron into three mutually perpendicular oscillations, we find that the average energy of such a linear oscillation is

where R is the absolute gas constant, N is the number of "real molecules" in a gram equivalent and T is the absolute temperature. Because of the time averages of the kinetic and potential energy, the energy is ⅔ as large as the kinetic energy of a single free gas molecule. Even if something (such as radiative processes) causes the time-averaged energy of a resonator to deviate from the value , collisions with the free electrons and gas molecules will return its average energy to by absorbing or releasing energy. Hence, in this situation, dynamic equilibrium can only exist when every resonator has an average energy .

We apply a similar consideration now to the interaction between the resonators and the ambient radiation within the cavity. For this case, Planck has derived the necessary condition for dynamic equilibrium [2]; treating the radiation as a completely random process.[3]

He found:

Here, is the average energy of a resonator of eigenfrequency ν (per oscillatory component), L is the speed of light, ν is the frequency, and ρν is the energy density of the cavity radiation of frequency between ν and ν + .

If the net radiative energy of frequency ν is not to continually increase or decrease, the following equality must hold

or, equivalently,

This condition for dynamic equilibrium not only lacks agreement with experiment, it also eliminates any possibility for equilibrium between matter and aether. The wider the range of frequencies of the resonators is chosen the bigger the radiation energy in the space becomes, and in the limit we obtain:

Planck's Derivation of the Fundamental Quantum


In the next section we want to show that the determination that Mr. Planck gave of the elementary quanta is to some extent independent of the "black body radiation" theory that he created.

Planck's formula [4] for ρν that suffices for all experiments so far goes


In the limit of large values of T/ν, that is for large wavelengths and radiation densities, this formula approaches the form:

One recognizes that this formula is the same as the one that was derived from Maxwell theory and electron theory. Equating the coefficients of the formula's:


that is, a hydrogen atom weighs 1/N gram = 1.62·10-24g. This is precisely the value found by Mr. Planck, which is in satisfactory agreement with values obtained in other ways.

This brings us to the conclusion: the larger the energy density and the wavelength of radiation the more suitable the theoretical basis that we used; but for small wavelengths and low radiation densities the basis fails completely.

In the following the "black body radiation" is to be considered in terms of what is experienced, without forming a picture of the creation and propagation of the radiation.

The Entropy of Radiation


The following discussion is contained in a famous work of Mr. Wien, and is only included here for the sake of completeness.

Let there be radiation taking up volume v. We assume that the observable properties of the radiation are determined completely when the radiation densities ρ(ν) are given for all frequencies. [5] Since we can regard radiations of different frequency as separable without doing work or transferring heat the entropy of the radiation can be expressed in the form

where φ is a function of the variables ρ and ν. φ can be reduced to a function of only one variable by expressing that the entropy of radiation between reflecting walls is not changed by adiabatic compression. We won't go into that however, but investigate right away how the function φ can be obtained from the radiation law of the black body.

In the case of "black body radiation" ρ is such a function of ν that for a given energy the entropy is a maximum, that is, that


From this it follows that for any choice of δρ as function of ν

Where λ is independent of ν. Thus is independent of ν

For the temperature increase of dT of a black body radiation of volume v = 1 the following equation is valid:

or, since is independent of ν:

Since dE is equal to the transferred heat, and the process is reversible we also have:

Equating formulas gives:

This is the black body radiation law. So it's possible to determine the black body radiation from the function φ. Conversely, through integration one can obtain φ from the black body radiation law keeping in mind that φ vanishes for ρ = 0.

Limiting law for the entropy of monochromatic radiation at low radiation density


Admittedly, the observations of "black body radiation" so far indicate that the law that Mr. Wien originally devised for the "black body radiation"

is not exactly valid. However, for large values of ν/T experiment completely confirms the law. We shall base our calculations on this formula, keeping in mind that the results will be valid within certain limitations only.

First, we get from this equation:

and then, using the relation obtained in the preceding section:

Let there be a radiation of energy E, with a frequency between ν and ν + . Let the radiation extend over volume v. The entropy of this radiation is:

We will limit ourselves to investigating the dependency of the radiation's entropy on the volume that is occupied. Let the entropy of the radiation be called S0 when it occupies the volume v0, then we get:

This equation shows that the entropy of monochromatic radiation of sufficiently low density varies with volume according to the same law as the entropy of an ideal gas or that of a dilute solution. In the following the equation just found will be interpreted in terms of the principle introduced by Mr. Boltzmann that says that the entropy of a system is a function of the probability of its state.

Molecular Theoretical investigation of the Volume Dependence of the Entropy of Gases and Dilute Solutions


In calculating Entropy on the grounds of molecular theory the word "probability" is often used in a meaning that isn't covered by the definition in probability theory. Especially the "cases of equal probability" are often set by hypothesis, where the applied theoretical representation is sufficiently definite to deduce probabilities without fixing them by hypothesis. I will show in a separate work that in considerations of thermal processes one obtains a complete result with the so-called "statistical probability". This way I hope to remove a logical difficulty that is in the way of fully implementing Boltzmann's principle. Here however only its general formulation and application in quite specific cases will be given.

When it's meaningful to talk about the probability of a state of a system, and additionally every increase of entropy can be described as a transition to a more probable state, the entropy S1 of a system is a function of the probability W1 of its instantaneous state. In the case of two systems S1 and S2, one can state:

If one considers these systems as a single system with entropy S and probability W, then:


The latter equation expresses that the states of the two systems are independent.

From these equations it follows:

and hence finally

The quantity C is also a universal constant; it follows from kinetic gas theory, where the constants R and N have the same meaning as above. Denoting the entropy at a particular starting state as S0, and the relative probability of a state with entropy S as W we have in general:

We now consider the following special case. Let a number (n) of movable points (for example molecules) be present in a volume v0, these points will be the subject of our considerations. Other than these, arbitrarily many other movable points can be present. As to the law that describes how the considered points move around in the space the only assumption is that no part of the space (and no direction) is favored over others. The number of the (first-mentioned) points that we are considering is so small that mutual interactions are negligible.

The system considered, which can be for example an ideal gas or a diluted solution, has a certain entropy. We take a part of the volume v0 with a size of v and we think of all n movable points displaced to that volume v, with otherwise no change of the system. Clearly this state has another entropy (S), and here we want to determine that entropy difference with the help of Boltzmann's principle.

We ask: how large is the probability of the last-mentioned state relative to the original state? Or, what is the probability that at some point in time all n independently moving points in a volume v0 have by chance ended up in the volume v?

For this probability, which is a "statistical probability" one obtains the value:

one derives from this, applying Boltzmann's principle:

It's noteworthy that for this derivation, from which the Boyle-Gay-Lussac law and the identical law of osmotic pressure can be easily derived thermodynamically [6], there is no need to make any assumption regarding the way the molecules move.

Interpretation of the Volume Dependence of the Entropy of Monochromatic Radiation using Boltzmann's Principle


In paragraph 4 we found for the dependence of Entropy of the monochromatic radiation on volume the expression:

This formula can be recast as follows:

Comparing this with the general formula that expresses Boltzmann's principle

we arrive at the following conclusion:

If monochromatic radiation of frequency ν and energy E is enclosed (by reflecting walls) in the volume v0, then the probability that at an arbitrary point in time all of the radiation energy located in a part v of the volume v0 is:

Subsequently we conclude:

In terms of heat theory monochromatic radiation of low density (within the realm of validity of Wien's radiation formula) behaves as if it consisted of independent energy quanta of the magnitude Rβν/N.

We also want to compare the average magnitude of the energy quanta of the "black body radiation" with the mean average energy of the center-of-mass-motion of a molecule at the same temperature. The latter is 3/2(R/N)T, and for the average energy of the Energy quanta Wien's formula gives:

The fact that monochromatic radiation (of sufficiently low density) behaves as regards to dependency of entropy on volume like a discontinuous medium that consists of energy quanta of magnitude Rβν/N suggests we should investigate whether the laws of generation and transformation of light are what they must be if light consisted of such energy quanta. In the following we will address that question.

Stokes' Rule


Let monochromatic light be transformed by photoluminence into light of another frequency, and let it be assumed that according to the result just obtained the generating as well as the generated light consists of energy quanta of magnitude (R/N)βν, where ν is the corresponding frequency. The transformation process can then be interpreted as follows. Each generating energy quantum of frequency ν1 is absorbed and generates—at least with sufficiently small density of the generating energy quanta—by itself a light quantum of frequency ν2; possibly other light quanta of frequency ν3, ν4 etc. as well as other form of energy (e.g heat) can be generated simultaneously. Through which intermedia processes the final result comes about is immaterial. If the photoluminescing substance isn't a continuous source of energy it follows from the energy principle that the energy of the generated energy quanta are not larger than the generating light quanta; therefore the following relation must hold:


As is well known this is Stokes' rule.

Especially noteworthy is that with weak illumination the amount of generated light must, other circumstances being equal, be proportional to the amount of exciting light, since every incident energy quantum will cause one elementary process of the above indicated kind, independent of the action of other exciting energy quanta. In particular there will be no lower limit of the intensity of the exciting light below which the light would be incapable of exciting light.

According to the way the understanding of the phenomena is laid down here deviations from Stokes' rule are conceivable in the following cases:

  1. When the number of energy quanta per unit of volume that are simultaneously involved in the transformation is so large that the energy quantum of the generated light can receive the energy of several exciting energy quanta.
  2. When the generating (or generated) light does not have the energy characteristics of "black body radiation" that is in the realm of validity of Wien's law, when for instance the exciting light is generated by a body of such high temperature that for the wavelengths considered Wien's law is no longer valid.

The last mentioned possibility merits special attention. According to the developed understanding it cannot be excluded that a "non-Wienian radiation", even in high dilution, would behave energetically differently from a "black body radiation" within the validity range of Wien's law.

On the Generation of Cathode Rays by Illumination of Solid Bodies


The usual understanding, that the energy of light is distributed over the space through which it travels in a continuous way encounters extraordinarily large difficulties in attempts to explain photo-electric phenomena, as has been presented in the groundbreaking article by Mr. Lenard. [7].

According to the understanding that the exciting light consists of energy quanta of energy (R/N)βν the generation of cathode rays by light can be conceived as follows. Quanta of energy penetrate the surface layer of the solid, and their energy is transformed, at least partially, in kinetic energy of electrons. The simplest picture is one where the light quantum gives its entire energy to a single electron; we assume that this will occur. However, it must not be excluded that electrons accept the energy of light quanta only partially. An electron that has been loaded with kinetic energy will have lost some of its energy when it arrives at the surface. Other than that we must assume that on leaving the solid every electron must do an amount of work P (characteristic of that solid). Electrons residing right at the surface, excited at right angles to it, will leave the solid with the largest normal velocity. The kinetic energy of such electrons is

If the body is charged to a positive potential Π and surrounded by conductors with potential zero and Π is just enough to prevent loss of electricity by the body, then we must have:

where ε is the electrical mass of the electron, or

where E is the charge of one gram equivalent of a single-valued ion and P' is the potential of this amount of negative electricity with respect to this body. [8]

If we set E = 9.6·103, then Π·10-8 is the potential in volts that the body will attain when it is irradiated in vacuum.

To see now whether the derived relation agrees with experiment to within an order of magnitude we set P' = 0, ν = 1.03·1015 (corresponding to the ultraviolet limit of the solar spectrum), and β = 4.866·10-11. We obtain Π·107 = 4.3 Volt, which agrees to within an order of magnitude with the results of Mr. Lenard. [9]

If the formula derived is correct, then Π, as a function of frequency of the excited light represented in Cartesian coordinates, must be a straight line, whose inclination is independent from the nature of the substance investigated. As far as I can see no contradiction exists between our understanding and the properties of photo-electric action observed by Mr. Lenard. If each energy quantum of the exciting light releases its energy independently from all others to the electrons, the distribution of velocities of the electrons, which means the quality of the generated cathode radiation, will be independent of the intensity of the exciting light; the number of electrons that exits the body, on the other hand, will, in otherwise equal circumstances, be proportional to the intensity of the exciting light. [10]

We expect that limits of validity of these rules will be similar in nature to the expected deviations from Stokes' rule.

In the preceding it has been assumed that the energy of at least some of the energy quanta of the generating light is transferred completely to a single electron. If one does not start with that natural supposition then instead of the above equation one obtains:

For cathode-luminescence, which constitutes the inverse process of the one just examined, one obtains by way of analogous consideration:

For the materials investigated by Mr. Lenard PE is always significantly larger than Rβν, as the voltage that the cathode rays have had to traverse to generate even visible light is in some cases several hundred, in other cases thousands of volts. [11]

Ionization of Gases by Ultraviolet Light


We have to assume that in ionization of a gas by ultraviolet light always one absorbed light energy quantum is used for the ionization of just one gas molecule. Firstly it follows that the ionization energy (that is, the theoretically necessary energy to ionize) of a molecule cannot be larger than the energy of an absorbed light energy quantum. Taking J as the (theoretical) ionization energy per gram equivalent, we have:

According to Lenard's measurements for air the largest wavelength that has an effect is about 1.9·10-5 cm, so

An upper limit for the ionization energy can also be obtained from the ionization voltage in rarefied gases. According to Stark [12] the smallest measured ionization voltage (for platinum anodes) is for air about 10 volt. [13] We have thus for J an upper limit 9.6·1012, which is nearly the same as the one just found. There is another consequence that in my mind is very important to verify. If every light energy quantum ionizes one molecule then the following relation must exist between the absorbed quantity of light L and the number j of thereby ionized gram molecules:

If our understanding reflects reality this relation must hold for every gas that (at the particular frequency) has no absorption that isn't accompanied by ionization.

Bern, march 17, 1905

  1. This assumption is equivalent to the condition that the mean kinetic energies of gas molecules and electrons are equal to each other when there is thermal equilibrium. As is known, using this condition Mr. Drude has theoretically derived the relation between thermal and electric conductivity of metals.
  2. M. Planck, Ann. d. Phys. 1 p.99. 1900.
  3. This condition can be formulated as follows. We expand the Z-component of the electric force (Z) in a given point in the space between the time coordinates of t=0 and t=T (where T is a large amount of time compared to all the vibration periods considered) in a Fourier series
    where and . Performing this expansion arbitrarily often with arbitrarily chosen initial times yields a range of different combinations for the quantities Aν and αν. Then for the frequencies of the different combinations of the quantities Aν and αν there are the (statistical) probabilities dW of the form:
    The radiation is then as unordered as imaginable, if
    That is if the probability of a particular value of A and α respectively is independent of the value of other values of A and x respectively. The more closely the demand is satisfied that the separate pairs of values Aν and αν depend on the emission and absorption process of separate resonators, the more closely will the examined case be one of being as unordered as imaginable.
  4. M. Planck, Ann. d. Phys. 4. p.561. 1901.
  5. This is an arbitrary assumption. The natural course of action is to stay with this simplest assumption until experiment forces us to abandon it.
  6. If E is the energy of the system, then one obtains:
  7. P. Lenard, Ann. d. Phys. 8. p.169 u. 170. 1902.
  8. If one assumes that in order to release an electron from a neutral molecule light must do a certain amount of work then one doesn't have to change the derived relation; one only has to think of P' as the sum of two terms.
  9. P. Lenard, Ann. d. Phys. 8. p165. u. 184 Taf. I, Fig.2 1902.
  10. P. Lenard, l. c. p.150 und p. 166-168.
  11. P. Lenard, Ann. d. Phys. 12. p.469. 1903.
  12. J. Stark, Die Elektricität in Gasen p. 57. Leipzig 1902.
  13. within the gas the ionization voltage for negative ions is nonetheless five times larger

 This work is a translation and has a separate copyright status to the applicable copyright protections of the original content.


This work is in the public domain in the United States because it was published before January 1, 1929.

The longest-living author of this work died in 1955, so this work is in the public domain in countries and areas where the copyright term is the author's life plus 68 years or less. This work may be in the public domain in countries and areas with longer native copyright terms that apply the rule of the shorter term to foreign works.

Public domainPublic domainfalsefalse


 The standard Wikisource licenses apply to the original work of the contributor(s).

This work is licensed under the terms of the GNU Free Documentation License.

The Terms of use of the Wikimedia Foundation require that GFDL-licensed text imported after November 2008 must also be dual-licensed with another compatible license. "Content available only under GFDL is not permissible" (§7.4). This does not apply to non-text media.

Public domainPublic domainfalsefalse

This work is released under the Creative Commons Attribution-ShareAlike 4.0 International license, which allows free use, distribution, and creation of derivatives, so long as the license is unchanged and clearly noted, and the original author is attributed—and if you alter, transform, or build upon this work, you may distribute the resulting work only under the same license as this one.

Public domainPublic domainfalsefalse