Kepler's Laws
We start here with a study of “celestial mechanics”, examining the interaction of bodies through gravitational forces. The fundamental problem at hand is to determine the trajectory of a particle. First, we’ll focus on finding the shape of the trajectory; in bound orbits, we’ll develop tools to find the position as a function of time.
The basic law underlying this is Newton’s law of universal gravitation, which is the classical description of gravity (we’ll see a bit of the modern modification to it—the study of general relativity—later on).
Consider two bodies of masses and , with position vectors and . The force exerted on body 2 due to body 1, described by Newton’s law of universal gravitation, is:
Here, is the universal gravitational constant, equal to .
The problem we’re going to tackle is Kepler’s two-body problem, which asks us to calculate and predict the motion of two bodies under the influence of their mutual gravitational attraction. The two bodies are assumed to be point masses, and the force acting on them is given by Newton’s law of gravitation.
We’ll see later what we can say about the other cases of the -body problem.
To solve for the trajectories of the two bodies, it is most convenient to work in the center of mass frame. The center of mass of the two-body system is given by
The position vectors of the two bodies with respect to the center of mass are given by
where is the total mass of the system, and is the vector joining the two bodies, also called the separation vector.
Working in the center of mass frame, , since there are no external forces. With this, we now only have to solve for the trajectory of the separation vector as a function of time. Note that the center of mass frame is an inertial frame, so we can apply Newton’s second law to it.
To simplify the algebra, we define the reduced mass of the two-body system as
such that . Recommended, we define the gravitational parameter as
The force acting on the two bodies are
Using Newton’s second law , we can write
Subtracting the two equations, we get
In the limit , we get that
We see that the trajectory of the separation vector is simply the trajectory of the smaller mass, , in the center of mass frame, while the larger mass is at rest. This is the case when one of the bodies is much heavier, for instance the motion of Earth around the Sun.
The separation vector also gives the trajectory of one of the bodies when viewed in the frame of the other body (which is a non-inertial frame).
Constants of Motion
There are a couple of observations we can make here to simplify the tasks. The first is to look for invariants—quantities that don’t change. These turn out to be quite helpful (and sometimes we can’t even solve the problem without them!). Because the force is along the line joining the two masses, it makes sense to think about the angular momentum, because the force is central in a sense.
In particular, because the force on the reduced mass is , the “angular momentum”, is constant.
The total orbital angular momentum is given by
where . The algebra is a little tricky, and offers no insight, so I’ve omitted it, but you should go through it once (you’ll actually get a term as well, but see the energy discussion later). Anyway, from our previous discussion, this is an invariant.
The total energy of the system is given by
but since the COM does not experience a net force, is constant, so we might as well discard it. Then our modified energy becomes,
which is an invariant because the gravitational force is conservative. Alternatively, is just 0 in the com frame, so the energy in the COM frame is simply this.
We define the specific angular momentum and specific energy as
Since both energy and angular momentum are invariants, so are and .
Since , lies perpendicular to the plane of motion, and hence defines the plane of motion, which must be fixed because is constant.
We define the eccentricity vector as
Taking its time derivative
we find that the eccentricity vector does not change with time, hence is also a constant of motion. It lies in the plane of motion () and points in the direction of the periapsis. The magnitude of the eccentricity vector is given by
You can derive the magnitude by taking the dot product with itself, and subbing in the specific energy. It is a little tedious, so I’ve omitted it for clarity.
I can’t quite give you a motivation for the eccentricity vector, there is some rhyme or reason to it, but justifying it is a great task in itself, so I must ask you to take my word for this.
The magnitude of the eccentricity vector (as we’ll see in a bit), determins the eccentricity of a planetary orbit, but this is sadly not very apparent from our definition, although suggestive from its name.
First Law
Kepler’s first law of planetary motion states that the orbit of a planet is an ellipse with the sun at one of the foci.
First, we can ask what the solutions for the trajectories of particles in such a system are. Also we’ll be using properties of ellipse somewhat freely for the next sections, so if you aren’t familiar with them, I’d recommend reading this wikipedia page. The sections of importance are mostly the cartesian co-ordinates one, and metric properties.
To do this we define a new quantity (we don’t necessarily need it, but its very useful later, so I might as well define it here), we call the angle between the eccentricity vector and the position vector the true anamoly, . We can use this to take the dot product:
Note that this, if you have seen it before, is the equation of a conic section in polar coordinates, with the focus being at origin, the eccentricity being and the length of the semi-latus rectum being , which gives the relation
The magnitude of determines the shape of the trajectory:
- : Circle
- : Ellipse
- : Parabola
- : Hyperbola
- : Straight line (degenerate conic section)
where the focus of the conic section lies at the origin of our system, i.e at the center of mass of the system.
The individual trajectories of the two bodies in the center of mass frame is therefore
Of course because the centre of mass is fixed, the trajectories of the two bodies however will be oriented opposite to each other in our frame, with an angle of radian between them, to keep the center of mass fixed.
We can actually determine the energy of the system can by just the eccentricity and length of semi-latus rectum of the trajectory. Consider the dot product, , using the magnitude of the eccentricity vector:
From here we can see that the sign of the total energy depends on the eccentricity of the trajectory.
- : An elliptical trajectory gives a negative total energy, and hence a bound orbit
- : A parabolic trajectory gives a zero total energy, and hence an unbound orbit
- : A hyperbolic trajectory gives a positive total energy, and hence an unbound orbit
First of all the sign of is the same as that of , of course. If we write down the expression for total energy:
The first term is non-negative. If , is unbounded for large values, because its fine for it be large, as the first term will accomodate for it. Contrastingly, this is not the case for as if grows too large, the last term goes , and the first term is , even though . Thus can’t grow too large, and is thus bounded.
You can also see this qualitatively in an energy diagram. For an extended discussion, see Kleppner & Kolenkow, An Introduction to Mechanics, 2nd ed, Section 10.4, page 382.
Therefore all bound orbits are ellipses, and all unbound orbits are hyperbolae or parabolae.
For our solar system, the orbits of the planets are bound. The sun is much more massive than any of the planets, hence is nearly at the center of mass, or the focus of the elliptical orbits of all planets, which proves the first law. Of course this isn’t exactly true, the orbits influence each other a bit, but its still good enough.
Second Law
Kepler’s second law of planetary motion states that a line segment joining a planet and the sun sweeps out equal areas during equal intervals of time.
The proof of it relies on the fact that the cross product of two vectors gives the area of a parallelogram possessing sides of those vectors. Hence, the triangular area swept out in a short period of time is given by half the cross product of the and vectors, for some short piece of the orbit, .
which is constant. Thus, we are done.
Johannes Kepler’s laws improved the model of Copernicus. According to Copernicus:
- The planetary orbit is a circle with epicycles.
- The Sun is approximately at the center of the orbit.
- The speed of the planet in the main orbit is constant.
Despite being correct in saying that the planets revolved around the Sun, Copernicus was incorrect in defining their orbits. Introducing physical explanations for movement in space beyond just geometry, Kepler correctly defined the orbit of planets as follows:
- The planetary orbit is not a circle with epicycles, but an ellipse.
- The Sun is not at the center but at a focal point of the elliptical orbit.
- Neither the linear speed nor the angular speed of the planet in the orbit is constant, but the area speed (closely linked historically with the concept of angular momentum) is constant.
Third Law
Kepler’s third law of planetary motion states that the square of the period of revolution of a planet is directly proportional to the cube of the semi-major axis of its orbit.
Consider two bodies moving in bound elliptical orbits, with the semi-major axis of the orbit being , and the semi-minor axis being . The lengths of the two axes are related by the eccentricity of the orbit, , as
The area of an ellipse is . Moreover, the length of the semi-latus rectum is , hence because as we had shown in eqn. (), .
Integrating the equation of areal velocity, from the previous law we have:
where is the time period of revolution.
Rearranging, we get
where is the period of one revolution. This is known as Newton’s form of the third law. If we work in the units of AU, sidereal years, and solar masses, we have . Hence the equation simplifies to
In our solar system, , we get the relation . This gives us the third and final law!