This paper will discuss the various methods and techniques used to look for integral solutions of equations. We cover the relatively elementary methods that can be used to solve equations of degree one in the full generality way possible and methods to generate more solutions to equations of degree two given a particular solution. Further, we introduce more advanced techniques that can be used to study equations of degree three. These include introducing elliptic curves and defining a group law on them. We further explain how to convert elliptic curves to the so-called Weirstrass normal form, which can be manipulated more straightforwardly. Finally, we demonstrate these techniques by finding positive solutions to a famous equation.
Introduction
Diophantine equations are equations, typically polynomial, where one seeks integer or natural solutions. Diophantine refers to Diophantus of Alexandria, one of the first mathematicians to introduce symbolism to algebra. Famous equations include:
Fermat famously conjectured this to have no positive integer solutions for n>2n > 2n>2. A conjecture which stood unproven for 358 years before being proven by Andrew Wiles in 19941.
The smallest solution in distinct positive integers to this equation is .
When the English mathematician G. H. Hardy remarked to the Indian number theorist Srinivasa Ramanujan that his taxi-cab number, 1729, seemed rather dull; Ramanujan replied that it was interesting, as 1729 is the smallest number that can be expressed as a sum of positive cubes in two different ways.
However, while some Diophantine equations have been studied throughout history, the general case is impossible to solve. Hilbert’s 10th problem2 asked for a general algorithm that could determine whether a Diophantine polynomial has an integer solution. After 21 years of work by Martin Davis, Yuri Matiyasevich, Hilary Putnam, and Julia Robinson, they proved the DPRM theorem3, which shows such an algorithm cannot exist.
Solving Diophantine equations of degree higher than two was a breakthrough in 20th-century mathematics. This paper will give an introduction to this theory and demonstrate it by finding positive integral solutions of the following equation:
This equation was originally solved in a paper “An Unusual Cubic Representation Problem4 which we will be using as a reference.
Section 2 discusses when and how equations of degree one can be solved for as many variables as possible. Section 3 discusses a connection between finding rational and integer solutions that will greatly benefit us in solving higher-degree equations. Section 4 explains how to solve homogeneous equations of degree two in three variables, covering some ideas that will help us later. In Section 5, we move to degree three and explain the group law of elliptic curves (equation (1.1) gives rise to a third-degree equation after multiplying by the common denominator) and their normal form and demonstrate both (the group law and the normal form) by solving the abovementioned equation.
Linear Diophantine Equation
Definition 2.1. A Linear Diophantine equation in n variables is an equation of the form:
with the coefficients ,
, being non-zero, and we are looking for integer solutions
.
Lemma 2.2.
A linear Diophantine equation in variables has a solution if and only if
where
stands for the greatest common divisor of the
and
is the “divides” relation.
Proof. The result follows from the following generalization of Bèzout’s identity: for all non-zero integers and any integer
, there exist
such that
.
It is sufficient to show this for as then:
Now, consider the set:
This is the set of all positive linear combinations of the . Since
,
is a non-empty subset of
. This means that
has a least element (well-ordering principle), say
. We claim
. For any
, by the division algorithm, we have
,
, since
and
is the least element of
,
. But
, so
is a linear combination of the
. But
is not in
, so it must be non-positive. Since it’s also non-negative, it must be 0. Since the remainder on the division of
by
is 0,
. This holds for all such
, so
divides every element of
. Now note that for every
, one of the following holds:
- If
, then
and hence
.
- If
, then
and hence
.
So divides each of the
and hence
. But note
is a linear combination of the
, which means that
. It follows that
.
Now, we demonstrate a method to solve a linear Diophantine equation in two variables by example.
Example 2.3.
Find a solution to the equation over the integers.
First, note that and
. This shows that this equation has solutions. Now, we will use the Euclidean algorithm on 14 and 38.
By rearranging and discarding the last equation:
Using (2.3) and (2.4) we obtain:
Using (2.2) and (2.6) we get:
Now, we multiply this equation by :
Hence, ,
is a solution. While
is a solution to the previous equation, so is
. Let us now find all the solutions, not just one of them.
Lemma 2.4.
If , where
has
as a solution, then the family of all solutions is given by:
Proof. We note that all pairs of numbers of this form are solutions. We shall now show that all solutions are of this form.
We know is a solution and assume
is another solution. We have:
Subtracting these gives:
Rearranging and dividing by we obtain:
Since and
are coprime:
Hence, is of the required form, and by substituting it in the original equation, we can show that
is also of the corresponding form.
Similarly, we can analyze linear equations in more variables, as illustrated by the following example:
Example 2.5.
Solve over the integers.
Define . Hence,
. This is a linear equation in two variables, something we know how to solve. Solving this, we get:
Now, we can solve the equation . Treating
as a constant, we get:
Hence, we conclude that our family of solutions is:
Remark 2.6:
Note that when defining , we use
and not
. This is because, in the second case,
could only have been a multiple of
. So, some solutions to the equation in
and
would not have corresponded to the solutions of the original equation.
Projectivization
Before we move on to higher-degree equations, it is essential to understand some introductory projective geometry.
Definition 3.1:
A homogeneous equation is an equation such that if
is a solution, then so is
.
Example 3.2:
The equation is homogeneous as:
Polynomial equations are homogeneous if and only if all (non-zero) terms in the equation have the same degree. Suppose we have an -degree polynomial equation with integer coefficients in
variables as follows:
Multiply all terms of this equation of degree by
to get a homogeneous equation:
We call the original equation the affine form and the transformed equation the projective form. The process of converting an equation to a projective form is called homogenization. Since we will primarily be concerned with affine equations in two variables, we shall use the variable names instead of
for the remainder of this section.
Example 3.3:
A few examples of homogenization:
- The projective form of
is
.
- The projective form of
is
.
- The projective form of
is
.
The inverse process, de-homogenization, is straightforward. Just substitute , and we get the affine form.
Since is a solution to all projective equations, we call it a trivial solution and disregard it. Also, since multiplying a solution by a constant gives another solution, we consider all proportional solutions the same. For the remainder of this paper, whenever we talk of solutions of a projective equation, we mean non-zero solutions up to their multiples.
Now, we note the correspondence between the solutions of the projective and affine equations. We see that if is a solution of the affine form,
is a solution of the projective form. Conversely, if
is a solution to the projective form,
is a solution to the affine form, provided
. In particular, rational solutions to the affine equation correspond one-to-one, with a few exceptions, to integer solutions (after potentially multiplying by a common denominator) of the projective equation. This connection is precisely why these two forms will help solve Diophantine equations; it is often easier to convert a projective equation of affine form and look for rational solutions than to look for integer solutions directly.
Now, we discuss what lines are in projective geometry and how they are related to the usual affine definition of a line.
Definition 3.4:
A projective point is any non-zero ordered triplet defined up to its non-zero multiples.
Example 3.5:
,
, and
all represent the same projective point.
Definition 3.6:
A projective line is the set of all solutions to the projective equation (for
,
,
not all 0). Further, we call three projective points collinear if they lie on a projective line.
Remark 3.7:
If a point lies on a line
, it may be denoted as
.
Remark 3.8:
Multiplying the equation of a projective line by any non-zero constant does not change it.
Theorem 3.9:
Projectivization preserves incidence relations, that is if the projective point lies on the projective line
(
and
, not both 0), then the affine point
lies on the affine line
if
. If instead
then the affine line
has slope
(possibly infinite).
Proof:
If , we divide by it to obtain the required relation:
If instead , we get:
Since the slope of the line is
, we are done.
Theorem 3.10:
Given any two distinct projective points, there is a unique projective line through both of them; given any two distinct projective lines, there is a unique projective point that lies on both.
Proof: Let the two points be and
. Consider the line:
The coefficients of this equation are 0 iff , which contradicts our assumption that
and
are distinct, so this equation does represent a line. We note that
and
lie on
. Suppose both
and
lie on another line
. We will show that
and
are the same line. Since
and
lie on
, we have:
We multiply the first equation by and the second by
and subtract them from each other:
Rearranging, we get:
Similarly, we can show that:
Since the coefficients of and
are proportional, they represent the same line.
Now consider two distinct lines and
.
Consider the following point:
.
By a similar argument as before, the coordinates of are not all 0. So it is, in fact, a projective point. We see that it lies on both
and
. Now, suppose another point, say
, lies on both lines. We know that only a unique line can pass through two distinct points. But through
and
, two lines,
and
, pass. Hence,
must be the same point as
.
Remark 3.11:
Parallel lines in affine geometry intersect at a point with in projective geometry.
Definition 3.12:
A line is said to intersect a curve
with multiplicity
at a point
if substituting the equation of
in the equation of
allows us to factor
where
is an equation of a line that intersects
at
(assuming the equation of
is not a factor of the equation of
).
Example 3.13:
Find the multiplicity of the intersection of the curve and the line
.
Substituting the equation of the line into the curve, we get:
Also, note that and
intersect at
. Since
was our factor after the substitution, we say
intersects
at
with multiplicity two.
Remark 3.14:
A way exists to define multiplicity for the intersection of any two general curves, but it is not required here. It has been described under the name “Intersection Numbers” in Section 3.3 of Algebraic Curves.
Remark 3.15:
When considering intersections up to multiplicity, we mean that an intersection of multiplicity is counted
times.
Theorem 3.16:
A projective line has at most intersections with an
degree projective curve unless the line is a factor of the curve.
Proof: This follows from substituting the equation of the line into the curve and using a similar argument as that for the proof that non-constant one-variable polynomials have at most roots. The details are left to the reader.
Remark 3.17:
If we allow for complex numbers, then two projective curves of degree and
with their equations having no common non-constant factor have exactly
intersections up to multiplicity. This result is Bèzout’s Theorem, which can be found in Section 5.3 of Algebraic Curves.
Definition 3.18:
A line is said to be tangent to a curve
at the point
if
is the intersection point of
and
with a multiplicity of at least two.
Definition 3.19:
A point is said to be an inflection point on a curve
if the tangent at
to
is a triple tangent, that is, it intersects
at
with multiplicity three.
Equations of Degree Two
Definition 4.1:
A three-variable homogeneous 2nd-degree Diophantine equation is an equation of the form:
where .
We will solve this equation for integers and
.
We start by de-homogenizing it by setting and looking for rational points (for the solutions where
, we can set either
or
to be 1). This equation is significantly more complicated than a linear equation, so let us start by analyzing a simple case:
Example 4.2:
Find all the rational solutions to , that is, find all rational points on the unit circle.
Note that is a rational solution. A unique line with a real slope (non-vertical) joining them exists for any other point on this circle. Let this line be
. By substituting this in the equation of the circle, we get:
Expanding and simplifying, we obtain:
Factoring, we have:
Thus, or
Substitute into to get:
Thus, the solutions are or
It is clear that there is a one-to-one correspondence between points on the circle (except ) and real values of
. We claim that there is a one-to-one correspondence between rational points on the circle (except
) and rational values of
. We can see that if
, the corresponding point
will be a rational point. Conversely, if
is a rational point, then let and
. Note that
. Hence,
(assuming
).
Hence, all rational points on the unit circle are of the form
or . It is worth noting that
which is to be expected as represents the vertical tangent to the unit circle at
; that is, a line whose “second” intersection (multiplicity two) with the unit circle is
. For the coming examples, we will assume that
.
The main idea from this example is that, given one rational point on a curve, we can use it to find the other rational points. We claim that the same method will work in the general case.
Theorem 4.3:
For any curve with
which has no non-constant factors and a rational point
on that curve, the set of all rational points on that curve is the set of second intersections (up to multiplicity) of lines with rational slope through
with that curve. We assume that vertical lines have a rational slope.
Proof: The line joining any two rational points must have a rational slope. We shall now prove that if a line with a rational slope passes through , then its second intersection with the curve is rational. Let the line be:
It suffices to prove that the -coordinate of the second intersection is rational. Substitute the value of
in
and simplify to get:
where
Since satisfies this equation, by Vieta’s theorem,
. Hence, the
-coordinate of the second intersection is
, which is rational. (This works because
cannot be identically 0, because if it were, the equation would have the line as a factor, which we assumed was not possible).
Remark 4.4:
If our equation could be factored into two rational linear factors, we would have reduced it to linear equations, something we already know how to solve.
This theorem allows us to generate the entire family of solutions to equations of degree two given a particular solution.
Corollary 4.5.
If a line has rational intersections, up to multiplicity, with an
degree rational curve, then it has an
th intersection that is also rational.
Proof. Since , the line has
rational points on it and hence has a rational slope. Now, we can substitute the equation of the line in the curve and use Vieta’s theorem again to prove this result.
Remark 4.6.
The case where the line is a tangent can be checked by taking a derivative to find the tangent and noting that the derivative will be rational and the tangent will have an intersection with multiplicity two with the curve.
Let us use this theorem to find the rational points on a more complicated curve.
Example 4.7.
Find all rational points on the curve .
We note that is a rational point on this curve. Consider the line
which passes through
for
. By substituting the value of
in the curve, we obtain:
Hence, the second intersection of the line with the curve is at . Since it also lies on
, we get that the set of all rational points on this curve is given by:
for all . In particular, note that if
, this is
itself and if
, it is
.
However, finding a particular solution may sometimes be difficult or even impossible. Consider the following example:
Example 4.8.
Find all rational points on the curve .
But is not rational. Hence, this curve has no rational point.
In general, it is always possible to determine if a given rational curve of degree two has a rational point in a finite number of steps. This follows from the Hasse-Minkowski theorem5. However, we shall not discuss it here. Further details can be found in Section 1.1 of Rational Points on Elliptic Curves.
Equations of Degree Three and Elliptic Curves
For equations of degree one, we analyzed as general a case as possible. For equations of degree two, we restricted ourselves to homogeneous equations and analyzed them. For equations of degree three (the so-called cubic equations), we enter a territory that, while not uncharted, is very difficult to navigate. Here, we shall restrict ourselves to one particular problem, originally studied in the paper An Unusual Cubic Representation Problem. The problem is to find positive integer solutions to:
In this section, we will first discuss what a group is and how the set of solutions to this equation forms a group. Then, we shall convert this equation to a more straightforward form (the so-called Weierstrass normal form). Finally, we shall use the group law to generate rational solutions to the equation in the normal form until we get a positive solution.
Group Law
Similar to the 2nd-degree case, we wish for a way to generate more rational points on a 2nd-degree curve, given a rational point on it. Unfortunately, as the following example demonstrates, we can’t use the same technique.
Example 5.1.
Find the rational solutions to .
We note that is a rational point on the curve. Consider a line
for
. Substituting it into our equation, we get:
And here lies the problem: the second factor is of the second degree, so there is no guarantee that it has rational roots.
Remark 5.2.
By Fermat’s Last Theorem, the only rational points on this curve are those where one of the variables is 0, that is, ,
,
. Hence, the only
for which the 2nd (and 3rd) intersections are rational is
.
Instead, we will use (4.5) to define an operation on the set of rational points, which allows us to generate more points given any two. This operation will satisfy the group axioms.
Definition 5.3.
A set with a binary operation
is called a group if:
for all
(Associativity).
- There exists
such that for all
,
(Existence of an Identity element).
- For all
, there exists
such that
(Existence of Inverses).
Further, if (Commutativity) for all
, we say that the structure
is an abelian group.
Example 5.4.
with the addition operation is an abelian group, with identity
.
is a group with operation multiplication and identity
.
Now consider the projective curve:
where . Assume that the cubic is non-singular (every point has a unique tangent) and has no non-constant factors. Take the set of all rational points on
. Now consider an operation
on this set such that
is the third intersection of the line through
with
(if
, we consider the “3rd” intersection of the tangent at
). Note that:
This is because all three statements mean that are collinear.
Definition 5.5.
We define addition on the rational points on as
for some fixed rational point
on
.
Theorem 5.6.
The set of all rational points on with operation
as defined above, is an abelian group.
Proof. Most requirements of being a group follow immediately; associativity is the only one that requires more work.
- Commutativity: Since the line through
and
is the same as the line through
and
, we have:
- Identity: We claim
is the required identity element. Using (5.2) with
,
, and
, we have:
- Inverses: For any given
, we claim that the corresponding
is
. First, we note that:
Now, we use (5.2):
Using (5.2) again:
- Associativity: Consider three points
on
and let
and
. Let
be the line through
and
, and let
be the line through
and
. Let
and
, and let
be the line through
and
and
be the line through
and
.
Now, we assume that are all pairwise distinct. The case where any of the points are equal has been skipped here; we invite the reader to fill in the details. Finally, consider
and
. Let
be the line through
and
, and
be the line through
and
.

If we could prove that , we would be done as then the following holds:
For the sake of contradiction, assume this was not the case. Now consider a cubic curve formed by multiplying the equations of ,
, and
together:
(the equation of is the product of the equations of
,
, and
) and similarly:
A point lies on if it lies on one of
,
, or
, and since all of them intersect
at three points each, which is the maximum (3.16) (we assumed that the equation of
has no non-constant factors),
,
,
,
,
,
,
,
, and
are the only intersections of
and
. Notably,
does not lie on
; similarly,
does not lie on
. Hence,
and
are distinct curves.
Consider the family, say , of cubic curves that are a linear combination of
and
. That is, all the curves whose equations are of the form:
where is the cubic equation of
and
is the equation of
. While we appear to have two degrees of freedom with this family, that of
and
, we only have one since multiplying by a constant does not change the curve. Since
,
,
,
,
,
,
, and
lie on both
and
, they lie on all curves in this family. Hence, this family is a subset of the family
of all cubic curves through
,
,
,
,
,
,
, and
(including the 0 curve, which has the equation
and passes through all projective points). We claim both families are the same. We will show this by showing that
also has one degree of freedom. And if
and both have the same degrees of freedom, then
.
An arbitrary cubic curve in three variables has ten coefficients, so it seems we would need ten equations to specify it. However, since multiplying by a constant does not change the curve, we only need nine. Since is a family of all curves that pass through eight distinct points, we already have eight equations; hence, only one more is needed to specify the curve completely. That is to say, the elements of
have one degree of freedom.
Clearly, and hence
, so:
At ,
and
vanish and
doesn’t, so
must be 0. Similarly,
can be shown to be 0. But
is non-singular, so it can’t be the 0 curve. So, we have a contradiction, which means our assumption is false. Thus,
, and we are done.
Remark 5.7.
The argument for via degrees of freedom can be formalized as follows:
is a vector space of dimension two (here, we consider equations that differ by a constant multiple as different), and
and
are linearly independent elements of
. So
is spanned by
and
.
Remark 5.8.
An alternate method to prove associativity (and the rest of the group law) is deriving an explicit formula for and verifying the conditions using the formulas. This is not reasonable to do by hand, but software can do it for us. This can be used to check the special cases where two or more of
,
,
,
,
,
,
, and
are the same.
Remark 5.9.
,
, and
are collinear.
In the next section, we will be transforming between different forms of an equation, and each form will be a projective curve of degree three for which its group law can be defined. These group laws for each of these curves will be related in a way that preserves the structure of the group:
Definition 5.10.
If is a group with operation
and
is a group with operation
, then a function
is called a group homomorphism if for all
,
. That is,
preserves the group structure.
Homomorphisms map the identity of to the identity of
.
Example 5.11.
Consider the groups and
. The function
:
is a homomorphism between these groups.
Weierstrass Form
Now that we have a group law, we multiply out the denominators to convert our equation to a form that is easier to work with:
We must be careful that this doesn’t introduce any new solutions by multiplying by 0. Thankfully, that would require or
or
, which is impossible for positive integers.
Definition 5.12.
An elliptic curve is any projective curve of degree three in three variables with integer coefficients and at least one rational point on it.
Note that there exists a rational point on (5.3). Hence, it is an elliptic curve. For our purposes, the most crucial fact about elliptic curves is that there exists a change of coordinates that takes any elliptic curve
with a rational point
to the form:
Such that the transformation is a group homomorphism between the group









In general, such transformations are not simple. Say our transformation is . We wish for
to be a bijection since we want to return to our original coordinates later. Since the identity element goes to the identity element,
. Now consider the line
in the transformed coordinates. If we substitute it in
, we obtain that it intersects the curve thrice at
. That is,
is an inflection point. But it may be that
is not an inflection point. So, the pre-image of
cannot be a line because no line intersects
thrice at
. So
could be a transformation that doesn’t send lines to lines, and such transformations are more challenging to work with than ones that do.
Thankfully, we do not need to worry about this case as (5.2) has as an inflection point. We will show that in such cases, a “line preserving” transformation exists that sends it to the normal form. Transformations that send lines to lines are of a specific form, as shown below (for the rest of this section, all variables are assumed to be rational).
Definition 5.13.
A projective transformation is a transformation from a coordinate system to
of the following form:
where
Remark 5.14.
The last condition is there to ensure that the transformation is invertible since it corresponds to:
This condition is equivalent to the three lines:
not being concurrent (all three not passing through a single projective point). The inverse of a projective transformation is a projective transformation.
Remark 5.15:
Since they are invertible, a projective transformation may be stated by either expressing in terms of
or vice versa.
Theorem 5.16:
A projective transformation sends lines to lines.
This theorem is demonstrated by the example below; generalizing this example involves quite a bit of algebra and has been skipped here. The reader is invited to fill in the details.
Example 5.17:
Consider the transformation:
Consider the set of all points on the lines







Now consider the line . Notice that:
So is mapped to
.
Now consider the general case . We wish to express this in terms of
. For this, we will note that the inverse of our transformation is given by:
We substitute these in the equation of and get that
maps to the line:
Remark 5.18:
It is worth noting that a projective transformation is almost entirely specified by the pre-images of ,
, and
. We have further choices with the scale; say the pre-image of
is
, then
may be equal to
for any “scale factor”
. Since scaling all three pre-images means scaling our projective point, which doesn’t change it, we can always fix one of our scale factors as unity. This means that while we have nine coefficients in a projective transformation, we only have eight degrees of freedom.
It is clear that this result can be generalized. That is:
Corollary 5.19:
Projective transformations send curves of degree to curves of degree
.
Another simple fact that follows from projective transformations being bijections is that incidence relations are preserved.
Lemma 5.20:
A point lies on a curve
if
lies on
where
is a projective transformation.
Proof: This fact follows from substituting the equations defining in the equation of
.
Corollary 5.21:
Two projective curves and
intersect at a point
if and only if
intersects
at
where
is a projective transformation.
Remark 5.22:
Not only do projective transformations preserve intersections, but they also preserve the multiplicity of intersections. This has been stated in Section 5.1 of Algebraic Curves.
Proof: and
intersect at a point
if and only if
lies on both
and
.
lies on
if and only if
lies on
and similarly for
. So
lies on both
and
if and only if
lies on both
and
, which happens if and only if
and
intersect at
.
Armed with these facts about projective transformations, we are ready to search for a transformation that sends an elliptic curve with an inflection point to the normal form.
Let be an elliptic curve and
be a rational point on it, which is also an inflection point. Let
be the triple tangent line to
at
. We wish for our transformation to send
to
. Since
lies on
and
then
must lie on their pre-images. And since
is a triple tangent to the normal form at
, its pre-image must also be a triple tangent to
at
. Hence, the pre-image of
is
, the pre-image of
is some line through
except
, and since the three lines can’t intersect at a single point, the pre-image of
is some line not through
.
In fact, we don’t need any more conditions on the pre-images of and
. Only some transformations reduce the equation to normal form. Still, any transformation such that the pre-images satisfy the given conditions will significantly simplify the equation of the elliptic curve to a point where a few simple transformations can finish the job.
Suppose after performing one such transformation goes to:
By the nature of our transformations, is a triple tangent at
. Substituting
we get:
Since we can factor out (x^3) (i.e., (x = 0) is the only line with no “z” term that passes through (0, 1, 0)), (B_0 = C_0 = D_0 = 0). So is of the form:
Now, consider the further transformation:
This removes the

The only thing left to do is to make the coefficient of (x^3) and (y^2 z) as 1, which can be achieved by the final transformation:
Composing the three transformations and renaming the variables, we obtain a transformation that takes to the normal form. Note that the final transformation still takes (P) to (0,1,0) and (l) to (z=0). Since the final transformation is a composition of projective transformations, which all take lines to lines and preserve incidence relations and tangents, and our group law was defined using incidence relations alone, it is clear that this is a homomorphism between the groups
(technically, the set of rational points on
with identity element as (P) and (
, +) with identity (0,1,0).
Remark 5.23:
Now, let us perform these transformations on (5.2). We have the inflection point (-1,0,1) and differentiating shows that the tangent to the curve at (-1,0,1) is (6a – b + 6c = 0). This is the pre-image of (z = 0). We take the pre-image (x = 0) to be (a + c = 0) and that of (y = 0) to be (a + b = 0). So, our transformations are:
A general method for converting curves to normal form without any known inflection point has been described in Section 1.3 of Rational Points on Elliptic Curves.
The inverse transformation here is:
We substitute this into (5.3) to get:
Now, we do a partial version of the next step. We eliminate the (xyz) and terms, but not the
term. This is done for no reason other than the fact that removing the
term makes our equation more complicated instead of less, and there is nothing to be gained by removing the term. Our transformations are:
Substituting these in (5.4), we get:
We should ideally get rid of the 91 here, but once again, that will make stuff more complicated for no gain, so we instead get rid of the denominators by the transformation:
Substituting this in (5.5) and rearranging, we get:
Finally, we rename (x”), (y”), and (z”) as (x), (y), and (z):
The conversion formulas between (5.3) and (5.6) are:
The original paper on this equation chose a different “normal” form than ours. Instead of making the term 0, they made the
term 0. Consider the following transformation:
Applying this on (5.6) and clearing the denominators gives:
This is the form used in that paper6 after substituting (N=4).
It is worth noting here that normal forms are generally not unique. We can always use a non-projective transformation to obtain a normal form, but even if we don’t, we can choose which inflection point to send to (0,1,0). For our equation, we could have used (1,-1,0) (0,1,-1) instead of (-1,0,1) just as well. We don’t have uniqueness even if we fix the point we send to (0,1,0) and restrict ourselves to projective transformations. We could always compose it with the further transformation:
This would give a new transformation. However, this new transformation still sends the same lines to (x=0), (y=0), and (z=0). It just scales them differently. If we do not care about scale, then normal forms are unique. Formally:
Theorem 5.24:
If











for
and
.
Since and
must be triple tangents through
in their respective coordinates, their pre-images must both be the triple tangent through
. Thus,
. Now consider the transformation
, which converts from coordinates
to
and thus converts between the two normal forms. We wish to show that
and
because this would imply that
and
, which is precisely what we aim to show.
Let be:
What we wish to show is equivalent to . Since we have already shown
, which implies that
. Now let the normal form in
coordinates be:
After the transformation, it becomes:
Since this is a normal form, the coefficient of is non-zero, and that of
is
. That is,
and
. We now have:
Now consider that the coefficient of is non-zero, and that of
is
. So
and
. We now have:
Now consider that the coefficient of and
must be
, which leads to
and
, respectively. Hence, we are done.
Remark 5.25.
The coefficients of and
are the same in the normal form (the equation may be multiplied by any required rational to make them 1). So, we have
. Since projective points do not change on multiplication by a constant, neither do projective transformations. So, we may force
from which it can be shown to give
and
for some
.
Finding a positive solution to the equation
We are now ready to start with our hunt for a positive solution to the equation:
We will start with a rational point of (5.6) and compute
where
with
summands. We will then convert these back to points on (5.8) and see if
are all positive or all negative. In either case, we can multiply by a common denominator and potentially
so that all three coordinates become positive integers, which will be our solution.
The only thing left to do is pick an . We clearly can not start from
as
because it is the identity. It is tempting to instead start with
or
. But since both of these are inflection points, so are their images, say
and
. Since
and
are both inflection points:
So , which means we are in a loop. This means we can’t use this process recursively if we start from an inflection point. Another idea is to add
and
to get a new point. However, we note that
; hence, this idea also fails us. We have no choice but to find a new rational point on (5.8). The original paper4 on this equation found both
and
as other rational points and used the former to start generating points; we chose to use the latter.
and its image after (5.7)
will be our starting points.
Now we can start our search. We de-homogenize (convert to affine form) to simplify algebra by setting .
Since this removes from our curve, we must redefine our addition law. We have
. It follows from (3.9)
can still be defined as the 3rd intersection of the line through
and
with (5.9) assuming that
(we do not need to worry about this case) and
is the reflection of
across the x-axis. After de-homogenizing, our starting point becomes
.
We will find manually, which requires drawing a tangent to (5.9) at
, which can be obtained by differentiating (implicit) (5.9). The tangent comes out to be
. Substituting it back, we get the third intersection as
. Reflecting it about the x-axis, we get
. This corresponds to
on our original curve, not a positive solution.
Beyond this, we will use formulas to compute the remaining points. Since we will repeatedly add , we derive the formula for
where
. The line through
and
is given by
where
and
. Substituting this in (5.9), we get:
and by applying Vieta’s theorem to obtain the sum of roots and noting that two of the roots are and
we get the third root to be, say
. Further, we have
. Reflecting this across the x-axis, we get that
is the point
.
Now, we let the machine do the rest. It is worth graphing (5.9) to see how the positive solutions of (5.8) look.

The red curve is (5.9), the grey region corresponds to positive solutions of (5.8), and the points have been plotted in blue (2S and 4S lie outside the image, 13S has been plotted in purple). We see 13S lies in the grey region, so it is our positive solution. Converting it using (5.7) and multiplying out by the common denominator to obtain our required solution
where:
We leave it to the reader to verify by hand that this is a solution.
Further Questions
The reader is undoubtedly left with many questions, so we will try to summarize them here.
- Is this the smallest solution to (5.1)?
It is not so; the solution found in the original paper4 was smaller. In fact, they found the smallest positive solution. To prove that their solution is the smallest, knowledge of the rank of elliptic curves7 is needed, which is beyond the scope of this paper. - When can we find a positive solution to equations like (5.1) with 4 replaced by some N, and how does their size vary?
This question is more challenging to answer; the original paper4 managed to find some conditions of N; in particular, N can’t be odd, but finding necessary and sufficient conditions is more challenging. They also computed the sizes of positive solutions for.
- When does a curve of degree three have a rational point on it?
No known algorithm can determine if a curve of degree three has a rational point in a finite number of steps7. - How does one solve Diophantine equations of degree higher than three?
Beyond the third degree lies the deep ocean, where particular cases like Fermat’s Last Theorem1 have been studied, but the general is unknown. - What happens if we introduce more variables?
This is also a dark forest, where little is known beyond the simple low-degree cases.
It seems incredible that such a simple-seeming question about integral points on curves has seemingly so little known about it.
Conclusion
This paper discusses the various methods used to solve Diophantine equations and explores equations of degrees one, two, and three. The theory of equations of degree one is fully explained here, and common ideas for degree two are also explored. We also briefly introduce degree three using tools from projective geometry, group theory, and the study of elliptic curves. We finally demonstrate these tools by solving an example equation.
References
- A. Wiles. Modular elliptic curves and Fermat’s last theorem. Ann. of Math. (2). 141(3), 443–551 (1995). [↩] [↩]
- D. Hilbert et al., Bulletin-American Mathematical Society, 2000, 37, 407–436. [↩]
- J. V. MATIJASEVIC, in ENUMERABLE SETS ARE DIOPHANTINE, pp.269–273. [↩]
- A. Bremner, A. Macleod. An unusual cubic representation problem. Annales Mathematicae et Informaticae. 43, 29–41 (2014). [↩] [↩] [↩] [↩]
- H. Hasse, Journal fur die reine und angewandte Mathematik, 1923, 152,
205–224. [↩] - A. Sharma and C. Rout, International Journal of Environmental Science,
2011, 2, 933–945. [↩] - J. H. Silverman, J. T. Tate. Rational Points on Elliptic Curves. [↩] [↩]