In the previous section, we introduced sequences and now we shall present notation and theorems concerning the sum of terms of a sequence. We begin with a definition, which, while intimidating, is meant to make our lives easier.
Given a sequence \(\left\ < a_\right\>_^<\infty>\) and numbers \(m\) and \(p\) satisfying \(k \leq m \leq p\), the summation from \(m\) to \(p\) of the sequence \(\left\
The variable \(n\) is called the index of summation. The number \(m\) is called the lower limit of summation while the number \(p\) is called the upper limit of summation.
In English, the definition of summation notation is simply defining a short-hand notation for adding up the terms of the sequence \(\left\ < a_\right\>_^<\infty>\) from \(a_\) through \(a_
\). The symbol \(\Sigma\) is the capital Greek letter sigma and is shorthand for "sum." The lower and upper limits of the summation tells us which term to start with and which term to end with, respectively. For example, using the sequence \(a_ = 2n-1\) for \(n \geq 1\), we can write the sum \(a_ +a_ + a_ + a_\) as
The index variable is considered a "dummy variable" in the sense that it may be changed to any letter without affecting the value of the summation. For instance,
One place you may encounter summation notation is in mathematical definitions. For example, summation notation allows us to define polynomials as functions of the form
for real numbers \(a_\), \(k = 0, 1, \ldots n\). The reader is invited to compare this with what is given in the definition of a polynomial function in Section 3.1.
Our next example gives us practice with this new notation.
1 This is indeed a geometric sequence with first term \(a = 1\) and common ratio \(r = −1\).
2 It is an arithmetic sequence with first term \(a = 1\) and common difference \(d = 1\).
3 To see why, try writing the summation using "\(n\)" as the index.
The following theorem presents some general properties of summation notation. While we shall not have much need of these properties in Algebra, they do play a great role in Calculus. Moreover, there is much to be learned by thinking about why the properties hold. We invite the reader to prove these results. To get started, remember, "When in doubt, write it out!"
Suppose \(\left\\right\>\) and \(\left\\right\>\) are sequences so that the following sums are defined.
\left(a_ \pm b_ \right) = \sum_^
a_ \pm \sum_^
b_ >\)
c \, a_ = c \sum_^
a_>\), for any real number \(c\).
a_ = \sum_^ a_ + \sum_^
a_>\), for any natural number \(m \leq j < j+1 \leq p\).
a_ = \sum_^ a_>\), for any whole number \(r\).
We now turn our attention to the sums involving arithmetic and geometric sequences. Given an arithmetic sequence \(a_ = a + (k-1) d\) for \(k \geq 1\), we let \(S_n\) denote the sum of the first \(n\) terms. To derive a formula for \(S_n\), we write it out in two different ways
\[\begin
S_n & = & a & + & (a + d) & + & \ldots & + & (a + (n-2)d) & + & (a + (n-1)d) \\
S_n & = & (a + (n-1)d) & + & (a + (n-2)d) & + & \ldots & + & (a + d) & + & a \\
\end\nonumber\]
If we add these two equations and combine the terms which are aligned vertically, we get
\[2S_n = (2a + (n-1)d) + (2a + (n-1)d) + \ldots + (2a + (n-1)d) + (2a + (n-1)d)\nonumber\]
The right hand side of this equation contains \(n\) terms, all of which are equal to \((2a + (n-1)d)\) so we get \(2S_n = n(2a + (n-1)d)\). Dividing both sides of this equation by \(2\), we obtain the formula
\[S_n = \dfrac (2a + (n-1)d)\nonumber\]
If we rewrite the quantity \(2a + (n-1)d\) as \(a + (a + (n-1)d) = a_ + a_\), we get the formula
\[S_n = n \left(\dfrac>\right)\nonumber\]
A helpful way to remember this last formula is to recognize that we have expressed the sum as the product of the number of terms \(n\) and the average of the first and \(n^>\) terms.
To derive the formula for the geometric sum, we start with a geometric sequence \(a_ = ar^\), \(k \geq 1\), and let \(S_n\) once again denote the sum of the first \(n\) terms. Comparing \(S_n\) and \(rS_n\), we get
\[\begin
S_n & = & a & + & ar & + & ar^2 & + & \ldots & + & ar^ & + & ar^ & & \\
r S_n & = & & & ar & + & ar^2 & + & \ldots & + & ar^ & + & ar^ & + & ar^ \\
\end\nonumber\]
Subtracting the second equation from the first forces all of the terms except \(a\) and \(ar^\) to cancel out and we get \(S_n - rS_n = a - ar^\). Factoring, we get \(S_n(1-r) = a \left(1-r^\right)\). Assuming \(r \neq 1\), we can divide both sides by the quantity \((1-r)\) to obtain
\[S_n = a \left( \dfrac\right)\nonumber\]
If we distribute \(a\) through the numerator, we get \(a - ar^ = a_ - a_\) which yields the formula
In the case when \(r=1\), we get the formula
Our results are summarized below.
While we have made an honest effort to derive the formulas in Theorem \( \PageIndex \), formal proofs require the machinery in Section 7.3.
4 This is a summation. It's just that we are adding negative values for the first few terms.
One way to stack wood for the winter (if you have a wood-burning stove) is to dry stack the wood so that each layer contains one fewer log than the layer below. If I dry stacked wood so that the bottom layer had 40 logs and the top layer ended with 18 logs, how long can I keep my house warm considering that I burn one log per hour?
This is a very common application of arithmetic summations. We have the following sum:\[ 40 + 39 + 38 + \cdots + 18 \nonumber \]Therefore, the initial term of our sequence this summation is based on is \( a_1 = 40 \). The common difference (how we get to each successive term) is \( d = -1 \). Finally, there are \( n = 40 - 18 + 1 = 23 \) layers of wood (why did we add the 1?). Thus, by Theorem \( \PageIndex \), we have\[ S_n = n \dfrac = 23 \dfrac = (23)(29) = 667 \nonumber \]pieces of wood. That's \( 667 \) hours of heat (or almost 28 days of warmth).
An application of the arithmetic sum formula which proves useful in Calculus results in formula for the sum of the first \(n\) natural numbers. The natural numbers themselves are a sequence 5 \(1, 2, 3, \ldots \) which is arithmetic with \(a = d = 1\). Applying Theorem \( \PageIndex \),
So, for example, the sum of the first \(100\) natural numbers 6 is \(\frac = 5050\).
This formula, and some related formulas, are so important that we need to make a theorem out of it.
The proofs of these will have to wait until we cover Mathematical Induction.
An important application of the geometric sum formula is the investment plan called an annuity. Annuities differ from the kind of investments we studied in Section 6.5 in that payments are deposited into the account on an on-going basis, and this complicates the mathematics a little. 7 Suppose you have an account with annual interest rate \(r\) which is compounded \(n\) times per year. We let \(i = \frac\) denote the interest rate per period. Suppose we wish to make ongoing deposits of \(P\) dollars at the end of each compounding period. Let \(A_\) denote the amount in the account after \(k\) compounding periods. Then \(A_ = P\), because we have made our first deposit at the end of the first compounding period and no interest has been earned. During the second compounding period, we earn interest on \(A_\) so that our initial investment has grown to \(A_(1+i) = P(1+i)\) in accordance with the Simple Interest formula. When we add our second payment at the end of the second period, we get
\[A_2 = A_1(1+i) + P = P(1+i) + P = P(1+i)\left(1 + \dfrac\right)\nonumber\]
The reason for factoring out the \(P(1+i)\) will become apparent in short order. During the third compounding period, we earn interest on \(A_\) which then grows to \(A_(1+i)\). We add our third payment at the end of the third compounding period to obtain
\[A_3 = A_2(1+i) + P = P(1+i)\left(1 + \dfrac\right)(1+i) + P = P(1+i)^2\left(1 + \dfrac + \dfrac\right)\nonumber\]
During the fourth compounding period, \(A_\) grows to \(A_(1+i)\), and when we add the fourth payment, we factor out \(P(1+i)^3\) to get
This pattern continues so that at the end of the \(k^>\) compounding, we get
The sum in the parentheses above is the sum of the first \(k\) terms of a geometric sequence with \(a = 1\) and \(r = \frac\). Using Theorem \( \PageIndex \), we get
If we let \(t\) be the number of years this investment strategy is followed, then \(k = nt\), and we get the formula for the future value of an ordinary annuity.
Suppose an annuity offers an annual interest rate \(r\) compounded \(n\) times per year. Let \(i = \frac\) be the interest rate per compounding period. If a deposit \(P\) is made at the end of each compounding period, the amount \(A\) in the account after \(t\) years is given by
The reader is encouraged to substitute \(i = \frac\) into the formula for the future value of an ordinary annuity and simplify. Some familiar equations arise which are cause for pause and meditation.
One last note: if the deposit \(P\) is made a the beginning of the compounding period instead of at the end, the annuity is called an annuity-due. We leave the derivation of the formula for the future value of an annuity-due as an exercise for the reader.
An ordinary annuity offers a \(6 \%\) annual interest rate, compounded monthly.
5 This is the identity function on the natural numbers!
6 There is an interesting anecdote which says that the famous mathematician Carl Friedrich Gauss was given this problem in primary school and devised a very clever solution.
7 The reader may wish to re-read the discussion on compound interest in Section 6.5 before proceeding
We close this section with a peek into Calculus by considering infinite sums, called series. Consider the number \(0.\overline\). We can write this number as
\[0.\overline = 0.9999. = 0.9 + 0.09 + 0.009 + 0.0009 + \ldots\nonumber\]
From Example \( \PageIndex \), we know we can write the sum of the first \(n\) of these terms as
Using Theorem \( \PageIndex \), we have
It stands to reason that \(0.\overline\) is the same value of \(1 - \frac>\) as \(n \to \infty\). Our knowledge of exponential expressions from Section 6.1 tells us that \(\frac> \to 0\) as \(n \to \infty\), so \(1 - \frac> \to 1\). We have just argued that \(0.\overline = 1\), which may cause some distress for some readers. 8 Any non-terminating decimal can be thought of as an infinite sum whose denominators are the powers of \(10\), so the phenomenon of adding up infinitely many terms and arriving at a finite number is not as foreign of a concept as it may appear. We end this section with a theorem concerning geometric series.
Given the sequence \(a_ = ar^\) for \(k \geq 1\), where \(|r| < 1\),
If \(|r| \geq 1\), the sum \(a + ar + ar^2 + \ldots\) is not defined.
The justification of the result in Theorem \( \PageIndex \) comes from taking the formula in Theorem \( \PageIndex \) for the sum of the first \(n\) terms of a geometric sequence and examining the formula as \(n \to \infty\). Assuming \(|r| \to 0\) as \(n \to \infty\). Hence as \(n \to \infty\),
As to what goes wrong when \(|r| \geq 1\), we leave that to Calculus as well, but will explore some cases in the exercises. Before we jump into exercises, it's in our best interest to introduce a little bit of mathematical language that is used frequently in Calculus.
The geometric series
is called convergent if \( |r| \lt 1 \) and, in such a case, it is said to converge to \( \frac \). If \( |r| \ge 1 \), the series is said to be divergent.
The formula for a convergent geometric series
only works if \( |r| \lt 1 \). Moreover, due to the fact that series will often be given to you with a lower limit not equal to \( 1 \), or with exponents not equal to \( k - 1 \), it is very important to rephrase the required "structure" of the geometric series form. Essentially, you want to manipulate your given series (through factoring) to the form
\[ \displaystyle b \sum r^
, \nonumber \]
where the power, \( p \), is \( 0 \) when evaluated at the lower limit of the summation.
That "Cautionary" statement needs some clarification, so let's see it in an example.
Determine if the series converges or diverges. If it is convergent, find its value.
Unfortunately, when we get the first term of the expansion, \[ \displaystyle \sum_^ <\infty>\dfrac>> = \dfrac + \cdots, \nonumber \]we can clearly see that those exponents are definitely not \( 0 \) and they don't even match each other! What do we do?
A common starting discussion in Calculus about the topic of limits concerns walking "half-distances." Suppose you are playing a game and the goal is to move your game piece so it eventually touches the edge of the board. The rules of the game are that you can always move in any direction, but you must move at most a distance that is half the current distance between your game piece and the edge. How can you win?
Okay, we need more discussion than that. Suppose you take the most direct route to the edge. On your first turn, you must half the distance from the current position to the edge. On your second turn, you move half the remaining distance. You repeat this process ad infinitum.
You can compute the total distance traveled using a series. Let \( D \) be the initial distance the piece is from the edge of the board, and let \( i \) be the number of the turn.
So, on turn \( i = 1 \), you move your piece a total of \( \frac D \) directly toward the edge of the board.
On your second turn, you move half of the remaining distance. Since the remaining distance is \( \frac D \), you move \( \frac \left( \frac D \right) = \frac D \). 10
On the third turn, you move half of the remaining distance. Let's see, you have moved \( \frac D + \frac D = \frac D \). Therefore, the remaining distance is \( \frac D \). So you move your piece \( \frac \left( \frac D \right) = \frac D \).
The fourth turn comes and you now see a pattern. To check, let's do one last computation. You have already moved your piece \( \frac D + \frac D + \frac D = \frac D \). So you only have \( \frac D \) to go. You move your piece half this distance, \( \frac \left( \frac D \right) = \frac D \), toward the edge.
This pattern continues on and on.
Building a table of our "step sizes," we get
\[ \begin<|c|c|>
\hline i & S_i \\
\hline 1 & \fracD \\
\hline 2 & \frac D \\
\hline 3 & \frac D \\
\hline 4 & \frac D \\
\hline \vdots & \vdots \\ \hline
\end \nonumber \]
Summing the sequence \( S_i \) in the table, we get\[ \text = \displaystyle \sum_^<\infty> D> = \displaystyle \sum_^<\infty>\right)^i>. \nonumber \]At this point, we have the common ratio, \( r = \frac \), and what looks to be the value of \( a \) (it looks like \( a = D \)); however, there is a problem. The form of this series does not match the form in Theorem \( \PageIndex \). We need the exponent on the common ratio, \( r = \frac \), to be \( i - 1 \). The easiest way to reduce the power on an expression is to factor off one power. That is, \( \frac = \frac \left( \frac> \right) \). Hence, our series becomes\[ \begin
\text & = & \displaystyle \sum_^<\infty>> D> \\
& = & \displaystyle \sum_^<\infty> \cdot \frac>> \\
& = & \displaystyle \sum_^<\infty> \left( \frac \right)^> \\
& = & \dfrac<1 - \frac> \\
& = & \dfrac \\
& = & D. \\
\end \nonumber \]Thus, given an infinite amount of turns, we will have moved the game piece a total distance of \( D \) (which is the full length of the initial distance the piece was from the edge of the board); however, there is no possibility for us to have an infinite number of moves! Moreover, pretending that we had \( 10^ \) turns, 11 we would still have half of a remaining distance before the piece would actually reach the true edge.
Hence, despite theoretically being able to move the piece the full distance \( D \) to the edge of the board, the moment we believe we have actually reached the edge of the board would be on some finite turn, say the \( n^> \) turn, and at that moment, we would still have a sliver of space left between the game piece and the edge - and we can only move half that remaining distance.
8 To make this more palatable, it is usually accepted that \(0 . \overline=\frac\) so that \(0 . \overline=3(0 . \overline)=3\left(\frac\right)=1\). Feel better?
9 When in doubt, write them out!
10 It is a great idea when dealing with sequences, summations, and series to not simplify expressions. This allows you to see the patterns within sums.
11 You should Google this number. The namesake is very familiar to you!
This page titled 7.2: Summation Notation is shared under a CC BY-NC-SA 3.0 license and was authored, remixed, and/or curated by Carl Stitz & Jeff Zeager via source content that was edited to the style and standards of the LibreTexts platform.