Polynomials and Polynomial Rings — Reading Notes
Video Lecture
If the video does not load in your browser, use:
Open video in a new tab.
Companion interactive notebook:
Open the Wolfram Cloud notebook.
Purpose, Context, and the Deep Idea
These notes support the Unit II theme: functions as algebraic objects.
In this chapter we focus on polynomials not merely as expressions to simplify,
but as objects that live in a structured algebraic system called a polynomial ring.
More precisely: if $K$ is a field and $f(x)\in K[x]$, then solving $f(x)=0$
over $K$ amounts to finding linear factors $(x-a)$ of $f(x)$ in $K[x]$.
- Define polynomials (formal viewpoint) and distinguish polynomial objects from polynomial functions.
- Define ring and field (working definitions used in this course).
- Work in $R[x]$ (addition, multiplication, composition) and interpret what “closed under an operation” means.
- State and prove the Division Algorithm for $F[x]$ when $F$ is a field.
- Derive the Remainder Theorem and the Factor Theorem.
- Relate roots and multiplicity to factorization and (over $\mathbb{R}$) to graph behavior.
Rings and Fields (Minimal Toolkit)
Definition (Ring). A ring is a set $R$ with two operations
(addition $+$ and multiplication $\cdot$) such that:
- $(R,+)$ is an abelian group (there is $0$, every element has an additive inverse, and addition is associative and commutative);
- multiplication is associative: $(ab)c=a(bc)$ for all $a,b,c\in R$;
- multiplication distributes over addition:
$a(b+c)=ab+ac$ and $(a+b)c=ac+bc$.
Many rings also have a multiplicative identity $1$; when we assume this, we say “ring with identity”.
Definition (Commutative ring and field).
A ring $R$ is commutative if $ab=ba$ for all $a,b\in R$.
A field is a commutative ring with identity $1\ne 0$
in which every nonzero element has a multiplicative inverse.
Equivalently: in a field you can divide by every nonzero element.
Examples.
$\mathbb{Z}$ is a commutative ring with identity, but not a field (e.g., $2$ has no inverse in $\mathbb{Z}$).
$\mathbb{Q}$ and $\mathbb{R}$ are fields.
The Division Algorithm in $F[x]$ requires dividing by the leading coefficient of the divisor,
which is possible exactly because $F$ is a field.
(LibreTexts mirror)
What is a polynomial?
Definition (Polynomial in one indeterminate).
Let $R$ be a commutative ring with identity (often a field $K$ in this course).
A polynomial in an indeterminate $x$ with coefficients in $R$ is a formal expression
$$p(x)=a_0+a_1x+a_2x^2+\cdots+a_nx^n,$$
where $n\ge 0$, each $a_i\in R$, and $a_n\ne 0$ unless $p$ is the zero polynomial.
The set of all such polynomials is denoted $R[x]$.
Reference: Wikipedia — Polynomial; Judson, Polynomials chapter.
(Wikipedia: Polynomial,
Judson (LibreTexts))
Example. $x^2-4x+7$ is a polynomial in $\mathbb{R}[x]$.
Non-example. $x^3-2x^{-2}+1$ is not a polynomial, because it uses a negative exponent.
Degree, leading coefficient, constant term.
If $p(x)=a_0+a_1x+\cdots+a_nx^n$ with $a_n\ne 0$, then:
- $\deg(p)=n$ is the degree;
- $a_n$ is the leading coefficient;
- $a_0$ is the constant term.
The zero polynomial has all coefficients $0$.
For degree bookkeeping, it is convenient to set $\deg(0)=-\infty$.
Formal viewpoint (polynomial as an object).
A rigorous way to remove ambiguity is to define a polynomial as a finite sequence of coefficients
$(a_0,a_1,a_2,\dots)$ with $a_i\in R$ and only finitely many nonzero terms.
In this viewpoint, $x$ is an indeterminate: it is a symbol used to label “which coefficient goes with which power”.
Proposition (Uniqueness of coefficients / canonical form).
In $R[x]$, a polynomial is uniquely determined by its coefficients:
$$\sum_{k=0}^n a_k x^k = \sum_{k=0}^m b_k x^k
\quad\Longleftrightarrow\quad
a_k=b_k \text{ for all } k\ge 0,$$
where missing coefficients are interpreted as $0$.
Proof.
In the formal definition, a polynomial is its coefficient sequence.
Equality is defined componentwise, so equality holds exactly when each coefficient agrees.
Conversely, if all coefficients agree, the sequences are identical, hence the polynomials are equal. $\square$
Reference: Judson, definition of $R[x]$ as sequences/coefficients and equality in $R[x]$.
(Judson PDF)
Given a polynomial $p\in R[x]$ and an element $a$ in a ring containing $R$, we can evaluate $p$ at $a$ by substitution to get $p(a)$.
If $R$ is a familiar number system (like $\mathbb{R}$), this defines a function $f(a)=p(a)$.
The polynomial and the function it defines are related, but not identical objects.
(Wikipedia: Polynomial ring)
The polynomial ring $R[x]$
We now define addition and multiplication so that $R[x]$ becomes a ring.
The rule is: add and multiply polynomials by combining coefficients exactly as you do when expanding expressions,
but interpret the result formally (as a new coefficient sequence).
Definition (Addition and multiplication).
Let
$p(x)=\sum_{i=0}^m a_i x^i$ and $q(x)=\sum_{j=0}^n b_j x^j$ in $R[x]$.
Define:
- Addition:
$$(p+q)(x)=\sum_{k\ge 0}(a_k+b_k)x^k,$$
where missing coefficients are treated as $0$. - Multiplication (Cauchy product):
$$(pq)(x)=\sum_{k\ge 0}\left(\sum_{i+j=k} a_i b_j\right)x^k.$$
Theorem. If $R$ is a commutative ring with identity, then $R[x]$ is a commutative ring with identity.
Proof (structure proof, coefficient-by-coefficient).
We verify the ring axioms using the definitions above.
- Additive structure. Under addition, coefficientwise addition makes $R[x]$ an abelian group:
associativity and commutativity follow from the same properties in $R$,
the zero polynomial is the additive identity,
and $-p$ is defined by $(-p)(x)=\sum (-a_k)x^k$. - Multiplication is associative. For $p,q,r\in R[x]$, compare the coefficient of $x^k$ in $(pq)r$ and in $p(qr)$.
Both coefficients are equal to $\sum_{i+j+\ell=k} a_i b_j c_\ell$, using associativity and distributivity in $R$. - Distributivity. The coefficient of $x^k$ in $p(q+r)$ equals
$\sum_{i+j=k} a_i (b_j+c_j)=\sum_{i+j=k} a_i b_j + \sum_{i+j=k} a_i c_j$,
which is the coefficient of $x^k$ in $pq+pr$. - Identity and commutativity. The constant polynomial $1$ acts as a multiplicative identity.
If $R$ is commutative, then $a_i b_j=b_j a_i$ implies $pq=qp$ coefficientwise.
Therefore $R[x]$ is a commutative ring with identity. $\square$
Reference: Standard construction of polynomial rings (Judson; Wikipedia polynomial ring).
(Judson PDF,
Wikipedia: Polynomial ring)
Composition (substitution). If $f(x),g(x)\in R[x]$, define
$$(f\circ g)(x)=f(g(x)).$$
This is again a polynomial because substitution uses only addition and multiplication in the ring.
(add, multiply, divide with remainder) without referring to graphs or numerical inputs.
That structural viewpoint will be the engine behind solving equations by factorization.
Degree arithmetic
Convention. We set $\deg(0)=-\infty$ so that max/inequality statements behave naturally.
Proposition. For $p,q\in R[x]$,
$$\deg(p+q)\le \max\{\deg(p),\deg(q)\}.$$
Proof.
Let $m=\deg(p)$ and $n=\deg(q)$. Then all coefficients of $p$ above degree $m$ are $0$, and similarly for $q$.
So for $k>\max\{m,n\}$, the coefficient of $x^k$ in $p+q$ is $0+0=0$.
Therefore $p+q$ has no nonzero terms above degree $\max\{m,n\}$, proving the inequality. $\square$
Proposition (Degree of a product).
If $R$ is an integral domain and $p,q\in R[x]$ are nonzero, then
$$\deg(pq)=\deg(p)+\deg(q).$$
Proof.
Write $p(x)=a_m x^m+\text{(lower terms)}$ and $q(x)=b_n x^n+\text{(lower terms)}$
with $a_m\ne 0$ and $b_n\ne 0$.
In the product, the coefficient of $x^{m+n}$ is $a_m b_n$.
Because $R$ is an integral domain, $a_m b_n\ne 0$, so $x^{m+n}$ appears with nonzero coefficient.
No term of degree greater than $m+n$ can appear (degrees add when multiplying monomials).
Hence $\deg(pq)=m+n$. $\square$
Reference: Wikipedia — Polynomial ring (degree properties); standard texts.
(Wikipedia: Polynomial ring)
it limits the number of roots and makes division with remainder meaningful.
Evaluation and the “polynomial vs. function” distinction
Let $R$ be a commutative ring and let $A$ be a ring containing $R$ (for example, $A=R$).
Given $a\in A$ we define an evaluation map $\operatorname{ev}_a:R[x]\to A$ by
$$\operatorname{ev}_a\left(\sum_{k=0}^n a_k x^k\right)=\sum_{k=0}^n a_k a^k.$$
Theorem (Evaluation is a ring homomorphism).
For each $a\in A$, the map $\operatorname{ev}_a:R[x]\to A$ satisfies:
$$\operatorname{ev}_a(p+q)=\operatorname{ev}_a(p)+\operatorname{ev}_a(q),\quad
\operatorname{ev}_a(pq)=\operatorname{ev}_a(p)\,\operatorname{ev}_a(q),\quad
\operatorname{ev}_a(1)=1.$$
Proof.
Additivity follows by substituting $a$ and using coefficientwise addition.
For multiplicativity, expand:
$$\operatorname{ev}_a(pq)=\sum_{k\ge 0}\left(\sum_{i+j=k} a_i b_j\right)a^k
=\sum_{i,j} a_i b_j a^{i+j}
=\left(\sum_i a_i a^i\right)\left(\sum_j b_j a^j\right)=\operatorname{ev}_a(p)\operatorname{ev}_a(q).$$
Finally $\operatorname{ev}_a(1)=1$. $\square$
Reference: Wikipedia — Polynomial ring (evaluation homomorphism/universal property).
(Wikipedia: Polynomial ring)
A polynomial $p\in R[x]$ is an algebraic object. The function $f(a)=p(a)$ is a different object.
Over infinite fields (like $\mathbb{R}$), different polynomials induce different functions,
but over finite fields distinct polynomials can induce the same function.
Proposition. If $K$ is an infinite field and $p,q\in K[x]$ define the same function $K\to K$,
then $p=q$ as polynomials.
Proof. If $p$ and $q$ define the same function, then $(p-q)(a)=0$ for all $a\in K$.
If $p\ne q$, then $p-q$ is a nonzero polynomial and therefore has only finitely many roots
(at most its degree; see Section “Roots and multiplicity”). This contradicts the fact that $K$ is infinite.
Hence $p-q=0$ and $p=q$. $\square$
Reference: Standard consequence of the “at most $n$ roots” theorem; see Judson polynomials chapter.
(Judson PDF)
Division Algorithm in $F[x]$ (existence and uniqueness)
Setting. For the Division Algorithm we assume $F$ is a field.
In $F[x]$ we can divide by any nonzero leading coefficient.
Theorem (Division Algorithm for polynomials over a field).
Let $F$ be a field. If $f(x),g(x)\in F[x]$ with $g(x)\ne 0$, then there exist unique polynomials
$q(x),r(x)\in F[x]$ such that
$$f(x)=g(x)q(x)+r(x),\qquad r(x)=0\ \text{or}\ \deg(r)<\deg(g).$$
Proof (existence).
Let $n=\deg(f)$ and $m=\deg(g)$.
If $n<m$, take $q=0$ and $r=f$.
Otherwise, write
$$f(x)=a_n x^n+\cdots,\qquad g(x)=b_m x^m+\cdots,$$
with $a_n,b_m\ne 0$. Since $F$ is a field, $b_m^{-1}$ exists.
Define a first step:
$$f_1(x)=f(x)-\left(a_n b_m^{-1}x^{n-m}\right)g(x).$$
The leading term cancels, so either $f_1=0$ or $\deg(f_1)<n$.
Repeat this process on $f_1$; degrees strictly decrease, so the algorithm terminates after finitely many steps,
producing polynomials $q$ (the sum of the monomials chosen) and $r$ (the final remainder), with $\deg(r)<m$.
Proof (uniqueness).
Suppose
$$f=gq+r=gq’+r’$$
with $r,r’=0$ or $\deg(r),\deg(r’)<\deg(g)$.
Then $g(q-q’)=r’-r$.
If $q\ne q’$, then the left-hand side is nonzero and
$$\deg\big(g(q-q’)\big)=\deg(g)+\deg(q-q’)\ge \deg(g),$$
while the right-hand side satisfies $\deg(r’-r)<\deg(g)$.
This is impossible. Hence $q=q’$ and then $r=r’$. $\square$
References:
Judson (Division Algorithm) and Conrad (Division theorem in $R[T]$).
(Judson §17.2 (LibreTexts),
Conrad: Division Theorem in $R[T]$ (PDF))
Teacher connection. This theorem is the formal justification for “polynomial long division”.
The proof is exactly the long-division idea, written in a way that makes the logic precise.
Remainder Theorem and Factor Theorem
Theorem (Remainder Theorem).
Let $F$ be a field, $f(x)\in F[x]$, and $a\in F$.
When $f(x)$ is divided by $(x-a)$, the remainder is $f(a)$.
Proof.
By the Division Algorithm, there exist unique $q(x)\in F[x]$ and $r\in F$ (a constant) such that
$$f(x)=(x-a)q(x)+r.$$
Evaluate at $x=a$:
$$f(a)=(a-a)q(a)+r=r.$$
Therefore the remainder is $f(a)$. $\square$
References: Judson (Division Algorithm implies Remainder Theorem); OpenStax also states this as a key concept.
(Judson §17.2,
OpenStax College Algebra 2e: Key Concepts (Remainder Theorem))
Theorem (Factor Theorem).
Let $F$ be a field, $f(x)\in F[x]$, and $a\in F$.
Then
$$f(a)=0 \quad\Longleftrightarrow\quad (x-a)\mid f(x).$$
Proof.
Divide $f(x)$ by $(x-a)$ to get $f(x)=(x-a)q(x)+r$ with $r=f(a)$ by the Remainder Theorem.
Now $f(a)=0$ if and only if $r=0$, and this holds if and only if
$$f(x)=(x-a)q(x),$$
meaning $(x-a)$ divides $f(x)$. $\square$
References: OpenStax Factor Theorem; Judson polynomials chapter.
(OpenStax Algebra 1: Factor Theorem,
Judson PDF)
To solve $f(x)=0$ over $F$, we are looking for $a\in F$ such that $f(a)=0$.
The Factor Theorem translates this into: find linear factors $(x-a)$ of $f(x)$ in $F[x]$.
Roots, multiplicity, and what graphs do near intercepts
Definition (Root). Let $F$ be a field and $0\ne f(x)\in F[x]$.
An element $a\in F$ is a root of $f$ if $f(a)=0$.
Definition (Multiplicity). A root $a$ has multiplicity $m\ge 1$ if
$$(x-a)^m\mid f(x)\quad\text{but}\quad (x-a)^{m+1}\nmid f(x).$$
Theorem (Degree bounds the number of roots).
If $F$ is a field and $0\ne f\in F[x]$ has degree $n$, then $f$ has at most $n$ roots in $F$.
Proof (induction on $n$).
If $n=0$, then $f$ is a nonzero constant and has no roots.
Assume the statement holds for degrees $<n$.
If $f$ has no roots, we are done. Otherwise let $a$ be a root.
By the Factor Theorem, $f(x)=(x-a)g(x)$ for some $g\in F[x]$ with $\deg(g)=n-1$.
Any root $b\ne a$ of $f$ must satisfy $0=f(b)=(b-a)g(b)$.
Since $b\ne a$, the factor $(b-a)\ne 0$, so $g(b)=0$.
Thus, all roots of $f$ other than $a$ are roots of $g$.
By induction, $g$ has at most $n-1$ roots, so $f$ has at most $1+(n-1)=n$ roots. $\square$
Reference: Standard theorem in the polynomials chapter of abstract algebra texts (Judson).
(Judson PDF)
Proposition (Multiplicity parity and sign change over $\mathbb{R}$).
Let $f\in\mathbb{R}[x]$ and let $a\in\mathbb{R}$. Suppose $a$ is a root of $f$ of multiplicity $m\ge 1$.
Then there exists a polynomial $h\in\mathbb{R}[x]$ with $h(a)\ne 0$ such that
$$f(x)=(x-a)^m h(x).$$
Then $f$ changes sign as $x$ passes through $a$ if and only if $m$ is odd.
Proof.
Because $h$ is a polynomial, it is continuous on $\mathbb{R}$.
Since $h(a)\ne 0$, set $\varepsilon=\frac{1}{2}|h(a)|$.
By continuity, there exists $\delta>0$ such that
$$|x-a|<\delta \ \Longrightarrow\ |h(x)-h(a)|<\varepsilon.$$
For such $x$, we have $|h(x)-h(a)|<|h(a)|$, which forces $h(x)$ to have the same sign as $h(a)$ (in particular, $h(x)\ne 0$).
Now take any $x$ with $0<|x-a|<\delta$.
If $x\in(a-\delta,a)$ then $(x-a)$ is negative, so $(x-a)^m$ is negative exactly when $m$ is odd.
If $x\in(a,a+\delta)$ then $(x-a)$ is positive, so $(x-a)^m$ is positive for every $m$.
Since $h(x)$ keeps a fixed sign near $a$, the product $f(x)=(x-a)^m h(x)$ changes sign from the left side of $a$ to the right side precisely when $m$ is odd. $\square$
Reference: sign-preserving property of continuous functions at a nonzero point (analysis) and the factorization
$f(x)=(x-a)^m h(x)$ for a root of multiplicity $m$ (algebra).
(Hunter, Introductory Real Analysis, Ch. 7;
OpenStax, Sec. 5.3.)
Over $\mathbb{R}$, sign-change corresponds to the graph crossing the $x$-axis (odd multiplicity),
while no sign-change corresponds to touching and turning (even multiplicity).
Higher multiplicity makes the graph “flatter” near the intercept because $(x-a)^m$ approaches $0$ more rapidly.
Worked example: solving $f(x)=0$ by factoring in $\mathbb{R}[x]$
Example. Solve the equation
$$f(x)=x^3-3x^2-4x+12=0.$$
Solution (with theorems we proved).
Test $a=3$:
$$f(3)=27-27-12+12=0.$$
By the Factor Theorem, $(x-3)$ divides $f(x)$.
Divide (or verify by multiplication) to get
$$f(x)=(x-3)(x^2-4).$$
Factor the quadratic:
$$x^2-4=(x-2)(x+2).$$
Hence
$$f(x)=(x-3)(x-2)(x+2).$$
Therefore the solutions are $x=3,2,-2$. $\square$
The calculation is not merely “symbol pushing.” It is an instance of a structural method:
evaluate, identify a linear factor, divide, and continue factoring.
Each step is justified by the Division Algorithm and the Factor Theorem.
CAS note: factor / solve / graph (and what to watch for)
Computer Algebra Systems (CAS) can factor polynomials, solve equations symbolically when possible,
and graph polynomial functions quickly. Use them to explore, then return to theory to justify.
Use the companion notebook for CAS demonstrations and exploration:
Open the Wolfram Cloud notebook.
A Wolfram account sign-in may be required.
Pick a polynomial in $\mathbb{R}[x]$ of degree $4$ or $5$ and use a CAS to:
(i) factor it (if possible over $\mathbb{Q}$ or $\mathbb{R}$),
(ii) approximate its real roots numerically,
(iii) graph it and mark intercept behavior.
Then explain: where did the Factor Theorem appear in your reasoning?
and factoring over $\mathbb{Q}$ can differ from factoring over $\mathbb{R}$ or $\mathbb{C}$.
(This is a preview of later algebra topics.)
Reading-completion exercises
-
Let $p(x)=2x^4-3x+1$ and $q(x)=x^2+5$ in $\mathbb{Q}[x]$.
Compute $p+q$ and $pq$ (fully expanded). State the degrees of each result. -
Use the Division Algorithm to divide $f(x)=x^4+1$ by $g(x)=x^2+1$ in $\mathbb{R}[x]$.
Write $f=gq+r$ with $\deg(r)<\deg(g)$. -
Let $f(x)=x^3-6x^2+11x-6$.
(a) Evaluate $f(1),f(2),f(3)$.
(b) Use the Factor Theorem to factor $f$ completely over $\mathbb{R}$. -
Prove: if $0\ne f\in F[x]$ has degree $n$, then $f$ cannot have more than $n$ distinct roots in $F$.
(You may use the Factor Theorem.) -
(Teacher reflection) Write one student misconception about “degree” or “like terms”
and explain how the formal definition of polynomial helps you correct it.
References (linked)
-
Wikipedia — Polynomial:
https://en.wikipedia.org/wiki/Polynomial -
Wikipedia — Polynomial ring (evaluation homomorphism, universal property):
https://en.wikipedia.org/wiki/Polynomial_ring -
Thomas W. Judson, Abstract Algebra: Theory and Applications — Chapter on Polynomials:
LibreTexts version,
PDF. -
Keith Conrad, The Division Theorem in $\mathbb{Z}$ and $R[T]$ (polynomial division with remainder):
PDF. -
OpenStax (Remainder and Factor Theorem statements for school algebra connection):
College Algebra 2e — Key Concepts,
Algebra 1 — Factor Theorem.