Conjugate gradient methods - Mines Nancy

Introduction¶

In this chapter, we will focus on conjugate gradient methods, a popular class of iterative algorithms widely used for solving large-scale optimization and linear algebra problems.

The original motivation comes from the study of (square) positive definite linear systems of $n$ equations,

Q x = p

(1)

where $x \in \mathbb{R}^n$ , $Q \in \mathbb{R}^{n\times n}$ and $p\in \mathbb{R}^n$ . We assume that $Q \succ 0$ .

The solution to (1) is of course $x^\star = Q^{-1}p$ . So, why does it matter to study this problem? Because, for large $n$ , direct computation or even, memory storage become too costly to be used in practice.

Instead, we seek iterative methods to solve (1): this is known as the linear conjugate gradient method, which exploits the specific structure of the problem. (also closely related to least squares problems, as we will recall next)

References¶

Nocedal, J., & Wright, S. J. (2006). Numerical optimization (Second Edition). Springer.

5 Conjugate gradient methods

Introduction¶