02 — Differential Calculus

Comprehensive study guide for Bocconi Math Module 2 (cod. 30063), General Exam. Source materials: Alice Sicconi's Proofs - Differential Calculus.pdf, Linear Algebra and Differential Calculus.pdf (diff-calc half), lect1_calc-diff.pdf, lect2_calc_diff (2).pdf, Lect3_calc_diff (3).pdf, lect 4_calc_diff (3).pdf, lect5_calc_diff.pdf, TA41 _calc_diff (2).pdf, General_24524_ENG_SOL.pdf, Final exercises solution.pdf.

§1. Overview & Exam Relevance

Differential Calculus is the second block of the first partial and accounts for approximately 27% of the general exam (typically 1 MCQ worth 5 pts plus one open-ended question worth up to 20 pts — about 25–30 pts of the 150-pt exam). On open-ended problems it is tied with Linear Algebra as the richest source of statement-plus-proof questions.

Topic scope. The exam tests:

multivariable functions $f : \mathbb{R}^n \to \mathbb{R}$ and their regularity classes $C^{(1)}, C^{(2)}$ ;
partial derivatives $\partial f / \partial x_i$ , gradient $\nabla f$ , directional derivatives, and the Hessian $\nabla^2 f$ (equivalently $H_f$ );
Schwarz's theorem: symmetry of the Hessian for $C^{(2)}$ functions;
Second-order Taylor formula with Peano remainder at a point $\mathbf{x}_0 \in \mathbb{R}^n$ ;
classification of stationary points via the sign of the Hessian quadratic form $\mathbf{h}^T \nabla^2 f(\mathbf{x}_0) \mathbf{h}$ (Theorem 1391);
convex/concave functions on open convex sets, tangent-hyperplane inequality, and the Hessian-based characterisation (Theorem/Prop. 1486);
necessary and sufficient conditions for global extrema of convex/concave functions (Theorems 1497, 1591);
existence theorems: Weierstrass (on compact sets), Tonelli (on coercive/super-coercive functions);
Dini's theorem (implicit function theorem): local uniqueness and formula for $f'(x_0)$ from $g(x,y) = c$ .

Typical MCQ patterns (from May 2024 General exam).

MCQ2 (Mode A) / MCQ1 (Mode B): "The Taylor formula of order two for $f(x,y) = x^2 + x \cos y$ centred at $(1, -\pi/2)$ is …" — compute $f, \nabla f, \nabla^2 f$ at the point and assemble.
Other recurring MCQs (from TA and lecture pools): local-extrema classification of $f(x,y) = (y-1)(y-x^2)$ ; Dini-applicability at boundary points of $g(x,y)=0$ ; coercivity check.

Typical open-ended pattern (May 2024 Q8 Mode A / Q7 Mode B).

Part (a) — State the concavity / strict-concavity test (Prop. 1486) based on the sign of the Hessian matrix.
Part (b) — Given $f(x,y) = e^x(x-1)y + y^2$ : (b1) write $\nabla f$ and determine stationary points → $P = (0, 1/2)$ local min and $Q = (1, 0)$ saddle; (b2) write $\nabla^2 f$ and classify each stationary point using Theorem 1391; (b3) decide whether $f$ is convex on the whole $\mathbb{R}^2$ .

Why this topic is high-leverage.

Every theorem here uses Linear Algebra from §01: classifying a stationary point is classifying a quadratic form attached to the Hessian. Spectral/Sylvester–Jacobi carry over verbatim.
The Taylor formula is the engine behind every result in this chapter — the proof of Theorem 1391 literally starts by invoking Theorem 1481 (Taylor).
Together with Linear Algebra (≈27%), Differential Calculus pushes the first-partial content to ≈55% of the whole general exam.

§2. Definitions

2.1 Multivariable function $f : \mathbb{R}^n \to \mathbb{R}$

A multivariable (scalar) function of $n$ real variables is a map $f : A \subseteq \mathbb{R}^n \to \mathbb{R}$ , assigning to each point $\mathbf{x} = (x_1, \ldots, x_n) \in A$ exactly one real number $f(\mathbf{x})$ . The set $A$ is the domain; its image $f(A) \subseteq \mathbb{R}$ is the range.

Example. $f(x,y) = x + 3y^2 - \ln(x + 2y)$ has domain $\{(x,y) \in \mathbb{R}^2 : x + 2y > 0\}$ because $\ln$ requires a strictly positive argument (Final exercises solution.pdf Q2).

2.2 Open ball and open set

Given $\mathbf{x}_0 \in \mathbb{R}^n$ and $r > 0$ , the open ball (or neighbourhood) of centre $\mathbf{x}_0$ and radius $r$ is $U_r(\mathbf{x}_0) = B_r(\mathbf{x}_0) = \{\mathbf{x} \in \mathbb{R}^n : \|\mathbf{x} - \mathbf{x}_0\| < r\}.$ A set $A \subseteq \mathbb{R}^n$ is open if for every $\mathbf{x}_0 \in A$ there is $r > 0$ with $U_r(\mathbf{x}_0) \subseteq A$ . A point $\mathbf{x}_0 \in A$ is an interior point of $A$ if such an $r$ exists; otherwise it is a boundary point.

2.3 Limit and continuity

$\lim_{\mathbf{x} \to \mathbf{x}_0} f(\mathbf{x}) = L$ means: $\forall \varepsilon > 0 \ \exists \delta > 0$ such that $0 < \|\mathbf{x} - \mathbf{x}_0\| < \delta \Rightarrow |f(\mathbf{x}) - L| < \varepsilon$ .

$f$ is continuous at $\mathbf{x}_0$ if $\lim_{\mathbf{x} \to \mathbf{x}_0} f(\mathbf{x}) = f(\mathbf{x}_0)$ ; it is continuous on $A$ if it is continuous at every point of $A$ .

2.4 Partial derivative and gradient

The $i$ -th partial derivative of $f$ at $\mathbf{x}_0$ is $\frac{\partial f}{\partial x_i}(\mathbf{x}_0) \;=\; f'_{x_i}(\mathbf{x}_0) \;=\; \lim_{h \to 0} \frac{f(\mathbf{x}_0 + h \mathbf{e}_i) - f(\mathbf{x}_0)}{h},$ where $\mathbf{e}_i$ is the $i$ -th coordinate vector. Operationally, $\partial f / \partial x_i$ is obtained by differentiating $f$ with respect to $x_i$ while treating all other variables as constants.

The gradient $\nabla f(\mathbf{x}_0)$ is the row vector of all partial derivatives: $\nabla f(\mathbf{x}_0) \;=\; \bigl[f'_{x_1}(\mathbf{x}_0) \; f'_{x_2}(\mathbf{x}_0) \; \cdots \; f'_{x_n}(\mathbf{x}_0)\bigr].$ When written as a column, we use $\nabla f(\mathbf{x}_0)^T$ .

Example (lect2). For $f(x,y) = 3 x^2 e^{y^2}$ we have $f'_x = 6 x e^{y^2}$ , $f'_y = 3x^2 e^{y^2} \cdot 2y = 6 x^2 y e^{y^2}$ , so $\nabla f(x,y) = [6 x e^{y^2}, \ 6 x^2 y e^{y^2}]$ and $\nabla f(1,1) = [6e, \ 6e]$ .

2.5 Directional derivative

Given a unit vector $\mathbf{v} \in \mathbb{R}^n$ ( $\|\mathbf{v}\| = 1$ ), the directional derivative of $f$ at $\mathbf{x}_0$ in the direction $\mathbf{v}$ is $D_{\mathbf{v}} f(\mathbf{x}_0) \;=\; \lim_{h \to 0} \frac{f(\mathbf{x}_0 + h \mathbf{v}) - f(\mathbf{x}_0)}{h}.$ If $f$ is differentiable at $\mathbf{x}_0$ , then $D_{\mathbf{v}} f(\mathbf{x}_0) = \nabla f(\mathbf{x}_0) \cdot \mathbf{v}$ .

2.6 Differentiability and $C^{(1)}, C^{(2)}$ classes

$f$ is differentiable at $\mathbf{x}_0$ if there exists a row vector $\mathbf{L} \in \mathbb{R}^n$ such that $f(\mathbf{x}_0 + \mathbf{h}) = f(\mathbf{x}_0) + \mathbf{L} \cdot \mathbf{h} + o(\|\mathbf{h}\|) \qquad \text{as } \mathbf{h} \to \mathbf{0}.$ If such $\mathbf{L}$ exists, it is necessarily $\mathbf{L} = \nabla f(\mathbf{x}_0)$ .

$C^{(1)}(A)$ : $f$ has continuous first partial derivatives on $A$ . If $f \in C^{(1)}$ then $f$ is differentiable.
$C^{(2)}(A)$ : $f$ has continuous second-order partial derivatives on $A$ . This is the hypothesis of Schwarz's theorem and Theorem 1481 (Taylor).

2.7 Hessian $\nabla^2 f = H_f$

The Hessian matrix of $f$ at $\mathbf{x}_0$ is the $n \times n$ matrix of second partial derivatives: $\nabla^2 f(\mathbf{x}_0) \;=\; H_f(\mathbf{x}_0) \;=\; \begin{pmatrix} f''_{x_1 x_1}(\mathbf{x}_0) & f''_{x_1 x_2}(\mathbf{x}_0) & \cdots & f''_{x_1 x_n}(\mathbf{x}_0) \\ f''_{x_2 x_1}(\mathbf{x}_0) & f''_{x_2 x_2}(\mathbf{x}_0) & \cdots & f''_{x_2 x_n}(\mathbf{x}_0) \\ \vdots & \vdots & \ddots & \vdots \\ f''_{x_n x_1}(\mathbf{x}_0) & f''_{x_n x_2}(\mathbf{x}_0) & \cdots & f''_{x_n x_n}(\mathbf{x}_0) \end{pmatrix}.$

Schwarz's theorem (stated in lect1): if $f \in C^{(2)}(A)$ with $A$ open, then $\partial^2 f / (\partial x_i \partial x_j) = \partial^2 f / (\partial x_j \partial x_i)$ for all $i, j$ and all $\mathbf{x} \in A$ . Hence $\nabla^2 f$ is symmetric, and the classification tools from Linear Algebra (§01) apply directly.

Mnemonic (lect1 sidenote). The gradient is an operator (it sends a point to a vector). The Jacobian matrix contains all partials of a vector-valued operator. Since $\nabla f$ is itself a vector-valued operator $\mathbb{R}^n \to \mathbb{R}^n$ , its Jacobian is the Hessian: $\nabla^2 f$ = Jacobian of $\nabla f$ .

2.8 Stationary point

A point $\mathbf{x}_0 \in A$ is a stationary point (or critical point) of $f$ if $\nabla f(\mathbf{x}_0) = \mathbf{0}$ , i.e., every partial derivative vanishes at $\mathbf{x}_0$ .

Remark. Candidates for local extrema of a differentiable $f$ on an open set are exactly the stationary points (Fermat's theorem, §3.7). On a non-open set one must also check boundary points and exceptional points where $f$ is not differentiable (lect1).

2.9 Local and global extrema

Let $\mathbf{x}_0 \in A$ . The point $\mathbf{x}_0$ is:

a local minimum of $f$ if there is $r > 0$ with $f(\mathbf{x}_0) \le f(\mathbf{x})$ for every $\mathbf{x} \in U_r(\mathbf{x}_0) \cap A$ ;
a strong (strict) local minimum if $f(\mathbf{x}_0) < f(\mathbf{x})$ for every $\mathbf{x} \in U_r(\mathbf{x}_0) \cap A$ with $\mathbf{x} \neq \mathbf{x}_0$ ;
a global (resp. strong global) minimum if the corresponding inequality holds for every $\mathbf{x} \in A$ .

Maxima are defined by reversing the inequalities. A stationary point that is neither a local max nor a local min is a saddle point (e.g., $(0,0)$ for $f(x,y) = x^2 - y^2$ ).

2.10 Convex set, convex/concave function

A set $C \subseteq \mathbb{R}^n$ is convex if for every $\mathbf{x}, \mathbf{y} \in C$ and every $t \in [0,1]$ , $t\mathbf{x} + (1-t)\mathbf{y} \in C$ (the segment joining any two points stays in $C$ ).

Let $C \subseteq \mathbb{R}^n$ be convex. A function $f : C \to \mathbb{R}$ is convex on $C$ if $f\bigl(t\mathbf{x} + (1-t)\mathbf{y}\bigr) \le t f(\mathbf{x}) + (1-t) f(\mathbf{y}) \qquad \forall\, \mathbf{x}, \mathbf{y} \in C, \forall\, t \in [0,1].$ It is strictly convex if the inequality is strict whenever $\mathbf{x} \neq \mathbf{y}$ and $t \in (0,1)$ . $f$ is (strictly) concave iff $-f$ is (strictly) convex.

Tangent-hyperplane characterisation (lect4, for $f$ differentiable on an open convex set $C$ ):

$f$ is convex on $C$ $\iff$ $f(\mathbf{x}) \ge f(\mathbf{x}_0) + \nabla f(\mathbf{x}_0)(\mathbf{x} - \mathbf{x}_0)$ for all $\mathbf{x}, \mathbf{x}_0 \in C$ (tangent hyperplane sits below the graph);
$f$ is strictly convex on $C$ $\iff$ the inequality above is strict whenever $\mathbf{x} \neq \mathbf{x}_0$ .

The 1-variable analogues ( $f(x) \ge f(x_0) + f'(x_0)(x - x_0)$ ) and the second-derivative tests ( $f''(x) \ge 0$ for convex, $f''(x) > 0$ for strictly convex) are recovered as special cases $n = 1$ .

2.11 Coercive, super-coercive, level set

A level set of $f$ at height $c$ is $\{\mathbf{x} \in A : f(\mathbf{x}) = c\}$ (or $\le c$ for the sub-level set).

$f : \mathbb{R}^n \to \mathbb{R}$ is coercive on a set $A$ (Sicconi's convention = super-coercive towards $+\infty$ ) if $\lim_{\|\mathbf{x}\| \to \infty} f(\mathbf{x}) = +\infty$ . Equivalently, every sub-level set $\{f \le c\}$ is bounded. If $\lim_{\|\mathbf{x}\| \to \infty} f(\mathbf{x}) = -\infty$ we say $f$ is super-coercive towards $-\infty$ (the symmetric case used for existence of global maxima).

Composition rule (lect4). If the inner function $g : \mathbb{R}^n \to \mathbb{R}$ is continuous and coercive towards $-\infty$ , and the outer function $h : \mathbb{R} \to \mathbb{R}$ is strictly increasing, then $h \circ g$ is coercive (towards $\lim_{t \to -\infty} h(t)$ ). This is how $e^{-x^2 - y^2}$ gets its super-coercivity towards $0$ from above (hence $-e^{-x^2-y^2}$ is super-coercive towards $0$ , but $f = e^{-x^2-y^2}$ has supremum equal to $1$ , making $(0,0)$ the global maximiser).

2.12 Implicit function — setup for Dini

Let $g : C \subseteq \mathbb{R}^2 \to \mathbb{R}$ and $c \in \mathbb{R}$ . Consider the level curve $g(x,y) = c$ . We ask: does there exist a function $f : I \subseteq \mathbb{R} \to \mathbb{R}$ with $y = f(x)$ such that $g(x, f(x)) = c$ for all $x \in I$ ? If yes, we say $f$ is the implicit function defined by $g(x,y) = c$ near a specific point. Dini's theorem (§3.6) gives sufficient conditions for local existence, uniqueness, and a formula for $f'(x_0)$ .

§3. Theorems, Propositions & Proofs

3.1 Schwarz's theorem — symmetry of the Hessian

Statement. Let $A \subseteq \mathbb{R}^n$ be open and $f : A \to \mathbb{R}$ be $C^{(2)}$ on $A$ . Then $\frac{\partial^2 f}{\partial x_i \partial x_j}(\mathbf{x}) \;=\; \frac{\partial^2 f}{\partial x_j \partial x_i}(\mathbf{x}) \qquad \forall i, j, \ \forall\, \mathbf{x} \in A.$ In particular $\nabla^2 f(\mathbf{x})$ is a symmetric matrix.

Source: lect1_calc-diff.pdf (stated without proof, as is course convention).

Why it matters. Every theorem that follows in §3 uses the fact that $\nabla^2 f(\mathbf{x}_0)$ is symmetric, so all the sign-classification tools from Linear Algebra (eigenvalue signs, Sylvester–Jacobi leading principal minors) apply to the Hessian.

3.2 Theorem 1481 — Second-order Taylor formula with Peano remainder

Statement. Let $A \subseteq \mathbb{R}^n$ be open, $\mathbf{x}_0 \in A$ , and $f \in C^{(2)}(A)$ . Then for every $\mathbf{x} = \mathbf{x}_0 + \mathbf{h}$ in a neighbourhood $B(\mathbf{x}_0) \subseteq A$ , $\boxed{\, f(\mathbf{x}) \;=\; f(\mathbf{x}_0) \;+\; \nabla f(\mathbf{x}_0)(\mathbf{x} - \mathbf{x}_0) \;+\; \tfrac{1}{2}(\mathbf{x} - \mathbf{x}_0)^T \nabla^2 f(\mathbf{x}_0)(\mathbf{x} - \mathbf{x}_0) \;+\; o(\|\mathbf{x} - \mathbf{x}_0\|^2) \,}$ as $\mathbf{x} \to \mathbf{x}_0$ .

Source: lect2_calc_diff (2).pdf (statement); proof sketch in Proofs - Differential Calculus.pdf p.3 inside the proof of Theorem 1391.

Structure of the formula.

$f(\mathbf{x}_0)$ — the zero-order term.
$\nabla f(\mathbf{x}_0)(\mathbf{x} - \mathbf{x}_0)$ — the linear / first-order term (tangent hyperplane).
$\tfrac{1}{2}(\mathbf{x} - \mathbf{x}_0)^T \nabla^2 f(\mathbf{x}_0)(\mathbf{x} - \mathbf{x}_0)$ — the quadratic term (a quadratic form in $\mathbf{h} = \mathbf{x} - \mathbf{x}_0$ ).
$o(\|\mathbf{x} - \mathbf{x}_0\|^2)$ — the Peano remainder: a function $R(\mathbf{h})$ with $R(\mathbf{h}) / \|\mathbf{h}\|^2 \to 0$ as $\mathbf{h} \to \mathbf{0}$ .

The $n=1$ special case. $f(x) = f(x_0) + f'(x_0)(x - x_0) + \tfrac{1}{2} f''(x_0)(x - x_0)^2 + o\bigl((x - x_0)^2\bigr).$ Maclaurin is Taylor with $\mathbf{x}_0 = \mathbf{0}$ .

Remark on notation. The factor $1/2$ in the quadratic term comes from $1/2!$ . For general order- $k$ expansion the coefficient would be $1/k!$ , but the course stops at second order.

3.3 Theorem 1391 — Second-order conditions for unconstrained local max/min (with proof)

Statement. Let $A \subseteq \mathbb{R}^n$ be open, $f \in C^{(2)}(A)$ , and $\mathbf{x}_0 \in A$ be a stationary point (i.e., $\nabla f(\mathbf{x}_0) = \mathbf{0}$ ). Then:

(1) Necessary condition. If $\mathbf{x}_0$ is a local maximiser (resp. minimiser) of $f$ , then the quadratic form $\mathbf{h}^T \nabla^2 f(\mathbf{x}_0) \mathbf{h}$ is negative semi-definite (resp. positive semi-definite).

(2) Sufficient condition. If the quadratic form $\mathbf{h}^T \nabla^2 f(\mathbf{x}_0) \mathbf{h}$ is negative definite (resp. positive definite), then $\mathbf{x}_0$ is a strong local maximiser (resp. strong local minimiser) of $f$ .

Consequently, if $\nabla^2 f(\mathbf{x}_0)$ is indefinite, $\mathbf{x}_0$ is a saddle point.

Source: Proofs - Differential Calculus.pdf p.3–4 (handwritten theorem 1391, "only (S)" — only the sufficient half is proved in the compendium, following Sicconi's syllabus).

Proof of (2) — sufficient condition, maximiser case.

Assume $\nabla^2 f(\mathbf{x}_0)$ defines a negative-definite quadratic form. We show $\mathbf{x}_0$ is a strong local maximiser.

Step 1 — Apply Taylor (Theorem 1481). Since $f \in C^{(2)}$ on a neighbourhood $U(\mathbf{x}_0)$ , for every $\mathbf{x} \in U(\mathbf{x}_0)$ , $f(\mathbf{x}) \;=\; f(\mathbf{x}_0) + \nabla f(\mathbf{x}_0)(\mathbf{x} - \mathbf{x}_0) + \tfrac{1}{2}(\mathbf{x} - \mathbf{x}_0)^T \nabla^2 f(\mathbf{x}_0)(\mathbf{x} - \mathbf{x}_0) + o(\|\mathbf{x} - \mathbf{x}_0\|^2), \quad \mathbf{x} \to \mathbf{x}_0.$ (With $\nabla^2 f(\mathbf{x}_0)$ symmetric, by Schwarz.) Because $\mathbf{x}_0$ is stationary, $\nabla f(\mathbf{x}_0) = \mathbf{0}$ , so the linear term vanishes: $f(\mathbf{x}) - f(\mathbf{x}_0) \;=\; \tfrac{1}{2}(\mathbf{x} - \mathbf{x}_0)^T \nabla^2 f(\mathbf{x}_0)(\mathbf{x} - \mathbf{x}_0) + o(\|\mathbf{x} - \mathbf{x}_0\|^2).$

Step 2 — Substitute $\mathbf{h} = \mathbf{x} - \mathbf{x}_0$ . $f(\mathbf{x}) - f(\mathbf{x}_0) \;=\; \tfrac{1}{2}\,\mathbf{h}^T \nabla^2 f(\mathbf{x}_0)\, \mathbf{h} \;+\; o(\|\mathbf{h}\|^2) \qquad \mathbf{h} \to \mathbf{0}.$

Step 3 — Renormalise. Assume $\mathbf{x} \neq \mathbf{x}_0$ , so $\mathbf{h} \neq \mathbf{0}$ , and multiply–divide the right-hand side by $\|\mathbf{h}\|^2$ : $f(\mathbf{x}) - f(\mathbf{x}_0) \;=\; \|\mathbf{h}\|^2 \cdot \left[\, \tfrac{1}{2}\,\frac{\mathbf{h}^T \nabla^2 f(\mathbf{x}_0)\, \mathbf{h}}{\|\mathbf{h}\|^2} \;+\; \frac{o(\|\mathbf{h}\|^2)}{\|\mathbf{h}\|^2}\, \right] \;=\; \|\mathbf{h}\|^2 \cdot \left[\, \tfrac{1}{2}\, \mathbf{v}^T \nabla^2 f(\mathbf{x}_0) \mathbf{v} + o(1)\, \right],$ where $\mathbf{v} = \mathbf{h} / \|\mathbf{h}\|$ is a unit vector, and we used the definition of little- $o$ to write $o(\|\mathbf{h}\|^2) / \|\mathbf{h}\|^2 = o(1)$ as $\mathbf{h} \to \mathbf{0}$ .

Step 4 — Maximise the quadratic form over the unit sphere. Consider the continuous function $Q(\mathbf{v}) = \mathbf{v}^T \nabla^2 f(\mathbf{x}_0) \mathbf{v} \qquad \text{on} \qquad S = \{\mathbf{v} \in \mathbb{R}^n : \|\mathbf{v}\| = 1\},$ which is the unit sphere — a compact set (closed and bounded). By Weierstrass, $Q$ attains a maximum $q^{\ast}$ on $S$ . Because $\nabla^2 f(\mathbf{x}_0)$ is negative definite, $Q(\mathbf{v}) < 0$ for every $\mathbf{v} \neq \mathbf{0}$ ; in particular $Q(\mathbf{v}) < 0$ for every $\mathbf{v} \in S$ (note $\mathbf{0} \notin S$ ), so $q^{\ast} \;=\; \max_{\mathbf{v} \in S} Q(\mathbf{v}) \;<\; 0.$

Step 5 — Bound $f(\mathbf{x}) - f(\mathbf{x}_0)$ strictly below zero. Substituting into Step 3: $f(\mathbf{x}) - f(\mathbf{x}_0) \;\le\; \|\mathbf{h}\|^2 \Bigl[\tfrac{1}{2} q^{\ast} + o(1)\Bigr].$ We want this bracket to be negative. Pick any $\varepsilon > 0$ ; by the definition of $o(1)$ , there is $\delta > 0$ such that $0 < \|\mathbf{h}\| < \delta$ implies $|o(1)| < \varepsilon$ , i.e., $-\varepsilon < o(1) < \varepsilon$ . Choosing the concrete value $\overline{\varepsilon} = -\tfrac{1}{4} q^{\ast}$ (which is strictly positive because $q^{\ast} < 0$ ), we get for $\|\mathbf{h}\| < \overline{\delta}$ : $\tfrac{1}{2} q^{\ast} + o(1) \;<\; \tfrac{1}{2} q^{\ast} - \tfrac{1}{4} q^{\ast} \;=\; \tfrac{1}{4} q^{\ast} \;<\; 0.$ Therefore $f(\mathbf{x}) - f(\mathbf{x}_0) \;\le\; \underbrace{\|\mathbf{h}\|^2}_{>0} \cdot \underbrace{\bigl(\tfrac{1}{4} q^{\ast}\bigr)}_{<0} \;<\; 0,$ i.e., $f(\mathbf{x}) < f(\mathbf{x}_0)$ for every $\mathbf{x} \neq \mathbf{x}_0$ in a neighbourhood of $\mathbf{x}_0$ . Hence $\mathbf{x}_0$ is a strong local maximiser. $\blacksquare$

Minimiser case. Replace "negative definite" by "positive definite"; then $q^{\ast} > 0$ (in fact $q_\star = \min Q > 0$ by the same Weierstrass argument applied to $-Q$ ), and the inequality flips to $f(\mathbf{x}) > f(\mathbf{x}_0)$ .

Saddle case. If $\nabla^2 f(\mathbf{x}_0)$ is indefinite there exist unit vectors $\mathbf{v}_+, \mathbf{v}_-$ with $Q(\mathbf{v}_+) > 0$ and $Q(\mathbf{v}_-) < 0$ . Taylor along each direction shows $f$ increases along $\mathbf{v}_+$ and decreases along $\mathbf{v}_-$ , so $\mathbf{x}_0$ is a saddle point.

Necessary condition (part 1), sketch. If $\mathbf{x}_0$ is a local maximiser, then $f(\mathbf{x}) \le f(\mathbf{x}_0)$ nearby, so the Taylor remainder $\tfrac{1}{2}\mathbf{h}^T \nabla^2 f(\mathbf{x}_0)\mathbf{h} + o(\|\mathbf{h}\|^2) \le 0$ . Dividing by $\|\mathbf{h}\|^2$ and taking $\mathbf{h} \to \mathbf{0}$ along any direction $\mathbf{v}$ gives $\mathbf{v}^T \nabla^2 f(\mathbf{x}_0) \mathbf{v} \le 0$ , i.e., negative semi-definite.

Summary table.

| Sign of $\nabla^2 f(\mathbf{x}_0)$ | Conclusion | Role | |---|---|---| | positive definite | strong local min | sufficient | | negative definite | strong local max | sufficient | | indefinite | saddle | sufficient | | positive semi-definite (not def.) | local min is possible, not guaranteed | necessary only | | negative semi-definite (not def.) | local max is possible, not guaranteed | necessary only |

Critical pitfall. Semi-definite $\Rightarrow$ cannot conclude. Need a direct inspection (as in Example 4.3 below, $f(x,y) = x^2 y$ at $(0,y)$ ).

3.4 Theorem/Proposition 1486 — Concavity/strict-concavity test via the Hessian

Statement. Let $C \subseteq \mathbb{R}^n$ be an open convex set and $f \in C^{(2)}(C)$ . Then: (a) $f$ is convex on $C$ $\iff$ $\nabla^2 f(\mathbf{x})$ is positive semi-definite for every $\mathbf{x} \in C$ . (b) $f$ is strictly convex on $C$ $\iff$ $\nabla^2 f(\mathbf{x})$ is positive definite for every $\mathbf{x} \in C$ .

Symmetric statements for concave / strictly concave by flipping all signs.

Source: Proofs - Differential Calculus.pdf (ref to prop 1486); lect 4_calc_diff (3).pdf (statement); General_24524_ENG_SOL.pdf Q8(a) Mode A / Q7(a) Mode B cites "see the textbook, proposition 1486" for the concavity test.

Proof (convex case ⇐, the direction most used on exams).

Assume $\nabla^2 f(\mathbf{x})$ is positive semi-definite for every $\mathbf{x} \in C$ . Pick $\mathbf{x}, \mathbf{x}_0 \in C$ . By the 2nd-order Taylor formula with Lagrange remainder (the integral / mean-value form, course-level), there exists $\xi$ on the segment $[\mathbf{x}_0, \mathbf{x}]$ (which stays in $C$ because $C$ is convex) such that $f(\mathbf{x}) \;=\; f(\mathbf{x}_0) + \nabla f(\mathbf{x}_0)(\mathbf{x} - \mathbf{x}_0) + \tfrac{1}{2}(\mathbf{x} - \mathbf{x}_0)^T \nabla^2 f(\xi)(\mathbf{x} - \mathbf{x}_0).$ Since $\nabla^2 f(\xi)$ is PSD, the last term is $\ge 0$ , so $f(\mathbf{x}) \;\ge\; f(\mathbf{x}_0) + \nabla f(\mathbf{x}_0)(\mathbf{x} - \mathbf{x}_0).$ This is the tangent-hyperplane inequality for convex functions, which (lect4) is equivalent to convexity of $f$ on $C$ .

Strict case (b), ⇐. If $\nabla^2 f(\mathbf{x})$ is positive definite everywhere, then for $\mathbf{x} \neq \mathbf{x}_0$ the last term is strictly positive, giving the strict tangent-hyperplane inequality — equivalent to strict convexity.

Converse (⇒), sketch. Convexity implies $f(\mathbf{x}) \ge f(\mathbf{x}_0) + \nabla f(\mathbf{x}_0)(\mathbf{x} - \mathbf{x}_0)$ . Apply Taylor, subtract the linear part, divide by $\|\mathbf{x} - \mathbf{x}_0\|^2$ : the lower bound $\ge 0$ transfers to the quadratic form $\mathbf{v}^T \nabla^2 f(\mathbf{x}_0) \mathbf{v} \ge 0$ for every unit direction $\mathbf{v}$ , i.e., PSD. $\blacksquare$

One-variable shadow. The standard "convex $\iff$ $f''(x) \ge 0$ " is exactly the $n = 1$ case.

Lecture 4 sidenote. For a strictly convex function the Hessian can fail to be positive definite at isolated points (classic example $f(x) = x^4$ has $f''(0) = 0$ but is strictly convex on $\mathbb{R}$ ). The textbook equivalence (b) as stated holds under the $C^{(2)}$ assumption; in practice the PSD-everywhere plus "PD somewhere" condition is what is checked.

3.5 Theorem 1497 — Global extrema of differentiable convex/concave functions

Statement. Let $C \subseteq \mathbb{R}^n$ be an open convex set, $f : C \to \mathbb{R}$ differentiable, and $\mathbf{x}_0 \in C$ . (1) If $f$ is concave on $C$ , then $\mathbf{x}_0 \text{ is a global maximiser for } f \iff \nabla f(\mathbf{x}_0) = \mathbf{0}.$ (2) If $f$ is convex on $C$ , then $\mathbf{x}_0 \text{ is a global minimiser for } f \iff \nabla f(\mathbf{x}_0) = \mathbf{0}.$

Source: Proofs - Differential Calculus.pdf p.5 (handwritten theorem 1497).

Proof of (1), $\Rightarrow$ (global max $\Rightarrow \nabla f(\mathbf{x}_0) = \mathbf{0}$ ).

Since $\mathbf{x}_0$ is a global maximiser, it is also a local maximiser. By Fermat's theorem (§3.7), and because $C$ is open (so $\mathbf{x}_0$ is an interior point), $\nabla f(\mathbf{x}_0) = \mathbf{0}$ .

Proof of (1), $\Leftarrow$ ( $\nabla f(\mathbf{x}_0) = \mathbf{0} \Rightarrow$ global max).

Since $f$ is concave and differentiable on the open convex set $C$ , the tangent-hyperplane characterisation of concavity gives $f(\mathbf{x}) \;\le\; f(\mathbf{x}_0) + \nabla f(\mathbf{x}_0)(\mathbf{x} - \mathbf{x}_0) \qquad \forall\, \mathbf{x} \in C.$ (Concave: the tangent hyperplane sits above the graph.) Because $\nabla f(\mathbf{x}_0) = \mathbf{0}$ , the linear term vanishes and $f(\mathbf{x}) \le f(\mathbf{x}_0) \qquad \forall\, \mathbf{x} \in C.$ Hence $\mathbf{x}_0$ is a global maximiser.

Proof of (2). Identical, with $\le$ replaced by $\ge$ , concave replaced by convex, maximiser by minimiser. $\blacksquare$

Why this is the single most useful theorem for open-ended exams. If you can prove the Hessian is PSD/PD everywhere on $C$ , then any stationary point is automatically a global minimiser (not just local) — no neighbourhood argument required. This is exactly the punchline of Final exercises Q2(b): the Hessian of $x + 3y^2 - \ln(x+2y)$ is PD throughout the domain, so $(1/3, 1/3)$ is a global min.

3.6 Theorem 1591 — Global maximum of a concave function (extension to continuous case)

Statement. Let $C \subseteq \mathbb{R}^n$ be a convex set (not necessarily open), $f : C \to \mathbb{R}$ be continuous on $C$ , concave on $C$ , and differentiable on $\text{int}(C)$ . Let $\hat{\mathbf{x}} \in \text{int}(C)$ . Then $\hat{\mathbf{x}} \text{ is a global maximiser of } f \text{ on } C \iff \nabla f(\hat{\mathbf{x}}) = \mathbf{0}.$ Symmetric statement for convex functions and global minimisers.

Source: Sicconi's course reference (textbook theorem 1591). Not proved in the compendium; follows from 1497 plus the standard continuous-extension argument.

Remark. This upgrades Theorem 1497 from open convex sets to arbitrary convex sets, provided the candidate stationary point lies in the interior. Points on the boundary still need separate treatment.

3.7 Fermat's theorem — necessary condition for interior extrema

Statement. Let $A \subseteq \mathbb{R}^n$ and $f : A \to \mathbb{R}$ . If $\mathbf{x}_0$ is an interior point of $A$ , $f$ is differentiable at $\mathbf{x}_0$ , and $\mathbf{x}_0$ is a local extremum (min or max) of $f$ , then $\nabla f(\mathbf{x}_0) = \mathbf{0}.$

Source: lect 4_calc_diff (3).pdf (applied inside the proof of Theorem 1497).

Proof. Without loss of generality, assume $\mathbf{x}_0$ is a local maximiser. Fix any direction $\mathbf{v} \in \mathbb{R}^n$ with $\|\mathbf{v}\| = 1$ , and consider the one-variable function $\varphi(t) := f(\mathbf{x}_0 + t \mathbf{v})$ , defined on a small interval $(-\delta, \delta)$ (possible because $\mathbf{x}_0$ is interior). Then $t = 0$ is a local maximum of $\varphi$ , so by the one-variable Fermat theorem $\varphi'(0) = 0$ . By the chain rule, $\varphi'(0) = \nabla f(\mathbf{x}_0) \cdot \mathbf{v}$ . Since this must equal $0$ for every direction $\mathbf{v}$ , we get $\nabla f(\mathbf{x}_0) = \mathbf{0}$ . $\blacksquare$

Converse is false. Not every stationary point is an extremum — saddles are also stationary. Fermat is a necessary, not sufficient, condition.

3.8 Dini's theorem (Implicit function theorem, local 2D version)

Statement. Let $g : A \subseteq \mathbb{R}^2 \to \mathbb{R}$ and let $P = (x_0, y_0) \in A$ satisfy $g(x_0, y_0) = c$ . Suppose: (i) $g \in C^{(1)}$ in a neighbourhood $B(x_0, y_0) \subseteq A$ ; (ii) $\dfrac{\partial g}{\partial y}(x_0, y_0) \neq 0$ .

Then there exist neighbourhoods $U(x_0) \subseteq \mathbb{R}$ and $V(y_0) \subseteq \mathbb{R}$ (with $U \times V \subseteq B(x_0, y_0)$ ) and a unique function $f : U(x_0) \to V(y_0)$ with $f(x_0) = y_0$ such that $g(x, f(x)) = c \qquad \forall x \in U(x_0).$ Furthermore $f$ is differentiable at $x_0$ and $\boxed{\, f'(x_0) \;=\; -\, \frac{\partial g / \partial x \,(x_0, y_0)}{\partial g / \partial y \,(x_0, y_0)} \,}.$

Source: lect5_calc_diff.pdf "Dini's Theorem: Implicit Theorem, local version"; Final exercises solution.pdf Q3 applies the formula with the minus sign.

Idea. The non-vanishing of $\partial g / \partial y$ at $P$ says the level curve $g = c$ is non-vertical at $P$ — no vertical tangent — so locally we can invert the $y$ -coordinate and express $y$ as a function of $x$ . The derivative formula comes from differentiating $g(x, f(x)) = c$ implicitly and using the chain rule: $\frac{\partial g}{\partial x}(x, f(x)) + \frac{\partial g}{\partial y}(x, f(x)) \cdot f'(x) \;=\; 0 \;\Longrightarrow\; f'(x) \;=\; -\, \frac{g_x}{g_y}.$

Global existence/uniqueness (lect5). If instead of just $P$ we work on a rectangular domain $A \times B \subseteq \mathbb{R}^2$ (with $B$ an interval) and we have (1) $\inf_{y \in B} g(x, y) < c < \sup_{y \in B} g(x, y)$ for every $x \in A$ ; (2) $g$ continuous in $y$ (for each fixed $x$ ); (3) $g$ strictly monotonic in $y$ (for each fixed $x$ ) — equivalently $\partial g / \partial y$ has constant sign,

then there exists a unique global $f : A \to B$ with $g(x, f(x)) = c$ for every $x \in A$ .

Caveat (lect5 MCQ). If $\partial g / \partial y (x_0, y_0) = 0$ (vertical tangent), Dini fails. Example: an ellipse at its leftmost / rightmost points fails the vertical-line test: a small move to the right leaves the ellipse entirely, so no function $y = f(x)$ is definable there.

3.9 Weierstrass theorem and coercive-function existence (Tonelli)

Weierstrass (stated in lect4). If $K \subseteq \mathbb{R}^n$ is compact (closed and bounded) and $f : K \to \mathbb{R}$ is continuous, then $f$ attains both its maximum and its minimum on $K$ : $\exists\, \mathbf{x}_{\max}, \mathbf{x}_{\min} \in K$ with $f(\mathbf{x}_{\max}) = \max_K f$ , $f(\mathbf{x}_{\min}) = \min_K f$ .

Tonelli / coercive theorem (stated in lect4). If $f : \mathbb{R}^n \to \mathbb{R}$ is continuous and coercive ( $\lim_{\|\mathbf{x}\|\to\infty} f(\mathbf{x}) = +\infty$ ), then $f$ attains a global minimum. If $f$ is continuous and super-coercive towards $-\infty$ ( $\lim_{\|\mathbf{x}\|\to\infty} f(\mathbf{x}) = -\infty$ ), then $f$ attains a global maximum.

Why coercivity alone gives existence. A continuous coercive $f$ has bounded sub-level sets. Pick any $c > \inf f$ ; the sub-level set $K = \{f \le c\}$ is non-empty, closed (continuity), and bounded (coercivity) — hence compact. By Weierstrass, $f|_K$ attains its minimum; that minimum is automatically the global minimum of $f$ on $\mathbb{R}^n$ (points outside $K$ have $f$ -values $> c \ge \min_K f$ ).

Composition rule (lect4). If $g : \mathbb{R}^n \to \mathbb{R}$ is continuous with $g(\mathbf{x}) \to -\infty$ as $\|\mathbf{x}\| \to \infty$ , and $h : \mathbb{R} \to \mathbb{R}$ is continuous and strictly increasing, then $h \circ g$ inherits the coercivity behaviour of $g$ filtered through $h$ . Example: $f(x,y) = e^{-x^2 - y^2}$ : inner $-x^2 - y^2 \to -\infty$ (super-coercive towards $-\infty$ ), outer $e^t$ strictly increasing, so $f \to e^{-\infty} = 0^+$ as $\|(x,y)\| \to \infty$ ; hence the supremum $1$ must be attained at a stationary point, which is $(0,0)$ — a global maximum.

Relaxation of constraints (lect4). When the admissible set is not compact and $f$ not coercive, one can sometimes relax the constraint and solve the unconstrained problem, then check whether the unconstrained optimum lies in the original set. If yes, it is the constrained optimum too; if no, the method fails and Lagrange multipliers or boundary analysis are needed.

§4. Worked Examples

Example 4.1 — Gradient, Hessian, and stationary points of $f(x,y) = e^x(x-1)\,y + y^2$

Source: May 2024 Q8 (Mode A) / Q7 (Mode B), General_24524_ENG_SOL.pdf.

Gradient. Treat $y$ as a constant: $f'_x = \frac{\partial}{\partial x}\bigl[y\, e^x(x-1)\bigr] = y\bigl[e^x(x-1) + e^x\bigr] = y \cdot e^x \cdot x = x y e^x.$ Treat $x$ as a constant: $f'_y = \frac{\partial}{\partial y}\bigl[y\, e^x(x-1) + y^2\bigr] = e^x(x-1) + 2y.$ $\boxed{\nabla f(x,y) \;=\; \bigl[\,x y e^x, \quad 2y + e^x(x-1)\,\bigr].}$

Stationary points. Solve $\nabla f(x,y) = \mathbf{0}$ : $\begin{cases} x y e^x = 0 \\ 2y + e^x(x-1) = 0 \end{cases}$ The first equation gives $x = 0$ or $y = 0$ (since $e^x > 0$ always).

Case $x = 0$ . The second equation becomes $2y + e^0 \cdot (0 - 1) = 2y - 1 = 0$ , so $y = 1/2$ . Stationary point $P = (0, 1/2)$ .
Case $y = 0$ . The second equation becomes $0 + e^x(x - 1) = 0 \Rightarrow x - 1 = 0 \Rightarrow x = 1$ (since $e^x > 0$ ). Stationary point $Q = (1, 0)$ .

Hessian. Compute second derivatives: $f''_{xx} = \frac{\partial}{\partial x}(x y e^x) = y e^x + x y e^x = y e^x(x + 1),$ $f''_{xy} = \frac{\partial}{\partial y}(x y e^x) = x e^x \quad (= f''_{yx} \text{ by Schwarz}),$ $f''_{yy} = \frac{\partial}{\partial y}\bigl(2y + e^x(x-1)\bigr) = 2.$ $\nabla^2 f(x,y) \;=\; \begin{pmatrix} y e^x (x+1) & x e^x \\ x e^x & 2 \end{pmatrix}.$

Classify $P = (0, 1/2)$ . $\nabla^2 f(0, 1/2) = \begin{pmatrix} (1/2) e^0 (0+1) & 0 \\ 0 & 2 \end{pmatrix} = \begin{pmatrix} 1/2 & 0 \\ 0 & 2 \end{pmatrix}.$ Leading principal minors: $\det A_1 = 1/2 > 0$ and $\det A_2 = 1 > 0$ . Positive definite. By Theorem 1391, $P = (0, 1/2)$ is a strong local minimiser.

Classify $Q = (1, 0)$ . $\nabla^2 f(1, 0) = \begin{pmatrix} 0 \cdot e \cdot 2 & 1 \cdot e \\ e & 2 \end{pmatrix} = \begin{pmatrix} 0 & e \\ e & 2 \end{pmatrix}.$ $\det A_1 = 0$ , $\det A_2 = 0 \cdot 2 - e^2 = -e^2 < 0$ . Since $\det A_2 \neq 0$ , we can finalise: one eigenvalue is positive, the other negative (product $= \det A_2 < 0$ ), so the Hessian is indefinite. By Theorem 1391, $Q = (1, 0)$ is a saddle point — not a local optimiser.

Global convexity of $f$ on $\mathbb{R}^2$ ? By Prop. 1486 (§3.4), $f$ is convex on $\mathbb{R}^2$ iff $\nabla^2 f(x,y)$ is PSD at every point. We found $\nabla^2 f(1, 0)$ is indefinite — so $f$ is not convex on $\mathbb{R}^2$ . (Official May 2024 solution: "The function $f$ is not convex on all $\mathbb{R}^2$ , because $\nabla^2 f(x,y)$ is not positive semidefinite on all $(x,y) \in \mathbb{R}^2$ . For example, this does not happen with point $(1, 0)$ .")

Example 4.2 — Second-order Taylor of $f(x,y) = x^2 + x \cos y$ at $(1, -\pi/2)$

Source: May 2024 MCQ2 (Mode A) / MCQ1 (Mode B).

Step 1 — function value. $f(1, -\pi/2) = 1^2 + 1 \cdot \cos(-\pi/2) = 1 + 0 = 1$ .

Step 2 — gradient. $f'_x = 2x + \cos y$ , $f'_y = -x \sin y$ . At $(1, -\pi/2)$ :

$f'_x(1, -\pi/2) = 2 + \cos(-\pi/2) = 2 + 0 = 2$ ;
$f'_y(1, -\pi/2) = -1 \cdot \sin(-\pi/2) = -1 \cdot (-1) = 1$ .

Thus $\nabla f(1, -\pi/2) = [2 \; 1]$ .

Step 3 — Hessian. $f''_{xx} = 2$ , $f''_{xy} = -\sin y$ , $f''_{yy} = -x \cos y$ . At $(1, -\pi/2)$ :

$f''_{xx} = 2$ ; $f''_{xy} = -\sin(-\pi/2) = -(-1) = 1$ ; $f''_{yy} = -1 \cdot \cos(-\pi/2) = -1 \cdot 0 = 0$ .

Thus $\nabla^2 f(1, -\pi/2) = \begin{pmatrix} 2 & 1 \\ 1 & 0 \end{pmatrix}$ .

Step 4 — assemble Taylor formula. With $\mathbf{h} = (x - 1, \; y + \pi/2)$ : $f(x,y) \;=\; 1 + [2 \; 1]\begin{pmatrix} x-1 \\ y + \pi/2 \end{pmatrix} + \tfrac{1}{2}[x-1 \; \; y + \pi/2]\begin{pmatrix} 2 & 1 \\ 1 & 0 \end{pmatrix}\begin{pmatrix} x-1 \\ y + \pi/2 \end{pmatrix} + o\bigl(\|(x-1, y+\pi/2)\|^2\bigr).$

Step 5 — expand. Linear part: $2(x-1) + (y + \pi/2)$ . Quadratic form: $\tfrac{1}{2}[2(x-1)^2 + 2(x-1)(y+\pi/2)] = (x-1)^2 + (x-1)(y+\pi/2)$ . After simplification the official answer is: $f(x,y) \;=\; \tfrac{\pi}{2} x + x^2 + x y + o\bigl(\|(x-1, y+\pi/2)\|^2\bigr).$

Answer. None of the listed options matches exactly ⇒ the correct MCQ choice is "none of the others" (Mode A option D, Mode B option C).

Example 4.3 — A case where the Hessian is semi-definite (`lect 4_calc_diff (3).pdf`)

$f(x,y) = x^2 y$ . Find all extrema.

Stationary points. $\nabla f = [2xy, \; x^2] = [0 \; 0]$ gives $x^2 = 0 \Rightarrow x = 0$ , and then $2 \cdot 0 \cdot y = 0$ for every $y$ — so the entire $y$ -axis is stationary: $(0, y)$ for every $y \in \mathbb{R}$ .

Hessian. $\nabla^2 f = \begin{pmatrix} 2y & 2x \\ 2x & 0 \end{pmatrix}$ , so $\nabla^2 f(0, y) = \begin{pmatrix} 2y & 0 \\ 0 & 0 \end{pmatrix}$ .

If $y > 0$ : Hessian is positive semi-definite (eigenvalues $2y, 0$ ), not definite. Theorem 1391 inconclusive.
If $y < 0$ : Hessian is negative semi-definite, not definite. Theorem 1391 inconclusive.
If $y = 0$ : Hessian is the zero matrix, the most degenerate case.

Direct inspection. Write $f(x, y) = x^2 y$ near $(0, y_0)$ .

If $y_0 > 0$ : in a small nbhd, $y$ stays positive, so $f(x, y) = x^2 y \ge 0 = f(0, y_0)$ , with equality only on the $y$ -axis. Hence $(0, y_0)$ is a weak (non-strict) local minimum.
If $y_0 < 0$ : symmetrically, $(0, y_0)$ is a weak local maximum.
If $y_0 = 0$ : the origin. For any neighbourhood, pick $y > 0$ : $f(x,y) > 0$ ; pick $y < 0$ : $f(x,y) < 0$ . So $(0,0)$ is a saddle point — neither max nor min.

Lesson. Semi-definite Hessian ⇒ you must argue by direct inspection or higher-order expansion. Theorem 1391 is silent.

Example 4.4 — $f(x,y) = x + 3 y^2 - \ln(x + 2y)$ (domain, stationary point, convexity, Taylor)

Source: Final exercises solution.pdf Q2.

Domain. The logarithm requires $x + 2y > 0$ , so $\text{Dom}(f) = \{(x, y) \in \mathbb{R}^2 : x + 2y > 0\}.$ This is the open half-plane above the line $x + 2y = 0$ ; it is open and convex.

Gradient. $f'_x = 1 - \frac{1}{x + 2y}, \qquad f'_y = 6y - \frac{2}{x + 2y}.$ $\nabla f(x,y) = \Bigl[\,1 - \tfrac{1}{x+2y},\ \ 6y - \tfrac{2}{x+2y}\,\Bigr].$

Stationary points. Set $\nabla f = \mathbf{0}$ : $\begin{cases} 1 - \frac{1}{x+2y} = 0 \\ 6y - \frac{2}{x+2y} = 0 \end{cases} \iff \begin{cases} x + 2y = 1 \\ 6y = \frac{2}{x+2y} = 2 \end{cases} \iff \begin{cases} x + 2y = 1 \\ y = 1/3 \end{cases} \iff x = 1 - 2/3 = 1/3,\ y = 1/3.$ Unique stationary point: $(1/3, 1/3)$ . Check domain: $1/3 + 2/3 = 1 > 0$ . $\checkmark$

Hessian. Let $u = x + 2y$ . Then $\frac{1}{u}$ has derivatives $\partial_x = -1/u^2$ and $\partial_y = -2/u^2$ . $f''_{xx} = \partial_x\Bigl[1 - u^{-1}\Bigr] = u^{-2} = (x + 2y)^{-2},$ $f''_{xy} = \partial_y\Bigl[1 - u^{-1}\Bigr] = 2 u^{-2} = 2(x+2y)^{-2},$ $f''_{yy} = \partial_y\bigl[6y - 2 u^{-1}\bigr] = 6 + 4 u^{-2} = 6 + 4(x+2y)^{-2}.$ $\nabla^2 f(x,y) = \begin{pmatrix} (x+2y)^{-2} & 2(x+2y)^{-2} \\ 2(x+2y)^{-2} & 6 + 4(x+2y)^{-2} \end{pmatrix}.$

At $(1/3, 1/3)$ where $x + 2y = 1$ : $\nabla^2 f(1/3, 1/3) = \begin{pmatrix} 1 & 2 \\ 2 & 10 \end{pmatrix}.$ Leading principal minors: $\det A_1 = 1 > 0$ , $\det A_2 = 10 - 4 = 6 > 0$ . Positive definite ⇒ by Theorem 1391, $(1/3, 1/3)$ is a strong local minimiser.

Is it also a global minimiser? Check PSD-ness of $\nabla^2 f(x,y)$ everywhere on the domain:

$f''_{xx} = (x+2y)^{-2} > 0$ since the denominator is positive (domain condition).
$\det \nabla^2 f(x,y) = (x+2y)^{-2} \cdot [6 + 4(x+2y)^{-2}] - 4(x+2y)^{-4} = 6(x+2y)^{-2} + 4(x+2y)^{-4} - 4(x+2y)^{-4} = 6(x+2y)^{-2} > 0$ .

Both leading principal minors positive everywhere — Hessian is positive definite on the whole (convex, open) domain. By Prop. 1486, $f$ is strictly convex on $\text{Dom}(f)$ . By Theorem 1497, the unique stationary point $(1/3, 1/3)$ is the global minimiser.

Value at the minimum. $f(1/3, 1/3) = 1/3 + 3 \cdot (1/9) - \ln(1) = 1/3 + 1/3 - 0 = 2/3$ .

Second-order Taylor at $(1/3, 1/3)$ . With $\nabla f(1/3, 1/3) = (0, 0)$ (stationary point) and $\nabla^2 f(1/3, 1/3) = \bigl(\begin{smallmatrix}1 & 2\\ 2 & 10\end{smallmatrix}\bigr)$ : $f(x, y) \;=\; \tfrac{2}{3} + \tfrac{1}{2}\Bigl[(x - \tfrac{1}{3})^2 + 10(y - \tfrac{1}{3})^2 + 4(x - \tfrac{1}{3})(y - \tfrac{1}{3})\Bigr] + o\bigl(\|(x - 1/3, y - 1/3)\|^2\bigr).$

Example 4.5 — Dini's theorem applied to $g(x,y) = 2 y e^x + x e^y - 2$ at $g = -2$

Source: Final exercises solution.pdf Q3.

Find $f'(0)$ where $y = f(x)$ is defined implicitly by $g(x, y) = -2$ .

Step 1 — locate the base point $(x_0, y_0)$ . We need $x_0 = 0$ and $y_0$ such that $g(0, y_0) = -2$ . Compute $g(0, y) = 2y e^0 + 0 \cdot e^y - 2 = 2y - 2$ . Setting $2y - 2 = -2$ gives $y = 0$ . So $(x_0, y_0) = (0, 0)$ .

Step 2 — verify Dini's hypothesis. Compute $\partial g / \partial y$ : $\frac{\partial g}{\partial y}(x, y) = 2 e^x + x e^y.$ At $(0, 0)$ : $g_y(0, 0) = 2 e^0 + 0 \cdot e^0 = 2 \neq 0$ . $\checkmark$ Dini applies.

Step 3 — compute $\partial g / \partial x$ . $\frac{\partial g}{\partial x}(x, y) = 2 y e^x + e^y.$ At $(0, 0)$ : $g_x(0, 0) = 2 \cdot 0 \cdot e^0 + e^0 = 1$ .

Step 4 — apply Dini's formula. $f'(0) \;=\; -\,\frac{g_x(0, 0)}{g_y(0, 0)} \;=\; -\,\frac{1}{2}.$

$\boxed{\, f'(0) \;=\; -\tfrac{1}{2}. \,}$

Critical: do not drop the minus sign.

Example 4.6 — Existence of a global maximiser via composition-coercivity (lect4)

Find the global maximiser of $f(x,y) = e^{-x^2 - y^2}$ on $\mathbb{R}^2$ .

Step 1 — existence. The inner function $g(x,y) = -x^2 - y^2$ is continuous with $g(x,y) \to -\infty$ as $\|(x,y)\| \to \infty$ . The outer function $h(t) = e^t$ is strictly increasing and continuous. By the composition rule (§3.9), $f$ is continuous and $f(x,y) \to e^{-\infty} = 0$ at infinity, i.e., $f$ is super-coercive towards $0$ from above. In particular, on any sub-level set $\{f \ge c\}$ for small $c > 0$ , $f$ is bounded — and $f \le 1 = f(0,0)$ so the supremum $1$ is attained. Tonelli-type argument: $f$ has a global maximum.

Step 2 — candidates. Compute $\nabla f$ : $\nabla f(x,y) = \bigl[\,-2x\, e^{-x^2 - y^2},\ \ -2y\, e^{-x^2 - y^2}\,\bigr].$ Setting $\nabla f = \mathbf{0}$ : the exponential is never $0$ , so $-2x = 0 \Rightarrow x = 0$ and $-2y = 0 \Rightarrow y = 0$ . Unique stationary point $(0, 0)$ .

Step 3 — conclusion. Since the global maximum exists (Tonelli) and there is only one candidate, $(0, 0)$ is the global maximum of $f$ on $\mathbb{R}^2$ , with $f(0, 0) = 1$ .

Example 4.7 — Stationary-point classification with three cases (lect3 TA)

$f(x, y) = (y - 1)(y - x^2)$ . Identify all extrema. (MCQ trap: the function expands to $y^2 - y x^2 - y + x^2$ .)

Gradient. $f'_x = -2xy + 2x$ , $f'_y = 2y - x^2 - 1$ .

Stationary points. Solve $-2xy + 2x = 0$ and $2y - x^2 - 1 = 0$ .

From first: $2x(1 - y) = 0 \Rightarrow x = 0$ or $y = 1$ .
Substitute into second: $2y = x^2 + 1$ $2 y = x^{2} + 1$ , so $y = (x^2 + 1)/2$ $y = (x^{2} + 1) /2$ .
- Case $x = 0$ : $y = 1/2$ . Point $(0, 1/2)$ .
- Case $y = 1$ : $1 = (x^2 + 1)/2 \Rightarrow x^2 = 1 \Rightarrow x = \pm 1$ . Points $(1, 1)$ and $(-1, 1)$ .

Three stationary points: $(0, 1/2), (1, 1), (-1, 1)$ .

Hessian. $f''_{xx} = -2y + 2$ , $f''_{xy} = -2x$ , $f''_{yy} = 2$ . $\nabla^2 f(x, y) = \begin{pmatrix} -2y + 2 & -2x \\ -2x & 2 \end{pmatrix}.$

At $(0, 1/2)$ : $\nabla^2 f(0, 1/2) = \bigl(\begin{smallmatrix} 1 & 0 \\ 0 & 2 \end{smallmatrix}\bigr)$ . $\det A_1 = 1 > 0$ , $\det A_2 = 2 > 0$ . Positive definite ⇒ strong local minimiser.

At $(1, 1)$ : $\nabla^2 f(1, 1) = \bigl(\begin{smallmatrix} 0 & -2 \\ -2 & 2 \end{smallmatrix}\bigr)$ . $\det A_1 = 0$ , $\det A_2 = 0 \cdot 2 - 4 = -4 < 0$ . Indefinite (eigenvalue product = $-4 < 0$ ) ⇒ saddle point.

At $(-1, 1)$ : $\nabla^2 f(-1, 1) = \bigl(\begin{smallmatrix} 0 & 2 \\ 2 & 2 \end{smallmatrix}\bigr)$ . $\det A_2 = -4 < 0$ . Indefinite ⇒ saddle point.

Answer. The function has a unique local minimiser and two saddle points — matches MCQ choice (C) "a unique local minimiser".

§5. Solution Methods

Each method is a named algorithm with input → steps → output → pitfalls. Cross-references indicate May 2024 exam use.

M-DC-1 — Compute the gradient $\nabla f$ and Hessian $\nabla^2 f$

Used on: May 2024 MCQ2 / MCQ1 (Taylor), Q8 / Q7 (stationary points), Final exercises Q2.

Input. A function $f : \mathbb{R}^n \to \mathbb{R}$ of class at least $C^{(2)}$ .

Steps.

Compute each $f'_{x_i}$ : differentiate $f$ with respect to $x_i$ , treating the other variables as constants. Assemble the row vector $\nabla f = [f'_{x_1} \cdots f'_{x_n}]$ .
For each pair $(i, j)$ , compute $f''_{x_i x_j} = \partial / \partial x_j \bigl(f'_{x_i}\bigr)$ .
Assemble $\nabla^2 f$ as the $n \times n$ matrix with entry $(i, j)$ equal to $f''_{x_i x_j}$ .
Schwarz sanity check: verify $f''_{x_i x_j} = f''_{x_j x_i}$ (symmetry). If not, you have a calculation error.

Output. $\nabla f(\mathbf{x})$ and $\nabla^2 f(\mathbf{x})$ as functions of $\mathbf{x}$ .

Pitfalls.

Forgetting the product and chain rules in expressions like $e^x(x-1)$ or $\ln(x + 2y)$ .
Miscomputing $f''_{yy}$ by forgetting that the remaining $x$ -dependence is held constant.
Not using Schwarz: if $f''_{xy} \neq f''_{yx}$ symbolically (for a $C^{(2)}$ function), you have almost certainly made a sign error.
Writing the gradient as a column when the convention is a row (affects matrix products in Taylor).

M-DC-2 — Find all stationary points (solve $\nabla f = \mathbf{0}$ )

Used on: May 2024 Q8 / Q7, Final exercises Q2, lecture examples.

Input. The gradient $\nabla f(\mathbf{x})$ as an $n$ -tuple of expressions, and the domain.

Steps.

Write the system $\nabla f = \mathbf{0}$ as $n$ scalar equations $f'_{x_i}(\mathbf{x}) = 0$ .
Factor carefully. If an equation factors as $A(\mathbf{x}) \cdot B(\mathbf{x}) = 0$ with $A, B$ not identically zero on the domain, split into cases $A = 0$ and $B = 0$ .
Within each case, substitute into the remaining equations and solve.
Check the domain. Discard any algebraic solutions that lie outside $\text{Dom}(f)$ (e.g., require $x + 2y > 0$ for a log term).
List all surviving stationary points.

Output. A finite list of points $\mathbf{x}^{(1)}, \ldots, \mathbf{x}^{(k)}$ where $\nabla f = \mathbf{0}$ .

Common case splits.

$x y e^x = 0$ (May 2024 Q8): $e^x > 0$ always, so $xy = 0 \Rightarrow x = 0$ or $y = 0$ .
$e^{x+2y} = 2\lambda x - 2\lambda$ (lect4 constrained): rule out $e^{\text{anything}} = 0$ (impossible), keep the algebraic case.
Curves of stationary points (Example 4.3): if one factor is always zero (e.g., $x^2 = 0 \Rightarrow x = 0$ but then $2xy = 0$ automatically for every $y$ ), the stationary set is a curve, not a discrete set. Each point on the curve must be analysed separately (often by direct inspection, since the Hessian is semi-definite).

Pitfalls.

Forgetting domain constraints. Final exercises Q2 yields the candidate $(1/3, 1/3)$ which happens to lie in the domain; had it lain on $x + 2y = 0$ , we would have rejected it (boundary of domain, $\ln$ undefined).
Dropping solutions from $y = 0$ or $x = 0$ branches. Always enumerate both.
Dividing by $\lambda$ / by a factor that might be zero. Either confirm the factor is non-zero (cross-check) or track the $\lambda = 0$ case separately.

M-DC-3 — Classify a stationary point via the Hessian

Used on: May 2024 Q8 / Q7 (b2), Final exercises Q2 (a), lecture TA problems.

Input. A stationary point $\mathbf{x}_0$ and the Hessian $\nabla^2 f(\mathbf{x}_0)$ (a symmetric $n \times n$ matrix).

Steps.

Compute the leading principal minors $\det A_1, \det A_2, \ldots, \det A_n$ of $\nabla^2 f(\mathbf{x}_0)$ .
Apply Sylvester–Jacobi (§01 M-LA-5):
- All $> 0$ ⇒ positive definite ⇒ strong local min.
- Alternating starting $<0$ ( $-, +, -, \ldots$ ) ⇒ negative definite ⇒ strong local max.
- Some minor $= 0$ : test inconclusive; fall through to step 3.
- All nonzero but neither pattern ⇒ indefinite ⇒ saddle.
If Sylvester is inconclusive, compute all principal minors (not just leading), or compute eigenvalues:
- All eigenvalues $\ge 0$ , not all $>0$ ⇒ positive semi-definite ⇒ Theorem 1391 is inconclusive, fall through to step 4.
- All $\le 0$ , some $= 0$ ⇒ negative semi-definite ⇒ inconclusive.
- Mixed signs ⇒ indefinite ⇒ saddle.
Direct inspection fallback. Compute $f(\mathbf{x}) - f(\mathbf{x}_0)$ near $\mathbf{x}_0$ and determine its sign by factoring or by a Taylor expansion of higher order.

Output. One of: strong local min, strong local max, saddle, inconclusive (semi-definite case).

Pitfalls.

Confusing semi-definite with definite. Semi-definite is only a necessary condition for a local extremum — not sufficient.
Sign of alternating pattern. The negative-definite pattern starts negative ( $\det A_1 < 0$ , $\det A_2 > 0$ , $\det A_3 < 0$ , …). All-negative is not negative-definite for $n \ge 2$ .
Forgetting the case $\det A_2 < 0$ . In 2D this means the two eigenvalues have opposite signs — automatically indefinite, regardless of $\det A_1$ . (Example 4.1, $(1,0)$ : $\det A_1 = 0$ but $\det A_2 = -e^2 < 0$ ⇒ indefinite ⇒ saddle.)

M-DC-4 — Check global convexity/concavity via the Hessian

Used on: May 2024 Q8 / Q7 (b3), Final exercises Q2 (b).

Input. A $C^{(2)}$ function $f$ defined on an open convex set $C$ , and its Hessian $\nabla^2 f(\mathbf{x})$ .

Steps.

Study the sign of $\nabla^2 f(\mathbf{x})$ as a function of $\mathbf{x}$ ranging over $C$ , not just at one point.
If $\nabla^2 f(\mathbf{x})$ is positive definite (resp. PSD) for every $\mathbf{x} \in C$ , then $f$ is strictly convex (resp. convex) on $C$ by Prop. 1486.
If there is even one point $\mathbf{x}^\ast \in C$ where $\nabla^2 f(\mathbf{x}^\ast)$ fails to be PSD (i.e., it is indefinite, or negative (semi-)definite, or has a negative eigenvalue), then $f$ is not convex on $C$ .

Output. One of: $f$ strictly convex on $C$ , $f$ convex on $C$ , $f$ not convex on $C$ (with a witness point). Symmetric statements for concave.

Consequence for global extrema.

If $f$ is strictly convex on an open convex domain and has a stationary point $\mathbf{x}_0$ , then by Theorem 1497 $\mathbf{x}_0$ is the unique global minimiser.
If $f$ is strictly concave, the unique global maximiser is any stationary point.

Pitfalls.

Testing the Hessian only at the stationary point. That tells you local behaviour, not global. Global (strict) convexity requires PSD/PD everywhere on the domain.
Ignoring the convexity of the domain. Prop. 1486 requires $C$ to be a convex set. For $\{x + 2y > 0\}$ , $C$ is a half-plane — convex $\checkmark$ .
Witness point. To disprove convexity on all of $\mathbb{R}^2$ , cite one specific point where the Hessian fails PSD (May 2024 Q8 b3: " $(1,0)$ ").

M-DC-5 — Write the second-order Taylor expansion at $(a, b)$

Used on: May 2024 MCQ2 / MCQ1, Final exercises Q2(c).

Input. A $C^{(2)}$ function $f$ and an expansion point $\mathbf{x}_0 = (a, b)$ .

Steps.

Compute $f(a, b)$ — zero-order term.
Compute $\nabla f(a, b) = [f'_x(a,b), \; f'_y(a,b)]$ — linear coefficient.
Compute $\nabla^2 f(a, b) = \bigl(\begin{smallmatrix} f''_{xx} & f''_{xy} \\ f''_{xy} & f''_{yy} \end{smallmatrix}\bigr)$ — quadratic coefficient.
Assemble: $f(x, y) = f(a,b) + \nabla f(a,b) \begin{pmatrix} x - a \\ y - b \end{pmatrix} + \tfrac{1}{2}[x - a \; y - b] \nabla^2 f(a,b) \begin{pmatrix} x - a \\ y - b \end{pmatrix} + o\bigl(\|(x-a, y-b)\|^2\bigr).$
If asked, simplify the polynomial by multiplying out.

Output. Explicit quadratic polynomial in $(x - a, y - b)$ plus the Peano remainder.

Pitfalls.

Forgetting the $1/2$ factor on the quadratic term.
Losing the factor 2 on the cross term: when you expand the matrix product, the $xy$ -cross term has coefficient $f''_{xy} \cdot 2 / 2 = f''_{xy}$ (because $a_{ij} + a_{ji} = 2 f''_{xy}$ , divided by 2 of the Taylor factor).
Sign errors on $f'_y$ when $y_0 = -\pi/2$ : $\sin(-\pi/2) = -1$ , $\cos(-\pi/2) = 0$ .

M-DC-6 — Apply Dini's theorem to find $f'(x_0)$ from $g(x, y) = c$

Used on: Final exercises Q3 (canonical example), lect5 exercises.

Input. A $C^{(1)}$ function $g : A \subseteq \mathbb{R}^2 \to \mathbb{R}$ , a constant $c$ , and (ideally) a known $x_0$ .

Steps.

Locate the base point $(x_0, y_0)$ satisfying $g(x_0, y_0) = c$ . If only $x_0$ is given, substitute and solve for $y_0$ .
Verify Dini's hypothesis: compute $g_y(x_0, y_0) = \partial g/\partial y \,(x_0, y_0)$ and check $g_y(x_0, y_0) \neq 0$ . If it is zero, Dini's theorem does not apply — you cannot guarantee a local implicit function.
Compute $g_x(x_0, y_0)$ .
Apply the formula: $f'(x_0) \;=\; -\,\frac{g_x(x_0, y_0)}{g_y(x_0, y_0)}.$
Sign check. The minus sign is not optional.

Output. The numerical value of $f'(x_0)$ (or a symbolic expression).

Pitfalls.

Forgetting the minus sign — the single most common error. The formula is $f'(x_0) = -g_x / g_y$ .
Verifying $g_y \neq 0$ at the wrong point (e.g., at $(x_0, 0)$ when actually $y_0 \neq 0$ ): always substitute the correct $(x_0, y_0)$ .
Missing the existence question: Dini gives local uniqueness in a neighbourhood. If the problem asks for a global function, check the extra hypotheses of the uniqueness theorem (strict monotonicity in $y$ ).
Applying Dini at a point not on the level curve: if $g(x_0, y_0) \neq c$ , there is nothing to expand around — re-derive $y_0$ .

M-DC-7 — Existence of global extrema via coercivity and Weierstrass

Used on: lect4 "Maximisers with Tonelli's Theorem", auxiliary in Final exercises.

Input. A continuous function $f : A \subseteq \mathbb{R}^n \to \mathbb{R}$ and a (possibly unbounded) admissible set $A$ .

Steps (case $A = \mathbb{R}^n$ ).

Check continuity of $f$ .
Coercivity test: examine $\lim_{\|\mathbf{x}\| \to \infty} f(\mathbf{x})$ $lim_{∥ x ∥ \to \infty} f (x)$ .
- If $= +\infty$ : $f$ is coercive ⇒ by Tonelli, $f$ attains a global minimum.
- If $= -\infty$ : $f$ is super-coercive towards $-\infty$ ⇒ attains a global maximum.
- If neither (e.g., oscillating or going to a finite limit): the existence theorems do not apply directly — try relaxation, Lagrange, or boundary analysis.
Composition rule: if $f = h \circ g$ , check if the outer $h$ is strictly increasing (preserves coercivity of $g$ ) or strictly decreasing (flips it).
Find all stationary and exceptional points (where $\nabla f = \mathbf{0}$ , plus points where $f$ is not differentiable).
Evaluate $f$ at each candidate and pick the extremum.

Steps (case $A \subseteq \mathbb{R}^n$ compact).

Weierstrass applies directly: $f$ attains its max and min on $A$ .
Candidates: interior stationary points of $f|_A$ , boundary points, and non-differentiability points.
Evaluate $f$ at each; the biggest/smallest wins.

Steps (relaxation for non-compact, non-coercive problems).

Solve the unconstrained problem $\min/\max f$ on $\mathbb{R}^n$ .
If the unconstrained optimum lies in $A$ , it is also the constrained optimum.
If not, the constrained problem likely has no solution via this method — switch to Lagrange or boundary analysis.

Pitfalls.

Unbounded set + continuous function is not enough. Weierstrass requires compact (closed and bounded).
Forgetting that a minimum may not exist. $f(x) = e^x$ on $\mathbb{R}$ is continuous but not coercive (goes to $0$ at $-\infty$ ); $\inf = 0$ is not attained.
Sub-vs. super-coercive confusion. Coercive = " $+\infty$ at infinity" (guarantees minimum). Super-coercive towards $-\infty$ = " $-\infty$ at infinity" (guarantees maximum). Different sign, different conclusion.

§6. Practice Problems with Solutions

Problem 6.1 — May 2024 MCQ2 (Mode A) / MCQ1 (Mode B): second-order Taylor

The Taylor formula of order two for $f(x, y) = x^2 + x \cos y$ , centred at $(1, -\pi/2)$ , is …

Solution. See Example 4.2. The answer is "none of the others" (option D Mode A / option C Mode B), since the simplification yields $f(x,y) = \tfrac{\pi}{2} x + x^2 + x y + o\bigl(\|(x-1, y+\pi/2)\|^2\bigr)$ , which does not match options A–C.

Problem 6.2 — May 2024 Q8 (Mode A) / Q7 (Mode B): Hessian classification and global convexity

(a) Let $C \subseteq \mathbb{R}^n$ be a convex set and $f : C \to \mathbb{R}$ be continuously twice differentiable. State the Concavity / Strict Concavity test on the basis of the sign of the Hessian matrix. (b) Consider $f(x, y) = e^x(x-1)\,y + y^2$ . (b1) Write the gradient vector of $f$ and determine the stationary points. (b2) Write the Hessian matrix and explain the nature of the stationary points. (b3) Is $f$ convex on all $\mathbb{R}^2$ ? Justify.

(a) Statement of Prop. 1486 (concavity / strict concavity test). Let $C \subseteq \mathbb{R}^n$ be an open convex set and $f \in C^{(2)}(C)$ . Then:

$f$ is concave on $C$ $\iff$ $\nabla^2 f(\mathbf{x})$ is negative semi-definite for every $\mathbf{x} \in C$ .
$f$ is strictly concave on $C$ $\iff$ $\nabla^2 f(\mathbf{x})$ is negative definite for every $\mathbf{x} \in C$ .

Analogously for convexity (positive semi-definite / positive definite).

(b1) See Example 4.1. Gradient $\nabla f(x,y) = [x y e^x, \; 2y + e^x(x-1)]$ . Stationary points: $P = (0, 1/2)$ and $Q = (1, 0)$ .

(b2) Hessian $\nabla^2 f(x,y) = \begin{pmatrix} y e^x (x+1) & x e^x \\ x e^x & 2 \end{pmatrix}$ .

At $P = (0, 1/2)$ : $\nabla^2 f(P) = \bigl(\begin{smallmatrix} 1/2 & 0 \\ 0 & 2 \end{smallmatrix}\bigr)$ — positive definite (both leading minors positive). By Theorem 1391, $P$ is a strong local minimiser.
At $Q = (1, 0)$ : $\nabla^2 f(Q) = \bigl(\begin{smallmatrix} 0 & e \\ e & 2 \end{smallmatrix}\bigr)$ , $\det = -e^2 < 0$ — indefinite. $Q$ is a saddle (not an optimiser).

(b3) $f$ is not convex on $\mathbb{R}^2$ . A witness is $(1, 0)$ , where $\nabla^2 f(1, 0)$ is indefinite — not positive semi-definite. By Prop. 1486, this rules out global convexity.

Source: General_24524_ENG_SOL.pdf Q8/Q7.

Problem 6.3 — Final exercises Q2: full optimization on a log-domain

Given $f(x, y) = x + 3 y^2 - \ln(x + 2y)$ . (a) Determine any local max/min point. (b) Can we deduce whether they are global extrema? (c) Find the 2nd-order Taylor formula in a neighbourhood of $(1/3, 1/3)$ .

Solution. See Example 4.4.

(a) Domain: $\{x + 2y > 0\}$ . Unique stationary point $(1/3, 1/3)$ . Hessian there $\bigl(\begin{smallmatrix}1 & 2 \\ 2 & 10\end{smallmatrix}\bigr)$ is PD ⇒ strong local minimiser.
(b) Yes. The Hessian is PD everywhere on the (open, convex) domain, so $f$ is strictly convex on its domain. By Theorem 1497, the unique stationary point is the unique global minimiser.
(c) With $f(1/3, 1/3) = 2/3$ , $\nabla f(1/3, 1/3) = \mathbf{0}$ , $\nabla^2 f(1/3, 1/3) = \bigl(\begin{smallmatrix}1 & 2 \\ 2 & 10\end{smallmatrix}\bigr)$ : $f(x, y) = \tfrac{2}{3} + \tfrac{1}{2}\Bigl[(x - \tfrac{1}{3})^2 + 10(y - \tfrac{1}{3})^2 + 4(x - \tfrac{1}{3})(y - \tfrac{1}{3})\Bigr] + o\bigl(\|(x - 1/3, y - 1/3)\|^2\bigr).$

Source: Final exercises solution.pdf pp.4–6, Problem 2.

Problem 6.4 — Final exercises Q3: Dini's theorem

Given $g(x, y) = 2 y e^x + x e^y - 2$ . Find $f'(0)$ of the function $y = f(x)$ implicitly defined by $g(x, y) = -2$ .

Solution. See Example 4.5. $f'(0) = -1/2$ .

Source: Final exercises solution.pdf p.7, Problem 3.

Problem 6.5 — lect 4: find the extrema of $f(x,y) = x^2 + y^2 + x + y$

(a) Find the gradient. (b) Find the stationary point(s). (c) Determine whether it is a point of extrema.

Solution.

(a) $\nabla f(x,y) = [2x + 1, \; 2y + 1]$ .
(b) $\nabla f = \mathbf{0}$ gives $x = -1/2, y = -1/2$ . Unique stationary point $(-1/2, -1/2)$ .
(c) $\nabla^2 f(x, y) = \bigl(\begin{smallmatrix}2 & 0 \\ 0 & 2\end{smallmatrix}\bigr)$ for every $(x,y)$ . $\det A_1 = 2, \det A_2 = 4$ — positive definite everywhere. So $f$ is strictly convex on $\mathbb{R}^2$ (convex open domain). By Theorem 1497, $(-1/2, -1/2)$ is the strong global minimiser.

Source: lect 4_calc_diff (3).pdf.

Problem 6.6 — lect3: $f(x, y) = x^2 y - y^2 x - 3 x$

Find the points of local extrema.

Solution.

Gradient. $f'_x = 2 x y - y^2 - 3$ , $f'_y = x^2 - 2 x y$ .

Stationary points. From $f'_y = 0$ : $x(x - 2y) = 0$ , so $x = 0$ or $x = 2y$ .

Case $x = 0$ : $f'_x = 0 - y^2 - 3 = 0 \Rightarrow y^2 = -3$ — no real solution.
Case $x = 2y$ : $f'_x = 2 \cdot 2y \cdot y - y^2 - 3 = 4 y^2 - y^2 - 3 = 3 y^2 - 3 = 0 \Rightarrow y^2 = 1 \Rightarrow y = \pm 1$ . Corresponding $x = \pm 2$ . Stationary points $(2, 1)$ and $(-2, -1)$ .

Hessian. $f''_{xx} = 2y, f''_{xy} = 2x - 2y, f''_{yy} = -2x$ . So $\nabla^2 f(x,y) = \bigl(\begin{smallmatrix}2y & 2x - 2y \\ 2x - 2y & -2x\end{smallmatrix}\bigr)$ .

At $(2, 1)$ : $\bigl(\begin{smallmatrix}2 & 2 \\ 2 & -4\end{smallmatrix}\bigr)$ , $\det A_1 = 2, \det A_2 = -8 - 4 = -12 < 0$ — indefinite ⇒ saddle.
At $(-2, -1)$ : $\bigl(\begin{smallmatrix}-2 & -2 \\ -2 & 4\end{smallmatrix}\bigr)$ , $\det A_1 = -2, \det A_2 = -8 - 4 = -12 < 0$ — indefinite ⇒ saddle.

Answer. Both stationary points are saddles; $f$ has no local extrema.

Source: Lect3_calc_diff (3).pdf.

Problem 6.7 — Final exercises Q1 recap with Differential-Calculus lens

$f(x, y, z) = k x^2 + k y^2 - 4 z^2 - 2 x z$ . Find conditions on $k$ for the origin to be a strong global maximum.

Solution. The origin is stationary for every quadratic form (all partial derivatives vanish at $\mathbf{0}$ ). For it to be a strong global max, we need $f(\mathbf{x}) < 0$ for every $\mathbf{x} \neq \mathbf{0}$ — i.e., the associated matrix $A$ must be negative definite. $A = \bigl(\begin{smallmatrix}k & 0 & -1 \\ 0 & k & 0 \\ -1 & 0 & -4\end{smallmatrix}\bigr)$ . Apply Sylvester–Jacobi with alternating pattern $\det A_1 < 0, \det A_2 > 0, \det A_3 < 0$ : $k < 0, \quad k^2 > 0\,\text{ (trivially, }k \neq 0\text{)}, \quad -4 k^2 - k < 0 \iff k(4k + 1) > 0.$ Combining: $k < -1/4$ . (This links Diff-Calc to §01 Linear Algebra — the same $A$ appears in Final Q1b, eigenvalue signing.)

Source: Final exercises solution.pdf pp.1–3, Problem 1.

Problem 6.8 — lect5: Dini on $g(x, y) = x^2 - 2y - e^y = 0$ , find $f'(1)$

Setup. Verify $g$ is in the range of $y \mapsto g(x, y)$ : as $y \to -\infty$ , $g \to x^2 - (-\infty) - 0 = +\infty$ ; as $y \to +\infty$ , $g \to x^2 - \infty - \infty = -\infty$ . So $0 \in (\text{range})$ and by the existence theorem there is $y = f(x)$ . Uniqueness: $g_y(x, y) = -2 - e^y < 0$ everywhere ⇒ $g$ is strictly decreasing in $y$ ⇒ $f$ is unique.

Find extrema of $f$ . Dini gives $f'(x) = -g_x / g_y = -2x / (-2 - e^y) = 2x / (2 + e^y)$ . Stationary points: $f'(x) = 0 \iff x = 0$ . First-derivative test: $f'$ changes from $-$ (when $x<0$ ) to $+$ (when $x>0$ ), so $x = 0$ is a strong local+global minimiser of $f$ .

1st-order Taylor at $(1, 0)$ . With $y = f(x)$ , $f(1) = 0$ (from solving $1 - 0 - 1 = 0$ ), $f'(1) = 2 \cdot 1 / (2 + 1) = 2/3$ : $f(x) = f(1) + f'(1)(x - 1) + o(x - 1) = \tfrac{2}{3}(x - 1) + o(x - 1).$

Source: lect5_calc_diff.pdf.

§7. Common Pitfalls

Below are the errors Sicconi flags most often and the traps the Bocconi exam-writers exploit. Read this list the day before the exam.

Domain constraints.
- For $\ln(u)$ : require $u > 0$ .
- For $\sqrt{u}$ : require $u \ge 0$ (and $u > 0$ for differentiability).
- For $1/u$ : require $u \neq 0$ .
- Before declaring a candidate a stationary point, always verify it lies in $\text{Dom}(f)$ . Algebraic solutions outside the domain must be rejected.
Strict vs non-strict convexity.
- Strictly convex: positive definite Hessian (plus domain convex, open).
- Convex: positive semi-definite Hessian.
- Difference matters for uniqueness of the global minimum: strict convexity ⇒ at most one global min; convex ⇒ possibly a whole flat region of minima.
Local vs global without convexity.
- Theorem 1391 (2nd-order conditions) gives local extrema only. It cannot distinguish a local from a global without extra info.
- To upgrade to global, you need convexity (Theorem 1497) or coercivity (Tonelli) or compactness (Weierstrass).
- May 2024 Q8 b2 → b3: (b2) proves $P = (0, 1/2)$ is a local min. (b3) explains the Hessian fails PSD elsewhere ⇒ cannot conclude global convexity ⇒ cannot conclude $P$ is global min via this route.
Semi-definite Hessian: Theorem 1391 is inconclusive.
- Positive semi-definite Hessian at $\mathbf{x}_0$ is only a necessary condition for $\mathbf{x}_0$ being a local min — not sufficient.
- Example 4.3 ( $f = x^2 y$ on the $y$ -axis) illustrates: Hessian is PSD, but $\mathbf{x}_0 = (0, 0)$ is a saddle. Fall back to direct inspection or higher-order Taylor.
Dini's formula sign error.
- The formula is $f'(x_0) = -g_x / g_y$ , with minus. Forgetting the minus sign flips the answer.
- Always write the minus sign first.
Hessian symmetry: Schwarz must hold.
- If you compute $f''_{xy}$ and $f''_{yx}$ and they disagree for a $C^{(2)}$ function, you have an arithmetic error.
- Conversely, use Schwarz to save time: once you compute $f''_{xy}$ , the $(2,1)$ entry of the Hessian is automatic.
Negative-definite Sylvester pattern: alternating, not all-negative.
- Pattern: $\det A_1 < 0$ , $\det A_2 > 0$ , $\det A_3 < 0$ , $\det A_4 > 0$ , …
- Not all-negative. Consistent with " $-A$ positive definite ⇒ eigenvalues of $-A$ all $>0$ ⇒ eigenvalues of $A$ all $<0$ ", and the leading minors of $-A$ are $(-1)^k \det A_k$ .
Taylor coefficients: the $1/2$ and the factor 2 on the cross term.
- Quadratic form $\tfrac{1}{2} \mathbf{h}^T H \mathbf{h}$ expands to $\tfrac{1}{2}[h_1^2 f''_{xx} + 2 h_1 h_2 f''_{xy} + h_2^2 f''_{yy}]$ . The cross-term coefficient is $f''_{xy}$ (not $2 f''_{xy}$ and not $f''_{xy}/2$ ).
Boundary points are not stationary in the gradient sense.
- On a closed set with non-empty boundary, the max/min may live on the boundary where $\nabla f \neq \mathbf{0}$ .
- Fermat requires $\mathbf{x}_0$ to be an interior point. Apply boundary analysis separately.
Coercivity gives existence, not uniqueness.
- Tonelli says a global min exists — not that it is unique.
- Uniqueness requires strict convexity (or a direct argument that the stationary point is unique and other candidates are ruled out).
Composition of coercive functions with monotone outer.
- $h$ strictly increasing + $g$ coercive towards $+\infty$ ⇒ $h \circ g$ coercive towards $\lim_{+\infty} h$ .
- $h$ strictly increasing + $g$ super-coercive towards $-\infty$ ⇒ $h \circ g$ tends to $\lim_{-\infty} h$ at infinity.
- If $h$ is strictly decreasing, the conclusions flip.
Ignoring the convexity of the domain when applying Prop. 1486.
- Prop. 1486 requires $C$ to be an open convex set. Half-planes, open balls, and all of $\mathbb{R}^n$ qualify; annuli and punctured disks do not (not convex).
"The function is strictly concave ⇒ at most one global max".
- Strict convexity / concavity ensures uniqueness of the global extremum if one exists. Existence still needs Weierstrass / Tonelli.
Reading the MCQ options carefully.
- Exam wording tests common distractors: wrong Dini sign; missing factor of $1/2$ in Taylor; using Hessian of $f(x,y)$ at $(0,0)$ instead of the required $(1, -\pi/2)$ .
- If the simplified Taylor polynomial doesn't match any of options A–C, the correct answer is very likely "none of the others" — trust the calculation and resist the urge to "round" to a listed option.

Final checklist before the exam (differential calculus slice)

[ ] I can state and prove Theorem 1391 (sufficient condition for strong local max/min), including the Weierstrass-on-unit-sphere step.
[ ] I can state Theorem 1481 (2nd-order Taylor with Peano remainder) and use it in computations.
[ ] I can state Prop. 1486 (Hessian sign ⇔ convexity/concavity) and use it on both sides.
[ ] I can state and prove Theorem 1497 (∇f = 0 ⇔ global extremum for convex/concave differentiable functions), including the use of Fermat and the tangent-hyperplane inequality.
[ ] I can state Theorem 1591 and know it extends 1497 to non-open convex sets via interior points.
[ ] I can state Fermat's theorem (∇f = 0 at interior local extrema) and sketch its proof.
[ ] I can state Dini's theorem and apply the formula $f'(x_0) = -g_x / g_y$ without dropping the sign.
[ ] I can state Weierstrass (compact ⇒ max/min) and Tonelli (coercive ⇒ min / super-coercive towards $-\infty$ ⇒ max).
[ ] I can run M-DC-1 through M-DC-7 on a $\mathbb{R}^2$ problem in under 20 minutes.
[ ] I know that semi-definite Hessian leaves Theorem 1391 inconclusive, and I have a direct-inspection fallback.
[ ] I can distinguish "local", "strong local", "global", "strong global", and the role of convexity in upgrading each.

End of Differential Calculus study guide.

Differential Calculus

02 — Differential Calculus

§1. Overview & Exam Relevance

§2. Definitions

2.1 Multivariable function f:Rn→Rf : \mathbb{R}^n \to \mathbb{R}f:Rn→R

2.2 Open ball and open set

2.3 Limit and continuity

2.4 Partial derivative and gradient

2.5 Directional derivative

2.6 Differentiability and C(1),C(2)C^{(1)}, C^{(2)}C(1),C(2) classes

2.7 Hessian ∇2f=Hf\nabla^2 f = H_f∇2f=Hf​

2.8 Stationary point

2.9 Local and global extrema

2.10 Convex set, convex/concave function

2.11 Coercive, super-coercive, level set

2.12 Implicit function — setup for Dini

§3. Theorems, Propositions & Proofs

3.1 Schwarz's theorem — symmetry of the Hessian

3.2 Theorem 1481 — Second-order Taylor formula with Peano remainder

3.3 Theorem 1391 — Second-order conditions for unconstrained local max/min (with proof)

3.4 Theorem/Proposition 1486 — Concavity/strict-concavity test via the Hessian

3.5 Theorem 1497 — Global extrema of differentiable convex/concave functions

3.6 Theorem 1591 — Global maximum of a concave function (extension to continuous case)

3.7 Fermat's theorem — necessary condition for interior extrema

3.8 Dini's theorem (Implicit function theorem, local 2D version)

3.9 Weierstrass theorem and coercive-function existence (Tonelli)

§4. Worked Examples

Example 4.1 — Gradient, Hessian, and stationary points of f(x,y)=ex(x−1) y+y2f(x,y) = e^x(x-1)\,y + y^2f(x,y)=ex(x−1)y+y2

Example 4.2 — Second-order Taylor of f(x,y)=x2+xcos⁡yf(x,y) = x^2 + x \cos yf(x,y)=x2+xcosy at (1,−π/2)(1, -\pi/2)(1,−π/2)

Example 4.3 — A case where the Hessian is semi-definite (lect 4_calc_diff (3).pdf)

Example 4.4 — f(x,y)=x+3y2−ln⁡(x+2y)f(x,y) = x + 3 y^2 - \ln(x + 2y)f(x,y)=x+3y2−ln(x+2y) (domain, stationary point, convexity, Taylor)

Example 4.5 — Dini's theorem applied to g(x,y)=2yex+xey−2g(x,y) = 2 y e^x + x e^y - 2g(x,y)=2yex+xey−2 at g=−2g = -2g=−2

Example 4.6 — Existence of a global maximiser via composition-coercivity (lect4)

Example 4.7 — Stationary-point classification with three cases (lect3 TA)

§5. Solution Methods

M-DC-1 — Compute the gradient ∇f\nabla f∇f and Hessian ∇2f\nabla^2 f∇2f

M-DC-2 — Find all stationary points (solve ∇f=0\nabla f = \mathbf{0}∇f=0)

M-DC-3 — Classify a stationary point via the Hessian

M-DC-4 — Check global convexity/concavity via the Hessian

M-DC-5 — Write the second-order Taylor expansion at (a,b)(a, b)(a,b)

M-DC-6 — Apply Dini's theorem to find f′(x0)f'(x_0)f′(x0​) from g(x,y)=cg(x, y) = cg(x,y)=c

M-DC-7 — Existence of global extrema via coercivity and Weierstrass

§6. Practice Problems with Solutions

Problem 6.1 — May 2024 MCQ2 (Mode A) / MCQ1 (Mode B): second-order Taylor

Problem 6.2 — May 2024 Q8 (Mode A) / Q7 (Mode B): Hessian classification and global convexity

Problem 6.3 — Final exercises Q2: full optimization on a log-domain

Problem 6.4 — Final exercises Q3: Dini's theorem

Problem 6.5 — lect 4: find the extrema of f(x,y)=x2+y2+x+yf(x,y) = x^2 + y^2 + x + yf(x,y)=x2+y2+x+y

Problem 6.6 — lect3: f(x,y)=x2y−y2x−3xf(x, y) = x^2 y - y^2 x - 3 xf(x,y)=x2y−y2x−3x

Problem 6.7 — Final exercises Q1 recap with Differential-Calculus lens

Problem 6.8 — lect5: Dini on g(x,y)=x2−2y−ey=0g(x, y) = x^2 - 2y - e^y = 0g(x,y)=x2−2y−ey=0, find f′(1)f'(1)f′(1)

§7. Common Pitfalls

Final checklist before the exam (differential calculus slice)

2.1 Multivariable function $f : \mathbb{R}^n \to \mathbb{R}$

2.6 Differentiability and $C^{(1)}, C^{(2)}$ classes

2.7 Hessian $\nabla^2 f = H_f$

Example 4.1 — Gradient, Hessian, and stationary points of $f(x,y) = e^x(x-1)\,y + y^2$

Example 4.2 — Second-order Taylor of $f(x,y) = x^2 + x \cos y$ at $(1, -\pi/2)$

Example 4.3 — A case where the Hessian is semi-definite (`lect 4_calc_diff (3).pdf`)

Example 4.4 — $f(x,y) = x + 3 y^2 - \ln(x + 2y)$ (domain, stationary point, convexity, Taylor)

Example 4.5 — Dini's theorem applied to $g(x,y) = 2 y e^x + x e^y - 2$ at $g = -2$

M-DC-1 — Compute the gradient $\nabla f$ and Hessian $\nabla^2 f$

M-DC-2 — Find all stationary points (solve $\nabla f = \mathbf{0}$ )

M-DC-5 — Write the second-order Taylor expansion at $(a, b)$

M-DC-6 — Apply Dini's theorem to find $f'(x_0)$ from $g(x, y) = c$

Problem 6.5 — lect 4: find the extrema of $f(x,y) = x^2 + y^2 + x + y$

Problem 6.6 — lect3: $f(x, y) = x^2 y - y^2 x - 3 x$

Problem 6.8 — lect5: Dini on $g(x, y) = x^2 - 2y - e^y = 0$ , find $f'(1)$