Comprehensive study guide for Bocconi Math Module 2 (cod. 30063), General Exam.Source materials: Alice Sicconi's Proofs - Differential Calculus.pdf, Linear Algebra and Differential Calculus.pdf (diff-calc half), lect1_calc-diff.pdf, lect2_calc_diff (2).pdf, Lect3_calc_diff (3).pdf, lect 4_calc_diff (3).pdf, lect5_calc_diff.pdf, TA41 _calc_diff (2).pdf, General_24524_ENG_SOL.pdf, Final exercises solution.pdf.
§1. Overview & Exam Relevance
Differential Calculus is the second block of the first partial and accounts for approximately 27% of the general exam (typically 1 MCQ worth 5 pts plus one open-ended question worth up to 20 pts — about 25–30 pts of the 150-pt exam). On open-ended problems it is tied with Linear Algebra as the richest source of statement-plus-proof questions.
Topic scope. The exam tests:
multivariable functions f:Rn→R and their regularity classes C(1),C(2);
partial derivatives ∂f/∂xi, gradient ∇f, directional derivatives, and the Hessian ∇2f (equivalently Hf);
Schwarz's theorem: symmetry of the Hessian for C(2) functions;
Second-order Taylor formula with Peano remainder at a point x0∈Rn;
classification of stationary points via the sign of the Hessian quadratic form hT∇2f(x0)h (Theorem 1391);
convex/concave functions on open convex sets, tangent-hyperplane inequality, and the Hessian-based characterisation (Theorem/Prop. 1486);
necessary and sufficient conditions for global extrema of convex/concave functions (Theorems 1497, 1591);
existence theorems: Weierstrass (on compact sets), Tonelli (on coercive/super-coercive functions);
Dini's theorem (implicit function theorem): local uniqueness and formula for f′(x0) from g(x,y)=c.
Typical MCQ patterns (from May 2024 General exam).
MCQ2 (Mode A) / MCQ1 (Mode B): "The Taylor formula of order two for f(x,y)=x2+xcosy centred at (1,−π/2) is …" — compute f,∇f,∇2f at the point and assemble.
Other recurring MCQs (from TA and lecture pools): local-extrema classification of f(x,y)=(y−1)(y−x2); Dini-applicability at boundary points of g(x,y)=0; coercivity check.
Part (a) — State the concavity / strict-concavity test (Prop. 1486) based on the sign of the Hessian matrix.
Part (b) — Given f(x,y)=ex(x−1)y+y2:
(b1) write ∇f and determine stationary points → P=(0,1/2) local min and Q=(1,0) saddle;
(b2) write ∇2f and classify each stationary point using Theorem 1391;
(b3) decide whether f is convex on the whole R2.
Why this topic is high-leverage.
Every theorem here uses Linear Algebra from §01: classifying a stationary point is classifying a quadratic form attached to the Hessian. Spectral/Sylvester–Jacobi carry over verbatim.
The Taylor formula is the engine behind every result in this chapter — the proof of Theorem 1391 literally starts by invoking Theorem 1481 (Taylor).
Together with Linear Algebra (≈27%), Differential Calculus pushes the first-partial content to ≈55% of the whole general exam.
§2. Definitions
2.1 Multivariable function f:Rn→R
A multivariable (scalar) function of n real variables is a map f:A⊆Rn→R, assigning to each point x=(x1,…,xn)∈A exactly one real number f(x). The set A is the domain; its image f(A)⊆R is the range.
Example.f(x,y)=x+3y2−ln(x+2y) has domain {(x,y)∈R2:x+2y>0} because ln requires a strictly positive argument (Final exercises solution.pdf Q2).
2.2 Open ball and open set
Given x0∈Rn and r>0, the open ball (or neighbourhood) of centre x0 and radius r is
Ur(x0)=Br(x0)={x∈Rn:∥x−x0∥<r}.
A set A⊆Rn is open if for every x0∈A there is r>0 with Ur(x0)⊆A. A point x0∈A is an interior point of A if such an r exists; otherwise it is a boundary point.
2.3 Limit and continuity
limx→x0f(x)=L means: ∀ε>0∃δ>0 such that 0<∥x−x0∥<δ⇒∣f(x)−L∣<ε.
f is continuous at x0 if limx→x0f(x)=f(x0); it is continuous on A if it is continuous at every point of A.
2.4 Partial derivative and gradient
The i-th partial derivative of f at x0 is
∂xi∂f(x0)=fxi′(x0)=limh→0hf(x0+hei)−f(x0),
where ei is the i-th coordinate vector. Operationally, ∂f/∂xi is obtained by differentiating f with respect to xi while treating all other variables as constants.
The gradient∇f(x0) is the row vector of all partial derivatives:
∇f(x0)=[fx1′(x0)fx2′(x0)⋯fxn′(x0)].
When written as a column, we use ∇f(x0)T.
Example (lect2). For f(x,y)=3x2ey2 we have fx′=6xey2, fy′=3x2ey2⋅2y=6x2yey2, so ∇f(x,y)=[6xey2,6x2yey2] and ∇f(1,1)=[6e,6e].
2.5 Directional derivative
Given a unit vector v∈Rn (∥v∥=1), the directional derivative of f at x0 in the direction v is
Dvf(x0)=limh→0hf(x0+hv)−f(x0).
If f is differentiable at x0, then Dvf(x0)=∇f(x0)⋅v.
2.6 Differentiability and C(1),C(2) classes
f is differentiable at x0 if there exists a row vector L∈Rn such that
f(x0+h)=f(x0)+L⋅h+o(∥h∥)as h→0.
If such L exists, it is necessarily L=∇f(x0).
C(1)(A): f has continuous first partial derivatives on A. If f∈C(1) then f is differentiable.
C(2)(A): f has continuous second-order partial derivatives on A. This is the hypothesis of Schwarz's theorem and Theorem 1481 (Taylor).
2.7 Hessian ∇2f=Hf
The Hessian matrix of f at x0 is the n×n matrix of second partial derivatives:
∇2f(x0)=Hf(x0)=fx1x1′′(x0)fx2x1′′(x0)⋮fxnx1′′(x0)fx1x2′′(x0)fx2x2′′(x0)⋮fxnx2′′(x0)⋯⋯⋱⋯fx1xn′′(x0)fx2xn′′(x0)⋮fxnxn′′(x0).
Schwarz's theorem (stated in lect1): if f∈C(2)(A) with A open, then ∂2f/(∂xi∂xj)=∂2f/(∂xj∂xi) for all i,j and all x∈A. Hence ∇2f is symmetric, and the classification tools from Linear Algebra (§01) apply directly.
Mnemonic (lect1 sidenote). The gradient is an operator (it sends a point to a vector). The Jacobian matrix contains all partials of a vector-valued operator. Since ∇f is itself a vector-valued operator Rn→Rn, its Jacobian is the Hessian: ∇2f = Jacobian of ∇f.
2.8 Stationary point
A point x0∈A is a stationary point (or critical point) of f if ∇f(x0)=0, i.e., every partial derivative vanishes at x0.
Remark. Candidates for local extrema of a differentiable f on an open set are exactly the stationary points (Fermat's theorem, §3.7). On a non-open set one must also check boundary points and exceptional points where f is not differentiable (lect1).
2.9 Local and global extrema
Let x0∈A. The point x0 is:
a local minimum of f if there is r>0 with f(x0)≤f(x) for every x∈Ur(x0)∩A;
a strong (strict) local minimum if f(x0)<f(x) for every x∈Ur(x0)∩A with x=x0;
a global (resp. strong global) minimum if the corresponding inequality holds for every x∈A.
Maxima are defined by reversing the inequalities. A stationary point that is neither a local max nor a local min is a saddle point (e.g., (0,0) for f(x,y)=x2−y2).
2.10 Convex set, convex/concave function
A set C⊆Rn is convex if for every x,y∈C and every t∈[0,1], tx+(1−t)y∈C (the segment joining any two points stays in C).
Let C⊆Rn be convex. A function f:C→R is convex on C if
f(tx+(1−t)y)≤tf(x)+(1−t)f(y)∀x,y∈C,∀t∈[0,1].
It is strictly convex if the inequality is strict whenever x=y and t∈(0,1). f is (strictly) concave iff −f is (strictly) convex.
Tangent-hyperplane characterisation (lect4, for f differentiable on an open convex set C):
f is convex on C⟺f(x)≥f(x0)+∇f(x0)(x−x0) for all x,x0∈C (tangent hyperplane sits below the graph);
f is strictly convex on C⟺ the inequality above is strict whenever x=x0.
The 1-variable analogues (f(x)≥f(x0)+f′(x0)(x−x0)) and the second-derivative tests (f′′(x)≥0 for convex, f′′(x)>0 for strictly convex) are recovered as special cases n=1.
2.11 Coercive, super-coercive, level set
A level set of f at height c is {x∈A:f(x)=c} (or ≤c for the sub-level set).
f:Rn→R is coercive on a set A (Sicconi's convention = super-coercive towards +∞) if lim∥x∥→∞f(x)=+∞. Equivalently, every sub-level set {f≤c} is bounded. If lim∥x∥→∞f(x)=−∞ we say f is super-coercive towards −∞ (the symmetric case used for existence of global maxima).
Composition rule (lect4). If the inner function g:Rn→R is continuous and coercive towards −∞, and the outer function h:R→R is strictly increasing, then h∘g is coercive (towards limt→−∞h(t)). This is how e−x2−y2 gets its super-coercivity towards 0 from above (hence −e−x2−y2 is super-coercive towards 0, but f=e−x2−y2 has supremum equal to 1, making (0,0) the global maximiser).
2.12 Implicit function — setup for Dini
Let g:C⊆R2→R and c∈R. Consider the level curve g(x,y)=c. We ask: does there exist a function f:I⊆R→R with y=f(x) such that g(x,f(x))=c for all x∈I? If yes, we say f is the implicit function defined by g(x,y)=c near a specific point. Dini's theorem (§3.6) gives sufficient conditions for local existence, uniqueness, and a formula for f′(x0).
§3. Theorems, Propositions & Proofs
3.1 Schwarz's theorem — symmetry of the Hessian
Statement. Let A⊆Rn be open and f:A→R be C(2) on A. Then
∂xi∂xj∂2f(x)=∂xj∂xi∂2f(x)∀i,j,∀x∈A.
In particular ∇2f(x) is a symmetric matrix.
Source: lect1_calc-diff.pdf (stated without proof, as is course convention).
Why it matters. Every theorem that follows in §3 uses the fact that ∇2f(x0) is symmetric, so all the sign-classification tools from Linear Algebra (eigenvalue signs, Sylvester–Jacobi leading principal minors) apply to the Hessian.
3.2 Theorem 1481 — Second-order Taylor formula with Peano remainder
Statement. Let A⊆Rn be open, x0∈A, and f∈C(2)(A). Then for every x=x0+h in a neighbourhood B(x0)⊆A,
f(x)=f(x0)+∇f(x0)(x−x0)+21(x−x0)T∇2f(x0)(x−x0)+o(∥x−x0∥2)
as x→x0.
Source: lect2_calc_diff (2).pdf (statement); proof sketch in Proofs - Differential Calculus.pdf p.3 inside the proof of Theorem 1391.
Structure of the formula.
f(x0) — the zero-order term.
∇f(x0)(x−x0) — the linear / first-order term (tangent hyperplane).
21(x−x0)T∇2f(x0)(x−x0) — the quadratic term (a quadratic form in h=x−x0).
o(∥x−x0∥2) — the Peano remainder: a function R(h) with R(h)/∥h∥2→0 as h→0.
The n=1 special case.f(x)=f(x0)+f′(x0)(x−x0)+21f′′(x0)(x−x0)2+o((x−x0)2).Maclaurin is Taylor with x0=0.
Remark on notation. The factor 1/2 in the quadratic term comes from 1/2!. For general order-k expansion the coefficient would be 1/k!, but the course stops at second order.
3.3 Theorem 1391 — Second-order conditions for unconstrained local max/min (with proof)
Statement. Let A⊆Rn be open, f∈C(2)(A), and x0∈A be a stationary point (i.e., ∇f(x0)=0). Then:
(1) Necessary condition. If x0 is a local maximiser (resp. minimiser) of f, then the quadratic form hT∇2f(x0)h is negative semi-definite (resp. positive semi-definite).
(2) Sufficient condition. If the quadratic form hT∇2f(x0)h is negative definite (resp. positive definite), then x0 is a strong local maximiser (resp. strong local minimiser) of f.
Consequently, if ∇2f(x0) is indefinite, x0 is a saddle point.
Source: Proofs - Differential Calculus.pdf p.3–4 (handwritten theorem 1391, "only (S)" — only the sufficient half is proved in the compendium, following Sicconi's syllabus).
Proof of (2) — sufficient condition, maximiser case.
Assume ∇2f(x0) defines a negative-definite quadratic form. We show x0 is a strong local maximiser.
Step 1 — Apply Taylor (Theorem 1481). Since f∈C(2) on a neighbourhood U(x0), for every x∈U(x0),
f(x)=f(x0)+∇f(x0)(x−x0)+21(x−x0)T∇2f(x0)(x−x0)+o(∥x−x0∥2),x→x0.(With ∇2f(x0) symmetric, by Schwarz.) Because x0 is stationary, ∇f(x0)=0, so the linear term vanishes:
f(x)−f(x0)=21(x−x0)T∇2f(x0)(x−x0)+o(∥x−x0∥2).
Step 3 — Renormalise. Assume x=x0, so h=0, and multiply–divide the right-hand side by ∥h∥2:
f(x)−f(x0)=∥h∥2⋅[21∥h∥2hT∇2f(x0)h+∥h∥2o(∥h∥2)]=∥h∥2⋅[21vT∇2f(x0)v+o(1)],
where v=h/∥h∥ is a unit vector, and we used the definition of little-o to write o(∥h∥2)/∥h∥2=o(1) as h→0.
Step 4 — Maximise the quadratic form over the unit sphere. Consider the continuous function
Q(v)=vT∇2f(x0)vonS={v∈Rn:∥v∥=1},
which is the unit sphere — a compact set (closed and bounded). By Weierstrass, Q attains a maximum q∗ on S. Because ∇2f(x0) is negative definite, Q(v)<0 for every v=0; in particular Q(v)<0 for every v∈S (note 0∈/S), so
q∗=maxv∈SQ(v)<0.
Step 5 — Bound f(x)−f(x0) strictly below zero. Substituting into Step 3:
f(x)−f(x0)≤∥h∥2[21q∗+o(1)].We want this bracket to be negative. Pick any ε>0; by the definition of o(1), there is δ>0 such that 0<∥h∥<δ implies ∣o(1)∣<ε, i.e., −ε<o(1)<ε. Choosing the concrete value ε=−41q∗ (which is strictly positive because q∗<0), we get for ∥h∥<δ:
21q∗+o(1)<21q∗−41q∗=41q∗<0.
Therefore
f(x)−f(x0)≤>0∥h∥2⋅<0(41q∗)<0,
i.e., f(x)<f(x0) for every x=x0 in a neighbourhood of x0. Hence x0 is a strong local maximiser. ■
Minimiser case. Replace "negative definite" by "positive definite"; then q∗>0 (in fact q⋆=minQ>0 by the same Weierstrass argument applied to −Q), and the inequality flips to f(x)>f(x0).
Saddle case. If ∇2f(x0) is indefinite there exist unit vectors v+,v− with Q(v+)>0 and Q(v−)<0. Taylor along each direction shows f increases along v+ and decreases along v−, so x0 is a saddle point.
Necessary condition (part 1), sketch. If x0 is a local maximiser, then f(x)≤f(x0) nearby, so the Taylor remainder 21hT∇2f(x0)h+o(∥h∥2)≤0. Dividing by ∥h∥2 and taking h→0 along any direction v gives vT∇2f(x0)v≤0, i.e., negative semi-definite.
Summary table.
| Sign of ∇2f(x0) | Conclusion | Role |
|---|---|---|
| positive definite | strong local min | sufficient |
| negative definite | strong local max | sufficient |
| indefinite | saddle | sufficient |
| positive semi-definite (not def.) | local min is possible, not guaranteed | necessary only |
| negative semi-definite (not def.) | local max is possible, not guaranteed | necessary only |
Critical pitfall. Semi-definite ⇒ cannot conclude. Need a direct inspection (as in Example 4.3 below, f(x,y)=x2y at (0,y)).
3.4 Theorem/Proposition 1486 — Concavity/strict-concavity test via the Hessian
Statement. Let C⊆Rn be an open convex set and f∈C(2)(C). Then:
(a) f is convex on C⟺∇2f(x) is positive semi-definite for every x∈C.
(b) f is strictly convex on C⟺∇2f(x) is positive definite for every x∈C.
Symmetric statements for concave / strictly concave by flipping all signs.
Source: Proofs - Differential Calculus.pdf (ref to prop 1486); lect 4_calc_diff (3).pdf (statement); General_24524_ENG_SOL.pdf Q8(a) Mode A / Q7(a) Mode B cites "see the textbook, proposition 1486" for the concavity test.
Proof (convex case ⇐, the direction most used on exams).
Assume ∇2f(x) is positive semi-definite for every x∈C. Pick x,x0∈C. By the 2nd-order Taylor formula with Lagrange remainder (the integral / mean-value form, course-level), there exists ξ on the segment [x0,x] (which stays in C because C is convex) such that
f(x)=f(x0)+∇f(x0)(x−x0)+21(x−x0)T∇2f(ξ)(x−x0).
Since ∇2f(ξ) is PSD, the last term is ≥0, so
f(x)≥f(x0)+∇f(x0)(x−x0).
This is the tangent-hyperplane inequality for convex functions, which (lect4) is equivalent to convexity of f on C.
Strict case (b), ⇐. If ∇2f(x) is positive definite everywhere, then for x=x0 the last term is strictly positive, giving the strict tangent-hyperplane inequality — equivalent to strict convexity.
Converse (⇒), sketch. Convexity implies f(x)≥f(x0)+∇f(x0)(x−x0). Apply Taylor, subtract the linear part, divide by ∥x−x0∥2: the lower bound ≥0 transfers to the quadratic form vT∇2f(x0)v≥0 for every unit direction v, i.e., PSD. ■
One-variable shadow. The standard "convex ⟺f′′(x)≥0" is exactly the n=1 case.
Lecture 4 sidenote. For a strictly convex function the Hessian can fail to be positive definite at isolated points (classic example f(x)=x4 has f′′(0)=0 but is strictly convex on R). The textbook equivalence (b) as stated holds under the C(2) assumption; in practice the PSD-everywhere plus "PD somewhere" condition is what is checked.
3.5 Theorem 1497 — Global extrema of differentiable convex/concave functions
Statement. Let C⊆Rn be an open convex set, f:C→Rdifferentiable, and x0∈C.
(1) If f is concave on C, then
x0 is a global maximiser for f⟺∇f(x0)=0.
(2) If f is convex on C, then
x0 is a global minimiser for f⟺∇f(x0)=0.
Since x0 is a global maximiser, it is also a local maximiser. By Fermat's theorem (§3.7), and because C is open (so x0 is an interior point), ∇f(x0)=0.
Proof of (1), ⇐ (∇f(x0)=0⇒ global max).
Since f is concave and differentiable on the open convex set C, the tangent-hyperplane characterisation of concavity gives
f(x)≤f(x0)+∇f(x0)(x−x0)∀x∈C.(Concave: the tangent hyperplane sits above the graph.) Because ∇f(x0)=0, the linear term vanishes and
f(x)≤f(x0)∀x∈C.
Hence x0 is a global maximiser.
Proof of (2). Identical, with ≤ replaced by ≥, concave replaced by convex, maximiser by minimiser. ■
Why this is the single most useful theorem for open-ended exams. If you can prove the Hessian is PSD/PD everywhere on C, then any stationary point is automatically a global minimiser (not just local) — no neighbourhood argument required. This is exactly the punchline of Final exercises Q2(b): the Hessian of x+3y2−ln(x+2y) is PD throughout the domain, so (1/3,1/3) is a global min.
3.6 Theorem 1591 — Global maximum of a concave function (extension to continuous case)
Statement. Let C⊆Rn be a convex set (not necessarily open), f:C→R be continuous on C, concave on C, and differentiable on int(C). Let x^∈int(C). Then
x^ is a global maximiser of f on C⟺∇f(x^)=0.
Symmetric statement for convex functions and global minimisers.
Source: Sicconi's course reference (textbook theorem 1591). Not proved in the compendium; follows from 1497 plus the standard continuous-extension argument.
Remark. This upgrades Theorem 1497 from open convex sets to arbitrary convex sets, provided the candidate stationary point lies in the interior. Points on the boundary still need separate treatment.
3.7 Fermat's theorem — necessary condition for interior extrema
Statement. Let A⊆Rn and f:A→R. If x0 is an interior point of A, f is differentiable at x0, and x0 is a local extremum (min or max) of f, then
∇f(x0)=0.
Source: lect 4_calc_diff (3).pdf (applied inside the proof of Theorem 1497).
Proof. Without loss of generality, assume x0 is a local maximiser. Fix any direction v∈Rn with ∥v∥=1, and consider the one-variable function φ(t):=f(x0+tv), defined on a small interval (−δ,δ) (possible because x0 is interior). Then t=0 is a local maximum of φ, so by the one-variable Fermat theorem φ′(0)=0. By the chain rule, φ′(0)=∇f(x0)⋅v. Since this must equal 0 for every direction v, we get ∇f(x0)=0. ■
Converse is false. Not every stationary point is an extremum — saddles are also stationary. Fermat is a necessary, not sufficient, condition.
3.8 Dini's theorem (Implicit function theorem, local 2D version)
Statement. Let g:A⊆R2→R and let P=(x0,y0)∈A satisfy g(x0,y0)=c. Suppose:
(i) g∈C(1) in a neighbourhood B(x0,y0)⊆A;
(ii) ∂y∂g(x0,y0)=0.
Then there exist neighbourhoods U(x0)⊆R and V(y0)⊆R (with U×V⊆B(x0,y0)) and a unique function f:U(x0)→V(y0) with f(x0)=y0 such that
g(x,f(x))=c∀x∈U(x0).
Furthermore f is differentiable at x0 and
f′(x0)=−∂g/∂y(x0,y0)∂g/∂x(x0,y0).
Source: lect5_calc_diff.pdf "Dini's Theorem: Implicit Theorem, local version"; Final exercises solution.pdf Q3 applies the formula with the minus sign.
Idea. The non-vanishing of ∂g/∂y at P says the level curve g=c is non-vertical at P — no vertical tangent — so locally we can invert the y-coordinate and express y as a function of x. The derivative formula comes from differentiating g(x,f(x))=c implicitly and using the chain rule:
∂x∂g(x,f(x))+∂y∂g(x,f(x))⋅f′(x)=0⟹f′(x)=−gygx.
Global existence/uniqueness (lect5). If instead of just P we work on a rectangular domain A×B⊆R2 (with B an interval) and we have
(1) infy∈Bg(x,y)<c<supy∈Bg(x,y) for every x∈A;
(2) g continuous in y (for each fixed x);
(3) g strictly monotonic in y (for each fixed x) — equivalently ∂g/∂y has constant sign,
then there exists a unique globalf:A→B with g(x,f(x))=c for every x∈A.
Caveat (lect5 MCQ). If ∂g/∂y(x0,y0)=0 (vertical tangent), Dini fails. Example: an ellipse at its leftmost / rightmost points fails the vertical-line test: a small move to the right leaves the ellipse entirely, so no function y=f(x) is definable there.
3.9 Weierstrass theorem and coercive-function existence (Tonelli)
Weierstrass (stated in lect4). If K⊆Rn is compact (closed and bounded) and f:K→R is continuous, then f attains both its maximum and its minimum on K: ∃xmax,xmin∈K with f(xmax)=maxKf, f(xmin)=minKf.
Tonelli / coercive theorem (stated in lect4). If f:Rn→R is continuous and coercive (lim∥x∥→∞f(x)=+∞), then f attains a global minimum. If f is continuous and super-coercive towards −∞ (lim∥x∥→∞f(x)=−∞), then f attains a global maximum.
Why coercivity alone gives existence. A continuous coercive f has bounded sub-level sets. Pick any c>inff; the sub-level set K={f≤c} is non-empty, closed (continuity), and bounded (coercivity) — hence compact. By Weierstrass, f∣K attains its minimum; that minimum is automatically the global minimum of f on Rn (points outside K have f-values >c≥minKf).
Composition rule (lect4). If g:Rn→R is continuous with g(x)→−∞ as ∥x∥→∞, and h:R→R is continuous and strictly increasing, then h∘g inherits the coercivity behaviour of g filtered through h. Example: f(x,y)=e−x2−y2: inner −x2−y2→−∞ (super-coercive towards −∞), outer et strictly increasing, so f→e−∞=0+ as ∥(x,y)∥→∞; hence the supremum 1 must be attained at a stationary point, which is (0,0) — a global maximum.
Relaxation of constraints (lect4). When the admissible set is not compact and f not coercive, one can sometimes relax the constraint and solve the unconstrained problem, then check whether the unconstrained optimum lies in the original set. If yes, it is the constrained optimum too; if no, the method fails and Lagrange multipliers or boundary analysis are needed.
§4. Worked Examples
Example 4.1 — Gradient, Hessian, and stationary points of f(x,y)=ex(x−1)y+y2
Source: May 2024 Q8 (Mode A) / Q7 (Mode B), General_24524_ENG_SOL.pdf.
Gradient. Treat y as a constant:
fx′=∂x∂[yex(x−1)]=y[ex(x−1)+ex]=y⋅ex⋅x=xyex.
Treat x as a constant:
fy′=∂y∂[yex(x−1)+y2]=ex(x−1)+2y.∇f(x,y)=[xyex,2y+ex(x−1)].
Stationary points. Solve ∇f(x,y)=0:
{xyex=02y+ex(x−1)=0
The first equation gives x=0 or y=0 (since ex>0 always).
Case x=0. The second equation becomes 2y+e0⋅(0−1)=2y−1=0, so y=1/2. Stationary point P=(0,1/2).
Case y=0. The second equation becomes 0+ex(x−1)=0⇒x−1=0⇒x=1 (since ex>0). Stationary point Q=(1,0).
Hessian. Compute second derivatives:
fxx′′=∂x∂(xyex)=yex+xyex=yex(x+1),fxy′′=∂y∂(xyex)=xex(=fyx′′ by Schwarz),fyy′′=∂y∂(2y+ex(x−1))=2.∇2f(x,y)=(yex(x+1)xexxex2).
Classify P=(0,1/2).∇2f(0,1/2)=((1/2)e0(0+1)002)=(1/2002).
Leading principal minors: detA1=1/2>0 and detA2=1>0. Positive definite. By Theorem 1391, P=(0,1/2) is a strong local minimiser.
Classify Q=(1,0).∇2f(1,0)=(0⋅e⋅2e1⋅e2)=(0ee2).detA1=0, detA2=0⋅2−e2=−e2<0. Since detA2=0, we can finalise: one eigenvalue is positive, the other negative (product =detA2<0), so the Hessian is indefinite. By Theorem 1391, Q=(1,0) is a saddle point — not a local optimiser.
Global convexity of f on R2?
By Prop. 1486 (§3.4), f is convex on R2 iff ∇2f(x,y) is PSD at every point. We found ∇2f(1,0) is indefinite — so f is not convex on R2. (Official May 2024 solution: "The function f is not convex on all R2, because ∇2f(x,y) is not positive semidefinite on all (x,y)∈R2. For example, this does not happen with point (1,0).")
Example 4.2 — Second-order Taylor of f(x,y)=x2+xcosy at (1,−π/2)
Source: May 2024 MCQ2 (Mode A) / MCQ1 (Mode B).
Step 1 — function value.f(1,−π/2)=12+1⋅cos(−π/2)=1+0=1.
Step 2 — gradient.fx′=2x+cosy, fy′=−xsiny. At (1,−π/2):
fx′(1,−π/2)=2+cos(−π/2)=2+0=2;
fy′(1,−π/2)=−1⋅sin(−π/2)=−1⋅(−1)=1.
Thus ∇f(1,−π/2)=[21].
Step 3 — Hessian.fxx′′=2, fxy′′=−siny, fyy′′=−xcosy. At (1,−π/2):
Step 4 — assemble Taylor formula. With h=(x−1,y+π/2):
f(x,y)=1+[21](x−1y+π/2)+21[x−1y+π/2](2110)(x−1y+π/2)+o(∥(x−1,y+π/2)∥2).
Step 5 — expand. Linear part: 2(x−1)+(y+π/2). Quadratic form: 21[2(x−1)2+2(x−1)(y+π/2)]=(x−1)2+(x−1)(y+π/2). After simplification the official answer is:
f(x,y)=2πx+x2+xy+o(∥(x−1,y+π/2)∥2).
Answer. None of the listed options matches exactly ⇒ the correct MCQ choice is "none of the others" (Mode A option D, Mode B option C).
Example 4.3 — A case where the Hessian is semi-definite (lect 4_calc_diff (3).pdf)
f(x,y)=x2y. Find all extrema.
Stationary points.∇f=[2xy,x2]=[00] gives x2=0⇒x=0, and then 2⋅0⋅y=0 for every y — so the entire y-axis is stationary: (0,y) for every y∈R.
Hessian.∇2f=(2y2x2x0), so ∇2f(0,y)=(2y000).
If y>0: Hessian is positive semi-definite (eigenvalues 2y,0), not definite. Theorem 1391 inconclusive.
If y<0: Hessian is negative semi-definite, not definite. Theorem 1391 inconclusive.
If y=0: Hessian is the zero matrix, the most degenerate case.
Direct inspection. Write f(x,y)=x2y near (0,y0).
If y0>0: in a small nbhd, y stays positive, so f(x,y)=x2y≥0=f(0,y0), with equality only on the y-axis. Hence (0,y0) is a weak (non-strict) local minimum.
If y0<0: symmetrically, (0,y0) is a weak local maximum.
If y0=0: the origin. For any neighbourhood, pick y>0: f(x,y)>0; pick y<0: f(x,y)<0. So (0,0) is a saddle point — neither max nor min.
Lesson. Semi-definite Hessian ⇒ you must argue by direct inspection or higher-order expansion. Theorem 1391 is silent.
Example 4.4 — f(x,y)=x+3y2−ln(x+2y) (domain, stationary point, convexity, Taylor)
Source: Final exercises solution.pdf Q2.
Domain. The logarithm requires x+2y>0, so
Dom(f)={(x,y)∈R2:x+2y>0}.
This is the open half-plane above the line x+2y=0; it is open and convex.
Hessian. Let u=x+2y. Then u1 has derivatives ∂x=−1/u2 and ∂y=−2/u2.
fxx′′=∂x[1−u−1]=u−2=(x+2y)−2,fxy′′=∂y[1−u−1]=2u−2=2(x+2y)−2,fyy′′=∂y[6y−2u−1]=6+4u−2=6+4(x+2y)−2.∇2f(x,y)=((x+2y)−22(x+2y)−22(x+2y)−26+4(x+2y)−2).
At (1/3,1/3) where x+2y=1:
∇2f(1/3,1/3)=(12210).
Leading principal minors: detA1=1>0, detA2=10−4=6>0. Positive definite ⇒ by Theorem 1391, (1/3,1/3) is a strong local minimiser.
Is it also a global minimiser? Check PSD-ness of ∇2f(x,y)everywhere on the domain:
fxx′′=(x+2y)−2>0 since the denominator is positive (domain condition).
Both leading principal minors positive everywhere — Hessian is positive definite on the whole (convex, open) domain. By Prop. 1486, f is strictly convex on Dom(f). By Theorem 1497, the unique stationary point (1/3,1/3) is the global minimiser.
Value at the minimum.f(1/3,1/3)=1/3+3⋅(1/9)−ln(1)=1/3+1/3−0=2/3.
Second-order Taylor at (1/3,1/3). With ∇f(1/3,1/3)=(0,0) (stationary point) and ∇2f(1/3,1/3)=(12210):
f(x,y)=32+21[(x−31)2+10(y−31)2+4(x−31)(y−31)]+o(∥(x−1/3,y−1/3)∥2).
Example 4.5 — Dini's theorem applied to g(x,y)=2yex+xey−2 at g=−2
Source: Final exercises solution.pdf Q3.
Find f′(0) where y=f(x) is defined implicitly by g(x,y)=−2.
Step 1 — locate the base point (x0,y0). We need x0=0 and y0 such that g(0,y0)=−2. Compute g(0,y)=2ye0+0⋅ey−2=2y−2. Setting 2y−2=−2 gives y=0. So (x0,y0)=(0,0).
Step 2 — verify Dini's hypothesis. Compute ∂g/∂y:
∂y∂g(x,y)=2ex+xey.
At (0,0): gy(0,0)=2e0+0⋅e0=2=0. ✓ Dini applies.
Step 3 — compute ∂g/∂x.∂x∂g(x,y)=2yex+ey.
At (0,0): gx(0,0)=2⋅0⋅e0+e0=1.
Example 4.6 — Existence of a global maximiser via composition-coercivity (lect4)
Find the global maximiser of f(x,y)=e−x2−y2 on R2.
Step 1 — existence. The inner function g(x,y)=−x2−y2 is continuous with g(x,y)→−∞ as ∥(x,y)∥→∞. The outer function h(t)=et is strictly increasing and continuous. By the composition rule (§3.9), f is continuous and f(x,y)→e−∞=0 at infinity, i.e., f is super-coercive towards 0 from above. In particular, on any sub-level set {f≥c} for small c>0, f is bounded — and f≤1=f(0,0) so the supremum 1 is attained. Tonelli-type argument: f has a global maximum.
Step 2 — candidates. Compute ∇f:
∇f(x,y)=[−2xe−x2−y2,−2ye−x2−y2].
Setting ∇f=0: the exponential is never 0, so −2x=0⇒x=0 and −2y=0⇒y=0. Unique stationary point (0,0).
Step 3 — conclusion. Since the global maximum exists (Tonelli) and there is only one candidate, (0,0) is the global maximum of f on R2, with f(0,0)=1.
Example 4.7 — Stationary-point classification with three cases (lect3 TA)
f(x,y)=(y−1)(y−x2). Identify all extrema. (MCQ trap: the function expands to y2−yx2−y+x2.)
Gradient.fx′=−2xy+2x, fy′=2y−x2−1.
Stationary points. Solve −2xy+2x=0 and 2y−x2−1=0.
From first: 2x(1−y)=0⇒x=0 or y=1.
Substitute into second: 2y=x2+1, so y=(x2+1)/2.
Case x=0:y=1/2. Point (0,1/2).
Case y=1:1=(x2+1)/2⇒x2=1⇒x=±1. Points (1,1) and (−1,1).
At (−1,1):∇2f(−1,1)=(0222). detA2=−4<0. Indefinite ⇒ saddle point.
Answer. The function has a unique local minimiser and two saddle points — matches MCQ choice (C) "a unique local minimiser".
§5. Solution Methods
Each method is a named algorithm with input → steps → output → pitfalls. Cross-references indicate May 2024 exam use.
M-DC-1 — Compute the gradient ∇f and Hessian ∇2f
Used on: May 2024 MCQ2 / MCQ1 (Taylor), Q8 / Q7 (stationary points), Final exercises Q2.
Input. A function f:Rn→R of class at least C(2).
Steps.
Compute each fxi′: differentiate f with respect to xi, treating the other variables as constants. Assemble the row vector ∇f=[fx1′⋯fxn′].
For each pair (i,j), compute fxixj′′=∂/∂xj(fxi′).
Assemble ∇2f as the n×n matrix with entry (i,j) equal to fxixj′′.
Schwarz sanity check: verify fxixj′′=fxjxi′′ (symmetry). If not, you have a calculation error.
Output.∇f(x) and ∇2f(x) as functions of x.
Pitfalls.
Forgetting the product and chain rules in expressions like ex(x−1) or ln(x+2y).
Miscomputing fyy′′ by forgetting that the remaining x-dependence is held constant.
Not using Schwarz: if fxy′′=fyx′′ symbolically (for a C(2) function), you have almost certainly made a sign error.
Writing the gradient as a column when the convention is a row (affects matrix products in Taylor).
M-DC-2 — Find all stationary points (solve ∇f=0)
Used on: May 2024 Q8 / Q7, Final exercises Q2, lecture examples.
Input. The gradient ∇f(x) as an n-tuple of expressions, and the domain.
Steps.
Write the system ∇f=0 as n scalar equations fxi′(x)=0.
Factor carefully. If an equation factors as A(x)⋅B(x)=0 with A,B not identically zero on the domain, split into cases A=0 and B=0.
Within each case, substitute into the remaining equations and solve.
Check the domain. Discard any algebraic solutions that lie outside Dom(f) (e.g., require x+2y>0 for a log term).
List all surviving stationary points.
Output. A finite list of points x(1),…,x(k) where ∇f=0.
Common case splits.
xyex=0 (May 2024 Q8): ex>0 always, so xy=0⇒x=0 or y=0.
ex+2y=2λx−2λ (lect4 constrained): rule out eanything=0 (impossible), keep the algebraic case.
Curves of stationary points (Example 4.3): if one factor is always zero (e.g., x2=0⇒x=0 but then 2xy=0 automatically for every y), the stationary set is a curve, not a discrete set. Each point on the curve must be analysed separately (often by direct inspection, since the Hessian is semi-definite).
Pitfalls.
Forgetting domain constraints. Final exercises Q2 yields the candidate (1/3,1/3) which happens to lie in the domain; had it lain on x+2y=0, we would have rejected it (boundary of domain, ln undefined).
Dropping solutions from y=0 or x=0 branches. Always enumerate both.
Dividing by λ / by a factor that might be zero. Either confirm the factor is non-zero (cross-check) or track the λ=0 case separately.
M-DC-3 — Classify a stationary point via the Hessian
Used on: May 2024 Q8 / Q7 (b2), Final exercises Q2 (a), lecture TA problems.
Input. A stationary point x0 and the Hessian ∇2f(x0) (a symmetric n×n matrix).
Steps.
Compute the leading principal minors detA1,detA2,…,detAn of ∇2f(x0).
Some minor =0: test inconclusive; fall through to step 3.
All nonzero but neither pattern ⇒ indefinite ⇒ saddle.
If Sylvester is inconclusive, compute all principal minors (not just leading), or compute eigenvalues:
All eigenvalues ≥0, not all >0 ⇒ positive semi-definite ⇒ Theorem 1391 is inconclusive, fall through to step 4.
All ≤0, some =0 ⇒ negative semi-definite ⇒ inconclusive.
Mixed signs ⇒ indefinite ⇒ saddle.
Direct inspection fallback. Compute f(x)−f(x0) near x0 and determine its sign by factoring or by a Taylor expansion of higher order.
Output. One of: strong local min, strong local max, saddle, inconclusive (semi-definite case).
Pitfalls.
Confusing semi-definite with definite. Semi-definite is only a necessary condition for a local extremum — not sufficient.
Sign of alternating pattern. The negative-definite pattern starts negative (detA1<0, detA2>0, detA3<0, …). All-negative is not negative-definite for n≥2.
Forgetting the case detA2<0. In 2D this means the two eigenvalues have opposite signs — automatically indefinite, regardless of detA1. (Example 4.1, (1,0): detA1=0 but detA2=−e2<0 ⇒ indefinite ⇒ saddle.)
M-DC-4 — Check global convexity/concavity via the Hessian
Used on: May 2024 Q8 / Q7 (b3), Final exercises Q2 (b).
Input. A C(2) function f defined on an open convex set C, and its Hessian ∇2f(x).
Steps.
Study the sign of ∇2f(x)as a function of x ranging over C, not just at one point.
If ∇2f(x) is positive definite (resp. PSD) for every x∈C, then f is strictly convex (resp. convex) on C by Prop. 1486.
If there is even one point x∗∈C where ∇2f(x∗) fails to be PSD (i.e., it is indefinite, or negative (semi-)definite, or has a negative eigenvalue), then f is not convex on C.
Output. One of: f strictly convex on C, f convex on C, f not convex on C (with a witness point). Symmetric statements for concave.
Consequence for global extrema.
If f is strictly convex on an open convex domain and has a stationary point x0, then by Theorem 1497 x0 is the unique global minimiser.
If f is strictly concave, the unique global maximiser is any stationary point.
Pitfalls.
Testing the Hessian only at the stationary point. That tells you local behaviour, not global. Global (strict) convexity requires PSD/PD everywhere on the domain.
Ignoring the convexity of the domain. Prop. 1486 requires C to be a convex set. For {x+2y>0}, C is a half-plane — convex ✓.
Witness point. To disprove convexity on all of R2, cite one specific point where the Hessian fails PSD (May 2024 Q8 b3: "(1,0)").
M-DC-5 — Write the second-order Taylor expansion at (a,b)
Used on: May 2024 MCQ2 / MCQ1, Final exercises Q2(c).
Input. A C(2) function f and an expansion point x0=(a,b).
Steps.
Compute f(a,b) — zero-order term.
Compute ∇f(a,b)=[fx′(a,b),fy′(a,b)] — linear coefficient.
If asked, simplify the polynomial by multiplying out.
Output. Explicit quadratic polynomial in (x−a,y−b) plus the Peano remainder.
Pitfalls.
Forgetting the 1/2 factor on the quadratic term.
Losing the factor 2 on the cross term: when you expand the matrix product, the xy-cross term has coefficient fxy′′⋅2/2=fxy′′ (because aij+aji=2fxy′′, divided by 2 of the Taylor factor).
Sign errors on fy′ when y0=−π/2: sin(−π/2)=−1, cos(−π/2)=0.
M-DC-6 — Apply Dini's theorem to find f′(x0) from g(x,y)=c
Used on: Final exercises Q3 (canonical example), lect5 exercises.
Input. A C(1) function g:A⊆R2→R, a constant c, and (ideally) a known x0.
Steps.
Locate the base point (x0,y0) satisfying g(x0,y0)=c. If only x0 is given, substitute and solve for y0.
Verify Dini's hypothesis: compute gy(x0,y0)=∂g/∂y(x0,y0) and check gy(x0,y0)=0. If it is zero, Dini's theorem does not apply — you cannot guarantee a local implicit function.
Compute gx(x0,y0).
Apply the formula:
f′(x0)=−gy(x0,y0)gx(x0,y0).
Sign check. The minus sign is not optional.
Output. The numerical value of f′(x0) (or a symbolic expression).
Pitfalls.
Forgetting the minus sign — the single most common error. The formula isf′(x0)=−gx/gy.
Verifying gy=0 at the wrong point (e.g., at (x0,0) when actually y0=0): always substitute the correct (x0,y0).
Missing the existence question: Dini gives local uniqueness in a neighbourhood. If the problem asks for a global function, check the extra hypotheses of the uniqueness theorem (strict monotonicity in y).
Applying Dini at a point not on the level curve: if g(x0,y0)=c, there is nothing to expand around — re-derive y0.
M-DC-7 — Existence of global extrema via coercivity and Weierstrass
Used on: lect4 "Maximisers with Tonelli's Theorem", auxiliary in Final exercises.
Input. A continuous function f:A⊆Rn→R and a (possibly unbounded) admissible set A.
Steps (case A=Rn).
Check continuity of f.
Coercivity test: examine lim∥x∥→∞f(x).
If =+∞: f is coercive ⇒ by Tonelli, f attains a global minimum.
If =−∞: f is super-coercive towards −∞ ⇒ attains a global maximum.
If neither (e.g., oscillating or going to a finite limit): the existence theorems do not apply directly — try relaxation, Lagrange, or boundary analysis.
Composition rule: if f=h∘g, check if the outer h is strictly increasing (preserves coercivity of g) or strictly decreasing (flips it).
Find all stationary and exceptional points (where ∇f=0, plus points where f is not differentiable).
Evaluate f at each candidate and pick the extremum.
Steps (case A⊆Rn compact).
Weierstrass applies directly: f attains its max and min on A.
Candidates: interior stationary points of f∣A, boundary points, and non-differentiability points.
Evaluate f at each; the biggest/smallest wins.
Steps (relaxation for non-compact, non-coercive problems).
Solve the unconstrained problem min/maxf on Rn.
If the unconstrained optimum lies in A, it is also the constrained optimum.
If not, the constrained problem likely has no solution via this method — switch to Lagrange or boundary analysis.
Pitfalls.
Unbounded set + continuous function is not enough. Weierstrass requires compact (closed and bounded).
Forgetting that a minimum may not exist.f(x)=ex on R is continuous but not coercive (goes to 0 at −∞); inf=0 is not attained.
Sub-vs. super-coercive confusion. Coercive = "+∞ at infinity" (guarantees minimum). Super-coercive towards −∞ = "−∞ at infinity" (guarantees maximum). Different sign, different conclusion.
§6. Practice Problems with Solutions
Problem 6.1 — May 2024 MCQ2 (Mode A) / MCQ1 (Mode B): second-order Taylor
The Taylor formula of order two for f(x,y)=x2+xcosy, centred at (1,−π/2), is …
Solution. See Example 4.2. The answer is "none of the others" (option D Mode A / option C Mode B), since the simplification yields f(x,y)=2πx+x2+xy+o(∥(x−1,y+π/2)∥2), which does not match options A–C.
Problem 6.2 — May 2024 Q8 (Mode A) / Q7 (Mode B): Hessian classification and global convexity
(a) Let C⊆Rn be a convex set and f:C→R be continuously twice differentiable. State the Concavity / Strict Concavity test on the basis of the sign of the Hessian matrix.
(b) Consider f(x,y)=ex(x−1)y+y2.
(b1) Write the gradient vector of f and determine the stationary points.
(b2) Write the Hessian matrix and explain the nature of the stationary points.
(b3) Is f convex on all R2? Justify.
(a)Statement of Prop. 1486 (concavity / strict concavity test). Let C⊆Rn be an open convex set and f∈C(2)(C). Then:
f is concave on C⟺∇2f(x) is negative semi-definite for every x∈C.
f is strictly concave on C⟺∇2f(x) is negative definite for every x∈C.
Analogously for convexity (positive semi-definite / positive definite).
(b1) See Example 4.1. Gradient ∇f(x,y)=[xyex,2y+ex(x−1)]. Stationary points: P=(0,1/2) and Q=(1,0).
(b2) Hessian ∇2f(x,y)=(yex(x+1)xexxex2).
At P=(0,1/2): ∇2f(P)=(1/2002) — positive definite (both leading minors positive). By Theorem 1391, P is a strong local minimiser.
At Q=(1,0): ∇2f(Q)=(0ee2), det=−e2<0 — indefinite. Q is a saddle (not an optimiser).
(b3)f is not convex on R2. A witness is (1,0), where ∇2f(1,0) is indefinite — not positive semi-definite. By Prop. 1486, this rules out global convexity.
Source: General_24524_ENG_SOL.pdf Q8/Q7.
Problem 6.3 — Final exercises Q2: full optimization on a log-domain
Given f(x,y)=x+3y2−ln(x+2y).
(a) Determine any local max/min point.
(b) Can we deduce whether they are global extrema?
(c) Find the 2nd-order Taylor formula in a neighbourhood of (1/3,1/3).
Solution. See Example 4.4.
(a) Domain: {x+2y>0}. Unique stationary point (1/3,1/3). Hessian there (12210) is PD ⇒ strong local minimiser.
(b) Yes. The Hessian is PD everywhere on the (open, convex) domain, so f is strictly convex on its domain. By Theorem 1497, the unique stationary point is the unique global minimiser.
(c) With f(1/3,1/3)=2/3, ∇f(1/3,1/3)=0, ∇2f(1/3,1/3)=(12210):
f(x,y)=32+21[(x−31)2+10(y−31)2+4(x−31)(y−31)]+o(∥(x−1/3,y−1/3)∥2).
Source: Final exercises solution.pdf pp.4–6, Problem 2.
Problem 6.4 — Final exercises Q3: Dini's theorem
Given g(x,y)=2yex+xey−2. Find f′(0) of the function y=f(x) implicitly defined by g(x,y)=−2.
Solution. See Example 4.5. f′(0)=−1/2.
Source: Final exercises solution.pdf p.7, Problem 3.
Problem 6.5 — lect 4: find the extrema of f(x,y)=x2+y2+x+y
(a) Find the gradient.
(b) Find the stationary point(s).
(c) Determine whether it is a point of extrema.
Solution.
(a) ∇f(x,y)=[2x+1,2y+1].
(b) ∇f=0 gives x=−1/2,y=−1/2. Unique stationary point (−1/2,−1/2).
(c) ∇2f(x,y)=(2002) for every (x,y). detA1=2,detA2=4 — positive definite everywhere. So f is strictly convex on R2 (convex open domain). By Theorem 1497, (−1/2,−1/2) is the strong global minimiser.
Source: lect 4_calc_diff (3).pdf.
Problem 6.6 — lect3: f(x,y)=x2y−y2x−3x
Find the points of local extrema.
Solution.
Gradient.fx′=2xy−y2−3, fy′=x2−2xy.
Stationary points. From fy′=0: x(x−2y)=0, so x=0 or x=2y.
Case x=0:fx′=0−y2−3=0⇒y2=−3 — no real solution.
Case x=2y:fx′=2⋅2y⋅y−y2−3=4y2−y2−3=3y2−3=0⇒y2=1⇒y=±1. Corresponding x=±2. Stationary points (2,1) and (−2,−1).
Hessian.fxx′′=2y,fxy′′=2x−2y,fyy′′=−2x. So ∇2f(x,y)=(2y2x−2y2x−2y−2x).
At (2,1): (222−4), detA1=2,detA2=−8−4=−12<0 — indefinite ⇒ saddle.
At (−2,−1): (−2−2−24), detA1=−2,detA2=−8−4=−12<0 — indefinite ⇒ saddle.
Answer. Both stationary points are saddles; f has no local extrema.
Source: Lect3_calc_diff (3).pdf.
Problem 6.7 — Final exercises Q1 recap with Differential-Calculus lens
f(x,y,z)=kx2+ky2−4z2−2xz. Find conditions on k for the origin to be a strong global maximum.
Solution. The origin is stationary for every quadratic form (all partial derivatives vanish at 0). For it to be a strong global max, we need f(x)<0 for every x=0 — i.e., the associated matrix A must be negative definite. A=(k0−10k0−10−4). Apply Sylvester–Jacobi with alternating pattern detA1<0,detA2>0,detA3<0:
k<0,k2>0 (trivially, k=0),−4k2−k<0⟺k(4k+1)>0.
Combining: k<−1/4. (This links Diff-Calc to §01 Linear Algebra — the same A appears in Final Q1b, eigenvalue signing.)
Source: Final exercises solution.pdf pp.1–3, Problem 1.
Problem 6.8 — lect5: Dini on g(x,y)=x2−2y−ey=0, find f′(1)
Setup. Verify g is in the range of y↦g(x,y): as y→−∞, g→x2−(−∞)−0=+∞; as y→+∞, g→x2−∞−∞=−∞. So 0∈(range) and by the existence theorem there is y=f(x). Uniqueness: gy(x,y)=−2−ey<0 everywhere ⇒ g is strictly decreasing in y ⇒ f is unique.
Find extrema of f. Dini gives f′(x)=−gx/gy=−2x/(−2−ey)=2x/(2+ey). Stationary points: f′(x)=0⟺x=0. First-derivative test: f′ changes from − (when x<0) to + (when x>0), so x=0 is a strong local+global minimiser of f.
1st-order Taylor at (1,0). With y=f(x), f(1)=0 (from solving 1−0−1=0), f′(1)=2⋅1/(2+1)=2/3:
f(x)=f(1)+f′(1)(x−1)+o(x−1)=32(x−1)+o(x−1).
Source: lect5_calc_diff.pdf.
§7. Common Pitfalls
Below are the errors Sicconi flags most often and the traps the Bocconi exam-writers exploit. Read this list the day before the exam.
Domain constraints.
For ln(u): require u>0.
For u: require u≥0 (and u>0 for differentiability).
For 1/u: require u=0.
Before declaring a candidate a stationary point, always verify it lies in Dom(f). Algebraic solutions outside the domain must be rejected.
Difference matters for uniqueness of the global minimum: strict convexity ⇒ at most one global min; convex ⇒ possibly a whole flat region of minima.
Local vs global without convexity.
Theorem 1391 (2nd-order conditions) gives local extrema only. It cannot distinguish a local from a global without extra info.
To upgrade to global, you need convexity (Theorem 1497) or coercivity (Tonelli) or compactness (Weierstrass).
May 2024 Q8 b2 → b3: (b2) proves P=(0,1/2) is a local min. (b3) explains the Hessian fails PSD elsewhere ⇒ cannot conclude global convexity ⇒ cannot conclude P is global min via this route.
Semi-definite Hessian: Theorem 1391 is inconclusive.
Positive semi-definite Hessian at x0 is only a necessary condition for x0 being a local min — not sufficient.
Example 4.3 (f=x2y on the y-axis) illustrates: Hessian is PSD, but x0=(0,0) is a saddle. Fall back to direct inspection or higher-order Taylor.
Dini's formula sign error.
The formula is f′(x0)=−gx/gy, with minus. Forgetting the minus sign flips the answer.
Always write the minus sign first.
Hessian symmetry: Schwarz must hold.
If you compute fxy′′ and fyx′′ and they disagree for a C(2) function, you have an arithmetic error.
Conversely, use Schwarz to save time: once you compute fxy′′, the (2,1) entry of the Hessian is automatic.
Negative-definite Sylvester pattern: alternating, not all-negative.
Not all-negative. Consistent with "−A positive definite ⇒ eigenvalues of −A all >0 ⇒ eigenvalues of A all <0", and the leading minors of −A are (−1)kdetAk.
Taylor coefficients: the 1/2 and the factor 2 on the cross term.
Quadratic form 21hTHh expands to 21[h12fxx′′+2h1h2fxy′′+h22fyy′′]. The cross-term coefficient is fxy′′ (not 2fxy′′ and not fxy′′/2).
Boundary points are not stationary in the gradient sense.
On a closed set with non-empty boundary, the max/min may live on the boundary where ∇f=0.
Fermat requires x0 to be an interior point. Apply boundary analysis separately.
Coercivity gives existence, not uniqueness.
Tonelli says a global min exists — not that it is unique.
Uniqueness requires strict convexity (or a direct argument that the stationary point is unique and other candidates are ruled out).
Composition of coercive functions with monotone outer.
h strictly increasing + g coercive towards +∞ ⇒ h∘g coercive towards lim+∞h.
h strictly increasing + g super-coercive towards −∞ ⇒ h∘g tends to lim−∞h at infinity.
If h is strictly decreasing, the conclusions flip.
Ignoring the convexity of the domain when applying Prop. 1486.
Prop. 1486 requires C to be an openconvex set. Half-planes, open balls, and all of Rn qualify; annuli and punctured disks do not (not convex).
"The function is strictly concave ⇒ at most one global max".
Strict convexity / concavity ensures uniqueness of the global extremum if one exists. Existence still needs Weierstrass / Tonelli.
Reading the MCQ options carefully.
Exam wording tests common distractors: wrong Dini sign; missing factor of 1/2 in Taylor; using Hessian of f(x,y) at (0,0) instead of the required (1,−π/2).
If the simplified Taylor polynomial doesn't match any of options A–C, the correct answer is very likely "none of the others" — trust the calculation and resist the urge to "round" to a listed option.
Final checklist before the exam (differential calculus slice)
[ ] I can state and prove Theorem 1391 (sufficient condition for strong local max/min), including the Weierstrass-on-unit-sphere step.
[ ] I can state Theorem 1481 (2nd-order Taylor with Peano remainder) and use it in computations.
[ ] I can state Prop. 1486 (Hessian sign ⇔ convexity/concavity) and use it on both sides.
[ ] I can state and prove Theorem 1497 (∇f = 0 ⇔ global extremum for convex/concave differentiable functions), including the use of Fermat and the tangent-hyperplane inequality.
[ ] I can state Theorem 1591 and know it extends 1497 to non-open convex sets via interior points.
[ ] I can state Fermat's theorem (∇f = 0 at interior local extrema) and sketch its proof.
[ ] I can state Dini's theorem and apply the formula f′(x0)=−gx/gy without dropping the sign.
[ ] I can state Weierstrass (compact ⇒ max/min) and Tonelli (coercive ⇒ min / super-coercive towards −∞ ⇒ max).
[ ] I can run M-DC-1 through M-DC-7 on a R2 problem in under 20 minutes.
[ ] I know that semi-definite Hessian leaves Theorem 1391 inconclusive, and I have a direct-inspection fallback.
[ ] I can distinguish "local", "strong local", "global", "strong global", and the role of convexity in upgrading each.