Chapter9-Pages - Abstract Algebra Chat

Chapter 9 Multilinear Algebra and Determinants We begin this chapter by investigating bilinear forms and quadratic forms on a vector space. Then we will move on to multilinear forms. We will show that the vector space of alternating 𝑛 -linear forms has dimension one on a vector space of dimension 𝑛 . This result will allow us to give a clean basis-free definition of the determinant of an operator. This approach to the determinant via alternating multilinear forms leads to straightforward proofs of key properties of determinants. For example, we will see that the determinant is multiplicative, meaning that det (𝑆𝑇) = ( det 𝑆)( det 𝑇) for all operators 𝑆 and 𝑇 on the same vector space. We will also see that 𝑇 is invertible if and only if det 𝑇 ≠ 0 . Another important result states that the determinant of an operator on a complex vector space equals the product of the eigenvalues of the operator, with each eigenvalue included as many times as its multiplicity. The chapter concludes with an introduction to tensor products. standing assumptions for this chapter • 𝐅 denotes 𝐑 or 𝐂 . • 𝑉 and 𝑊 denote finite-dimensional nonzero vector spaces over 𝐅 . M a tt he w P e t r o ff CC BY - SA The Mathematical Institute at the University of Göttingen. This building opened in 1930, when Emmy Noether ( 1882–1935 ) had already been a research mathematician and faculty member at the university for 15 years ( the first eight years without salary ) . Noether was fired by the Nazi government in 1933. By then Noether and her collaborators had created many of the foundations of modern algebra, including an abstract algebra viewpoint that contributed to the development of linear algebra. Linear Algebra Done Right , fourth edition, by Sheldon Axler 332 Annotated Entity: ID: 346 Spans: True Boxes: True Text: Section 9A Bilinear Forms and Quadratic Forms 333 9A Bilinear Forms and Quadratic Forms Bilinear Forms A bilinear form on 𝑉 is a function from 𝑉 × 𝑉 to 𝐅 that is linear in each slot separately, meaning that if we hold either slot fixed then we have a linear function in the other slot. Here is the formal definition. 9.1 definition: bilinear form A bilinear form on 𝑉 is a function 𝛽 ∶ 𝑉× 𝑉 → 𝐅 such that 𝑣 ↦ 𝛽(𝑣 , 𝑢) and 𝑣 ↦ 𝛽(𝑢 , 𝑣) are both linear functionals on 𝑉 for every 𝑢 ∈ 𝑉 . Recall that the term linear functional , used in the definition above, means a linear function that maps into the scalar field 𝐅 . Thus the term bilinear functional would be more consistent terminology than bilinear form , which unfortunately has become standard. For example, if 𝑉 is a real inner prod- uct space, then the function that takes an ordered pair (𝑢 , 𝑣) ∈ 𝑉 × 𝑉 to ⟨𝑢 , 𝑣⟩ is a bilinear form on 𝑉 . If 𝑉 is a nonzero complex inner product space, then this function is not a bilinear form because the inner product is not linear in the sec- ond slot (complex scalars come out of the second slot as their complex conjugates). If 𝐅 = 𝐑 , then a bilinear form differs from an inner product in that an inner product requires symmetry [ meaning that 𝛽(𝑣 , 𝑤) = 𝛽(𝑤 , 𝑣) for all 𝑣 , 𝑤 ∈ 𝑉] and positive definiteness [ meaning that 𝛽(𝑣 , 𝑣) > 0 for all 𝑣 ∈ 𝑉\{0}] , but these properties are not required for a bilinear form. 9.2 example: bilinear forms • The function 𝛽 ∶ 𝐅 3 × 𝐅 3 → 𝐅 defined by 𝛽((𝑥 1 , 𝑥 2 , 𝑥 3 ) , (𝑦 1 , 𝑦 2 , 𝑦 3 )) = 𝑥 1 𝑦 2 − 5𝑥 2 𝑦 3 + 2𝑥 3 𝑦 1 is a bilinear form on 𝐅 3 . • Suppose 𝐴 is an 𝑛 -by- 𝑛 matrix with 𝐴 𝑗 , 𝑘 ∈ 𝐅 in row 𝑗 , column 𝑘 . Define a bilinear form 𝛽 𝐴 on 𝐅 𝑛 by 𝛽 𝐴 ((𝑥 1 , … , 𝑥 𝑛 ) , (𝑦 1 , … , 𝑦 𝑛 )) = 𝑛 ∑ 𝑘=1 𝑛 ∑ 𝑗 = 1 𝐴 𝑗 , 𝑘 𝑥 𝑗 𝑦 𝑘 . The first bullet point is a special case of this bullet point with 𝑛 = 3 and 𝐴 = ⎛⎜⎜⎜ ⎝ 0 1 0 0 0 −5 2 0 0 ⎞⎟⎟⎟ ⎠ . Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 347 Spans: True Boxes: True Text: 334 Chapter 9 Multilinear Algebra and Determinants • Suppose 𝑉 is a real inner product space and 𝑇 ∈ ℒ (𝑉) . Then the function 𝛽 ∶ 𝑉× 𝑉 → 𝐑 defined by 𝛽(𝑢 , 𝑣) = ⟨𝑢 , 𝑇𝑣⟩ is a bilinear form on 𝑉 . • If 𝑛 is a positive integer, then the function 𝛽 ∶ 𝒫 𝑛 (𝐑) × 𝒫 𝑛 (𝐑) → 𝐑 defined by 𝛽(𝑝 , 𝑞) = 𝑝(2) ⋅ 𝑞 ′ (3) is a bilinear form on 𝒫 𝑛 (𝐑) . • Suppose 𝜑 , 𝜏 ∈ 𝑉 ′ . Then the function 𝛽 ∶ 𝑉× 𝑉 → 𝐅 defined by 𝛽(𝑢 , 𝑣) = 𝜑(𝑢) ⋅ 𝜏(𝑣) is a bilinear form on 𝑉 . • More generally, suppose that 𝜑 1 , … , 𝜑 𝑛 , 𝜏 1 , … , 𝜏 𝑛 ∈ 𝑉 ′ . Then the function 𝛽 ∶ 𝑉× 𝑉 → 𝐅 defined by 𝛽(𝑢 , 𝑣) = 𝜑 1 (𝑢) ⋅ 𝜏 1 (𝑣) + ⋯ + 𝜑 𝑛 (𝑢) ⋅ 𝜏 𝑛 (𝑣) is a bilinear form on 𝑉 . A bilinear form on 𝑉 is a function from 𝑉× 𝑉 to 𝐅 . Because 𝑉× 𝑉 is a vector space, this raises the question of whether a bilinear form can also be a linear map from 𝑉× 𝑉 to 𝐅 . Note that none of the bilinear forms in 9.2 are linear maps except in some special cases in which the bilinear form is the zero map. Exercise 3 shows that a bilinear form 𝛽 on 𝑉 is a linear map on 𝑉× 𝑉 only if 𝛽 = 0 . 9.3 definition: 𝑉 (2) The set of bilinear forms on 𝑉 is denoted by 𝑉 (2) . With the usual operations of addition and scalar multiplication of functions, 𝑉 (2) is a vector space. For 𝑇 an operator on an 𝑛 -dimensional vector space 𝑉 and a basis 𝑒 1 , … , 𝑒 𝑛 of 𝑉 , we used an 𝑛 -by- 𝑛 matrix to provide information about 𝑇 . We now do the same thing for bilinear forms on 𝑉 . 9.4 definition: matrix of a bilinear form, ℳ (𝛽) Suppose 𝛽 is a bilinear form on 𝑉 and 𝑒 1 , … , 𝑒 𝑛 is a basis of 𝑉 . The matrix of 𝛽 with respect to this basis is the 𝑛 -by- 𝑛 matrix ℳ (𝛽) whose entry ℳ (𝛽) 𝑗 , 𝑘 in row 𝑗 , column 𝑘 is given by ℳ (𝛽) 𝑗 , 𝑘 = 𝛽(𝑒 𝑗 , 𝑒 𝑘 ). If the basis 𝑒 1 , … , 𝑒 𝑛 is not clear from the context, then the notation ℳ (𝛽 , (𝑒 1 , … , 𝑒 𝑛 )) is used. Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 348 Spans: True Boxes: True Text: Section 9A Bilinear Forms and Quadratic Forms 335 Recall that 𝐅 𝑛 , 𝑛 denotes the vector space of 𝑛 -by- 𝑛 matrices with entries in 𝐅 and that dim 𝐅 𝑛 , 𝑛 = 𝑛 2 (see 3.39 and 3.40). 9.5 dim 𝑉 (2) = ( dim 𝑉) 2 Suppose 𝑒 1 , … , 𝑒 𝑛 is a basis of 𝑉 . Then the map 𝛽 ↦ ℳ (𝛽) is an isomorphism of 𝑉 (2) onto 𝐅 𝑛 , 𝑛 . Furthermore, dim 𝑉 (2) = ( dim 𝑉) 2 . Proof The map 𝛽 ↦ ℳ (𝛽) is clearly a linear map of 𝑉 (2) into 𝐅 𝑛 , 𝑛 . For 𝐴 ∈ 𝐅 𝑛 , 𝑛 , define a bilinear form 𝛽 𝐴 on 𝑉 by 𝛽 𝐴 (𝑥 1 𝑒 1 + ⋯ + 𝑥 𝑛 𝑒 𝑛 , 𝑦 1 𝑒 1 + ⋯ + 𝑦 𝑛 𝑒 𝑛 ) = 𝑛 ∑ 𝑘=1 𝑛 ∑ 𝑗 = 1 𝐴 𝑗 , 𝑘 𝑥 𝑗 𝑦 𝑘 for 𝑥 1 , … , 𝑥 𝑛 , 𝑦 1 , … , 𝑦 𝑛 ∈ 𝐅 ( if 𝑉 = 𝐅 𝑛 and 𝑒 1 , … , 𝑒 𝑛 is the standard basis of 𝐅 𝑛 , this 𝛽 𝐴 is the same as the bilinear form 𝛽 𝐴 in the second bullet point of Example 9.2 ) . The linear map 𝛽 ↦ ℳ (𝛽) from 𝑉 (2) to 𝐅 𝑛 , 𝑛 and the linear map 𝐴 ↦ 𝛽 𝐴 from 𝐅 𝑛 , 𝑛 to 𝑉 (2) are inverses of each other because 𝛽 ℳ (𝛽) = 𝛽 for all 𝛽 ∈ 𝑉 (2) and ℳ (𝛽 𝐴 ) = 𝐴 for all 𝐴 ∈ 𝐅 𝑛 , 𝑛 , as you should verify. Thus both maps are isomorphisms and the two spaces that they connect have the same dimension. Hence dim 𝑉 (2) = dim 𝐅 𝑛 , 𝑛 = 𝑛 2 = ( dim 𝑉) 2 . Recall that 𝐶 t denotes the transpose of a matrix 𝐶 . The matrix 𝐶 t is obtained by interchanging the rows and the columns of 𝐶 . 9.6 composition of a bilinear form and an operator Suppose 𝛽 is a bilinear form on 𝑉 and 𝑇 ∈ ℒ (𝑉) . Define bilinear forms 𝛼 and 𝜌 on 𝑉 by 𝛼(𝑢 , 𝑣) = 𝛽(𝑢 , 𝑇𝑣) and 𝜌(𝑢 , 𝑣) = 𝛽(𝑇𝑢 , 𝑣). Let 𝑒 1 , … , 𝑒 𝑛 be a basis of 𝑉 . Then ℳ (𝛼) = ℳ (𝛽) ℳ (𝑇) and ℳ (𝜌) = ℳ (𝑇) t ℳ (𝛽). Proof If 𝑗 , 𝑘 ∈ {1 , … , 𝑛} , then ℳ (𝛼) 𝑗 , 𝑘 = 𝛼(𝑒 𝑗 , 𝑒 𝑘 ) = 𝛽(𝑒 𝑗 , 𝑇𝑒 𝑘 ) = 𝛽(𝑒 𝑗 , 𝑛 ∑ 𝑚=1 ℳ (𝑇) 𝑚 , 𝑘 𝑒 𝑚 ) = 𝑛 ∑ 𝑚=1 𝛽(𝑒 𝑗 , 𝑒 𝑚 ) ℳ (𝑇) 𝑚 , 𝑘 = ( ℳ (𝛽) ℳ (𝑇)) 𝑗 , 𝑘 . Thus ℳ (𝛼) = ℳ (𝛽) ℳ (𝑇) . The proof that ℳ (𝜌) = ℳ (𝑇) t ℳ (𝛽) is similar. Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 349 Spans: True Boxes: True Text: 336 Chapter 9 Multilinear Algebra and Determinants The result below shows how the matrix of a bilinear form changes if we change the basis. The formula in the result below should be compared to the change- of-basis formula for the matrix of an operator (see 3.84). The two formulas are similar, except that the transpose 𝐶 t appears in the formula below and the inverse 𝐶 −1 appears in the change-of-basis formula for the matrix of an operator. 9.7 change-of-basis formula Suppose 𝛽 ∈ 𝑉 (2) . Suppose 𝑒 1 , … , 𝑒 𝑛 and 𝑓 1 , … , 𝑓 𝑛 are bases of 𝑉 . Let 𝐴 = ℳ (𝛽 , (𝑒 1 , … , 𝑒 𝑛 )) and 𝐵 = ℳ (𝛽 , ( 𝑓 1 , … , 𝑓 𝑛 )) and 𝐶 = ℳ (𝐼 , (𝑒 1 , … , 𝑒 𝑛 ) , ( 𝑓 1 , … , 𝑓 𝑛 )) . Then 𝐴 = 𝐶 t 𝐵𝐶. Proof The linear map lemma (3.4) tells us that there exists an operator 𝑇 ∈ ℒ (𝑉) such that 𝑇 𝑓 𝑘 = 𝑒 𝑘 for each 𝑘 = 1 , … , 𝑛 . The definition of the matrix of an operator with respect to a basis implies that ℳ (𝑇 , ( 𝑓 1 , … , 𝑓 𝑛 )) = 𝐶. Define bilinear forms 𝛼 , 𝜌 on 𝑉 by 𝛼(𝑢 , 𝑣) = 𝛽(𝑢 , 𝑇𝑣) and 𝜌(𝑢 , 𝑣) = 𝛼(𝑇𝑢 , 𝑣) = 𝛽(𝑇𝑢 , 𝑇𝑣). Then 𝛽(𝑒 𝑗 , 𝑒 𝑘 ) = 𝛽(𝑇 𝑓 𝑗 , 𝑇 𝑓 𝑘 ) = 𝜌( 𝑓 𝑗 , 𝑓 𝑘 ) for all 𝑗 , 𝑘 ∈ {1 , … , 𝑛} . Thus 𝐴 = ℳ (𝜌 , ( 𝑓 1 , … , 𝑓 𝑛 )) = 𝐶 t ℳ ( 𝛼 , 𝑓 1 , … , 𝑓 𝑛 ) ) = 𝐶 t 𝐵𝐶 , where the second and third lines each follow from 9.6. 9.8 example: the matrix of a bilinear form on 𝒫 2 (𝐑) Define a bilinear form 𝛽 on 𝒫 2 (𝐑) by 𝛽(𝑝 , 𝑞) = 𝑝(2) ⋅ 𝑞 ′ (3) . Let 𝐴 = ℳ (𝛽 , (1 , 𝑥 − 2 , (𝑥 − 3) 2 )) and 𝐵 = ℳ (𝛽 , (1 , 𝑥 , 𝑥 2 )) and 𝐶 = ℳ (𝐼 , (1 , 𝑥 − 2 , (𝑥 − 3) 2 ) , (1 , 𝑥 , 𝑥 2 )) . Then 𝐴 = ⎛⎜⎜⎜⎝ 0 1 0 0 0 0 0 1 0 ⎞⎟⎟⎟⎠ and 𝐵 = ⎛⎜⎜⎜⎝ 0 1 6 0 2 12 0 4 24 ⎞⎟⎟⎟⎠ and 𝐶 = ⎛⎜⎜⎜⎝ 1 −2 9 0 1 −6 0 0 1 ⎞⎟⎟⎟⎠. Now the change-of-basis formula 9.7 asserts that 𝐴 = 𝐶 t 𝐵𝐶 , which you can verify with matrix multiplication using the matrices above. Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 350 Spans: True Boxes: True Text: Section 9A Bilinear Forms and Quadratic Forms 337 Symmetric Bilinear Forms 9.9 definition: symmetric bilinear form, 𝑉 (2) sym A bilinear form 𝜌 ∈ 𝑉 (2) is called symmetric if 𝜌(𝑢 , 𝑤) = 𝜌(𝑤 , 𝑢) for all 𝑢 , 𝑤 ∈ 𝑉 . The set of symmetric bilinear forms on 𝑉 is denoted by 𝑉 (2) sym . 9.10 example: symmetric bilinear forms • If 𝑉 is a real inner product space and 𝜌 ∈ 𝑉 (2) is defined by 𝜌(𝑢 , 𝑤) = ⟨𝑢 , 𝑤⟩ , then 𝜌 is a symmetric bilinear form on 𝑉 . • Suppose 𝑉 is a real inner product space and 𝑇 ∈ ℒ (𝑉) . Define 𝜌 ∈ 𝑉 (2) by 𝜌(𝑢 , 𝑤) = ⟨𝑢 , 𝑇𝑤⟩. Then 𝜌 is a symmetric bilinear form on 𝑉 if and only if 𝑇 is a self-adjoint operator (the previous bullet point is the special case 𝑇 = 𝐼 ). • Suppose 𝜌 ∶ ℒ (𝑉) × ℒ (𝑉) → 𝐅 is defined by 𝜌(𝑆 , 𝑇) = tr (𝑆𝑇). Then 𝜌 is a symmetric bilinear form on ℒ (𝑉) because trace is a linear functional on ℒ (𝑉) and tr (𝑆𝑇) = tr (𝑇𝑆) for all 𝑆 , 𝑇 ∈ ℒ (𝑉) ; see 8.56. 9.11 definition: symmetric matrix A square matrix 𝐴 is called symmetric if it equals its transpose. An operator on 𝑉 may have a symmetric matrix with respect to some but not all bases of 𝑉 . In contrast, the next result shows that a bilinear form on 𝑉 has a sym- metric matrix with respect to either all bases of 𝑉 or with respect to no bases of 𝑉 . 9.12 symmetric bilinear forms are diagonalizable Suppose 𝜌 ∈ 𝑉 (2) . Then the following are equivalent. (a) 𝜌 is a symmetric bilinear form on 𝑉 . (b) ℳ (𝜌 , (𝑒 1 , … , 𝑒 𝑛 )) is a symmetric matrix for every basis 𝑒 1 , … , 𝑒 𝑛 of 𝑉 . (c) ℳ (𝜌 , (𝑒 1 , … , 𝑒 𝑛 )) is a symmetric matrix for some basis 𝑒 1 , … , 𝑒 𝑛 of 𝑉 . (d) ℳ (𝜌 , (𝑒 1 , … , 𝑒 𝑛 )) is a diagonal matrix for some basis 𝑒 1 , … , 𝑒 𝑛 of 𝑉 . Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 351 Spans: True Boxes: True Text: 338 Chapter 9 Multilinear Algebra and Determinants Proof First suppose (a) holds, so 𝜌 is a symmetric bilinear form. Suppose 𝑒 1 , … , 𝑒 𝑛 is a basis of 𝑉 and 𝑗 , 𝑘 ∈ {1 , … , 𝑛} . Then 𝜌(𝑒 𝑗 , 𝑒 𝑘 ) = 𝜌(𝑒 𝑘 , 𝑒 𝑗 ) because 𝜌 is symmetric. Thus ℳ (𝜌 , (𝑒 1 , … , 𝑒 𝑛 )) is a symmetric matrix, showing that (a) implies (b). Clearly (b) implies (c). Now suppose (c) holds and 𝑒 1 , … , 𝑒 𝑛 is a basis of 𝑉 such that ℳ (𝜌 , (𝑒 1 , … , 𝑒 𝑛 )) is a symmetric matrix. Suppose 𝑢 , 𝑤 ∈ 𝑉 . There exist 𝑎 1 , … , 𝑎 𝑛 , 𝑏 1 , … , 𝑏 𝑛 ∈ 𝐅 such that 𝑢 = 𝑎 1 𝑒 1 + ⋯ + 𝑎 𝑛 𝑒 𝑛 and 𝑤 = 𝑏 1 𝑒 1 + ⋯ + 𝑏 𝑛 𝑒 𝑛 . Now 𝜌(𝑢 , 𝑤) = 𝜌( 𝑛 ∑ 𝑗 = 1 𝑎 𝑗 𝑒 𝑗 , 𝑛 ∑ 𝑘=1 𝑏 𝑘 𝑒 𝑘 ) = 𝑛 ∑ 𝑗 = 1 𝑛 ∑ 𝑘=1 𝑎 𝑗 𝑏 𝑘 𝜌(𝑒 𝑗 , 𝑒 𝑘 ) = 𝑛 ∑ 𝑗 = 1 𝑛 ∑ 𝑘=1 𝑎 𝑗 𝑏 𝑘 𝜌(𝑒 𝑘 , 𝑒 𝑗 ) = 𝜌( 𝑛 ∑ 𝑘=1 𝑏 𝑘 𝑒 𝑘 , 𝑛 ∑ 𝑗 = 1 𝑎 𝑗 𝑒 𝑗 ) = 𝜌(𝑤 , 𝑢) , where the third line holds because ℳ (𝜌) is a symmetric matrix. The equation above shows that 𝜌 is a symmetric bilinear form, proving that (c) implies (a). At this point, we have proved that (a), (b), (c) are equivalent. Because every diagonal matrix is symmetric, (d) implies (c). To complete the proof, we will show that (a) implies (d) by induction on 𝑛 = dim 𝑉 . If 𝑛 = 1 , then (a) implies (d) because every 1 -by- 1 matrix is diagonal. Now suppose 𝑛 > 1 and the implication (a) ⟹ (d) holds for one less dimension. Suppose (a) holds, so 𝜌 is a symmetric bilinear form. If 𝜌 = 0 , then the matrix of 𝜌 with respect to every basis of 𝑉 is the zero matrix, which is a diagonal matrix. Hence we can assume that 𝜌 ≠ 0 , which means there exist 𝑢 , 𝑤 ∈ 𝑉 such that 𝜌(𝑢 , 𝑤) ≠ 0 . Now 2𝜌(𝑢 , 𝑤) = 𝜌(𝑢 + 𝑤 , 𝑢 + 𝑤) − 𝜌(𝑢 , 𝑢) − 𝜌(𝑤 , 𝑤). Because the left side of the equation above is nonzero, the three terms on the right cannot all equal 0 . Hence there exists 𝑣 ∈ 𝑉 such that 𝜌(𝑣 , 𝑣) ≠ 0 . Let 𝑈 = {𝑢 ∈ 𝑉 ∶ 𝜌(𝑢 , 𝑣) = 0} . Thus 𝑈 is the null space of the linear functional 𝑢 ↦ 𝜌(𝑢 , 𝑣) on 𝑉 . This linear functional is not the zero linear functional because 𝑣 ∉ 𝑈 . Thus dim 𝑈 = 𝑛 − 1 . By our induction hypothesis, there is a basis 𝑒 1 , … , 𝑒 𝑛−1 of 𝑈 such that the symmetric bilinear form 𝜌| 𝑈×𝑈 has a diagonal matrix with respect to this basis. Because 𝑣 ∉ 𝑈 , the list 𝑒 1 , … , 𝑒 𝑛−1 , 𝑣 is a basis of 𝑉 . Suppose 𝑘 ∈ {1 , … , 𝑛−1} . Then 𝜌(𝑒 𝑘 , 𝑣) = 0 by the construction of 𝑈 . Because 𝜌 is symmetric, we also have 𝜌(𝑣 , 𝑒 𝑘 ) = 0 . Thus the matrix of 𝜌 with respect to 𝑒 1 , … , 𝑒 𝑛−1 , 𝑣 is a diagonal matrix, completing the proof that (a) implies (d). Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 352 Spans: True Boxes: True Text: Section 9A Bilinear Forms and Quadratic Forms 339 The previous result states that every symmetric bilinear form has a diagonal matrix with respect to some basis. If our vector space happens to be a real inner product space, then the next result shows that every symmetric bilinear form has a diagonal matrix with respect to some orthonormal basis. Note that the inner product here is unrelated to the bilinear form. 9.13 diagonalization of a symmetric bilinear form by an orthonormal basis Suppose 𝑉 is a real inner product space and 𝜌 is a symmetric bilinear form on 𝑉 . Then 𝜌 has a diagonal matrix with respect to some orthonormal basis of 𝑉 . Proof Let 𝑓 1 , … , 𝑓 𝑛 be an orthonormal basis of 𝑉 . Let 𝐵 = ℳ (𝜌 , ( 𝑓 1 , … , 𝑓 𝑛 )) . Then 𝐵 is a symmetric matrix (by 9.12). Let 𝑇 ∈ ℒ (𝑉) be the operator such that ℳ (𝑇 , ( 𝑓 1 , … , 𝑓 𝑛 )) = 𝐵 . Thus 𝑇 is self-adjoint. The real spectral theorem (7.29) states that 𝑇 has a diagonal matrix with respect to some orthonormal basis 𝑒 1 , … , 𝑒 𝑛 of 𝑉 . Let 𝐶 = ℳ (𝐼 , (𝑒 1 , … , 𝑒 𝑛 ) , ( 𝑓 1 , … , 𝑓 𝑛 )) . Thus 𝐶 −1 𝐵𝐶 is the matrix of 𝑇 with respect to the basis 𝑒 1 , … , 𝑒 𝑛 (by 3.84). Hence 𝐶 −1 𝐵𝐶 is a diagonal matrix. Now 𝑀(𝜌 , (𝑒 1 , … , 𝑒 𝑛 )) = 𝐶 t 𝐵𝐶 = 𝐶 −1 𝐵𝐶 , where the first equality holds by 9.7 and the second equality holds because 𝐶 is a unitary matrix with real entries ( which implies that 𝐶 −1 = 𝐶 t ; see 7.57 ) . Now we turn our attention to alternating bilinear forms. Alternating multilinear forms will play a major role in our approach to determinants later in this chapter. 9.14 definition: alternating bilinear form, 𝑉 (2) alt A bilinear form 𝛼 ∈ 𝑉 (2) is called alternating if 𝛼(𝑣 , 𝑣) = 0 for all 𝑣 ∈ 𝑉 . The set of alternating bilinear forms on 𝑉 is denoted by 𝑉 (2) alt . 9.15 example: alternating bilinear forms • Suppose 𝑛 ≥ 3 and 𝛼 ∶ 𝐅 𝑛 × 𝐅 𝑛 → 𝐅 is defined by 𝛼((𝑥 1 , … , 𝑥 𝑛 ) , (𝑦 1 , … , 𝑦 𝑛 )) = 𝑥 1 𝑦 2 − 𝑥 2 𝑦 1 + 𝑥 1 𝑦 3 − 𝑥 3 𝑦 1 . Then 𝛼 is an alternating bilinear form on 𝐅 𝑛 . • Suppose 𝜑 , 𝜏 ∈ 𝑉 ′ . Then the bilinear form 𝛼 on 𝑉 defined by 𝛼(𝑢 , 𝑤) = 𝜑(𝑢)𝜏(𝑤) − 𝜑(𝑤)𝜏(𝑢) is alternating. Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 353 Spans: True Boxes: True Text: 340 Chapter 9 Multilinear Algebra and Determinants The next result shows that a bilinear form is alternating if and only if switching the order of the two inputs multiplies the output by −1 . 9.16 characterization of alternating bilinear forms A bilinear form 𝛼 on 𝑉 is alternating if and only if 𝛼(𝑢 , 𝑤) = −𝛼(𝑤 , 𝑢) for all 𝑢 , 𝑤 ∈ 𝑉 . Proof First suppose that 𝛼 is alternating. If 𝑢 , 𝑤 ∈ 𝑉 , then 0 = 𝛼(𝑢 + 𝑤 , 𝑢 + 𝑤) = 𝛼(𝑢 , 𝑢) + 𝛼(𝑢 , 𝑤) + 𝛼(𝑤 , 𝑢) + 𝛼(𝑤 , 𝑤) = 𝛼(𝑢 , 𝑤) + 𝛼(𝑤 , 𝑢). Thus 𝛼(𝑢 , 𝑤) = −𝛼(𝑤 , 𝑢) , as desired. To prove the implication in the other direction, suppose 𝛼(𝑢 , 𝑤) = −𝛼(𝑤 , 𝑢) for all 𝑢 , 𝑤 ∈ 𝑉 . Then 𝛼(𝑣 , 𝑣) = −𝛼(𝑣 , 𝑣) for all 𝑣 ∈ 𝑉 , which implies that 𝛼(𝑣 , 𝑣) = 0 for all 𝑣 ∈ 𝑉 . Thus 𝛼 is alternating. Now we show that the vector space of bilinear forms on 𝑉 is the direct sum of the symmetric bilinear forms on 𝑉 and the alternating bilinear forms on 𝑉 . 9.17 𝑉 (2) = 𝑉 (2) sym ⊕ 𝑉 (2) alt The sets 𝑉 (2) sym and 𝑉 (2) alt are subspaces of 𝑉 (2) . Furthermore, 𝑉 (2) = 𝑉 (2) sym ⊕ 𝑉 (2) alt . Proof The definition of symmetric bilinear form implies that the sum of any two symmetric bilinear forms on 𝑉 is a bilinear form on 𝑉 , and any scalar multiple of any bilinear form on 𝑉 is a bilinear form on 𝑉 . Thus 𝑉 (2) sym is a subspace of 𝑉 (2) . Similarly, the verification that 𝑉 (2) alt is a subspace of 𝑉 (2) is straightforward. Next, we want to show that 𝑉 (2) = 𝑉 (2) sym + 𝑉 (2) alt . To do this, suppose 𝛽 ∈ 𝑉 (2) . Define 𝜌 , 𝛼 ∈ 𝑉 (2) by 𝜌(𝑢 , 𝑤) = 𝛽(𝑢 , 𝑤) + 𝛽(𝑤 , 𝑢) 2 and 𝛼(𝑢 , 𝑤) = 𝛽(𝑢 , 𝑤) − 𝛽(𝑤 , 𝑢) 2 . Then 𝜌 ∈ 𝑉 (2) sym and 𝛼 ∈ 𝑉 (2) alt , and 𝛽 = 𝜌 + 𝛼 . Thus 𝑉 (2) = 𝑉 (2) sym + 𝑉 (2) alt . Finally, to show that the intersection of the two subspaces under consideration equals {0} , suppose 𝛽 ∈ 𝑉 (2) sym ∩ 𝑉 (2) alt . Then 9.16 implies that 𝛽(𝑢 , 𝑤) = −𝛽(𝑤 , 𝑢) = −𝛽(𝑢 , 𝑤) for all 𝑢 , 𝑤 ∈ 𝑉 , which implies that 𝛽 = 0 . Thus 𝑉 (2) = 𝑉 (2) sym ⊕ 𝑉 (2) alt , as implied by 1.46. Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 354 Spans: True Boxes: True Text: Section 9A Bilinear Forms and Quadratic Forms 341 Quadratic Forms 9.18 definition: quadratic form associated with a bilinear form, 𝑞 𝛽 For 𝛽 a bilinear form on 𝑉 , define a function 𝑞 𝛽 ∶ 𝑉 → 𝐅 by 𝑞 𝛽 (𝑣) = 𝛽(𝑣 , 𝑣) . A function 𝑞 ∶ 𝑉 → 𝐅 is called a quadratic form on 𝑉 if there exists a bilinear form 𝛽 on 𝑉 such that 𝑞 = 𝑞 𝛽 . Note that if 𝛽 is a bilinear form, then 𝑞 𝛽 = 0 if and only if 𝛽 is alternating. 9.19 example: quadratic form Suppose 𝛽 is the bilinear form on 𝐑 3 defined by 𝛽((𝑥 1 , 𝑥 2 , 𝑥 3 ) , (𝑦 1 , 𝑦 2 , 𝑦 3 )) = 𝑥 1 𝑦 1 − 4𝑥 1 𝑦 2 + 8𝑥 1 𝑦 3 − 3𝑥 3 𝑦 3 . Then 𝑞 𝛽 is the quadratic form on 𝐑 3 given by the formula 𝑞 𝛽 (𝑥 1 , 𝑥 2 , 𝑥 3 ) = 𝑥 12 − 4𝑥 1 𝑥 2 + 8𝑥 1 𝑥 3 − 3𝑥 32 . The quadratic form in the example above is typical of quadratic forms on 𝐅 𝑛 , as shown in the next result. 9.20 quadratic forms on 𝐅 𝑛 Suppose 𝑛 is a positive integer and 𝑞 is a function from 𝐅 𝑛 to 𝐅 . Then 𝑞 is a quadratic form on 𝐅 𝑛 if and only if there exist numbers 𝐴 𝑗 , 𝑘 ∈ 𝐅 for 𝑗 , 𝑘 ∈ {1 , … , 𝑛} such that 𝑞(𝑥 1 , … , 𝑥 𝑛 ) = 𝑛 ∑ 𝑘=1 𝑛 ∑ 𝑗 = 1 𝐴 𝑗 , 𝑘 𝑥 𝑗 𝑥 𝑘 for all (𝑥 1 , … , 𝑥 𝑛 ) ∈ 𝐅 𝑛 . Proof First suppose 𝑞 is a quadratic form on 𝐅 𝑛 . Thus there exists a bilinear form 𝛽 on 𝐅 𝑛 such that 𝑞 = 𝑞 𝛽 . Let 𝐴 be the matrix of 𝛽 with respect to the standard basis of 𝐅 𝑛 . Then for all (𝑥 1 , … , 𝑥 𝑛 ) ∈ 𝐅 𝑛 , we have the desired equation 𝑞(𝑥 1 , … , 𝑥 𝑛 ) = 𝛽((𝑥 1 , … , 𝑥 𝑛 ) , (𝑥 1 , … , 𝑥 𝑛 )) = 𝑛 ∑ 𝑘=1 𝑛 ∑ 𝑗 = 1 𝐴 𝑗 , 𝑘 𝑥 𝑗 𝑥 𝑘 . Conversely, suppose there exist numbers 𝐴 𝑗 , 𝑘 ∈ 𝐅 for 𝑗 , 𝑘 ∈ {1 , … , 𝑛} such that 𝑞(𝑥 1 , … , 𝑥 𝑛 ) = 𝑛 ∑ 𝑘=1 𝑘 ∑ 𝑗 = 1 𝐴 𝑗 , 𝑘 𝑥 𝑗 𝑥 𝑘 for all (𝑥 1 , … , 𝑥 𝑛 ) ∈ 𝐅 𝑛 . Define a bilinear form 𝛽 on 𝐅 𝑛 by 𝛽((𝑥 1 , … , 𝑥 𝑛 ) , (𝑦 1 , … , 𝑦 𝑛 )) = 𝑛 ∑ 𝑘=1 𝑘 ∑ 𝑗 = 1 𝐴 𝑗 , 𝑘 𝑥 𝑗 𝑦 𝑘 . Then 𝑞 = 𝑞 𝛽 , as desired. Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 355 Spans: True Boxes: True Text: 342 Chapter 9 Multilinear Algebra and Determinants Although quadratic forms are defined in terms of an arbitrary bilinear form, the equivalence of (a) and (b) in the result below shows that a symmetric bilinear form can always be used. 9.21 characterizations of quadratic forms Suppose 𝑞 ∶ 𝑉 → 𝐅 is a function. The following are equivalent. (a) 𝑞 is a quadratic form. (b) There exists a unique symmetric bilinear form 𝜌 on 𝑉 such that 𝑞 = 𝑞 𝜌 . (c) 𝑞(𝜆𝑣) = 𝜆 2 𝑞(𝑣) for all 𝜆 ∈ 𝐅 and all 𝑣 ∈ 𝑉 , and the function (𝑢 , 𝑤) ↦ 𝑞(𝑢 + 𝑤) − 𝑞(𝑢) − 𝑞(𝑤) is a symmetric bilinear form on 𝑉 . (d) 𝑞(2𝑣) = 4𝑞(𝑣) for all 𝑣 ∈ 𝑉 , and the function (𝑢 , 𝑤) ↦ 𝑞(𝑢 + 𝑤) − 𝑞(𝑢) − 𝑞(𝑤) is a symmetric bilinear form on 𝑉 . Proof First suppose (a) holds, so 𝑞 is a quadratic form. Hence there exists a bilinear form 𝛽 such that 𝑞 = 𝑞 𝛽 . By 9.17, there exist a symmetric bilinear form 𝜌 on 𝑉 and an alternating bilinear form 𝛼 on 𝑉 such that 𝛽 = 𝜌 + 𝛼 . Now 𝑞 = 𝑞 𝛽 = 𝑞 𝜌 + 𝑞 𝛼 = 𝑞 𝜌 . If 𝜌 ′ ∈ 𝑉 (2) sym also satisfies 𝑞 𝜌 ′ = 𝑞 , then 𝑞 𝜌 ′ −𝜌 = 0 ; thus 𝜌 ′ − 𝜌 ∈ 𝑉 (2) sym ∩ 𝑉 (2) alt , which implies that 𝜌 ′ = 𝜌 (by 9.17). This completes the proof that (a) implies (b). Now suppose (b) holds, so there exists a symmetric bilinear form 𝜌 on 𝑉 such that 𝑞 = 𝑞 𝜌 . If 𝜆 ∈ 𝐅 and 𝑣 ∈ 𝑉 then 𝑞(𝜆𝑣) = 𝜌(𝜆𝑣 , 𝜆𝑣) = 𝜆𝜌(𝑣 , 𝜆𝑣) = 𝜆 2 𝜌(𝑣 , 𝑣) = 𝜆 2 𝑞(𝑣) , showing that the first part of (c) holds. If 𝑢 , 𝑤 ∈ 𝑉 , then 𝑞(𝑢 + 𝑤) − 𝑞(𝑢) − 𝑞(𝑤) = 𝜌(𝑢 + 𝑤 , 𝑢 + 𝑤) − 𝜌(𝑢 , 𝑢) − 𝜌(𝑤 , 𝑤) = 2𝜌(𝑢 , 𝑤). Thus the function (𝑢 , 𝑤) ↦ 𝑞(𝑢 + 𝑤)−𝑞(𝑢)−𝑞(𝑤) equals 2𝜌 , which is a symmetric bilinear form on 𝑉 , completing the proof that (b) implies (c). Clearly (c) implies (d). Now suppose (d) holds. Let 𝜌 be the symmetric bilinear form on 𝑉 defined by 𝜌(𝑢 , 𝑤) = 𝑞(𝑢 + 𝑤) − 𝑞(𝑢) − 𝑞(𝑤) 2 . If 𝑣 ∈ 𝑉 , then 𝜌(𝑣 , 𝑣) = 𝑞(2𝑣) − 𝑞(𝑣) − 𝑞(𝑣) 2 = 4𝑞(𝑣) − 2𝑞(𝑣) 2 = 𝑞(𝑣). Thus 𝑞 = 𝑞 𝜌 , completing the proof that (d) implies (a). Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 356 Spans: True Boxes: True Text: Section 9A Bilinear Forms and Quadratic Forms 343 9.22 example: symmetric bilinear form associated with a quadratic form Suppose 𝑞 is the quadratic form on 𝐑 3 given by the formula 𝑞(𝑥 1 , 𝑥 2 , 𝑥 3 ) = 𝑥 12 − 4𝑥 1 𝑥 2 + 8𝑥 1 𝑥 3 − 3𝑥 32 . A bilinear form 𝛽 on 𝐑 3 such that 𝑞 = 𝑞 𝛽 is given by Example 9.19, but this bilinear form is not symmetric, as promised by 9.21(b). However, the bilinear form 𝜌 on 𝐑 3 defined by 𝜌((𝑥 1 , 𝑥 2 , 𝑥 3 ) , (𝑦 1 , 𝑦 2 , 𝑦 3 )) = 𝑥 1 𝑦 1 − 2𝑥 1 𝑦 2 − 2𝑥 2 𝑦 1 + 4𝑥 1 𝑦 3 + 4𝑥 3 𝑦 1 − 3𝑥 3 𝑦 3 is symmetric and satisfies 𝑞 = 𝑞 𝜌 . The next result states that for each quadratic form we can choose a basis such that the quadratic form looks like a weighted sum of squares of the coordinates, meaning that there are no cross terms of the form 𝑥 𝑗 𝑥 𝑘 with 𝑗 ≠ 𝑘 . 9.23 diagonalization of quadratic form Suppose 𝑞 is a quadratic form on 𝑉 . (a) There exist a basis 𝑒 1 , … , 𝑒 𝑛 of 𝑉 and 𝜆 1 , … , 𝜆 𝑛 ∈ 𝐅 such that 𝑞(𝑥 1 𝑒 1 + ⋯ + 𝑥 𝑛 𝑒 𝑛 ) = 𝜆 1 𝑥 12 + ⋯ + 𝜆 𝑛 𝑥 𝑛2 for all 𝑥 1 , … , 𝑥 𝑛 ∈ 𝐅 . (b) If 𝐅 = 𝐑 and 𝑉 is an inner product space, then the basis in (a) can be chosen to be an orthonormal basis of 𝑉 . Proof (a) There exists a symmetric bilinear form 𝜌 on 𝑉 such that 𝑞 = 𝑞 𝜌 (by 9.21). Now there exists a basis 𝑒 1 , … , 𝑒 𝑛 of 𝑉 such that ℳ (𝜌 , (𝑒 1 , … , 𝑒 𝑛 )) is a diagonal matrix (by 9.12). Let 𝜆 1 , … , 𝜆 𝑛 denote the entries on the diagonal of this matrix. Thus 𝜌(𝑒 𝑗 , 𝑒 𝑘 ) = ⎧{ ⎨{⎩ 𝜆 𝑗 if 𝑗 = 𝑘 , 0 if 𝑗 ≠ 𝑘 for all 𝑗 , 𝑘 ∈ {1 , … , 𝑛} . If 𝑥 1 , … , 𝑥 𝑛 ∈ 𝐅 , then 𝑞(𝑥 1 𝑒 1 + ⋯ + 𝑥 𝑛 𝑒 𝑛 ) = 𝜌(𝑥 1 𝑒 1 + ⋯ + 𝑥 𝑛 𝑒 𝑛 , 𝑥 1 𝑒 1 + ⋯ + 𝑥 𝑛 𝑒 𝑛 ) = 𝑛 ∑ 𝑘=1 𝑛 ∑ 𝑗 = 1 𝑥 𝑗 𝑥 𝑘 𝜌(𝑒 𝑗 , 𝑒 𝑘 ) = 𝜆 1 𝑥 12 + ⋯ + 𝜆 𝑛 𝑥 𝑛2 , as desired. (b) Suppose 𝐅 = 𝐑 and 𝑉 is an inner product space. Then 9.13 tells us that the basis in (a) can be chosen to be an orthonormal basis of 𝑉 . Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 357 Spans: True Boxes: True Text: 344 Chapter 9 Multilinear Algebra and Determinants Exercises 9A 1 Prove that if 𝛽 is a bilinear form on 𝐅 , then there exists 𝑐 ∈ 𝐅 such that 𝛽(𝑥 , 𝑦) = 𝑐𝑥𝑦 for all 𝑥 , 𝑦 ∈ 𝐅 . 2 Let 𝑛 = dim 𝑉 . Suppose 𝛽 is a bilinear form on 𝑉 . Prove that there exist 𝜑 1 , … , 𝜑 𝑛 , 𝜏 1 , … , 𝜏 𝑛 ∈ 𝑉 ′ such that 𝛽(𝑢 , 𝑣) = 𝜑 1 (𝑢) ⋅ 𝜏 1 (𝑣) + ⋯ + 𝜑 𝑛 (𝑢) ⋅ 𝜏 𝑛 (𝑣) for all 𝑢 , 𝑣 ∈ 𝑉 . This exercise shows that if 𝑛 = dim 𝑉 , then every bilinear form on 𝑉 is of the form given by the last bullet point of Example 9.2. 3 Suppose 𝛽 ∶ 𝑉× 𝑉 → 𝐅 is a bilinear form on 𝑉 and also is a linear functional on 𝑉× 𝑉 . Prove that 𝛽 = 0 . 4 Suppose 𝑉 is a real inner product space and 𝛽 is a bilinear form on 𝑉 . Show that there exists a unique operator 𝑇 ∈ ℒ (𝑉) such that 𝛽(𝑢 , 𝑣) = ⟨𝑢 , 𝑇𝑣⟩ for all 𝑢 , 𝑣 ∈ 𝑉 . This exercise states that if 𝑉 is a real inner product space, then every bilinear form on 𝑉 is of the form given by the third bullet point in 9.2. 5 Suppose 𝛽 is a bilinear form on a real inner product space 𝑉 and 𝑇 is the unique operator on 𝑉 such that 𝛽(𝑢 , 𝑣) = ⟨𝑢 , 𝑇𝑣⟩ for all 𝑢 , 𝑣 ∈ 𝑉 (see Exercise 4). Show that 𝛽 is an inner product on 𝑉 if and only if 𝑇 is an invertible positive operator on 𝑉 . 6 Prove or give a counterexample: If 𝜌 is a symmetric bilinear form on 𝑉 , then {𝑣 ∈ 𝑉 ∶ 𝜌(𝑣 , 𝑣) = 0} is a subspace of 𝑉 . 7 Explain why the proof of 9.13 (diagonalization of a symmetric bilinear form by an orthonormal basis on a real inner product space) fails if the hypothesis that 𝐅 = 𝐑 is dropped. 8 Find formulas for dim 𝑉 (2) sym and dim 𝑉 (2) alt in terms of dim 𝑉 . 9 Suppose that 𝑛 is a positive integer and 𝑉 = {𝑝 ∈ 𝒫 𝑛 (𝐑) ∶ 𝑝(0) = 𝑝(1)} . Define 𝛼 ∶ 𝑉× 𝑉 → 𝐑 by 𝛼(𝑝 , 𝑞) = ∫ 1 0 𝑝𝑞 ′ . Show that 𝛼 is an alternating bilinear form on 𝑉 . Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 358 Spans: True Boxes: True Text: Section 9A Bilinear Forms and Quadratic Forms 345 10 Suppose that 𝑛 is a positive integer and 𝑉 = {𝑝 ∈ 𝒫 𝑛 (𝐑) ∶ 𝑝(0) = 𝑝(1) and 𝑝 ′ (0) = 𝑝 ′ (1)}. Define 𝜌 ∶ 𝑉× 𝑉 → 𝐑 by 𝜌(𝑝 , 𝑞) = ∫ 1 0 𝑝𝑞 ″ . Show that 𝜌 is a symmetric bilinear form on 𝑉 . Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 359 Spans: True Boxes: True Text: 346 Chapter 9 Multilinear Algebra and Determinants 9B Alternating Multilinear Forms Multilinear Forms 9.24 definition: 𝑉 𝑚 For 𝑚 a positive integer, define 𝑉 𝑚 by 𝑉 𝑚 = 𝑉× ⋯ × 𝑉 ⏟ 𝑚 times . Now we can define 𝑚 -linear forms as a generalization of the bilinear forms that we discussed in the previous section. 9.25 definition: 𝑚 -linear form, 𝑉 (𝑚) , multilinear form • For 𝑚 a positive integer, an 𝑚 - linear form on 𝑉 is a function 𝛽 ∶ 𝑉 𝑚 → 𝐅 that is linear in each slot when the other slots are held fixed. This means that for each 𝑘 ∈ {1 , … , 𝑚} and all 𝑢 1 , … , 𝑢 𝑚 ∈ 𝑉 , the function 𝑣 ↦ 𝛽(𝑢 1 , … , 𝑢 𝑘−1 , 𝑣 , 𝑢 𝑘 + 1 , … , 𝑢 𝑚 ) is a linear map from 𝑉 to 𝐅 . • The set of 𝑚 -linear forms on 𝑉 is denoted by 𝑉 (𝑚) . • A function 𝛽 is called a multilinear form on 𝑉 if it is an 𝑚 -linear form on 𝑉 for some positive integer 𝑚 . In the definition above, the expression 𝛽(𝑢 1 , … , 𝑢 𝑘−1 , 𝑣 , 𝑢 𝑘 + 1 , … , 𝑢 𝑚 ) means 𝛽(𝑣 , 𝑢 2 , … , 𝑢 𝑚 ) if 𝑘 = 1 and means 𝛽(𝑢 1 , … , 𝑢 𝑚−1 , 𝑣) if 𝑘 = 𝑚 . A 1 -linear form on 𝑉 is a linear functional on 𝑉 . A 2 -linear form on 𝑉 is a bilinear form on 𝑉 . You can verify that with the usual addition and scalar multiplication of functions, 𝑉 (𝑚) is a vector space. 9.26 example: 𝑚 -linear forms • Suppose 𝛼 , 𝜌 ∈ 𝑉 (2) . Define a function 𝛽 ∶ 𝑉 4 → 𝐅 by 𝛽(𝑣 1 , 𝑣 2 , 𝑣 3 , 𝑣 4 ) = 𝛼(𝑣 1 , 𝑣 2 ) 𝜌(𝑣 3 , 𝑣 4 ). Then 𝛽 ∈ 𝑉 (4) . • Define 𝛽 ∶ ( ℒ (𝑉)) 𝑚 → 𝐅 by 𝛽(𝑇 1 , … , 𝑇 𝑚 ) = tr (𝑇 1 ⋯𝑇 𝑚 ). Then 𝛽 is an 𝑚 -linear form on ℒ (𝑉) . Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 360 Spans: True Boxes: True Text: Section 9B Alternating Multilinear Forms 347 Alternating multilinear forms, which we now define, play an important role as we head toward defining determinants. 9.27 definition: alternating forms, 𝑉 (𝑚) alt Suppose 𝑚 is a positive integer. • An 𝑚 -linear form 𝛼 on 𝑉 is called alternating if 𝛼(𝑣 1 , … , 𝑣 𝑚 ) = 0 whenever 𝑣 1 , … , 𝑣 𝑚 is a list of vectors in 𝑉 with 𝑣 𝑗 = 𝑣 𝑘 for some two distinct values of 𝑗 and 𝑘 in {1 , … , 𝑚} . • 𝑉 (𝑚) alt = {𝛼 ∈ 𝑉 (𝑚) ∶ 𝛼 is an alternating 𝑚 -linear form on 𝑉} . You should verify that 𝑉 (𝑚) alt is a subspace of 𝑉 (𝑚) . See Example 9.15 for examples of alternating 2 -linear forms. See Exercise 2 for an example of an alternating 3 -linear form. The next result tells us that if a linearly dependent list is input to an alternating multilinear form, then the output equals 0 . 9.28 alternating multilinear forms and linear dependence Suppose 𝑚 is a positive integer and 𝛼 is an alternating 𝑚 -linear form on 𝑉 . If 𝑣 1 , … , 𝑣 𝑚 is a linearly dependent list in 𝑉 , then 𝛼(𝑣 1 , … , 𝑣 𝑚 ) = 0. Proof Suppose 𝑣 1 , … , 𝑣 𝑚 is a linearly dependent list in 𝑉 . By the linear depen- dence lemma (2.19), some 𝑣 𝑘 is a linear combination of 𝑣 1 , … , 𝑣 𝑘−1 . Thus there exist 𝑏 1 , … , 𝑏 𝑘−1 such that 𝑣 𝑘 = 𝑏 1 𝑣 1 + ⋯ + 𝑏 𝑘−1 𝑣 𝑘−1 . Now 𝛼(𝑣 1 , … , 𝑣 𝑚 ) = 𝛼(𝑣 1 , … , 𝑣 𝑘−1 , 𝑘−1 ∑ 𝑗=1 𝑏 𝑗 𝑣 𝑗 , 𝑣 𝑘 + 1 , … , 𝑣 𝑚 ) = 𝑘−1 ∑ 𝑗=1 𝑏 𝑗 𝛼(𝑣 1 , … , 𝑣 𝑘−1 , 𝑣 𝑗 , 𝑣 𝑘 + 1 , … , 𝑣 𝑚 ) = 0. The next result states that if 𝑚 > dim 𝑉 , then there are no alternating 𝑚 -linear forms on 𝑉 other than the function on 𝑉 𝑚 that is identically 0 . 9.29 no nonzero alternating 𝑚 -linear forms for 𝑚 > dim 𝑉 Suppose 𝑚 > dim 𝑉 . Then 0 is the only alternating 𝑚 -linear form on 𝑉 . Proof Suppose that 𝛼 is an alternating 𝑚 -linear form on 𝑉 and 𝑣 1 , … , 𝑣 𝑚 ∈ 𝑉 . Because 𝑚 > dim 𝑉 , this list is not linearly independent (by 2.22). Thus 9.28 implies that 𝛼(𝑣 1 , … , 𝑣 𝑚 ) = 0 . Hence 𝛼 is the zero function from 𝑉 𝑚 to 𝐅 . Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 361 Spans: True Boxes: True Text: 348 Chapter 9 Multilinear Algebra and Determinants Alternating Multilinear Forms and Permutations 9.30 swapping input vectors in an alternating multilinear form Suppose 𝑚 is a positive integer, 𝛼 is an alternating 𝑚 -linear form on 𝑉 , and 𝑣 1 , … , 𝑣 𝑚 is a list of vectors in 𝑉 . Then swapping the vectors in any two slots of 𝛼(𝑣 1 , … , 𝑣 𝑚 ) changes the value of 𝛼 by a factor of −1 . Proof Put 𝑣 1 + 𝑣 2 in both the first two slots, getting 0 = 𝛼(𝑣 1 + 𝑣 2 , 𝑣 1 + 𝑣 2 , 𝑣 3 , … , 𝑣 𝑚 ). Use the multilinear properties of 𝛼 to expand the right side of the equation above (as in the proof of 9.16) to get 𝛼(𝑣 2 , 𝑣 1 , 𝑣 3 , … , 𝑣 𝑚 ) = −𝛼(𝑣 1 , 𝑣 2 , 𝑣 3 , … , 𝑣 𝑚 ). Similarly, swapping the vectors in any two slots of 𝛼(𝑣 1 , … , 𝑣 𝑚 ) changes the value of 𝛼 by a factor of −1 . To see what can happen with multiple swaps, suppose 𝛼 is an alternating 3 -linear form on 𝑉 and 𝑣 1 , 𝑣 2 , 𝑣 3 ∈ 𝑉 . To evaluate 𝛼(𝑣 3 , 𝑣 1 , 𝑣 2 ) in terms of 𝛼(𝑣 1 , 𝑣 2 , 𝑣 3 ) , start with 𝛼(𝑣 3 , 𝑣 1 , 𝑣 2 ) and swap the entries in the first and third slots, getting 𝛼(𝑣 3 , 𝑣 1 , 𝑣 2 ) = −𝛼(𝑣 2 , 𝑣 1 , 𝑣 3 ) . Now in the last expression, swap the entries in the first and second slots, getting 𝛼(𝑣 3 , 𝑣 1 , 𝑣 2 ) = −𝛼(𝑣 2 , 𝑣 1 , 𝑣 3 ) = 𝛼(𝑣 1 , 𝑣 2 , 𝑣 3 ). More generally, we see that if we do an odd number of swaps, then the value of 𝛼 changes by a factor of −1 , and if we do an even number of swaps, then the value of 𝛼 does not change. To deal with arbitrary multiple swaps, we need a bit of information about permutations. 9.31 definition: permutation, perm 𝑚 Suppose 𝑚 is a positive integer. • A permutation of (1 , … , 𝑚) is a list (𝑗 1 , … , 𝑗 𝑚 ) that contains each of the numbers 1 , … , 𝑚 exactly once. • The set of all permutations of (1 , … , 𝑚) is denoted by perm 𝑚 . For example, (2 , 3 , 4 , 5 , 1) ∈ perm 5 . You should think of an element of perm 𝑚 as a rearrangement of the first 𝑚 positive integers. The number of swaps used to change a permutation (𝑗 1 , … , 𝑗 𝑚 ) to the stan- dard order (1 , … , 𝑚) can depend on the specific swaps selected. The following definition has the advantage of assigning a well-defined sign to every permutation. Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 362 Spans: True Boxes: True Text: Section 9B Alternating Multilinear Forms 349 9.32 definition: sign of a permutation The sign of a permutation (𝑗 1 , … , 𝑗 𝑚 ) is defined by sign (𝑗 1 , … , 𝑗 𝑚 ) = (−1) 𝑁 , where 𝑁 is the number of pairs of integers (𝑘 , ℓ) with 1 ≤ 𝑘 < ℓ ≤ 𝑚 such that 𝑘 appears after ℓ in the list (𝑗 1 , … , 𝑗 𝑚 ) . Hence the sign of a permutation equals 1 if the natural order has been changed an even number of times and equals −1 if the natural order has been changed an odd number of times. 9.33 example: signs • The permutation (1 , … , 𝑚) [no changes in the natural order] has sign 1 . • The only pair of integers (𝑘 , ℓ) with 𝑘 < ℓ such that 𝑘 appears after ℓ in the list (2 , 1 , 3 , 4) is (1 , 2) . Thus the permutation (2 , 1 , 3 , 4) has sign −1 . • In the permutation (2 , 3 , … , 𝑚 , 1) , the only pairs (𝑘 , ℓ) with 𝑘 < ℓ that appear with changed order are (1 , 2) , (1 , 3) , … , (1 , 𝑚) . Because we have 𝑚 − 1 such pairs, the sign of this permutation equals (−1) 𝑚−1 . 9.34 swapping two entries in a permutation Swapping two entries in a permutation multiplies the sign of the permutation by −1 . Proof Suppose we have two permutations, where the second permutation is obtained from the first by swapping two entries. The two swapped entries were in their natural order in the first permutation if and only if they are not in their natural order in the second permutation. Thus we have a net change (so far) of 1 or −1 (both odd numbers) in the number of pairs not in their natural order. Consider each entry between the two swapped entries. If an intermediate entry was originally in the natural order with respect to both swapped entries, then it is now in the natural order with respect to neither swapped entry. Similarly, if an intermediate entry was originally in the natural order with respect to neither of the swapped entries, then it is now in the natural order with respect to both swapped entries. If an intermediate entry was originally in the natural order with respect to exactly one of the swapped entries, then that is still true. Thus the net change (for each pair containing an entry between the two swapped entries) in the number of pairs not in their natural order is 2 , −2 , or 0 (all even numbers). For all other pairs of entries, there is no change in whether or not they are in their natural order. Thus the total net change in the number of pairs not in their natural order is an odd number. Hence the sign of the second permutation equals −1 times the sign of the first permutation. Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 363 Spans: True Boxes: True Text: 350 Chapter 9 Multilinear Algebra and Determinants 9.35 permutations and alternating multilinear forms Suppose 𝑚 is a positive integer and 𝛼 ∈ 𝑉 (𝑚) alt . Then 𝛼(𝑣 𝑗 1 , … , 𝑣 𝑗 𝑚 ) = ( sign (𝑗 1 , … , 𝑗 𝑚 ))𝛼(𝑣 1 , … , 𝑣 𝑚 ) for every list 𝑣 1 , … , 𝑣 𝑚 of vectors in 𝑉 and all (𝑗 1 , … , 𝑗 𝑚 ) ∈ perm 𝑚 . Proof Suppose 𝑣 1 , … , 𝑣 𝑚 ∈ 𝑉 and (𝑗 1 , … , 𝑗 𝑚 ) ∈ perm 𝑚 . We can get from (𝑗 1 , … , 𝑗 𝑚 ) to (1 , … , 𝑚) by a series of swaps of entries in different slots. Each such swap changes the value of 𝛼 by a factor of −1 (by 9.30) and also changes the sign of the remaining permutation by a factor of −1 (by 9.34). After an appropriate number of swaps, we reach the permutation 1 , … , 𝑚 , which has sign 1 . Thus the value of 𝛼 changed signs an even number of times if sign (𝑗 1 , … , 𝑗 𝑚 ) = 1 and an odd number of times if sign (𝑗 1 , … , 𝑗 𝑚 ) = −1 , which gives the desired result. Our use of permutations now leads in a natural way to the following beautiful formula for alternating 𝑛 -linear forms on an 𝑛 -dimensional vector space. 9.36 formula for ( dim 𝑉) -linear alternating forms on 𝑉 Let 𝑛 = dim 𝑉 . Suppose 𝑒 1 , … , 𝑒 𝑛 is a basis of 𝑉 and 𝑣 1 , … , 𝑣 𝑛 ∈ 𝑉 . For each 𝑘 ∈ {1 , … , 𝑛} , let 𝑏 1 , 𝑘 , … , 𝑏 𝑛 , 𝑘 ∈ 𝐅 be such that 𝑣 𝑘 = 𝑛 ∑ 𝑗=1 𝑏 𝑗 , 𝑘 𝑒 𝑗 . Then 𝛼(𝑣 1 , … , 𝑣 𝑛 ) = 𝛼(𝑒 1 , … , 𝑒 𝑛 ) ∑ (𝑗 1 , … , 𝑗 𝑛 )∈ perm 𝑛 ( sign (𝑗 1 , … , 𝑗 𝑛 ))𝑏 𝑗 1 , 1 ⋯𝑏 𝑗 𝑛 , 𝑛 for every alternating 𝑛 -linear form 𝛼 on 𝑉 . Proof Suppose 𝛼 is an alternating 𝑛 -linear form 𝛼 on 𝑉 . Then 𝛼(𝑣 1 , … , 𝑣 𝑛 ) = 𝛼( 𝑛 ∑ 𝑗 1 =1 𝑏 𝑗 1 , 1 𝑒 𝑗 1 , … , 𝑛 ∑ 𝑗 𝑛 =1 𝑏 𝑗 𝑛 , 𝑛 𝑒 𝑗 𝑛 ) = 𝑛 ∑ 𝑗 1 =1 ⋯ 𝑛 ∑ 𝑗 𝑛 =1 𝑏 𝑗 1 , 1 ⋯𝑏 𝑗 𝑛 , 𝑛 𝛼(𝑒 𝑗 1 , … , 𝑒 𝑗 𝑛 ) = ∑ (𝑗 1 , … , 𝑗 𝑛 )∈ perm 𝑛 𝑏 𝑗 1 , 1 ⋯𝑏 𝑗 𝑛 , 𝑛 𝛼(𝑒 𝑗 1 , … , 𝑒 𝑗 𝑛 ) = 𝛼(𝑒 1 , … , 𝑒 𝑛 ) ∑ (𝑗 1 , … , 𝑗 𝑛 )∈ perm 𝑛 ( sign (𝑗 1 , … , 𝑗 𝑛 ))𝑏 𝑗 1 , 1 ⋯𝑏 𝑗 𝑛 , 𝑛 , where the third line holds because 𝛼(𝑒 𝑗 1 , … , 𝑒 𝑗 𝑛 ) = 0 if 𝑗 1 , … , 𝑗 𝑛 are not distinct integers, and the last line holds by 9.35. Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 364 Spans: True Boxes: True Text: Section 9B Alternating Multilinear Forms 351 The following result will be the key to our definition of the determinant in the next section. 9.37 dim 𝑉 ( dim 𝑉) alt = 1 The vector space 𝑉 ( dim 𝑉) alt has dimension one. Proof Let 𝑛 = dim 𝑉 . Suppose 𝛼 and 𝛼 ′ are alternating 𝑛 -linear forms on 𝑉 with 𝛼 ≠ 0 . Let 𝑒 1 , … , 𝑒 𝑛 be such that 𝛼(𝑒 1 , … , 𝑒 𝑛 ) ≠ 0 . There exists 𝑐 ∈ 𝐅 such that 𝛼 ′ (𝑒 1 , … , 𝑒 𝑛 ) = 𝑐𝛼(𝑒 1 , … , 𝑒 𝑛 ). Furthermore, 9.28 implies that 𝑒 1 , … , 𝑒 𝑛 is linearly independent and thus is a basis of 𝑉 . Suppose 𝑣 1 , … , 𝑣 𝑛 ∈ 𝑉 . Let 𝑏 𝑗 , 𝑘 be as in 9.36 for 𝑗 , 𝑘 = 1 , … , 𝑛 . Then 𝛼 ′ (𝑣 1 , … , 𝑣 𝑛 ) = 𝛼 ′ (𝑒 1 , … , 𝑒 𝑛 ) ∑ (𝑗 1 , … , 𝑗 𝑛 )∈ perm 𝑛 ( sign (𝑗 1 , … , 𝑗 𝑛 ))𝑏 𝑗 1 , 1 ⋯𝑏 𝑗 𝑛 , 𝑛 = 𝑐𝛼(𝑒 1 , … , 𝑒 𝑛 ) ∑ (𝑗 1 , … , 𝑗 𝑛 )∈ perm 𝑛 ( sign (𝑗 1 , … , 𝑗 𝑛 ))𝑏 𝑗 1 , 1 ⋯𝑏 𝑗 𝑛 , 𝑛 = 𝑐𝛼(𝑣 1 , … , 𝑣 𝑛 ) , where the first and last lines above come from 9.36. The equation above implies that 𝛼 ′ = 𝑐𝛼 . Thus 𝛼 ′ , 𝛼 is not a linearly independent list, which implies that dim 𝑉 (𝑛) alt ≤ 1 . To complete the proof, we only need to show that there exists a nonzero alternating 𝑛 -linear form 𝛼 on 𝑉 ( thus eliminating the possibility that dim 𝑉 (𝑛) alt equals 0) . To do this, let 𝑒 1 , … , 𝑒 𝑛 be any basis of 𝑉 , and let 𝜑 1 , … , 𝜑 𝑛 ∈ 𝑉 ′ be the linear functionals on 𝑉 that allow us to express each element of 𝑉 as a linear combination of 𝑒 1 , … , 𝑒 𝑛 . In other words, 𝑣 = 𝑛 ∑ 𝑗=1 𝜑 𝑗 (𝑣)𝑒 𝑗 for every 𝑣 ∈ 𝑉 (see 3.114). Now for 𝑣 1 , … , 𝑣 𝑛 ∈ 𝑉 , define 9.38 𝛼(𝑣 1 , … , 𝑣 𝑛 ) = ∑ (𝑗 1 , … , 𝑗 𝑛 )∈ perm 𝑛 ( sign (𝑗 1 , … , 𝑗 𝑛 ))𝜑 𝑗 1 (𝑣 1 )⋯𝜑 𝑗 𝑛 (𝑣 𝑛 ). The verification that 𝛼 is an 𝑛 -linear form on 𝑉 is straightforward. To see that 𝛼 is alternating, suppose 𝑣 1 , … , 𝑣 𝑛 ∈ 𝑉 with 𝑣 1 = 𝑣 2 . For each (𝑗 1 , … , 𝑗 𝑛 ) ∈ perm 𝑛 , the permutation (𝑗 2 , 𝑗 1 , 𝑗 3 , … , 𝑗 𝑛 ) has the opposite sign. Be- cause 𝑣 1 = 𝑣 2 , the contributions from these two permutations to the sum in 9.38 cancel either other. Hence 𝛼(𝑣 1 , 𝑣 1 , 𝑣 3 , … , 𝑣 𝑛 ) = 0 . Similarly, 𝛼(𝑣 1 , … , 𝑣 𝑛 ) = 0 if any two vectors in the list 𝑣 1 , … , 𝑣 𝑛 are equal. Thus 𝛼 is alternating. Finally, consider 9.38 with each 𝑣 𝑘 = 𝑒 𝑘 . Because 𝜑 𝑗 (𝑒 𝑘 ) equals 0 if 𝑗 ≠ 𝑘 and equals 1 if 𝑗 = 𝑘 , only the permutation (1 , … , 𝑛) makes a nonzero contribution to the right side of 9.38 in this case, giving the equation 𝛼(𝑒 1 , … , 𝑒 𝑛 ) = 1 . Thus we have produced a nonzero alternating 𝑛 -linear form 𝛼 on 𝑉 , as desired. Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 365 Spans: True Boxes: True Text: 352 Chapter 9 Multilinear Algebra and Determinants The formula 9.38 used in the last proof to construct a nonzero alternating 𝑛 - linear form came from the formula in 9.36, and that formula arose naturally from the properties of an alternating multilinear form. Earlier we showed that the value of an alternating multilinear form applied to a linearly dependent list is 0 ; see 9.28. The next result provides a converse of 9.28 for 𝑛 -linear multilinear forms when 𝑛 = dim 𝑉 . In the following result, the statement that 𝛼 is nonzero means (as usual for a function) that 𝛼 is not the function on 𝑉 𝑛 that is identically 0 . 9.39 alternating ( dim 𝑉) -linear forms and linear independence Let 𝑛 = dim 𝑉 . Suppose 𝛼 is a nonzero alternating 𝑛 -linear form on 𝑉 and 𝑒 1 , … , 𝑒 𝑛 is a list of vectors in 𝑉 . Then 𝛼(𝑒 1 , … , 𝑒 𝑛 ) ≠ 0 if and only if 𝑒 1 , … , 𝑒 𝑛 is linearly independent. Proof First suppose 𝛼(𝑒 1 , … , 𝑒 𝑛 ) ≠ 0 . Then 9.28 implies that 𝑒 1 , … , 𝑒 𝑛 is linearly independent. To prove the implication in the other direction, now suppose 𝑒 1 , … , 𝑒 𝑛 is linearly independent. Because 𝑛 = dim 𝑉 , this implies that 𝑒 1 , … , 𝑒 𝑛 is a basis of 𝑉 (see 2.38). Because 𝛼 is not the zero 𝑛 -linear form, there exist 𝑣 1 , … , 𝑣 𝑛 ∈ 𝑉 such that 𝛼(𝑣 1 , … , 𝑣 𝑛 ) ≠ 0 . Now 9.36 implies that 𝛼(𝑒 1 , … , 𝑒 𝑛 ) ≠ 0 . Exercises 9B 1 Suppose 𝑚 is a positive integer. Show that dim 𝑉 (𝑚) = ( dim 𝑉) 𝑚 . 2 Suppose 𝑛 ≥ 3 and 𝛼 ∶ 𝐅 𝑛 × 𝐅 𝑛 × 𝐅 𝑛 → 𝐅 is defined by 𝛼((𝑥 1 , … , 𝑥 𝑛 ) , (𝑦 1 , … , 𝑦 𝑛 ) , (𝑧 1 , … , 𝑧 𝑛 )) = 𝑥 1 𝑦 2 𝑧 3 − 𝑥 2 𝑦 1 𝑧 3 − 𝑥 3 𝑦 2 𝑧 1 − 𝑥 1 𝑦 3 𝑧 2 + 𝑥 3 𝑦 1 𝑧 2 + 𝑥 2 𝑦 3 𝑧 1 . Show that 𝛼 is an alternating 3 -linear form on 𝐅 𝑛 . 3 Suppose 𝑚 is a positive integer and 𝛼 is an 𝑚 -linear form on 𝑉 such that 𝛼(𝑣 1 , … , 𝑣 𝑚 ) = 0 whenever 𝑣 1 , … , 𝑣 𝑚 is a list of vectors in 𝑉 with 𝑣 𝑗 = 𝑣 𝑗 + 1 for some 𝑗 ∈ {1 , … , 𝑚 − 1} . Prove that 𝛼 is an alternating 𝑚 -linear form on 𝑉 . 4 Prove or give a counterexample: If 𝛼 ∈ 𝑉 (4) alt , then {(𝑣 1 , 𝑣 2 , 𝑣 3 , 𝑣 4 ) ∈ 𝑉 4 ∶ 𝛼(𝑣 1 , 𝑣 2 , 𝑣 3 , 𝑣 4 ) = 0} is a subspace of 𝑉 4 . Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 366 Spans: True Boxes: True Text: Section 9B Alternating Multilinear Forms 353 5 Suppose 𝑚 is a positive integer and 𝛽 is an 𝑚 -linear form on 𝑉 . Define an 𝑚 -linear form 𝛼 on 𝑉 by 𝛼(𝑣 1 , … , 𝑣 𝑚 ) = ∑ (𝑗 1 , … , 𝑗 𝑚 )∈ perm 𝑚 ( sign (𝑗 1 , … , 𝑗 𝑚 ))𝛽(𝑣 𝑗 1 , … , 𝑣 𝑗 𝑚 ) for 𝑣 1 , … , 𝑣 𝑚 ∈ 𝑉 . Explain why 𝛼 ∈ 𝑉 (𝑚) alt . 6 Suppose 𝑚 is a positive integer and 𝛽 is an 𝑚 -linear form on 𝑉 . Define an 𝑚 -linear form 𝛼 on 𝑉 by 𝛼(𝑣 1 , … , 𝑣 𝑚 ) = ∑ (𝑗 1 , … , 𝑗 𝑚 )∈ perm 𝑚 𝛽(𝑣 𝑗 1 , … , 𝑣 𝑗 𝑚 ) for 𝑣 1 , … , 𝑣 𝑚 ∈ 𝑉 . Explain why 𝛼(𝑣 𝑘 1 , … , 𝑣 𝑘 𝑚 ) = 𝛼(𝑣 1 , … , 𝑣 𝑚 ) for all 𝑣 1 , … , 𝑣 𝑚 ∈ 𝑉 and all (𝑘 1 , … , 𝑘 𝑚 ) ∈ perm 𝑚 . 7 Give an example of a nonzero alternating 2 -linear form 𝛼 on 𝐑 3 and a linearly independent list 𝑣 1 , 𝑣 2 in 𝐑 3 such that 𝛼(𝑣 1 , 𝑣 2 ) = 0 . This exercise shows that 9.39 can fail if the hypothesis that 𝑛 = dim 𝑉 is deleted. Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 367 Spans: True Boxes: True Text: 354 Chapter 9 Multilinear Algebra and Determinants 9C Determinants Defining the Determinant The next definition will lead us to a clean, beautiful, basis-free definition of the determinant of an operator. 9.40 definition: 𝛼 𝑇 Suppose that 𝑚 is a positive integer and 𝑇 ∈ ℒ (𝑉) . For 𝛼 ∈ 𝑉 (𝑚) alt , define 𝛼 𝑇 ∈ 𝑉 (𝑚) alt by 𝛼 𝑇 (𝑣 1 , … , 𝑣 𝑚 ) = 𝛼(𝑇𝑣 1 , … , 𝑇𝑣 𝑚 ) for each list 𝑣 1 , … , 𝑣 𝑚 of vectors in 𝑉 . Suppose 𝑇 ∈ ℒ (𝑉) . If 𝛼 ∈ 𝑉 (𝑚) alt and 𝑣 1 , … , 𝑣 𝑚 is a list of vectors in 𝑉 with 𝑣 𝑗 = 𝑣 𝑘 for some 𝑗 ≠ 𝑘 , then 𝑇𝑣 𝑗 = 𝑇𝑣 𝑘 , which implies that 𝛼 𝑇 (𝑣 1 , … , 𝑣 𝑚 ) = 𝛼(𝑇𝑣 1 , … , 𝑇𝑣 𝑚 ) = 0 . Thus the function 𝛼 ↦ 𝛼 𝑇 is a linear map of 𝑉 (𝑚) alt to itself. We know that dim 𝑉 ( dim 𝑉) alt = 1 (see 9.37). Every linear map from a one- dimensional vector space to itself is multiplication by some unique scalar. For the linear map 𝛼 ↦ 𝛼 𝑇 , we now define det 𝑇 to be that scalar. 9.41 definition: determinant of an operator, det 𝑇 Suppose 𝑇 ∈ ℒ (𝑉) . The determinant of 𝑇 , denoted by det 𝑇 , is defined to be the unique number in 𝐅 such that 𝛼 𝑇 = ( det 𝑇) 𝛼 for all 𝛼 ∈ 𝑉 ( dim 𝑉) alt . 9.42 example: determinants of operators Let 𝑛 = dim 𝑉 . • If 𝐼 is the identity operator on 𝑉 , then 𝛼 𝐼 = 𝛼 for all 𝛼 ∈ 𝑉 (𝑛) alt . Thus det 𝐼 = 1 . • More generally, if 𝜆 ∈ 𝐅 , then 𝛼 𝜆𝐼 = 𝜆 𝑛 𝛼 for all 𝛼 ∈ 𝑉 (𝑛) alt . Thus det (𝜆𝐼) = 𝜆 𝑛 . • Still more generally, if 𝑇 ∈ ℒ (𝑉) and 𝜆 ∈ 𝐅 , then 𝛼 𝜆𝑇 = 𝜆 𝑛 𝛼 𝑇 = 𝜆 𝑛 ( det 𝑇)𝛼 for all 𝛼 ∈ 𝑉 (𝑛) alt . Thus det (𝜆𝑇) = 𝜆 𝑛 det 𝑇 . • Suppose 𝑇 ∈ ℒ (𝑉) and there is a basis 𝑒 1 , … , 𝑒 𝑛 of 𝑉 consisting of eigenvectors of 𝑇 , with corresponding eigenvalues 𝜆 1 , … , 𝜆 𝑛 . If 𝛼 ∈ 𝑉 (𝑛) alt , then 𝛼 𝑇 (𝑒 1 , … , 𝑒 𝑛 ) = 𝛼(𝜆 1 𝑒 1 , … , 𝜆 𝑛 𝑒 𝑛 ) = (𝜆 1 ⋯𝜆 𝑛 )𝛼(𝑒 1 , … , 𝑒 𝑛 ). If 𝛼 ≠ 0 , then 9.39 implies 𝛼(𝑒 1 , … , 𝑒 𝑛 ) ≠ 0 . Thus the equation above implies det 𝑇 = 𝜆 1 ⋯𝜆 𝑛 . Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 368 Spans: True Boxes: True Text: Section 9C Determinants 355 Our next task is to define and give a formula for the determinant of a square matrix. To do this, we associate with each square matrix an operator and then define the determinant of the matrix to be the determinant of the associated operator. 9.43 definition: determinant of a matrix, det 𝐴 Suppose that 𝑛 is a positive integer and 𝐴 is an 𝑛 -by- 𝑛 square matrix with entries in 𝐅 . Let 𝑇 ∈ ℒ (𝐅 𝑛 ) be the operator whose matrix with respect to the standard basis of 𝐅 𝑛 equals 𝐴 . The determinant of 𝐴 , denoted by det 𝐴 , is defined by det 𝐴 = det 𝑇 . 9.44 example: determinants of matrices • If 𝐼 is the 𝑛 -by- 𝑛 identity matrix, then the corresponding operator on 𝐅 𝑛 is the identity operator 𝐼 on 𝐅 𝑛 . Thus the first bullet point of 9.42 implies that the determinant of the identity matrix is 1 . • Suppose 𝐴 is a diagonal matrix with 𝜆 1 , … , 𝜆 𝑛 on the diagonal. Then the corresponding operator on 𝐅 𝑛 has the standard basis of 𝐅 𝑛 as eigenvectors, with eigenvalues 𝜆 1 , … , 𝜆 𝑛 . Thus the last bullet point of 9.42 implies that det 𝐴 = 𝜆 1 ⋯𝜆 𝑛 . For the next result, think of each list 𝑣 1 , … , 𝑣 𝑛 of 𝑛 vectors in 𝐅 𝑛 as a list of 𝑛 -by- 1 column vectors. The notation ( 𝑣 1 ⋯ 𝑣 𝑛 ) then denotes the 𝑛 -by- 𝑛 square matrix whose 𝑘 th column is 𝑣 𝑘 for each 𝑘 = 1 , … , 𝑛 . 9.45 determinant is an alternating multilinear form Suppose that 𝑛 is a positive integer. The map that takes a list 𝑣 1 , … , 𝑣 𝑛 of vectors in 𝐅 𝑛 to det ( 𝑣 1 ⋯ 𝑣 𝑛 ) is an alternating 𝑛 -linear form on 𝐅 𝑛 . Proof Let 𝑒 1 , … , 𝑒 𝑛 be the standard basis of 𝐅 𝑛 and suppose 𝑣 1 , … , 𝑣 𝑛 is a list of vectors in 𝐅 𝑛 . Let 𝑇 ∈ ℒ (𝐅 𝑛 ) be the operator such that 𝑇𝑒 𝑘 = 𝑣 𝑘 for 𝑘 = 1 , … , 𝑛 . Thus 𝑇 is the operator whose matrix with respect to 𝑒 1 , … , 𝑒 𝑛 is ( 𝑣 1 ⋯ 𝑣 𝑛 ) . Hence det ( 𝑣 1 ⋯ 𝑣 𝑛 ) = det 𝑇 , by definition of the determinant of a matrix. Let 𝛼 be an alternating 𝑛 -linear form on 𝐅 𝑛 such that 𝛼(𝑒 1 , … , 𝑒 𝑛 ) = 1 . Then det ( 𝑣 1 ⋯ 𝑣 𝑛 ) = det 𝑇 = ( det 𝑇) 𝛼(𝑒 1 , … , 𝑒 𝑛 ) = 𝛼(𝑇𝑒 1 , … , 𝑇𝑒 𝑛 ) = 𝛼(𝑣 1 , … , 𝑣 𝑛 ) , where the third line follows from the definition of the determinant of an operator. The equation above shows that the map that takes a list of vectors 𝑣 1 , … , 𝑣 𝑛 in 𝐅 𝑛 to det ( 𝑣 1 ⋯ 𝑣 𝑛 ) is the alternating 𝑛 -linear form 𝛼 on 𝐅 𝑛 . Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 369 Spans: True Boxes: True Text: 356 Chapter 9 Multilinear Algebra and Determinants The previous result has several important consequences. For example, it immediately implies that a matrix with two identical columns has determinant 0 . We will come back to other consequences later, but for now we want to give a formula for the determinant of a square matrix. Recall that if 𝐴 is a matrix, then 𝐴 𝑗 , 𝑘 denotes the entry in row 𝑗 , column 𝑘 of 𝐴 . 9.46 formula for determinant of a matrix Suppose that 𝑛 is a positive integer and 𝐴 is an 𝑛 -by- 𝑛 square matrix. Then det 𝐴 = ∑ (𝑗 1 , … , 𝑗 𝑛 )∈ perm 𝑛 ( sign (𝑗 1 , … , 𝑗 𝑛 ))𝐴 𝑗 1 , 1 ⋯𝐴 𝑗 𝑛 , 𝑛 . Proof Apply 9.36 with 𝑉 = 𝐅 𝑛 and 𝑒 1 , … , 𝑒 𝑛 the standard basis of 𝐅 𝑛 and 𝛼 the alternating 𝑛 -linear form on 𝐅 𝑛 that takes 𝑣 1 , … , 𝑣 𝑛 to det ( 𝑣 1 ⋯ 𝑣 𝑛 ) [see 9.45]. If each 𝑣 𝑘 is the 𝑘 th column of 𝐴 , then each 𝑏 𝑗 , 𝑘 in 9.36 equals 𝐴 𝑗 , 𝑘 . Finally, 𝛼(𝑒 1 , … , 𝑒 𝑛 ) = det ( 𝑒 1 ⋯ 𝑒 𝑛 ) = det 𝐼 = 1. Thus the formula in 9.36 becomes the formula stated in this result. 9.47 example: explicit formula for determinant • If 𝐴 is a 2 -by- 2 matrix, then the formula in 9.46 becomes det 𝐴 = 𝐴 1 , 1 𝐴 2 , 2 − 𝐴 2 , 1 𝐴 1 , 2 . • If 𝐴 is a 3 -by- 3 matrix, then the formula in 9.46 becomes det 𝐴 =𝐴 1 , 1 𝐴 2 , 2 𝐴 3 , 3 − 𝐴 2 , 1 𝐴 1 , 2 𝐴 3 , 3 − 𝐴 3 , 1 𝐴 2 , 2 𝐴 1 , 3 − 𝐴 1 , 1 𝐴 3 , 2 𝐴 2 , 3 + 𝐴 3 , 1 𝐴 1 , 2 𝐴 2 , 3 + 𝐴 2 , 1 𝐴 3 , 2 𝐴 1 , 3 . The sum in the formula in 9.46 contains 𝑛! terms. Because 𝑛! grows rapidly as 𝑛 increases, the formula in 9.46 is not a viable method to evaluate determinants even for moderately sized 𝑛 . For example, 10! is over three million, and 100! is approximately 10 158 , leading to a sum that the fastest computer cannot evaluate. We will soon see some results that lead to faster evaluations of determinants than direct use of the sum in 9.46. 9.48 determinant of upper-triangular matrix Suppose that 𝐴 is an upper-triangular matrix with 𝜆 1 , … , 𝜆 𝑛 on the diagonal. Then det 𝐴 = 𝜆 1 ⋯𝜆 𝑛 . Proof If (𝑗 1 , … , 𝑗 𝑛 ) ∈ perm 𝑛 with (𝑗 1 , … , 𝑗 𝑛 ) ≠ (1 , … , 𝑛) , then 𝑗 𝑘 > 𝑘 for some 𝑘 ∈ {1 , … , 𝑛} , which implies that 𝐴 𝑗 𝑘 , 𝑘 = 0 . Thus the only permutation that can make a nonzero contribution to the sum in 9.46 is the permutation (1 , … , 𝑛) . Because 𝐴 𝑘 , 𝑘 = 𝜆 𝑘 for each 𝑘 = 1 , … , 𝑛 , this implies that det 𝐴 = 𝜆 1 ⋯𝜆 𝑛 . Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 370 Spans: True Boxes: True Text: Section 9C Determinants 357 Properties of Determinants Our definition of the determinant leads to the following magical proof that the determinant is multiplicative. 9.49 determinant is multiplicative (a) Suppose 𝑆 , 𝑇 ∈ ℒ (𝑉) . Then det (𝑆𝑇) = ( det 𝑆)( det 𝑇) . (b) Suppose 𝐴 and 𝐵 are square matrices of the same size. Then det (𝐴𝐵) = ( det 𝐴)( det 𝐵) Proof (a) Let 𝑛 = dim 𝑉 . Suppose 𝛼 ∈ 𝑉 (𝑛) alt and 𝑣 1 , … , 𝑣 𝑛 ∈ 𝑉 . Then 𝛼 𝑆𝑇 (𝑣 1 , … , 𝑣 𝑛 ) = 𝛼(𝑆𝑇𝑣 1 , … , 𝑆𝑇𝑣 𝑛 ) = ( det 𝑆)𝛼(𝑇𝑣 1 , … , 𝑇𝑣 𝑛 ) = ( det 𝑆)( det 𝑇)𝛼(𝑣 1 , … , 𝑣 𝑛 ) , where the first equation follows from the definition of 𝛼 𝑆𝑇 , the second equation follows from the definition of det 𝑆 , and the third equation follows from the definition of det 𝑇 . The equation above implies that det (𝑆𝑇) = ( det 𝑆)( det 𝑇) . (b) Let 𝑆 , 𝑇 ∈ ℒ (𝐅 𝑛 ) be such that ℳ (𝑆) = 𝐴 and ℳ (𝑇) = 𝐵 , where all matrices of operators in this proof are with respect to the standard basis of 𝐅 𝑛 . Then ℳ (𝑆𝑇) = ℳ (𝑆) ℳ (𝑇) = 𝐴𝐵 (see 3.43). Thus det (𝐴𝐵) = det (𝑆𝑇) = ( det 𝑆)( det 𝑇) = ( det 𝐴)( det 𝐵) , where the second equality comes from the result in (a). The determinant of an operator determines whether the operator is invertible. 9.50 invertible ⟺ nonzero determinant An operator 𝑇 ∈ ℒ (𝑉) is invertible if and only if det 𝑇 ≠ 0 . Furthermore, if 𝑇 is invertible, then det (𝑇 −1 ) = 1 det 𝑇 . Proof First suppose 𝑇 is invertible. Thus 𝑇𝑇 −1 = 𝐼 . Now 9.49 implies that 1 = det 𝐼 = det (𝑇𝑇 −1 ) = ( det 𝑇)( det (𝑇 −1 )). Hence det 𝑇 ≠ 0 and det (𝑇 −1 ) is the multiplicative inverse of det 𝑇 . To prove the other direction, now suppose det 𝑇 ≠ 0 . Suppose 𝑣 ∈ 𝑉 and 𝑣 ≠ 0 . Let 𝑣 , 𝑒 2 , … , 𝑒 𝑛 be a basis of 𝑉 and let 𝛼 ∈ 𝑉 (𝑛) alt be such that 𝛼 ≠ 0 . Then 𝛼(𝑣 , 𝑒 2 , … , 𝑒 𝑛 ) ≠ 0 (by 9.39). Now 𝛼(𝑇𝑣 , 𝑇𝑒 2 , … , 𝑇𝑒 𝑛 ) = ( det 𝑇)𝛼(𝑣 , 𝑒 2 , … , 𝑒 𝑛 ) ≠ 0 , Thus 𝑇𝑣 ≠ 0 . Hence 𝑇 is invertible. Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 371 Spans: True Boxes: True Text: 358 Chapter 9 Multilinear Algebra and Determinants An 𝑛 -by- 𝑛 matrix 𝐴 is invertible (see 3.80 for the definition of an invertible matrix) if and only if the operator on 𝐅 𝑛 associated with 𝐴 ( via the standard basis of 𝐅 𝑛 ) is invertible. Thus the previous result shows that a square matrix 𝐴 is invertible if and only if det 𝐴 ≠ 0 . 9.51 eigenvalues and determinants Suppose 𝑇 ∈ ℒ (𝑉) and 𝜆 ∈ 𝐅 . Then 𝜆 is an eigenvalue of 𝑇 if and only if det (𝜆𝐼 − 𝑇) = 0 . Proof The number 𝜆 is an eigenvalue of 𝑇 if and only if 𝑇 − 𝜆𝐼 is not invertible (see 5.7), which happens if and only if 𝜆𝐼 − 𝑇 is not invertible, which happens if and only if det (𝜆𝐼 − 𝑇) = 0 (by 9.50). Suppose 𝑇 ∈ ℒ (𝑉) and 𝑆 ∶ 𝑊 → 𝑉 is an invertible linear map. To prove that det (𝑆 −1 𝑇𝑆) = det 𝑇 , we could try to use 9.49 and 9.50, writing det (𝑆 −1 𝑇𝑆) = ( det 𝑆 −1 )( det 𝑇)( det 𝑆) = det 𝑇. That proof works if 𝑊 = 𝑉 , but if 𝑊 ≠ 𝑉 then it makes no sense because the determinant is defined only for linear maps from a vector space to itself, and 𝑆 maps 𝑊 to 𝑉 , making det 𝑆 undefined. The proof given below works around this issue and is valid when 𝑊 ≠ 𝑉 . 9.52 determinant is a similarity invariant Suppose 𝑇 ∈ ℒ (𝑉) and 𝑆 ∶ 𝑊 → 𝑉 is an invertible linear map. Then det (𝑆 −1 𝑇𝑆) = det 𝑇. Proof Let 𝑛 = dim 𝑊 = dim 𝑉 . Suppose 𝜏 ∈ 𝑊 (𝑛) alt . Define 𝛼 ∈ 𝑉 (𝑛) alt by 𝛼(𝑣 1 , … , 𝑣 𝑛 ) = 𝜏(𝑆 −1 𝑣 1 , … , 𝑆 −1 𝑣 𝑛 ) for 𝑣 1 , … , 𝑣 𝑛 ∈ 𝑉 . Suppose 𝑤 1 , … , 𝑤 𝑛 ∈ 𝑊 . Then 𝜏 𝑆 −1 𝑇𝑆 (𝑤 1 , … , 𝑤 𝑛 ) = 𝜏(𝑆 −1 𝑇𝑆𝑤 1 , … , 𝑆 −1 𝑇𝑆𝑤 𝑛 ) = 𝛼(𝑇𝑆𝑤 1 , … , 𝑇𝑆𝑤 𝑛 ) = 𝛼 𝑇 (𝑆𝑤 1 , … , 𝑆𝑤 𝑛 ) = ( det 𝑇)𝛼(𝑆𝑤 1 , … , 𝑆𝑤 𝑛 ) = ( det 𝑇)𝜏(𝑤 1 , … , 𝑤 𝑛 ). The equation above and the definition of the determinant of the operator 𝑆 −1 𝑇𝑆 imply that det (𝑆 −1 𝑇𝑆) = det 𝑇 . Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 372 Spans: True Boxes: True Text: Section 9C Determinants 359 For the special case in which 𝑉 = 𝐅 𝑛 and 𝑒 1 , … , 𝑒 𝑛 is the standard basis of 𝐅 𝑛 , the next result is true by the definition of the determinant of a matrix. The left side of the equation in the next result does not depend on a choice of basis, which means that the right side is independent of the choice of basis. 9.53 determinant of operator equals determinant of its matrix Suppose 𝑇 ∈ ℒ (𝑉) and 𝑒 1 , … , 𝑒 𝑛 is a basis of 𝑉 . Then det 𝑇 = det ℳ (𝑇 , (𝑒 1 , … , 𝑒 𝑛 )). Proof Let 𝑓 1 , … , 𝑓 𝑛 be the standard basis of 𝐅 𝑛 . Let 𝑆 ∶ 𝐅 𝑛 → 𝑉 be the linear map such that 𝑆 𝑓 𝑘 = 𝑒 𝑘 for each 𝑘 = 1 , … , 𝑛 . Thus ℳ (𝑆 , ( 𝑓 1 , … , 𝑓 𝑛 ) , (𝑒 1 , … , 𝑒 𝑛 )) and ℳ (𝑆 −1 , (𝑒 1 , … , 𝑒 𝑛 ) , ( 𝑓 1 , … , 𝑓 𝑛 )) both equal the 𝑛 -by- 𝑛 identity matrix. Hence 9.54 ℳ (𝑆 −1 𝑇𝑆 , ( 𝑓 1 , … , 𝑓 𝑛 )) = ℳ (𝑇 , (𝑒 1 , … , 𝑒 𝑛 )) , as follows from two applications of 3.43. Thus det 𝑇 = det (𝑆 −1 𝑇𝑆) = det ℳ (𝑆 −1 𝑇𝑆 , ( 𝑓 1 , … , 𝑓 𝑛 )) = det ℳ (𝑇 , (𝑒 1 , … , 𝑒 𝑛 )) , where the first line comes from 9.52, the second line comes from the definition of the determinant of a matrix, and the third line follows from 9.54. The next result gives a more intuitive way to think about determinants than the definition or the formula in 9.46. We could make the characterization in the result below the definition of the determinant of an operator on a finite-dimensional complex vector space, with the current definition then becoming a consequence of that definition. 9.55 if 𝐅 = 𝐂 , then determinant equals product of eigenvalues Suppose 𝐅 = 𝐂 and 𝑇 ∈ ℒ (𝑉) . Then det 𝑇 equals the product of the eigen- values of 𝑇 , with each eigenvalue included as many times as its multiplicity. Proof There is a basis of 𝑉 with respect to which 𝑇 has an upper-triangular matrix with the diagonal entries of the matrix consisting of the eigenvalues of 𝑇 , with each eigenvalue included as many times as its multiplicity—see 8.37. Thus 9.53 and 9.48 imply that det 𝑇 equals the product of the eigenvalues of 𝑇 , with each eigenvalue included as many times as its multiplicity. As the next result shows, the determinant interacts nicely with the transpose of a square matrix, with the dual of an operator, and with the adjoint of an operator on an inner product space. Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 373 Spans: True Boxes: True Text: 360 Chapter 9 Multilinear Algebra and Determinants 9.56 determinant of transpose, dual, or adjoint (a) Suppose 𝐴 is a square matrix. Then det 𝐴 t = det 𝐴 . (b) Suppose 𝑇 ∈ ℒ (𝑉) . Then det 𝑇 ′ = det 𝑇 . (c) Suppose 𝑉 is an inner product space and 𝑇 ∈ ℒ (𝑉) . Then det (𝑇 ∗ ) = det 𝑇. Proof (a) Let 𝑛 be a positive integer. Define 𝛼 ∶ (𝐅 𝑛 ) 𝑛 → 𝐅 by 𝛼(( 𝑣 1 ⋯ 𝑣 𝑛 )) = det (( 𝑣 1 ⋯ 𝑣 𝑛 ) t ) for all 𝑣 1 , … , 𝑣 𝑛 ∈ 𝐅 𝑛 . The formula in 9.46 for the determinant of a matrix shows that 𝛼 is an 𝑛 -linear form on 𝐅 𝑛 . Suppose 𝑣 1 , … , 𝑣 𝑛 ∈ 𝐅 𝑛 and 𝑣 𝑗 = 𝑣 𝑘 for some 𝑗 ≠ 𝑘 . If 𝐵 is an 𝑛 -by- 𝑛 matrix, then ( 𝑣 1 ⋯ 𝑣 𝑛 ) t 𝐵 cannot equal the identity matrix because row 𝑗 and row 𝑘 of ( 𝑣 1 ⋯ 𝑣 𝑛 ) t 𝐵 are equal. Thus ( 𝑣 1 ⋯ 𝑣 𝑛 ) t is not invertible, which implies that 𝛼(( 𝑣 1 ⋯ 𝑣 𝑛 )) = 0 . Hence 𝛼 is an alternating 𝑛 - linear form on 𝐅 𝑛 . Note that 𝛼 applied to the standard basis of 𝐅 𝑛 equals 1 . Because the vector space of alternating 𝑛 -linear forms on 𝐅 𝑛 has dimension one (by 9.37), this implies that 𝛼 is the determinant function. Thus (a) holds. (b) The equation det 𝑇 ′ = det 𝑇 follows from (a) and 9.53 and 3.132. (c) Pick an orthonormal basis of 𝑉 . The matrix of 𝑇 ∗ with respect to that basis is the conjugate transpose of the matrix of 𝑇 with respect to that basis (by 7.9). Thus 9.53, 9.46, and (a) imply that det (𝑇 ∗ ) = det 𝑇 . 9.57 helpful results in evaluating determinants (a) If either two columns or two rows of a square matrix are equal, then the determinant of the matrix equals 0 . (b) Suppose 𝐴 is a square matrix and 𝐵 is the matrix obtained from 𝐴 by swapping either two columns or two rows. Then det 𝐴 = − det 𝐵 . (c) If one column or one row of a square matrix is multiplied by a scalar, then the value of the determinant is multiplied by the same scalar. (d) If a scalar multiple of one column of a square matrix to added to another column, then the value of the determinant is unchanged. (e) If a scalar multiple of one row of a square matrix to added to another row, then the value of the determinant is unchanged. Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 374 Spans: True Boxes: True Text: Section 9C Determinants 361 Proof All the assertions in this result follow from the result that the maps 𝑣 1 , … , 𝑣 𝑛 ↦ det ( 𝑣 1 ⋯ 𝑣 𝑛 ) and 𝑣 1 , … , 𝑣 𝑛 ↦ det ( 𝑣 1 ⋯ 𝑣 𝑛 ) t are both alternating 𝑛 -linear forms on 𝐅 𝑛 [ see 9.45 and 9.56(a) ] . For example, to prove (d) suppose 𝑣 1 , … , 𝑣 𝑛 ∈ 𝐅 𝑛 and 𝑐 ∈ 𝐅 . Then det ( 𝑣 1 + 𝑐𝑣 2 𝑣 2 ⋯ 𝑣 𝑛 ) = det ( 𝑣 1 𝑣 2 ⋯ 𝑣 𝑛 ) + 𝑐 det ( 𝑣 2 𝑣 2 𝑣 3 ⋯ 𝑣 𝑛 ) = det ( 𝑣 1 𝑣 2 ⋯ 𝑣 𝑛 ) , where the first equation follows from the multilinearity property and the second equation follows from the alternating property. The equation above shows that adding a multiple of the second column to the first column does not change the value of the determinant. The same conclusion holds for any two columns. Thus (d) holds. The proof of (e) follows from (d) and from 9.56(a). The proofs of (a), (b), and (c) use similar tools and are left to the reader. For matrices whose entries are concrete numbers, the result above leads to a much faster way to evaluate the determinant than direct application of the formula in 9.46. Specifically, apply the Gaussian elimination procedure of swapping rows [ by 9.57(b), this changes the determinant by a factor of −1] , multiplying a row by a nonzero constant [ by 9.57(c), this changes the determinant by the same constant ] , and adding a multiple of one row to another row [ by 9.57(e), this does not change the determinant ] to produce an upper-triangular matrix, whose determinant is the product of the diagonal entries (by 9.48). If your software keeps track of the number of row swaps and of the constants used when multiplying a row by a constant, then the determinant of the original matrix can be computed. Because a number 𝜆 ∈ 𝐅 is an eigenvalue of an operator 𝑇 ∈ ℒ (𝑉) if and only if det (𝜆𝐼 − 𝑇) = 0 (by 9.51), you may be tempted to think that one way to find eigenvalues quickly is to choose a basis of 𝑉 , let 𝐴 = ℳ (𝑇) , evaluate det (𝜆𝐼 − 𝐴) , and then solve the equation det (𝜆𝐼 − 𝐴) = 0 for 𝜆 . However, that procedure is rarely efficient, except when dim 𝑉 = 2 (or when dim 𝑉 equals 3 or 4 if you are willing to use the cubic or quartic formulas). One problem is that the procedure described in the paragraph above for evaluating a determinant does not work when the matrix includes a symbol (such as the 𝜆 in 𝜆𝐼 − 𝐴 ). This problem arises because decisions need to be made in the Gaussian elimination procedure about whether certain quantities equal 0 , and those decisions become complicated in expressions involving a symbol 𝜆 . Recall that an operator on a finite-dimensional inner product space is unitary if it preserves norms (see 7.51 and the paragraph following it). Every eigenvalue of a unitary operator has absolute value 1 (by 7.54). Thus the product of the eigenvalues of a unitary operator has absolute value 1 . Hence (at least in the case 𝐅 = 𝐂 ) the determinant of a unitary operator has absolute value 1 (by 9.55). The next result gives a proof that works without the assumption that 𝐅 = 𝐂 . Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 375 Spans: True Boxes: True Text: 362 Chapter 9 Multilinear Algebra and Determinants 9.58 every unitary operator has determinant with absolute value 1 Suppose 𝑉 is an inner product space and 𝑆 ∈ ℒ (𝑉) is a unitary operator. Then | det 𝑆| = 1 . Proof Because 𝑆 is unitary, 𝐼 = 𝑆 ∗ 𝑆 (see 7.53). Thus 1 = det (𝑆 ∗ 𝑆) = ( det 𝑆 ∗ )( det 𝑆) = ( det 𝑆)( det 𝑆) = | det 𝑆| 2 , where the second equality comes from 9.49(a) and the third equality comes from 9.56(c). The equation above implies that | det 𝑆| = 1 . The determinant of a positive operator on an inner product space meshes well with the analogy that such operators correspond to the nonnegative real numbers. 9.59 every positive operator has nonnegative determinant Suppose 𝑉 is an inner product space and 𝑇 ∈ ℒ (𝑉) is a positive operator. Then det 𝑇 ≥ 0 . Proof By the spectral theorem (7.29 or 7.31), 𝑉 has an orthonormal basis con- sisting of eigenvectors of 𝑇 . Thus by the last bullet point of 9.42, det 𝑇 equals a product of the eigenvalues of 𝑇 , possibly with repetitions. Each eigenvalue of 𝑇 is a nonnegative number (by 7.38). Thus we conclude that det 𝑇 ≥ 0 . Suppose 𝑉 is an inner product space and 𝑇 ∈ ℒ (𝑉) . Recall that the list of nonnegative square roots of the eigenvalues of 𝑇 ∗ 𝑇 (each included as many times as its multiplicity) is called the list of singular values of 𝑇 (see Section 7E). 9.60 | det 𝑇| = product of singular values of 𝑇 Suppose 𝑉 is an inner product space and 𝑇 ∈ ℒ (𝑉) . Then | det 𝑇| = √ det (𝑇 ∗ 𝑇) = product of singular values of 𝑇. Proof We have | det 𝑇| 2 = ( det 𝑇)( det 𝑇) = ( det (𝑇 ∗ ))( det 𝑇) = det (𝑇 ∗ 𝑇) , where the middle equality comes from 9.56(c) and the last equality comes from 9.49(a). Taking square roots of both sides of the equation above shows that | det 𝑇| = √ det (𝑇 ∗ 𝑇) . Let 𝑠 1 , … , 𝑠 𝑛 denote the list of singular values of 𝑇 . Thus 𝑠 12 , … , 𝑠 𝑛2 is the list of eigenvalues of 𝑇 ∗ 𝑇 (with appropriate repetitions), corresponding to an orthonormal basis of 𝑉 consisting of eigenvectors of 𝑇 ∗ 𝑇 . Hence the last bullet point of 9.42 implies that det (𝑇 ∗ 𝑇) = 𝑠 12 ⋯𝑠 𝑛2 . Thus | det 𝑇| = 𝑠 1 ⋯𝑠 𝑛 , as desired. Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 376 Spans: True Boxes: True Text: Section 9C Determinants 363 An operator 𝑇 on a real inner product space changes volume by a factor of the product of the singular values (by 7.111). Thus the next result follows immediately from 7.111 and 9.60. This result explains why the absolute value of a determinant appears in the change of variables formula in multivariable calculus. 9.61 𝑇 changes volume by factor of | det 𝑇| Suppose 𝑇 ∈ ℒ (𝐑 𝑛 ) and Ω ⊆ 𝐑 𝑛 . Then volume 𝑇(Ω) = | det 𝑇|( volume Ω). For operators on finite-dimensional complex vector spaces, we now connect the determinant to a polynomial that we have previously seen. 9.62 if 𝐅 = 𝐂 , then characteristic polynomial of 𝑇 equals det (𝑧𝐼 − 𝑇) Suppose 𝐅 = 𝐂 and 𝑇 ∈ ℒ (𝑉) . Let 𝜆 1 , … , 𝜆 𝑚 denote the distinct eigenvalues of 𝑇 , and let 𝑑 1 , … , 𝑑 𝑚 denote their multiplicities. Then det (𝑧𝐼 − 𝑇) = (𝑧 − 𝜆 1 ) 𝑑 1 ⋯(𝑧 − 𝜆 𝑚 ) 𝑑 𝑚 . Proof There exists a basis of 𝑉 with respect to which 𝑇 has an upper-triangular matrix with each 𝜆 𝑘 appearing on the diagonal exactly 𝑑 𝑘 times (by 8.37). With respect to this basis, 𝑧𝐼 − 𝑇 has an upper-triangular matrix with 𝑧 − 𝜆 𝑘 appearing on the diagonal exactly 𝑑 𝑘 times for each 𝑘 . Thus 9.48 gives the desired equation. Suppose 𝐅 = 𝐂 and 𝑇 ∈ ℒ (𝑉) . The characteristic polynomial of 𝑇 was defined in 8.26 as the polynomial on the right side of the equation in 9.62. We did not previously define the characteristic polynomial of an operator on a finite- dimensional real vector space because such operators may have no eigenvalues, making a definition using the right side of the equation in 9.62 inappropriate. We now present a new definition of the characteristic polynomial, motivated by 9.62. This new definition is valid for both real and complex vector spaces. The equation in 9.62 shows that this new definition is equivalent to our previous definition when 𝐅 = 𝐂 (8.26). 9.63 definition: characteristic polynomial Suppose 𝑇 ∈ ℒ (𝑉) . The polynomial defined by 𝑧 ↦ det (𝑧𝐼 − 𝑇) is called the characteristic polynomial of 𝑇 . The formula in 9.46 shows that the characteristic polynomial of an opera- tor 𝑇 ∈ ℒ (𝑉) is a monic polynomial of degree dim 𝑉 . The zeros in 𝐅 of the characteristic polynomial of 𝑇 are exactly the eigenvalues of 𝑇 (by 9.51). Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 377 Spans: True Boxes: True Text: 364 Chapter 9 Multilinear Algebra and Determinants Previously we proved the Cayley–Hamilton theorem (8.29) in the complex case. Now we can extend that result to operators on real vector spaces. 9.64 Cayley–Hamilton theorem Suppose 𝑇 ∈ ℒ (𝑉) and 𝑞 is the characteristic polynomial of 𝑇 . Then 𝑞(𝑇) = 0 . Proof If 𝐅 = 𝐂 , then the equation 𝑞(𝑇) = 0 follows from 9.62 and 8.29. Now suppose 𝐅 = 𝐑 . Fix a basis of 𝑉 , and let 𝐴 be the matrix of 𝑇 with respect to this basis. Let 𝑆 be the operator on 𝐂 dim 𝑉 such that the matrix of 𝑆 ( with respect to the standard basis of 𝐂 dim 𝑉 ) is 𝐴 . For all 𝑧 ∈ 𝐑 we have 𝑞(𝑧) = det (𝑧𝐼 − 𝑇) = det (𝑧𝐼 − 𝐴) = det (𝑧𝐼 − 𝑆). Thus 𝑞 is the characteristic polynomial of 𝑆 . The case 𝐅 = 𝐂 (first sentence of this proof) now implies that 0 = 𝑞(𝑆) = 𝑞(𝐴) = 𝑞(𝑇) . The Cayley–Hamilton theorem (9.64) implies that the characteristic polyno- mial of an operator 𝑇 ∈ ℒ (𝑉) is a polynomial multiple of the minimal polynomial of 𝑇 (by 5.29). Thus if the degree of the minimal polynomial of 𝑇 equals dim 𝑉 , then the characteristic polynomial of 𝑇 equals the minimal polynomial of 𝑇 . This happens for a very large percentage of operators, including over 99.999% of 4 -by- 4 matrices with integer entries in [−100 , 100] (see the paragraph following 5.25). The last sentence in our next result was previously proved in the complex case (see 8.54). Now we can give a proof that works on both real and complex vector spaces. 9.65 characteristic polynomial, trace, and determinant Suppose 𝑇 ∈ ℒ (𝑉) . Let 𝑛 = dim 𝑉 . Then the characteristic polynomial of 𝑇 can be written as 𝑧 𝑛 − ( tr 𝑇)𝑧 𝑛−1 + ⋯ + (−1) 𝑛 ( det 𝑇). Proof The constant term of a polynomial function of 𝑧 is the value of the poly- nomial when 𝑧 = 0 . Thus the constant term of the characteristic polynomial of 𝑇 equals det (−𝑇) , which equals (−1) 𝑛 det 𝑇 (by the third bullet point of 9.42). Fix a basis of 𝑉 , and let 𝐴 be the matrix of 𝑇 with respect to this basis. The matrix of 𝑧𝐼 − 𝑇 with respect to this basis is 𝑧𝐼 − 𝐴 . The term coming from the identity permutation {1 , … , 𝑛} in the formula 9.46 for det (𝑧𝐼 − 𝐴) is (𝑧 − 𝐴 1 , 1 )⋯(𝑧 − 𝐴 𝑛 , 𝑛 ). The coefficient of 𝑧 𝑛−1 in the expression above is −(𝐴 1 , 1 + ⋯ + 𝐴 𝑛 , 𝑛 ) , which equals − tr 𝑇 . The terms in the formula for det (𝑧𝐼 − 𝐴) coming from other elements of perm 𝑛 contain at most 𝑛−2 factors of the form 𝑧−𝐴 𝑘 , 𝑘 and thus do not contribute to the coefficient of 𝑧 𝑛−1 in the characteristic polynomial of 𝑇 . Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 378 Spans: True Boxes: True Text: Section 9C Determinants 365 The next result was proved by Jacques Hadamard ( 1865–1963 ) in 1893. In the result below, think of the columns of the 𝑛 -by- 𝑛 matrix 𝐴 as ele- ments of 𝐅 𝑛 . The norms appearing below then arise from the standard inner product on 𝐅 𝑛 . Recall that the notation 𝑅 ⋅ , 𝑘 in the proof below means the 𝑘 th column of the matrix 𝑅 (as was defined in 3.44). 9.66 Hadamard’s inequality Suppose 𝐴 is an 𝑛 -by- 𝑛 matrix. Let 𝑣 1 , … , 𝑣 𝑛 denote the columns of 𝐴 . Then | det 𝐴| ≤ 𝑛 ∏ 𝑘=1 ‖𝑣 𝑘 ‖. Proof If 𝐴 is not invertible, then det 𝐴 = 0 and hence the desired inequality holds in this case. Thus assume that 𝐴 is invertible. The QR factorization (7.58) tells us that there exist a unitary matrix 𝑄 and an upper-triangular matrix 𝑅 whose diagonal contains only positive numbers such that 𝐴 = 𝑄𝑅 . We have | det 𝐴| = | det 𝑄| | det 𝑅| = | det 𝑅| = 𝑛 ∏ 𝑘=1 𝑅 𝑘 , 𝑘 ≤ 𝑛 ∏ 𝑘=1 ‖𝑅 ⋅ , 𝑘 ‖ = 𝑛 ∏ 𝑘=1 ‖𝑄𝑅 ⋅ , 𝑘 ‖ = 𝑛 ∏ 𝑘=1 ‖𝑣 𝑘 ‖ , where the first line comes from 9.49(b), the second line comes from 9.58, the third line comes from 9.48, and the fifth line holds because 𝑄 is an isometry. To give a geometric interpretation to Hadamard’s inequality, suppose 𝐅 = 𝐑 . Let 𝑇 ∈ ℒ (𝐑 𝑛 ) be the operator such that 𝑇𝑒 𝑘 = 𝑣 𝑘 for each 𝑘 = 1 , … , 𝑛 , where 𝑒 1 , … , 𝑒 𝑛 is the standard basis of 𝐑 𝑛 . Then 𝑇 maps the box 𝑃(𝑒 1 , … , 𝑒 𝑛 ) onto the parallelepiped 𝑃(𝑣 1 , … , 𝑣 𝑛 ) [see 7.102 and 7.105 for a review of this notation and terminology]. Because the box 𝑃(𝑒 1 , … , 𝑒 𝑛 ) has volume 1 , this implies (by 9.61) that the parallelepiped 𝑃(𝑣 1 , … , 𝑣 𝑛 ) has volume | det 𝑇| , which equals | det 𝐴| . Thus Hadamard’s inequality above can be interpreted to say that among all paral- lelepipeds whose edges have lengths ‖𝑣 1 ‖ , … , ‖𝑣 𝑛 ‖ , the ones with largest volume have orthogonal edges ( and thus have volume ∏ 𝑛𝑘=1 ‖𝑣 𝑘 ‖) . For a necessary and sufficient condition for Hadamard’s inequality to be an equality, see Exercise 18. Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 379 Spans: True Boxes: True Text: 366 Chapter 9 Multilinear Algebra and Determinants The matrix in the next result is called the Vandermonde matrix . Vandermonde matrices have important applications in polynomial interpolation, the discrete Fourier transform, and other areas of mathematics. The proof of the next result is a nice illustration of the power of switching between matrices and linear maps. 9.67 determinant of Vandermonde matrix Suppose 𝑛 > 1 and 𝛽 1 , … , 𝛽 𝑛 ∈ 𝐅 . Then det ⎛⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎝ 1 𝛽 1 𝛽 12 ⋯ 𝛽 1𝑛−1 1 𝛽 2 𝛽 22 ⋯ 𝛽 2𝑛−1 ⋱ 1 𝛽 𝑛 𝛽 𝑛2 ⋯ 𝛽 𝑛𝑛−1 ⎞⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎠ = ∏ 1≤𝑗<𝑘≤𝑛 (𝛽 𝑘 − 𝛽 𝑗 ). Proof Let 1 , 𝑧 , … , 𝑧 𝑛−1 be the standard basis of 𝒫 𝑛−1 (𝐅) and let 𝑒 1 , … , 𝑒 𝑛 denote the standard basis of 𝐅 𝑛 . Define a linear map 𝑆 ∶ 𝒫 𝑛−1 (𝐅) → 𝐅 𝑛 by 𝑆𝑝 = (𝑝(𝛽 1 ) , … , 𝑝(𝛽 𝑛 )). Let 𝐴 denote the Vandermonde matrix shown in the statement of this result. Note that 𝐴 = ℳ (𝑆 , (1 , 𝑧 , … , 𝑧 𝑛−1 ) , (𝑒 1 , … , 𝑒 𝑛 )). Let 𝑇 ∶ 𝒫 𝑛−1 (𝐅) → 𝒫 𝑛−1 (𝐅) be the operator on 𝒫 𝑛−1 (𝐅) such that 𝑇1 = 1 and 𝑇𝑧 𝑘 = (𝑧 − 𝛽 1 )(𝑧 − 𝛽 2 )⋯(𝑧 − 𝛽 𝑘 ) for 𝑘 = 1 , … , 𝑛 − 1 . Let 𝐵 = ℳ (𝑇 , (1 , 𝑧 , … , 𝑧 𝑛−1 ) , (1 , 𝑧 , … , 𝑧 𝑛−1 )) . Then 𝐵 is an upper-triangular matrix all of whose diagonal entries equal 1 . Thus det 𝐵 = 1 (by 9.48). Let 𝐶 = ℳ (𝑆𝑇 , (1 , 𝑧 , … , 𝑧 𝑛−1 ) , (𝑒 1 , … , 𝑒 𝑛 )) . Thus 𝐶 = 𝐴𝐵 (by 3.81), which implies that det 𝐴 = ( det 𝐴)( det 𝐵) = det 𝐶. The definitions of 𝐶 , 𝑆 , and 𝑇 show that 𝐶 equals ⎛⎜⎜ ⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎝ 1 0 0 ⋯ 0 1 𝛽 2 − 𝛽 1 0 ⋯ 0 1 𝛽 3 − 𝛽 1 (𝛽 3 − 𝛽 1 )(𝛽 3 − 𝛽 2 ) ⋯ 0 ⋱ 1 𝛽 𝑛 − 𝛽 1 (𝛽 𝑛 − 𝛽 1 )(𝛽 𝑛 − 𝛽 2 ) ⋯ (𝛽 𝑛 − 𝛽 1 )(𝛽 𝑛 − 𝛽 2 )⋯(𝛽 𝑛 − 𝛽 𝑛−1 ) ⎞⎟⎟ ⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎠. Now det 𝐴 = det 𝐶 = ∏ 1≤𝑗<𝑘≤𝑛 (𝛽 𝑘 − 𝛽 𝑗 ) , where we have used 9.56(a) and 9.48. Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 380 Spans: True Boxes: True Text: Section 9C Determinants 367 Exercises 9C 1 Prove or give a counterexample: 𝑆 , 𝑇 ∈ ℒ (𝑉) ⟹ det (𝑆 + 𝑇) = det 𝑆 + det 𝑇 . 2 Suppose the first column of a square matrix 𝐴 consists of all zeros except possibly the first entry 𝐴 1 , 1 . Let 𝐵 be the matrix obtained from 𝐴 by deleting the first row and the first column of 𝐴 . Show that det 𝐴 = 𝐴 1 , 1 det 𝐵 . 3 Suppose 𝑇 ∈ ℒ (𝑉) is nilpotent. Prove that det (𝐼 + 𝑇) = 1 . 4 Suppose 𝑆 ∈ ℒ (𝑉) . Prove that 𝑆 is unitary if and only if | det 𝑆| = ‖𝑆‖ = 1 . 5 Suppose 𝐴 is a block upper-triangular matrix 𝐴 = ⎛⎜⎜⎜⎝ 𝐴 1 ∗ ⋱ 0 𝐴 𝑚 ⎞⎟⎟⎟⎠ , where each 𝐴 𝑘 along the diagonal is a square matrix. Prove that det 𝐴 = ( det 𝐴 1 )⋯( det 𝐴 𝑚 ). 6 Suppose 𝐴 = ( 𝑣 1 ⋯ 𝑣 𝑛 ) is an 𝑛 -by- 𝑛 matrix, with 𝑣 𝑘 denoting the 𝑘 th column of 𝐴 . Show that if (𝑚 1 , … , 𝑚 𝑛 ) ∈ perm 𝑛 , then det ( 𝑣 𝑚 1 ⋯ 𝑣 𝑚 𝑛 ) = ( sign (𝑚 1 , … , 𝑚 𝑛 )) det 𝐴. 7 Suppose 𝑇 ∈ ℒ (𝑉) is invertible. Let 𝑝 denote the characteristic polynomial of 𝑇 and let 𝑞 denote the characteristic polynomial of 𝑇 −1 . Prove that 𝑞(𝑧) = 1 𝑝(0) 𝑧 dim 𝑉 𝑝(1𝑧) for all nonzero 𝑧 ∈ 𝐅 . 8 Suppose 𝑇 ∈ ℒ (𝑉) is an operator with no eigenvalues (which implies that 𝐅 = 𝐑 ). Prove that det 𝑇 > 0 . 9 Suppose that 𝑉 is a real vector space of even dimension, 𝑇 ∈ ℒ (𝑉) , and det 𝑇 < 0 . Prove that 𝑇 has at least two distinct eigenvalues. 10 Suppose 𝑉 is a real vector space of odd dimension and 𝑇 ∈ ℒ (𝑉) . Without using the minimal polynomial, prove that 𝑇 has an eigenvalue. This result was previously proved without using determinants or the charac- teristic polynomial—see 5.34. 11 Prove or give a counterexample: If 𝐅 = 𝐑 , 𝑇 ∈ ℒ (𝑉) , and det 𝑇 > 0 , then 𝑇 has a square root. If 𝐅 = 𝐂 , 𝑇 ∈ ℒ (𝑉) , and det 𝑇 ≠ 0 , then 𝑇 has a square root ( see 8.41 ) . Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 381 Spans: True Boxes: True Text: 368 Chapter 9 Multilinear Algebra and Determinants 12 Suppose 𝑆 , 𝑇 ∈ ℒ (𝑉) and 𝑆 is invertible. Define 𝑝 ∶ 𝐅 → 𝐅 by 𝑝(𝑧) = det (𝑧𝑆 − 𝑇). Prove that 𝑝 is a polynomial of degree dim 𝑉 and that the coefficient of 𝑧 dim 𝑉 in this polynomial is det 𝑆 . 13 Suppose 𝐅 = 𝐂 , 𝑇 ∈ ℒ (𝑉) , and 𝑛 = dim 𝑉 > 2 . Let 𝜆 1 , … , 𝜆 𝑛 denote the eigenvalues of 𝑇 , with each eigenvalue included as many times as its multiplicity. (a) Find a formula for the coefficient of 𝑧 𝑛−2 in the characteristic polynomial of 𝑇 in terms of 𝜆 1 , … , 𝜆 𝑛 . (b) Find a formula for the coefficient of 𝑧 in the characteristic polynomial of 𝑇 in terms of 𝜆 1 , … , 𝜆 𝑛 . 14 Suppose 𝑉 is an inner product space and 𝑇 is a positive operator on 𝑉 . Prove that det √ 𝑇 = √ det 𝑇. 15 Suppose 𝑉 is an inner product space and 𝑇 ∈ ℒ (𝑉) . Use the polar decom- position to give a proof that | det 𝑇| = √ det (𝑇 ∗ 𝑇) that is different from the proof given earlier (see 9.60). 16 Suppose 𝑇 ∈ ℒ (𝑉) . Define 𝑔 ∶ 𝐅 → 𝐅 by 𝑔(𝑥) = det (𝐼 + 𝑥𝑇) . Show that 𝑔’(0) = tr 𝑇 . Look for a clean solution to this exercise, without using the explicit but complicated formula for the determinant of a matrix. 17 Suppose 𝑎 , 𝑏 , 𝑐 are positive numbers. Find the volume of the ellipsoid {(𝑥 , 𝑦 , 𝑧) ∈ 𝐑 3 ∶ 𝑥 2 𝑎 2 + 𝑦 2 𝑏 2 + 𝑧 2 𝑐 2 < 1} by finding a set Ω ⊆ 𝐑 3 whose volume you know and an operator 𝑇 on 𝐑 3 such that 𝑇(Ω) equals the ellipsoid above. 18 Suppose that 𝐴 is an invertible square matrix. Prove that Hadamard’s inequality (9.66) is an equality if and only if each column of 𝐴 is orthogonal to the other columns. 19 Suppose 𝑉 is an inner product space, 𝑒 1 , … , 𝑒 𝑛 is an orthonormal basis of 𝑉 , and 𝑇 ∈ ℒ (𝑉) is a positive operator. (a) Prove that det 𝑇 ≤ ∏ 𝑛𝑘=1 ⟨𝑇𝑒 𝑘 , 𝑒 𝑘 ⟩ . (b) Prove that if 𝑇 is invertible, then the inequality in (a) is an equality if and only if 𝑒 𝑘 is an eigenvector of 𝑇 for each 𝑘 = 1 , … , 𝑛 . Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 382 Spans: True Boxes: True Text: Section 9C Determinants 369 20 Suppose 𝐴 is an 𝑛 -by- 𝑛 matrix, and suppose 𝑐 is such that |𝐴 𝑗 , 𝑘 | ≤ 𝑐 for all 𝑗 , 𝑘 ∈ {1 , … , 𝑛} . Prove that | det 𝐴| ≤ 𝑐 𝑛 𝑛 𝑛/2 . The formula for the determinant of a matrix ( 9.46 ) shows that | det 𝐴| ≤ 𝑐 𝑛 𝑛! . However, the estimate given by this exercise is much better. For example, if 𝑐 = 1 and 𝑛 = 100 , then 𝑐 𝑛 𝑛! ≈ 10 158 , but the estimate given by this exercise is the much smaller number 10 100 . If 𝑛 is an integer power of 2 , then the inequality above is sharp and cannot be improved. 21 Suppose 𝑛 is a positive integer and 𝛿 ∶ 𝐂 𝑛 , 𝑛 → 𝐂 is a function such that 𝛿(𝐴𝐵) = 𝛿(𝐴) ⋅ 𝛿(𝐵) for all 𝐴 , 𝐵 ∈ 𝐂 𝑛 , 𝑛 and 𝛿(𝐴) equals the product of the diagonal entries of 𝐴 for each diagonal matrix 𝐴 ∈ 𝐂 𝑛 , 𝑛 . Prove that 𝛿(𝐴) = det 𝐴 for all 𝐴 ∈ 𝐂 𝑛 , 𝑛 . Recall that 𝐂 𝑛 , 𝑛 denotes set of 𝑛 -by- 𝑛 matrices with entries in 𝐂 . This exercise shows that the determinant is the unique function defined on square matrices that is multiplicative and has the desired behavior on diagonal matrices. This result is analogous to Exercise 10 in Section 8D, which shows that the trace is uniquely determined by its algebraic properties. I find that in my own elementary lectures, I have, for pedagogical reasons, pushed determinants more and more into the background. Too often I have had the expe- rience that, while the students acquired facility with the formulas, which are so useful in abbreviating long expressions, they often failed to gain familiarity with their meaning , and skill in manipulation prevented the student from going into all the details of the subject and so gaining a mastery. — Elementary Mathematics from an Advanced Standpoint: Geometry , Felix Klein Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 383 Spans: True Boxes: True Text: 370 Chapter 9 Multilinear Algebra and Determinants 9D Tensor Products Tensor Product of Two Vector Spaces The motivation for our next topic comes from wanting to form the product of a vector 𝑣 ∈ 𝑉 and a vector 𝑤 ∈ 𝑊 . This product will be denoted by 𝑣 ⊗ 𝑤 , pronounced “ 𝑣 tensor 𝑤 ”, and will be an element of some new vector space called 𝑉⊗ 𝑊 (also pronounced “ 𝑉 tensor 𝑊 ”). We already have a vector space 𝑉× 𝑊 (see Section 3E), called the product of 𝑉 and 𝑊 . However, 𝑉× 𝑊 will not serve our purposes here because it does not provide a natural way to multiply an element of 𝑉 by an element of 𝑊 . We would like our tensor product to satisfy some of the usual properties of multiplication. For example, we would like the distributive property to be satisfied, meaning that if 𝑣 1 , 𝑣 2 , 𝑣 ∈ 𝑉 and 𝑤 1 , 𝑤 2 , 𝑤 ∈ 𝑊 , then (𝑣 1 + 𝑣 2 ) ⊗ 𝑤 = 𝑣 1 ⊗ 𝑤 + 𝑣 2 ⊗ 𝑤 and 𝑣 ⊗ (𝑤 1 + 𝑤 2 ) = 𝑣 ⊗ 𝑤 1 + 𝑣 ⊗ 𝑤 2 . To produce ⊗ in TEX, type \otimes . We would also like scalar multiplica- tion to interact well with this new multi- plication, meaning that 𝜆(𝑣 ⊗ 𝑤) = (𝜆𝑣) ⊗ 𝑤 = 𝑣 ⊗ (𝜆𝑤) for all 𝜆 ∈ 𝐅 , 𝑣 ∈ 𝑉 , and 𝑤 ∈ 𝑊 . Furthermore, it would be nice if each basis of 𝑉 when combined with each basis of 𝑊 produced a basis of 𝑉⊗ 𝑊 . Specifically, if 𝑒 1 , … , 𝑒 𝑚 is a basis of 𝑉 and 𝑓 1 , … , 𝑓 𝑛 is a basis of 𝑊 , then we would like a list (in any order) consisting of 𝑒 𝑗 ⊗ 𝑓 𝑘 , as 𝑗 ranges from 1 to 𝑚 and 𝑘 ranges from 1 to 𝑛 , to be a basis of 𝑉⊗ 𝑊 . This implies that dim (𝑉⊗ 𝑊) should equal ( dim 𝑉)( dim 𝑊) . Recall that dim (𝑉× 𝑊) = dim 𝑉 + dim 𝑊 (see 3.92), which shows that the product 𝑉× 𝑊 will not serve our purposes here. To produce a vector space whose dimension is ( dim 𝑉)( dim 𝑊) in a natural fashion from 𝑉 and 𝑊 , we look at the vector space of bilinear functionals, as defined below. 9.68 definition: bilinear functional on 𝑉× 𝑊 , the vector space ℬ (𝑉 , 𝑊) • A bilinear functional on 𝑉 × 𝑊 is a function 𝛽 ∶ 𝑉 × 𝑊 → 𝐅 such that 𝑣 ↦ 𝛽(𝑣 , 𝑤) is a linear functional on 𝑉 for each 𝑤 ∈ 𝑊 and 𝑤 ↦ 𝛽(𝑣 , 𝑤) is a linear functional on 𝑊 for each 𝑣 ∈ 𝑉 . • The vector space of bilinear functionals on 𝑉× 𝑊 is denoted by ℬ (𝑉 , 𝑊) . If 𝑊 = 𝑉 , then a bilinear functional on 𝑉 × 𝑊 is a bilinear form; see 9.1. The operations of addition and scalar multiplication on ℬ (𝑉 , 𝑊) are defined to be the usual operations of addition and scalar multiplication of functions. As you can verify, these operations make ℬ (𝑉 , 𝑊) into a vector space whose additive identity is the zero function from 𝑉 × 𝑊 to 𝐅 . Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 384 Spans: True Boxes: True Text: Section 9D Tensor Products 371 9.69 example: bilinear functionals • Suppose 𝜑 ∈ 𝑉 ′ and 𝜏 ∈ 𝑊 ′ . Define 𝛽 ∶ 𝑉× 𝑊 → 𝐅 by 𝛽(𝑣 , 𝑤) = 𝜑(𝑣)𝜏(𝑤) . Then 𝛽 is a bilinear functional on 𝑉× 𝑊 . • Suppose 𝑣 ∈ 𝑉 and 𝑤 ∈ 𝑊 . Define 𝛽 ∶ 𝑉 ′ × 𝑊 ′ → 𝐅 by 𝛽(𝜑 , 𝜏) = 𝜑(𝑣)𝜏(𝑤) . Then 𝛽 is a bilinear functional on 𝑉 ′ × 𝑊 ′ . • Define 𝛽 ∶ 𝑉× 𝑉 ′ → 𝐅 by 𝛽(𝑣 , 𝜑) = 𝜑(𝑣) . Then 𝛽 is a bilinear functional on 𝑉× 𝑉 ′ . • Suppose 𝜑 ∈ 𝑉 ′ . Define 𝛽 ∶ 𝑉× ℒ (𝑉) → 𝐅 by 𝛽(𝑣 , 𝑇) = 𝜑(𝑇𝑣) . Then 𝛽 is a bilinear functional on 𝑉× ℒ (𝑉) . • Suppose 𝑚 and 𝑛 are positive integers. Define 𝛽 ∶ 𝐅 𝑚 , 𝑛 ×𝐅 𝑛 , 𝑚 → 𝐅 by 𝛽(𝐴 , 𝐵) = tr (𝐴𝐵) . Then 𝛽 is a bilinear functional on 𝐅 𝑚 , 𝑛 × 𝐅 𝑛 , 𝑚 . 9.70 dimension of the vector space of bilinear functionals dim ℬ (𝑉 , 𝑊) = ( dim 𝑉)( dim 𝑊) . Proof Let 𝑒 1 , … , 𝑒 𝑚 be a basis of 𝑉 and 𝑓 1 , … , 𝑓 𝑛 be a basis of 𝑊 . For a bilinear functional 𝛽 ∈ ℬ (𝑉 , 𝑊) , let ℳ (𝛽) be the 𝑚 -by- 𝑛 matrix whose entry in row 𝑗 , column 𝑘 is 𝛽(𝑒 𝑗 , 𝑓 𝑘 ) . The map 𝛽 ↦ ℳ (𝛽) is a linear map of ℬ (𝑉 , 𝑊) into 𝐅 𝑚 , 𝑛 . For a matrix 𝐶 ∈ 𝐅 𝑚 , 𝑛 , define a bilinear functional 𝛽 𝐶 on 𝑉× 𝑊 by 𝛽 𝐶 (𝑎 1 𝑒 1 + ⋯ + 𝑎 𝑚 𝑒 𝑚 , 𝑏 1 𝑓 1 + ⋯ + 𝑏 𝑛 𝑓 𝑛 ) = 𝑛 ∑ 𝑘=1 𝑚 ∑ 𝑗 = 1 𝐶 𝑗 , 𝑘 𝑎 𝑗 𝑏 𝑘 for 𝑎 1 , … , 𝑎 𝑚 , 𝑏 1 , … , 𝑏 𝑛 ∈ 𝐅 . The linear map 𝛽 ↦ ℳ (𝛽) from ℬ (𝑉 , 𝑊) to 𝐅 𝑚 , 𝑛 and the linear map 𝐶 ↦ 𝛽 𝐶 from 𝐅 𝑚 , 𝑛 to ℬ (𝑉 , 𝑊) are inverses of each other because 𝛽 ℳ (𝛽) = 𝛽 for all 𝛽 ∈ ℬ (𝑉 , 𝑊) and ℳ (𝛽 𝐶 ) = 𝐶 for all 𝐶 ∈ 𝐅 𝑚 , 𝑛 , as you should verify. Thus both maps are isomorphisms and the two spaces that they connect have the same dimension. Hence dim ℬ (𝑉 , 𝑊) = dim 𝐅 𝑚 , 𝑛 = 𝑚𝑛 = ( dim 𝑉)( dim 𝑊) . Several different definitions of 𝑉⊗ 𝑊 appear in the mathematical literature. These definitions are equivalent to each other, at least in the finite-dimensional context, because any two vector spaces of the same dimension are isomorphic. The result above states that ℬ (𝑉 , 𝑊) has the dimension that we seek, as do ℒ (𝑉 , 𝑊) and 𝐅 dim 𝑉 , dim 𝑊 . Thus it may be tempting to define 𝑉⊗ 𝑊 to be ℬ (𝑉 , 𝑊) or ℒ (𝑉 , 𝑊) or 𝐅 dim 𝑉 , dim 𝑊 . However, none of those definitions would lead to a basis-free definition of 𝑣 ⊗ 𝑤 for 𝑣 ∈ 𝑉 and 𝑤 ∈ 𝑊 . The following definition, while it may seem a bit strange and abstract at first, has the huge advantage that it defines 𝑣 ⊗ 𝑤 in a basis-free fashion. We define 𝑉⊗ 𝑊 to be the vector space of bilinear functionals on 𝑉 ′ × 𝑊 ′ instead of the more tempting choice of the vector space of bilinear functionals on 𝑉× 𝑊 . Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 385 Spans: True Boxes: True Text: 372 Chapter 9 Multilinear Algebra and Determinants 9.71 definition: tensor product, 𝑉⊗ 𝑊 , 𝑣 ⊗ 𝑤 • The tensor product 𝑉⊗ 𝑊 is defined to be ℬ (𝑉 ′ , 𝑊 ′ ) . • For 𝑣 ∈ 𝑉 and 𝑤 ∈ 𝑊 , the tensor product 𝑣 ⊗ 𝑤 is the element of 𝑉⊗ 𝑊 defined by (𝑣 ⊗ 𝑤)(𝜑 , 𝜏) = 𝜑(𝑣)𝜏(𝑤) for all (𝜑 , 𝜏) ∈ 𝑉 ′ × 𝑊 ′ . We can quickly prove that the definition of 𝑉⊗𝑊 gives it the desired dimension. 9.72 dimension of the tensor product of two vector spaces dim (𝑉⊗ 𝑊) = ( dim 𝑉)( dim 𝑊) . Proof Because a vector space and its dual have the same dimension (by 3.111), we have dim 𝑉 ′ = dim 𝑉 and dim 𝑊 ′ = dim 𝑊 . Thus 9.70 tells us that the dimension of ℬ (𝑉 ′ , 𝑊 ′ ) equals ( dim 𝑉)( dim 𝑊) . To understand the definition of the tensor product 𝑣 ⊗ 𝑤 of two vectors 𝑣 ∈ 𝑉 and 𝑤 ∈ 𝑊 , focus on the kind of object it is. An element of 𝑉⊗ 𝑊 is a bilinear functional on 𝑉 ′ × 𝑊 ′ , and in particular it is a function from 𝑉 ′ × 𝑊 ′ to 𝐅 . Thus for each element of 𝑉 ′ × 𝑊 ′ , it should produce an element of 𝐅 . The definition above has this behavior, because 𝑣 ⊗ 𝑤 applied to a typical element (𝜑 , 𝜏) of 𝑉 ′ × 𝑊 ′ produces the number 𝜑(𝑣)𝜏(𝑤) . The somewhat abstract nature of 𝑣 ⊗ 𝑤 should not matter. The important point is the behavior of these objects. The next result shows that tensor products of vectors have the desired bilinearity properties. 9.73 bilinearity of tensor product Suppose 𝑣 , 𝑣 1 , 𝑣 2 ∈ 𝑉 and 𝑤 , 𝑤 1 , 𝑤 2 ∈ 𝑊 and 𝜆 ∈ 𝐅 . Then (𝑣 1 + 𝑣 2 ) ⊗ 𝑤 = 𝑣 1 ⊗ 𝑤 + 𝑣 2 ⊗ 𝑤 and 𝑣 ⊗ (𝑤 1 + 𝑤 2 ) = 𝑣 ⊗ 𝑤 1 + 𝑣 ⊗ 𝑤 2 and 𝜆(𝑣 ⊗ 𝑤) = (𝜆𝑣) ⊗ 𝑤 = 𝑣 ⊗ (𝜆𝑤). Proof Suppose (𝜑 , 𝜏) ∈ 𝑉 ′ × 𝑊 ′ . Then ((𝑣 1 + 𝑣 2 ) ⊗ 𝑤)(𝜑 , 𝜏) = 𝜑(𝑣 1 + 𝑣 2 )𝜏(𝑤) = 𝜑(𝑣 1 )𝜏(𝑤) + 𝜑(𝑣 2 )𝜏(𝑤) = (𝑣 1 ⊗ 𝑤)(𝜑 , 𝜏) + (𝑣 2 ⊗ 𝑤)(𝜑 , 𝜏) = (𝑣 1 ⊗ 𝑤 + 𝑣 2 ⊗ 𝑤)(𝜑 , 𝜏). Thus (𝑣 1 + 𝑣 2 ) ⊗ 𝑤 = 𝑣 1 ⊗ 𝑤 + 𝑣 2 ⊗ 𝑤 . The other two equalities are proved similarly. Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 386 Spans: True Boxes: True Text: Section 9D Tensor Products 373 Lists are, by definition, ordered. The order matters when, for example, we form the matrix of an operator with respect to a basis. For lists in this section with two indices, such as {𝑒 𝑗 ⊗ 𝑓 𝑘 } 𝑗=1 , … , 𝑚 ; 𝑘=1 , … , 𝑛 in the next result, the ordering does not matter and we do not specify it—just choose any convenient ordering. The linear independence of elements of 𝑉 ⊗ 𝑊 in (a) of the result below captures the idea that there are no relationships among vectors in 𝑉⊗ 𝑊 other than the relationships that come from bilinearity of the tensor product (see 9.73) and the relationships that may be present due to linear dependence of a list of vectors in 𝑉 or a list of vectors in 𝑊 . 9.74 basis of 𝑉⊗ 𝑊 Suppose 𝑒 1 , … , 𝑒 𝑚 is a list of vectors in 𝑉 and 𝑓 1 , … , 𝑓 𝑛 is a list of vectors in 𝑊 . (a) If 𝑒 1 , … , 𝑒 𝑚 and 𝑓 1 , … , 𝑓 𝑛 are both linearly independent lists, then {𝑒 𝑗 ⊗ 𝑓 𝑘 } 𝑗=1 , … , 𝑚 ; 𝑘=1 , … , 𝑛 is a linearly independent list in 𝑉⊗ 𝑊 . (b) If 𝑒 1 , … , 𝑒 𝑚 is a basis of 𝑉 and 𝑓 1 , … , 𝑓 𝑛 is a basis of 𝑊 , then the list {𝑒 𝑗 ⊗ 𝑓 𝑘 } 𝑗=1 , … , 𝑚 ; 𝑘=1 , … , 𝑛 is a basis of 𝑉⊗ 𝑊 . Proof To prove (a), suppose 𝑒 1 , … , 𝑒 𝑚 and 𝑓 1 , … , 𝑓 𝑛 are both linearly independent lists. This linear independence and the linear map lemma (3.4) imply that there exist 𝜑 1 , … , 𝜑 𝑚 ∈ 𝑉 ′ and 𝜏 1 , … , 𝜏 𝑛 ∈ 𝑊 ′ such that 𝜑 𝑗 (𝑒 𝑘 ) = ⎧{ ⎨ { ⎩ 1 if 𝑗 = 𝑘 , 0 if 𝑗 ≠ 𝑘 and 𝜏 𝑗 ( 𝑓 𝑘 ) = ⎧{ ⎨ { ⎩ 1 if 𝑗 = 𝑘 , 0 if 𝑗 ≠ 𝑘 , where 𝑗 , 𝑘 ∈ {1 , … , 𝑚} in the first equation and 𝑗 , 𝑘 ∈ {1 , … , 𝑛} in the second equation. Suppose {𝑎 𝑗 , 𝑘 } 𝑗=1 , … , 𝑚 ; 𝑘=1 , … , 𝑛 is a list of scalars such that 9.75 𝑛 ∑ 𝑘=1 𝑚 ∑ 𝑗 = 1 𝑎 𝑗 , 𝑘 (𝑒 𝑗 ⊗ 𝑓 𝑘 ) = 0. Note that (𝑒 𝑗 ⊗ 𝑓 𝑘 )(𝜑 𝑀 , 𝜏 𝑁 ) equals 1 if 𝑗 = 𝑀 and 𝑘 = 𝑁 , and equals 0 otherwise. Thus applying both sides of 9.75 to (𝜑 𝑀 , 𝜏 𝑁 ) shows that 𝑎 𝑀 , 𝑁 = 0 , proving that {𝑒 𝑗 ⊗ 𝑓 𝑘 } 𝑗=1 , … , 𝑚 ; 𝑘=1 , … , 𝑛 is linearly independent. Now (b) follows from (a), the equation dim 𝑉⊗ 𝑊 = ( dim 𝑉)( dim 𝑊) [see 9.72], and the result that a linearly independent list of the right length is a basis (see 2.38). Every element of 𝑉⊗ 𝑊 is a finite sum of elements of the form 𝑣 ⊗ 𝑤 , where 𝑣 ∈ 𝑉 and 𝑤 ∈ 𝑊 , as implied by (b) in the result above. However, if dim 𝑉 > 1 and dim 𝑊 > 1 , then Exercise 4 shows that {𝑣 ⊗ 𝑤 ∶ (𝑣 , 𝑤) ∈ 𝑉× 𝑊} ≠ 𝑉⊗ 𝑊. Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 387 Spans: True Boxes: True Text: 374 Chapter 9 Multilinear Algebra and Determinants 9.76 example: tensor product of element of 𝐅 𝑚 with element of 𝐅 𝑛 Suppose 𝑚 and 𝑛 are positive integers. Let 𝑒 1 , … , 𝑒 𝑚 denote the standard basis of 𝐅 𝑚 and let 𝑓 1 , … , 𝑓 𝑛 denote the standard basis of 𝐅 𝑛 . Suppose 𝑣 = (𝑣 1 , … , 𝑣 𝑚 ) ∈ 𝐅 𝑚 and 𝑤 = (𝑤 1 , … , 𝑤 𝑛 ) ∈ 𝐅 𝑛 . Then 𝑣 ⊗ 𝑤 = ( 𝑚 ∑ 𝑗 = 1 𝑣 𝑗 𝑒 𝑗 ) ⊗ ( 𝑛 ∑ 𝑘=1 𝑤 𝑘 𝑓 𝑘 ) = 𝑛 ∑ 𝑘=1 𝑚 ∑ 𝑗 = 1 (𝑣 𝑗 𝑤 𝑘 )(𝑒 𝑗 ⊗ 𝑓 𝑘 ). Thus with respect to the basis {𝑒 𝑗 ⊗ 𝑓 𝑘 } 𝑗=1 , … , 𝑚 ; 𝑘=1 , … , 𝑛 of 𝐅 𝑚 ⊗ 𝐅 𝑛 provided by 9.74(b), the coefficients of 𝑣 ⊗ 𝑤 are the numbers {𝑣 𝑗 𝑤 𝑘 } 𝑗=1 , … , 𝑚 ; 𝑘=1 , … , 𝑛 . If instead of writing these numbers in a list, we write them in an 𝑚 -by- 𝑛 matrix with 𝑣 𝑗 𝑤 𝑘 in row 𝑗 , column 𝑘 , then we can identify 𝑣 ⊗ 𝑤 with the 𝑚 -by- 𝑛 matrix ⎛⎜⎜⎜⎜⎜⎜⎝ 𝑣 1 𝑤 1 ⋯ 𝑣 1 𝑤 𝑛 ⋱ 𝑣 𝑚 𝑤 1 ⋯ 𝑣 𝑚 𝑤 𝑛 ⎞⎟⎟⎟⎟⎟⎟⎠. See Exercises 5 and 6 for practice in using the identification from the example above. We now define bilinear maps, which differ from bilinear functionals in that the target space can be an arbitrary vector space rather than just the scalar field. 9.77 definition: bilinear map A bilinear map from 𝑉× 𝑊 to a vector space 𝑈 is a function Γ ∶ 𝑉× 𝑊 → 𝑈 such that 𝑣 ↦ Γ(𝑣 , 𝑤) is a linear map from 𝑉 to 𝑈 for each 𝑤 ∈ 𝑊 and 𝑤 ↦ Γ(𝑣 , 𝑤) is a linear map from 𝑊 to 𝑈 for each 𝑣 ∈ 𝑉 . 9.78 example: bilinear maps • Every bilinear functional on 𝑉× 𝑊 is a bilinear map from 𝑉× 𝑊 to 𝐅 . • The function Γ ∶ 𝑉× 𝑊 → 𝑉⊗ 𝑊 defined by Γ(𝑣 , 𝑤) = 𝑣 ⊗ 𝑤 is a bilinear map from 𝑉× 𝑊 to 𝑉⊗ 𝑊 (by 9.73). • The function Γ ∶ ℒ (𝑉) × ℒ (𝑉) → ℒ (𝑉) defined by Γ(𝑆 , 𝑇) = 𝑆𝑇 is a bilinear map from ℒ (𝑉) × ℒ (𝑉) to ℒ (𝑉) . • The function Γ ∶ 𝑉× ℒ (𝑉 , 𝑊) → 𝑊 defined by Γ(𝑣 , 𝑇) = 𝑇𝑣 is a bilinear map from 𝑉× ℒ (𝑉 , 𝑊) to 𝑊 . Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 388 Spans: True Boxes: True Text: Section 9D Tensor Products 375 Tensor products allow us to convert bilinear maps on 𝑉×𝑊 into linear maps on 𝑉⊗𝑊 (and vice versa), as shown by the next result. In the mathematical literature, (a) of the result below is called the “universal property” of tensor products. 9.79 converting bilinear maps to linear maps Suppose 𝑈 is a vector space. (a) Suppose Γ ∶ 𝑉 × 𝑊 → 𝑈 is a bilinear map. Then there exists a unique linear map ̂Γ ∶ 𝑉⊗ 𝑊 → 𝑈 such that ̂Γ(𝑣 ⊗ 𝑤) = Γ(𝑣 , 𝑤) for all (𝑣 , 𝑤) ∈ 𝑉× 𝑊 . (b) Conversely, suppose 𝑇 ∶ 𝑉⊗ 𝑊 → 𝑈 is a linear map. There there exists a unique bilinear map 𝑇 # ∶ 𝑉× 𝑊 → 𝑈 such that 𝑇 # (𝑣 , 𝑤) = 𝑇(𝑣 ⊗ 𝑤) for all (𝑣 , 𝑤) ∈ 𝑉× 𝑊 . Proof Let 𝑒 1 , … , 𝑒 𝑚 be a basis of 𝑉 and let 𝑓 1 , … , 𝑓 𝑛 be a basis of 𝑊 . By the linear map lemma (3.4) and 9.74(b), there exists a unique linear map ̂Γ ∶ 𝑉⊗ 𝑊 → 𝑈 such that ̂Γ(𝑒 𝑗 ⊗ 𝑓 𝑘 ) = Γ(𝑒 𝑗 , 𝑓 𝑘 ) for all 𝑗 ∈ {1 , … , 𝑚} and 𝑘 ∈ {1 , … , 𝑛} . Now suppose (𝑣 , 𝑤) ∈ 𝑉× 𝑊 . There exist 𝑎 1 , … , 𝑎 𝑚 , 𝑏 1 , … , 𝑏 𝑛 ∈ 𝐅 such that 𝑣 = 𝑎 1 𝑒 1 + ⋯ + 𝑎 𝑚 𝑒 𝑚 and 𝑤 = 𝑏 1 𝑓 1 + ⋯ + 𝑏 𝑛 𝑓 𝑛 . Thus ̂Γ(𝑣 ⊗ 𝑤) = ̂Γ( 𝑛 ∑ 𝑘=1 𝑚 ∑ 𝑗 = 1 (𝑎 𝑗 𝑏 𝑘 )(𝑒 𝑗 ⊗ 𝑓 𝑘 )) = 𝑛 ∑ 𝑘=1 𝑚 ∑ 𝑗 = 1 𝑎 𝑗 𝑏 𝑘 ̂Γ(𝑒 𝑗 ⊗ 𝑓 𝑘 ) = 𝑛 ∑ 𝑘=1 𝑚 ∑ 𝑗 = 1 𝑎 𝑗 𝑏 𝑘 Γ(𝑒 𝑗 , 𝑓 𝑘 ) = Γ(𝑣 , 𝑤) , as desired, where the second line holds because ̂ Γ is linear, the third line holds by the definition of ̂Γ , and the fourth line holds because Γ is bilinear. The uniqueness of the linear map ̂Γ satisfying ̂Γ(𝑣 ⊗ 𝑤) = Γ(𝑣 , 𝑤) follows from 9.74(b), completing the proof of (a). To prove (b), define a function 𝑇 # ∶ 𝑉×𝑊 → 𝑈 by 𝑇 # (𝑣 , 𝑤) = 𝑇(𝑣⊗𝑤) for all (𝑣 , 𝑤) ∈ 𝑉× 𝑊 . The bilinearity of the tensor product (see 9.73) and the linearity of 𝑇 imply that 𝑇 # is bilinear. Clearly the choice of 𝑇 # that satisfies the conditions is unique. Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 389 Spans: True Boxes: True Text: 376 Chapter 9 Multilinear Algebra and Determinants To prove 9.79(a), we could not just define ̂Γ(𝑣 ⊗ 𝑤) = Γ(𝑣 , 𝑤) for all 𝑣 ∈ 𝑉 and 𝑤 ∈ 𝑊 ( and then extend ̂Γ linearly to all of 𝑉 ⊗ 𝑊) because elements of 𝑉⊗ 𝑊 do not have unique representations as finite sums of elements of the form 𝑣 ⊗ 𝑤 . Our proof used a basis of 𝑉 and a basis of 𝑊 to get around this problem. Although our construction of ̂Γ in the proof of 9.79(a) depended on a basis of 𝑉 and a basis of 𝑊 , the equation ̂Γ(𝑣 ⊗ 𝑤) = Γ(𝑣 , 𝑤) that holds for all 𝑣 ∈ 𝑉 and 𝑤 ∈ 𝑊 shows that ̂ Γ does not depend on the choice of bases for 𝑉 and 𝑊 . Tensor Product of Inner Product Spaces The result below features three inner products—one on 𝑉⊗ 𝑊 , one on 𝑉 , and one on 𝑊 , although we use the same symbol ⟨⋅ , ⋅⟩ for all three inner products. 9.80 inner product on tensor product of two inner product spaces Suppose 𝑉 and 𝑊 are inner product spaces. Then there is a unique inner product on 𝑉⊗ 𝑊 such that ⟨𝑣 ⊗ 𝑤 , 𝑢 ⊗ 𝑥⟩ = ⟨𝑣 , 𝑢⟩⟨𝑤 , 𝑥⟩ for all 𝑣 , 𝑢 ∈ 𝑉 and 𝑤 , 𝑥 ∈ 𝑊 . Proof Suppose 𝑒 1 , … , 𝑒 𝑚 is an orthonormal basis of 𝑉 and 𝑓 1 , … , 𝑓 𝑛 is an ortho- normal basis of 𝑊 . Define an inner product on 𝑉⊗ 𝑊 by 9.81 ⟨ 𝑛 ∑ 𝑘=1 𝑚 ∑ 𝑗 = 1 𝑏 𝑗 , 𝑘 𝑒 𝑗 ⊗ 𝑓 𝑘 , 𝑛 ∑ 𝑘=1 𝑚 ∑ 𝑗 = 1 𝑐 𝑗 , 𝑘 𝑒 𝑗 ⊗ 𝑓 𝑘 ⟩ = 𝑛 ∑ 𝑘=1 𝑚 ∑ 𝑗 = 1 𝑏 𝑗 , 𝑘 𝑐 𝑗 , 𝑘 . The straightforward verification that 9.81 defines an inner product on 𝑉× 𝑊 is left to the reader [ use 9.74(b) ] . Suppose that 𝑣 , 𝑢 ∈ 𝑉 and 𝑤 , 𝑥 ∈ 𝑊 . Let 𝑣 1 , … , 𝑣 𝑚 ∈ 𝐅 be such that 𝑣 = 𝑣 1 𝑒 1 + ⋯ + 𝑣 𝑚 𝑒 𝑚 , with similar expressions for 𝑢 , 𝑤 , and 𝑥 . Then ⟨𝑣 ⊗ 𝑤 , 𝑢 ⊗ 𝑥⟩ = ⟨ 𝑚 ∑ 𝑗 = 1 𝑣 𝑗 𝑒 𝑗 ⊗ 𝑛 ∑ 𝑘=1 𝑤 𝑘 𝑓 𝑘 , 𝑚 ∑ 𝑗 = 1 𝑢 𝑗 𝑒 𝑗 ⊗ 𝑛 ∑ 𝑘=1 𝑥 𝑘 𝑓 𝑘 ⟩ = ⟨ 𝑛 ∑ 𝑘=1 𝑚 ∑ 𝑗 = 1 𝑣 𝑗 𝑤 𝑘 𝑒 𝑗 ⊗ 𝑓 𝑘 , 𝑛 ∑ 𝑘=1 𝑚 ∑ 𝑗 = 1 𝑢 𝑗 𝑥 𝑘 𝑒 𝑗 ⊗ 𝑓 𝑘 ⟩ = 𝑛 ∑ 𝑘=1 𝑚 ∑ 𝑗 = 1 𝑣 𝑗 𝑢 𝑗 𝑤 𝑘 𝑥 𝑘 = ( 𝑚 ∑ 𝑗 = 1 𝑣 𝑗 𝑢 𝑗 )( 𝑛 ∑ 𝑘=1 𝑤 𝑘 𝑥 𝑘 ) = ⟨𝑣 , 𝑢⟩⟨𝑤 , 𝑥⟩. There is only one inner product on 𝑉⊗𝑊 such that ⟨𝑣⊗𝑤 , 𝑢⊗𝑥⟩ = ⟨𝑣 , 𝑢⟩⟨𝑤 , 𝑥⟩ for all 𝑣 , 𝑢 ∈ 𝑉 and 𝑤 , 𝑥 ∈ 𝑊 because every element of 𝑉⊗ 𝑊 can be written as a linear combination of elements of the form 𝑣 ⊗ 𝑤 [ by 9.74(b) ] . Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 390 Spans: True Boxes: True Text: Section 9D Tensor Products 377 The definition below of a natural inner product on 𝑉⊗ 𝑊 is now justified by 9.80. We could not have simply defined ⟨𝑣 ⊗ 𝑤 , 𝑢 ⊗ 𝑥⟩ to be ⟨𝑣 , 𝑢⟩⟨𝑤 , 𝑥⟩ (and then used additivity in each slot separately to extend the definition to 𝑉⊗ 𝑊 ) without some proof because elements of 𝑉⊗ 𝑊 do not have unique representations as finite sums of elements of the form 𝑣 ⊗ 𝑤 . 9.82 definition: inner product on tensor product of two inner product spaces Suppose 𝑉 and 𝑊 are inner product spaces. The inner product on 𝑉⊗ 𝑊 is the unique function ⟨⋅ , ⋅⟩ from (𝑉⊗ 𝑊) × (𝑉⊗ 𝑊) to 𝐅 such that ⟨𝑣 ⊗ 𝑤 , 𝑢 ⊗ 𝑥⟩ = ⟨𝑣 , 𝑢⟩⟨𝑤 , 𝑥⟩ for all 𝑣 , 𝑢 ∈ 𝑉 and 𝑤 , 𝑥 ∈ 𝑊 . Take 𝑢 = 𝑣 and 𝑥 = 𝑤 in the equation above and then take square roots to show that ‖𝑣 ⊗ 𝑤‖ = ‖𝑣‖ ‖𝑤‖ for all 𝑣 ∈ 𝑉 and all 𝑤 ∈ 𝑊 . The construction of the inner product in the proof of 9.80 depended on an orthonormal basis 𝑒 1 , … , 𝑒 𝑚 of 𝑉 and an orthonormal basis 𝑓 1 , … , 𝑓 𝑛 of 𝑊 . Formula 9.81 implies that {𝑒 𝑗 ⊗ 𝑓 𝑘 } 𝑗=1 , … , 𝑚 ; 𝑘=1 , … , 𝑛 is a doubly indexed orthonormal list in 𝑉⊗ 𝑊 and hence is an orthonormal basis of 𝑉⊗ 𝑊 [ by 9.74(b) ] . The importance of the next result arises because the orthonormal bases used there can be different from the orthonormal bases used to define the inner product in 9.80. Although the notation for the bases is the same in the proof of 9.80 and in the result below, think of them as two different sets of orthonormal bases. 9.83 orthonormal basis of 𝑉⊗ 𝑊 Suppose 𝑉 and 𝑊 are inner product spaces, and 𝑒 1 , … , 𝑒 𝑚 is an orthonormal basis of 𝑉 and 𝑓 1 , … , 𝑓 𝑛 is an orthonormal basis of 𝑊 . Then {𝑒 𝑗 ⊗ 𝑓 𝑘 } 𝑗=1 , … , 𝑚 ; 𝑘=1 , … , 𝑛 is an orthonormal basis of 𝑉⊗ 𝑊 . Proof We know that {𝑒 𝑗 ⊗ 𝑓 𝑘 } 𝑗=1 , … , 𝑚 ; 𝑘=1 , … , 𝑛 is a basis of 𝑉⊗ 𝑊 [ by 9.74(b) ] . Thus we only need to verify orthonormality. To do this, suppose 𝑗 , 𝑀 ∈ {1 , … , 𝑚} and 𝑘 , 𝑁 ∈ {1 , … , 𝑛} . Then ⟨𝑒 𝑗 ⊗ 𝑓 𝑘 , 𝑒 𝑁 ⊗ 𝑓 𝑀 ⟩ = ⟨𝑒 𝑗 , 𝑒 𝑁 ⟩⟨ 𝑓 𝑘 , 𝑓 𝑀 ⟩ = ⎧{ ⎨{⎩ 1 if 𝑗 = 𝑁 and 𝑘 = 𝑀 , 0 otherwise . Hence the doubly indexed list {𝑒 𝑗 ⊗ 𝑓 𝑘 } 𝑗=1 , … , 𝑚 ; 𝑘=1 , … , 𝑛 is indeed an orthonormal basis of 𝑉⊗ 𝑊 . See Exercise 11 for an example of how the inner product structure on 𝑉⊗ 𝑊 interacts with operators on 𝑉 and 𝑊 . Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 391 Spans: True Boxes: True Text: 378 Chapter 9 Multilinear Algebra and Determinants Tensor Product of Multiple Vector Spaces We have been discussing properties of the tensor product of two finite-dimensional vector spaces. Now we turn our attention to the tensor product of multiple finite- dimensional vector spaces. This generalization requires no new ideas, only some slightly more complicated notation. Readers with a good understanding of the tensor product of two vector spaces should be able to make the extension to the tensor product of more than two vector spaces. Thus in this subsection, no proofs will be provided. The definitions and the statements of results that will be provided should be enough information to enable readers to fill in the details, using what has already been learned about the tensor product of two vector spaces. We begin with the following notational assumption. 9.84 notation: 𝑉 1 , … , 𝑉 𝑚 For the rest of this subsection, 𝑚 denotes an integer greater than 1 and 𝑉 1 , … , 𝑉 𝑚 denote finite-dimensional vector spaces. The notion of an 𝑚 -linear functional, which we are about to define, generalizes the notion of a bilinear functional (see 9.68). Recall that the use of the word “functional” indicates that we are mapping into the scalar field 𝐅 . Recall also that the terminology “ 𝑚 -linear form” is used in the special case 𝑉 1 = ⋯ = 𝑉 𝑚 (see 9.25). The notation ℬ (𝑉 1 , … , 𝑉 𝑚 ) generalizes our previous notation ℬ (𝑉 , 𝑊) . 9.85 definition: 𝑚 -linear functional, the vector space ℬ (𝑉 1 , … , 𝑉 𝑚 ) • An 𝑚 - linear functional on 𝑉 1 × ⋯ × 𝑉 𝑚 is a function 𝛽 ∶ 𝑉 1 × ⋯ × 𝑉 𝑚 → 𝐅 that is a linear functional in each slot when the other slots are held fixed. • The vector space of 𝑚 -linear functionals on 𝑉 1 × ⋯ × 𝑉 𝑚 is denoted by ℬ (𝑉 1 , … , 𝑉 𝑚 ) . 9.86 example: 𝑚 -linear functional Suppose 𝜑 𝑘 ∈ (𝑉 𝑘 ) ′ for each 𝑘 ∈ {1 , … , 𝑚} . Define 𝛽 ∶ 𝑉 1 × ⋯ × 𝑉 𝑚 → 𝐅 by 𝛽(𝑣 1 , … , 𝑣 𝑚 ) = 𝜑 1 (𝑣 1 ) × ⋯ × 𝜑 𝑚 (𝑣 𝑚 ). Then 𝛽 is an 𝑚 -linear functional on 𝑉 1 × ⋯ × 𝑉 𝑚 . The next result can be proved by imitating the proof of 9.70. 9.87 dimension of the vector space of 𝑚 -linear functionals dim ℬ (𝑉 1 , … , 𝑉 𝑚 ) = ( dim 𝑉 1 ) × ⋯ × ( dim 𝑉 𝑚 ) . Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 392 Spans: True Boxes: True Text: Section 9D Tensor Products 379 Now we can define the tensor product of multiple vector spaces and the tensor product of elements of those vector spaces. The following definition is completely analogous to our previous definition (9.71) in the case 𝑚 = 2 . 9.88 definition: tensor product, 𝑉 1 ⊗ ⋯ ⊗ 𝑉 𝑚 , 𝑣 1 ⊗ ⋯ ⊗ 𝑣 𝑚 • The tensor product 𝑉 1 ⊗ ⋯ ⊗ 𝑉 𝑚 is defined to be ℬ (𝑉 1 ′ , … , 𝑉 𝑚 ′ ) . • For 𝑣 1 ∈ 𝑉 1 , … , 𝑣 𝑚 ∈ 𝑉 𝑚 , the tensor product 𝑣 1 ⊗ ⋯ ⊗ 𝑣 𝑚 is the element of 𝑉 1 ⊗ ⋯ ⊗ 𝑉 𝑚 defined by (𝑣 1 ⊗ ⋯ ⊗ 𝑣 𝑚 )(𝜑 1 , … , 𝜑 𝑚 ) = 𝜑 1 (𝑣 1 )⋯𝜑 𝑚 (𝑣 𝑚 ) for all (𝜑 1 … , 𝜑 𝑚 ) ∈ 𝑉 1′ × ⋯ × 𝑉 𝑚′ . The next result can be proved by following the pattern of the proof of the analogous result when 𝑚 = 2 (see 9.72). 9.89 dimension of the tensor product dim (𝑉 1 ⊗ ⋯ ⊗ 𝑉 𝑚 ) = ( dim 𝑉 1 )⋯( dim 𝑉 𝑚 ) . Our next result generalizes 9.74. 9.90 basis of 𝑉 1 ⊗ ⋯ ⊗ 𝑉 𝑚 Suppose dim 𝑉 𝑘 = 𝑛 𝑘 and 𝑒 𝑘1 , … , 𝑒 𝑘𝑛 𝑘 is a basis of 𝑉 𝑘 for 𝑘 = 1 , … , 𝑚 . Then {𝑒 1𝑗 1 ⊗ ⋯ ⊗ 𝑒 𝑚𝑗 𝑚 } 𝑗 1 =1 , … , 𝑛 1 ; ⋯ ; 𝑗 𝑚 =1 , … , 𝑛 𝑚 is a basis of 𝑉 1 ⊗ ⋯ ⊗ 𝑉 𝑚 . Suppose 𝑚 = 2 and 𝑒 11 , … , 𝑒 1𝑛 1 is a basis of 𝑉 1 and 𝑒 21 , … , 𝑒 2𝑛 2 is a basis of 𝑉 2 . Then with respect to the basis {𝑒 1𝑗 1 ⊗ 𝑒 2𝑗 2 } 𝑗 1 =1 , … , 𝑛 1 ; 𝑗 2 =1 , … , 𝑛 2 in the result above, the coefficients of an element of 𝑉 1 ⊗𝑉 2 can be represented by an 𝑛 1 -by- 𝑛 2 matrix that contains the coefficient of 𝑒 1𝑗 1 ⊗ 𝑒 2𝑗 2 in row 𝑗 1 , column 𝑗 2 . Thus we need a matrix, which is an array specified by two indices, to represent an element of 𝑉 1 ⊗ 𝑉 2 . If 𝑚 > 2 , then the result above shows that we need an array specified by 𝑚 indices to represent an arbitrary element of 𝑉 1 ⊗ ⋯ ⊗ 𝑉 𝑚 . Thus tensor products may appear when we deal with objects specified by arrays with multiple indices. The next definition generalizes the notion of a bilinear map (see 9.77). As with bilinear maps, the target space can be an arbitrary vector space. 9.91 definition: 𝑚 -linear map An 𝑚 - linear map from 𝑉 1 × ⋯ × 𝑉 𝑚 to a vector space 𝑈 is a function Γ ∶ 𝑉 1 × ⋯ × 𝑉 𝑚 → 𝑈 that is a linear map in each slot when the other slots are held fixed. Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 393 Spans: True Boxes: True Text: 380 Chapter 9 Multilinear Algebra and Determinants The next result can be proved by following the pattern of the proof of 9.79. 9.92 converting 𝑚 -linear maps to linear maps Suppose 𝑈 is a vector space. (a) Suppose that Γ ∶ 𝑉 1 × ⋯ × 𝑉 𝑚 → 𝑈 is an 𝑚 -linear map. Then there exists a unique linear map ̂Γ ∶ 𝑉 1 ⊗ ⋯ ⊗ 𝑉 𝑚 → 𝑈 such that ̂Γ(𝑣 1 ⊗ ⋯ ⊗ 𝑣 𝑚 ) = Γ(𝑣 1 , … , 𝑣 𝑚 ) for all (𝑣 1 , … , 𝑣 𝑚 ) ∈ 𝑉 1 × ⋯ × 𝑉 𝑚 . (b) Conversely, suppose 𝑇 ∶ 𝑉 1 ⊗ ⋯ ⊗ 𝑉 𝑚 → 𝑈 is a linear map. There there exists a unique 𝑚 -linear map 𝑇 # ∶ 𝑉 1 × ⋯ × 𝑉 𝑚 → 𝑈 such that 𝑇 # (𝑣 1 , … , 𝑣 𝑚 ) = 𝑇(𝑣 1 ⊗ ⋯ ⊗ 𝑣 𝑚 ) for all (𝑣 1 , … , 𝑣 𝑚 ) ∈ 𝑉 1 × ⋯ × 𝑉 𝑚 . See Exercises 12 and 13 for tensor products of multiple inner product spaces. Exercises 9D 1 Suppose 𝑣 ∈ 𝑉 and 𝑤 ∈ 𝑊 . Prove that 𝑣 ⊗ 𝑤 = 0 if and only if 𝑣 = 0 or 𝑤 = 0 . 2 Give an example of six distinct vectors 𝑣 1 , 𝑣 2 , 𝑣 3 , 𝑤 1 , 𝑤 2 , 𝑤 3 in 𝐑 3 such that 𝑣 1 ⊗ 𝑤 1 + 𝑣 2 ⊗ 𝑤 2 + 𝑣 3 ⊗ 𝑤 3 = 0 but none of 𝑣 1 ⊗ 𝑤 1 , 𝑣 2 ⊗ 𝑤 2 , 𝑣 3 ⊗ 𝑤 3 is a scalar multiple of another element of this list. 3 Suppose that 𝑣 1 , … , 𝑣 𝑚 is a linearly independent list in 𝑉 . Suppose also that 𝑤 1 , … , 𝑤 𝑚 is a list in 𝑊 such that 𝑣 1 ⊗ 𝑤 1 + ⋯ + 𝑣 𝑚 ⊗ 𝑤 𝑚 = 0. Prove that 𝑤 1 = ⋯ = 𝑤 𝑚 = 0 . 4 Suppose dim 𝑉 > 1 and dim 𝑊 > 1 . Prove that {𝑣 ⊗ 𝑤 ∶ (𝑣 , 𝑤) ∈ 𝑉× 𝑊} is not a subspace of 𝑉⊗ 𝑊 . This exercise implies that if dim 𝑉 > 1 and dim 𝑊 > 1 , then {𝑣 ⊗ 𝑤 ∶ (𝑣 , 𝑤) ∈ 𝑉× 𝑊} ≠ 𝑉⊗ 𝑊. Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 394 Spans: True Boxes: True Text: Section 9D Tensor Products 381 5 Suppose 𝑚 and 𝑛 are positive integers. For 𝑣 ∈ 𝐅 𝑚 and 𝑤 ∈ 𝐅 𝑛 , identify 𝑣 ⊗ 𝑤 with an 𝑚 -by- 𝑛 matrix as in Example 9.76. With that identification, show that the set {𝑣 ⊗ 𝑤 ∶ 𝑣 ∈ 𝐅 𝑚 and 𝑤 ∈ 𝐅 𝑛 } is the set of 𝑚 -by- 𝑛 matrices (with entries in 𝐅 ) that have rank at most one. 6 Suppose 𝑚 and 𝑛 are positive integers. Give a description, analogous to Exercise 5, of the set of 𝑚 -by- 𝑛 matrices (with entries in 𝐅 ) that have rank at most two. 7 Suppose dim 𝑉 > 2 and dim 𝑊 > 2 . Prove that {𝑣 1 ⊗ 𝑤 1 + 𝑣 2 ⊗ 𝑤 2 ∶ 𝑣 1 , 𝑣 2 ∈ 𝑉 and 𝑤 1 , 𝑤 2 ∈ 𝑊} ≠ 𝑉⊗ 𝑊. 8 Suppose 𝑣 1 , … , 𝑣 𝑚 ∈ 𝑉 and 𝑤 1 , … , 𝑤 𝑚 ∈ 𝑊 are such that 𝑣 1 ⊗ 𝑤 1 + ⋯ + 𝑣 𝑚 ⊗ 𝑤 𝑚 = 0. Suppose that 𝑈 is a vector space and Γ ∶ 𝑉× 𝑊 → 𝑈 is a bilinear map. Show that Γ(𝑣 1 , 𝑤 1 ) + ⋯ + Γ(𝑣 𝑚 , 𝑤 𝑚 ) = 0. 9 Suppose 𝑆 ∈ ℒ (𝑉) and 𝑇 ∈ ℒ (𝑊) . Prove that there exists a unique operator on 𝑉⊗ 𝑊 that takes 𝑣 ⊗ 𝑤 to 𝑆𝑣 ⊗ 𝑇𝑤 for all 𝑣 ∈ 𝑉 and 𝑤 ∈ 𝑊 . In an abuse of notation, the operator on 𝑉⊗ 𝑊 given by this exercise is often called 𝑆 ⊗ 𝑇 . 10 Suppose 𝑆 ∈ ℒ (𝑉) and 𝑇 ∈ ℒ (𝑊) . Prove that 𝑆⊗𝑇 is an invertible operator on 𝑉⊗ 𝑊 if and only if both 𝑆 and 𝑇 are invertible operators. Also, prove that if both 𝑆 and 𝑇 are invertible operators, then (𝑆 ⊗ 𝑇) −1 = 𝑆 −1 ⊗ 𝑇 −1 , where we are using the notation from the comment after Exercise 9. 11 Suppose 𝑉 and 𝑊 are inner product spaces. Prove that if 𝑆 ∈ ℒ (𝑉) and 𝑇 ∈ ℒ (𝑊) , then (𝑆 ⊗ 𝑇) ∗ = 𝑆 ∗ ⊗ 𝑇 ∗ , where we are using the notation from the comment after Exercise 9. 12 Suppose that 𝑉 1 , … , 𝑉 𝑚 are finite-dimensional inner product spaces. Prove that there is a unique inner product on 𝑉 1 ⊗ ⋯ ⊗ 𝑉 𝑚 such that ⟨𝑣 1 ⊗ ⋯ ⊗ 𝑣 𝑚 , 𝑢 1 ⊗ ⋯ ⊗ 𝑢 𝑚 ⟩ = ⟨𝑣 1 , 𝑢 1 ⟩⋯⟨𝑣 𝑚 , 𝑢 𝑚 ⟩ for all (𝑣 1 , … , 𝑣 𝑚 ) and (𝑢 1 , … , 𝑢 𝑚 ) in 𝑉 1 × ⋯ × 𝑉 𝑚 . Note that the equation above implies that ‖𝑣 1 ⊗ ⋯ ⊗ 𝑣 𝑚 ‖ = ‖𝑣 1 ‖ × ⋯ × ‖𝑣 𝑚 ‖ for all (𝑣 1 , … , 𝑣 𝑚 ) ∈ 𝑉 1 × ⋯ × 𝑉 𝑚 . Linear Algebra Done Right , fourth edition, by Sheldon Axler Annotated Entity: ID: 395 Spans: True Boxes: True Text: 382 Chapter 9 Multilinear Algebra and Determinants 13 Suppose that 𝑉 1 , … , 𝑉 𝑚 are finite-dimensional inner product spaces and 𝑉 1 ⊗ ⋯ ⊗ 𝑉 𝑚 is made into an inner product space using the inner product from Exercise 12. Suppose 𝑒 𝑘1 , … , 𝑒 𝑘𝑛 𝑘 is an orthonormal basis of 𝑉 𝑘 for each 𝑘 = 1 , … , 𝑚 . Show that the list {𝑒 1𝑗 1 ⊗ ⋯ ⊗ 𝑒 𝑚𝑗 𝑚 } 𝑗 1 =1 , … , 𝑛 1 ; ⋯ ; 𝑗 𝑚 =1 , … , 𝑛 𝑚 is an orthonormal basis of 𝑉 1 ⊗ ⋯ ⊗ 𝑉 𝑚 . Linear Algebra Done Right , fourth edition, by Sheldon Axler