Linear Algebra, 4th edition

Graduate Texts in Mathematics Linear Algebra Volume 23 1975 Springer Graduate Texts in Mathematics 23 Editorial Bo

7,599 3,082 5MB

Pages 470 Page size 356.4 x 563.52 pts

Report DMCA / Copyright

DOWNLOAD FILE

Recommend Papers

Linear Algebra, 4th edition

Linear Algebra This page intentionally left blank Fourth Edition Stephen H. Friedberg Arnold J. Insel Lawrence E. S

19,826 1,841 3MB Read more

Linear Algebra and Its Applications (4th Edition)

8,538 3,560 133MB Read more

Linear Algebra and Its Applications (4th Edition)

Linear Algebra and Its Applications Fourth Edition Gilbert Strang x y z Ax b y Ay b b 0 0 z Az 0 Cont

5,422 752 3MB Read more

Linear Algebra and Its Applications, 4th Edition

28,462 18,123 35MB Read more

Advanced Linear Algebra, 2nd edition

ADVANCED LINEAR ALGEBRA Second Edition TEXTBOOKS in MATHEMATICS Series Editors: Al Boggess and Ken Rosen PUBLISHED TIT

2,488 1,503 3MB Read more

Applied numerical linear algebra

427 11 4MB Read more

Applied numerical linear algebra

1,296 468 2MB Read more

Numerical Linear Algebra

3,489 96 5MB Read more

Elementary Linear Algebra

INDEX OF APPLICATIONS BIOLOGY AND LIFE SCIENCES Calories burned, 117 Population of deer, 43 of rabbits, 459 Population g

2,660 1,636 3MB Read more

Linear Algebra Demystified

1,925 145 2MB Read more

File loading please wait...

Citation preview

Graduate Texts in Mathematics

Linear Algebra

Volume 23 1975

Springer

Graduate Texts in Mathematics 23

Editorial Board: F. W. Gehring P. R. Halmos (Managing Editor) C. C. Moore

Werner Greub

Linear Algebra Fourth Edition

Springer-Verlag

New York Heidelberg Berlin

Werner Greub University of Toronto Department of Mathematics Toronto M5S 1A1 Canada

Managing Editor P. R. Halmos Indiana University Department of Mathematics Swain Hall East Bloomington, Indiana 47401

Editors F. W. Gehring

C. C. Moore

University of Michigan Department of Mathematics Ann Arbor, Michigan 48104

University of California at Berkeley Department of Mathematics Berkeley, California 94720

AMS Subject Classifications 15-01, 15A03, 15A06, 15A18, 15A21, 16-01 Library

Greub,

of Congress Cataloging in Publication Data Werner Hildbert, 1925—

algebra. (Graduate texts in mathematics; v. 23) Bibliography: p. 445 1. Algebras, Linear. 1. Title. II. Series. QA184.G7313 1974 512'.S 74-13868 Linear

All rights

reserved.

No part of this book may be translated or reproduced in any form without written permission from Springer-Verlag. © 1975 by Springer-Verlag New York Inc. Softcover reprint of the hardcover 4th edition 1975 ISBN 978-1-4684-9448-8 DOl 10.1007/978-1-4684-9446-4

ISBN 978-1-4684-9446-4 (eBook)

To Roif Nevanlinna

Preface to the fourth edition This textbook gives a detailed and comprehensive presentation of linear algebra based on an axiomatic treatment of linear spaces. For this fourth edition some new material has been added to the text, for instance, the intrinsic treatment of the classical adjoint of a linear transformation in Chapter IV, as well as the discussion of quaternions and the classification of associative division algebras in Chapter VII. Chapters XII and

XIII have been substantially rewritten for the sake of clarity, but the contents remain basically the same as before. Finally, a number of problems covering new topics — e.g. complex structures, Caylay numbers and symplectic spaces — have been added.

I should like to thank Mr. M. L. Johnson who made many useful suggestions for the problems in the third edition. I am also grateful to my colleague S. Halperin who assisted in the revision of Chapters XII and XIII and to Mr. F. Gomez who helped to prepare the subject index. Finally, I have to express my deep gratitude to my colleague J. R. Vanstone who worked closely with me in the preparation of all the revisions and additions and who generously helped with the proof reading.

Toronto, February 1975

WERNER H. GREUB

Preface to the third edition The major change between the second and third edition is the separation

of linear and multilinear algebra into two different volumes as well as the incorporation of a great deal of new material. However, the essential

character of the book remains the same; in other words, the entire presentation continues to be based on an axiomatic treatment of vector spaces.

In this first volume the restriction to finite dimensional vector spaces has been eliminated except for those results which do not hold in the infinite dimensional case. The restriction of the coefficient field to the real and complex numbers has also been removed and except for chapters VII to XI, § 5 of chapter I and § 8, chapter IV we allow any coefficient

field of characteristic zero. In fact, many of the theorems are valid for modules over a commutative ring. Finally, a large number of problems of different degree of difficulty has been added.

Chapter I deals with the general properties of a vector space. The topology of a real vector space of finite dimension is axiomatically characterized in an additional paragraph. In chapter lIthe sections on exact sequences, direct decompositions and duality have been greatly expanded. Oriented vector spaces have been incorporated into chapter IV and so chapter V of the second edition has disappeared. Chapter V (algebras) and VI (gradations and homology)

are completely new and introduce the reader to the basic concepts associated with these fields. The second volume will depend heavily on some of the material developed in these two chapters.

Chapters X (Inner product spaces) XI (Linear mappings of inner product spaces) XII (Symmetric bilinear functions) XIII (Quadrics) and XIV (Unitary spaces) of the second edition have been renumbered but remain otherwise essentially unchanged. Chapter XII (Polynomial algebra) is again completely new and developes all the standard material about polynomials in one indeterminate. Most of this is applied in chapter XIII (Theory of a linear transformation). This last chapter is a very much expanded version of chapter XV of the

second edition. Of particular importance is the generalization of the

Preface to the third edition

results in the second edition to vector spaces over an arbitrary coefficient field of characteristic zero. This has been accomplished without reversion

to the cumbersome calculations of the first edition. Furthermore the concept of a semisimple transformation is introduced and treated in some depth.

One additional change has been made: some of the paragraphs or sections have been starred. The rest of the book can be read without reference to this material.

Last but certainly not least, I have to express my sincerest thanks to everyone who has helped in the preparation of this edition. First of all I am particularly indebted to Mr. S. HALPERIN who made a great number of valuable suggestions for improvements. Large parts of the book, in particular chapters XII and XIII are his own work. My warm thanks also go to Mr. L. YONKER, Mr. G. PEDERZOLI and Mr. J. SCHERK who did the proofreading. Furthermore I am grateful to Mrs. V. PEDERZOLI

and to Miss M. PETTINGER for their assistance in the preparation of the

manuscript. Finally I would like to express my thanks to professor K. BLEULER for providing an agreeable milieu in which to work and to the publishers for their patience and cooperation.

Toronto, December 1966

WERNER H. GREUB

Preface to the second edition Besides the very obvious change from German to English, the second

edition of this book contains many additions as well as a great many other changes. It might even be called a new book altogether were it not

for the fact that the essential character of the book has remained the same; in other words, the entire presentation continues to be based on an axiomatic treatment of linear spaces. In this second edition, the thorough-going restriction to linear spaces of finite dimension has been removed. Another complete change is the

restriction to linear spaces with real or complex coefficients, thereby removing a number of relatively involved discussions which did not really contribute substantially to the subject. On p. 6 there is a list of those chapters in which the presentation can be transferred directly to spaces over an arbitrary coefficient field.

Chapter I deals with the general properties of a linear space. Those concepts which are only valid for finitely many dimensions are discussed in a special paragraph.

Chapter II now covers only linear transformations while the treatment of matrices has been delegated to a new chapter, chapter III. The discussion of dual spaces has been changed; dual spaces are now introduced abstractly and the connection with the space of linear functions is not established until later. Chapters IV and V, dealing with determinants and orientation respectively, do not contain substantial changes. Brief reference should

be made here to the new paragraph in chapter IV on the trace of an endomorphism — a concept which is used quite consistently throughout the book from that time on. Special emphasis is given to tensors. The original chapter on Multilinear Algebra is now spread over four chapters: Multilinear Mappings (Ch. VI), Tensor Algebra (Ch. VII), Exterior Algebra (Ch. VIII) and

Duality in Exterior Algebra (Ch. IX). The chapter on multilinear mappings consists now primarily of an introduction to the theory of the tensor-product. In chapter VII the notion of vector-valued tensors has

been introduced and used to define the contraction. Furthermore, a

XII

Preface to the second edition

treatment of the transformation of tensors under linear mappings has been added. In Chapter VIII the antisymmetry-operator is studied in greater detail and the concept of the skew-symmetric power is introduced. The dual product (Ch. IX) is generalized to mixed tensors. A special paragraph in this chapter covers the skew-symmetric powers of the unit tensor and shows their significance in the characteristic polynomial. The paragraph "Adjoint Tensors" provides a number of applications of the duality theory to certain tensors arising from an endomorphism of the underlying space.

There are no essential changes in Chapter X (Inner product spaces) except for the addition of a short new paragraph on normed linear spaces.

In the next chapter, on linear mappings of inner product spaces, the orthogonal projections 3) and the skew mappings 4) are discussed in greater detail. Furthermore, a paragraph on differentiable families of automorphisms has been added here. Chapter XII (Symmetric Bilinear Functions) contains a new paragraph dealing with Lorentz-transformations. Whereas the discussion of quadrics in the first edition was limited to quadrics with centers, the second edition covers this topic in full. The chapter on unitary spaces has been changed to include a more thorough-going presentation of unitary transformations of the complex plane and their relation to the algebra of quaternions. The restriction to linear spaces with complex or real coefficients has of course greatly simplified the construction of irreducible subspaces in chapter XV. Another essential simplification of this construction was achieved by the simultaneous consideration of the dual mapping. A final paragraph with applications to Lorentz-transformation has been added to this concluding chapter. Many other minor changes have been incorporated — not least of which are the many additional problems now accompanying each paragraph.

Last, but certainly not least, I have to express my sincerest thanks to everyone who has helped me in the preparation of this second edition. First of all, I am particularly indebted to CORNELIE J. RHEINBOLDT

who assisted in the entire translating and editing work and to Dr. WERNER C. RHEINBOLDT who cooperated in this task and who also made a number of valuable suggestions for improvements, especially in the chapters on linear transformations and matrices. My warm thanks

also go to Dr. H. BOLDER of the Royal Dutch/Shell Laboratory at Amsterdam for his criticism on the chapter on tensor-products and to Dr. H. H. KELLER who read the entire manuscript and offered many

Preface to the second edition

XIII

important suggestions. Furthermore, I am grateful to Mr. GI0RGI0 PEDERZOLI who helped to read the proofs of the entire work and who collected a number of new problems and to Mr. KHADJA NESAMUDDIN KHAN for his assistance in preparing the manuscript.

Finally I would like to express my thanks to the publishers for their patience and cooperation during the preparation of this edition. Toronto, April 1963

WERNER H. GREUB

Contents Chapter 0. Prerequisites

Chapter I. Vector spaces § 1. Vector spaces § 2. Linear mappings § 3. Subspaces and factor spaces § 4. Dimension § 5. The topology of a real finite dimensional vector space

37

Chapter II. Linear mappings § 1. Basic properties § 2. Operations with linear mappings § 3. Linear isomorphisms § 4. Direct sum of vector spaces § 5. Dual vector spaces § 6. Finite dimensional vector spaces

41 41

Chapter III. Matrices § 1. Matrices and systems of linear equations § 2. Multiplication of matrices § 3. Basis transformation § 4. Elementary transformations

83 83 89 92 95

Chapter IV. Determinants § 1. Determinant functions § 2. The determinant of a linear transformation § 3. The determinant of a matrix § 4. Dual determinant functions § 5. The adjoint matrix § 6. The characteristic polynomial §7. The trace § 8. Oriented vector spaces Chapter V. Algebras § 1. Basic properties

51

55 56

63 76

104 109 112 114 120 126 131

Change of coefficient field of a vector space

144 144 158 163

Chapter VI. Gradations and homology § 1. G-graded vector spaces § 2. G-graded algebras § 3. Differential spaces and differential algebras.

167 167 174 178

Chapter VII. Inner product spaces § 1. The inner product § 2. Orthonormal bases

186 186

§2. Ideals § 3.

191

Contents

XVI

Normed determinant functions Duality in an inner product space Normed vector spaces § 6. The algebra of quaternions Chapter VIII. Linear mappings of inner product spaces § 1. The adjoint mapping § 2. Selfadjoint mappings § 3. Orthogonal projections § 4. Skew mappings § 5. Isometric mappings

195

§ 3. § 4. § 5.

§ 6.

Rotations of Euclidean spaces of dimension 2, 3and4

§ 7.

Differentiable families of linear automorphisms.

202 205

208 216

216 221

226 229 232

237 249

Chapter IX. Symmetric bilinear functions § 1. Bilinear and quadratic functions § 2. The decomposition of E § 3. Pairs of symmetric bilinear functions § 4. Pseudo-Euclidean spaces § 5. Linear mappings of Pseudo-Euclidean spaces.

261

Chapter X. Quadrics § 1.

296 296

§

301

261 265

272

.

§

281 288

Affine spaces 2. Quadrics in the affine space 3. Affine equivalence of quadrics

§ 4.

310 316

Quadrics in the Euclidean space

Chapter XI. Unitary spaces § 1. Hermitian functions § 2. Unitary spaces § 3. Linear mappings of unitary spaces § 4. Unitary mappings of the complex plane § 5. Application to Lorentz-transformations

325 325 327 334 340

Chapter XII. Polynomial algebra . § 1. Basic properties § 2. Ideals and divisibility § 3. Factor algebras § 4. The structure of factor algebras.

351 351 357

Chapter XIII. Theory of a linear transformation § 1. Polynomials in a linear transformation § 2. § 3.

345

366 369

.

Generalized eigenspaces Cyclic spaces

.

383 390

397

§ 4. Irreducible spaces

402

Application of cyclic spaces Nilpotent and semisimple transformations 7. Applications to inner product spaces .

§ 5.

§ 6. §

383

.

.

415 425 436

Bibliography

445

Subject Index

447

Interdependence of Chapters Vector spaces F

Linear mappings J

Matrices

Determinants

Inner product spaces

Gradations and homology

Linear mappings of inner product spaces

Symmetric bilinear functions

r

Unitary spaces

Algebras

Polynomial algebra

Quadrics

Theory of a linear transformation

Chapter 0

Prerequisites 0.1. Sets. The reader is expected to be familiar with naive set theory

up to the level of the first half of [11]. In general we shall adopt the notations and definitions of that book; however, we make two exceptions. First, the word function will in this book have a very restricted meaning, and what Halmos calls a function, we shall call a mapping or a set mapping. Second, we follow Bourbaki and call mappings that are one-to-one (onto, one-to-one and onto) injective (surjective, bijective). 0.2. Topology. Except for § 5 chap. I, § 8, Chap. IV and parts of chapters VII to IX we make no use at all of topology. For these parts of the book the reader should be familiar with elementary point set topology as found in the first part of [16]. 0.3. Groups. A group is a set G, together with a binary law of composition x G—+G which satisfies the following axioms (x, y) will be denoted by xy): 1. Associativity: (xy)z = x (yz) 2. Identity: There exists an element e, called the identity such that

xe = ex = x. 3. To each element xeG corresponds a second element

xx' =

such that

= e.

The identity element of a group is uniquely determined and each element has a unique inverse. We also have the relation

(xy)' As an example consider the set of all permutations of the set { 1. n} and define the product of two permutations a, t by . .

(crt)i=a(ti) In this way

i=1...n.

becomes a group, called the group of permutations of n objects. The identity element of is the identity permutation.

I

Greub. linear Algebra

Chapter 0. Prerequisites

Let G and H be two groups. Then a mapping G —*

H

is called a homomorphism if

X,yEG.

=

A homomorphism which is injective (resp. surjective, bijective) is called a monomorphism (resp. epimorphism, isomorphism). The inverse mapping of an isomorphism is clearly again an isomorphism. A subgroup H of a group G is a subset H such that with any two elementsyeHand zeH the productyz is contained in Hand that the inverse of every element of H is again in H. Then the restriction of p to the subset Hx H makes H into a group. A group G is called commutative or abelian if for each x, yeG xy=yx.

In an abelian group one often writes x+y instead of xy and calls x+y the sum of x and y. Then the unit element is denoted by 0. As an example consider the set of integers and define addition in the usual way. 0.4. Factor groups of commutative groups.* Let G be a commutative

group and consider a subgroup H. Then H determines an equivalence relation in G given by x

x'

if and only if

x—

x' e H.

The corresponding equivalence classes are the sets {H+x} and are called the cosets of H in G. Every element xeG is contained in precisely one coset ?. The set G/H of these cosets is called the factor set of G by H and the surjective mapping G

G/H

defined by XEX

called the canonical projection of G onto G/H. The set G/H can be made into a group in precisely one way such that the canonical projection becomes a homomorphism; i.e., is

it(x + y) = irx + 7ry. To define the addition in G/H let

e G/H, 9 e G/H be arbitrary and choose

xeG and yeG such that and

my=9.

*) This concept can be generalized to non-commutative groups.

Chapter 0. Prerequisites

Then the element it (x +y) depends only on

two other elements satisfying whence

3

and 9. In

and icy' =9

we

fact, if x', y' are

have

x'—xeH and y'—yeH (x' + y') — (x

+ y)eH

and so ir(x'+y')—ic(x+y). Hence, it makes sense to define the sum by

is easy to verify that the above sum satisfies the group axioms. Relation (0.1) is an immediate consequence of the definition of the sum in G/H. Finally, since it is a surjective map, the addition in G/H is uniquely deterIt

mined by (0.1). The group G/H is called the factor group of G with respect to the subgroup H. Its unit element is the set H. 0.5. Fields. Afield is a set F on which two binary laws of composition, called respectively addition and multiplication, are defined such that 1. F is a commutative group with respect to the addition. 2. The set F — {0} is a commutative group with respect to the multiplication.

3. Addition and multiplication are connected by the distributive

law,

The rational numbers the real numbers 11 and the complex numbers C are fields with respect to the usual operations, as will be assumed without proof. A homomorphism p:F—÷F' between two fields is a mapping that preserves addition and multiplication. A subset A F of a field which is closed under addition, multiplication and the taking of inverses is called a subfield. If A is a subfield of F, F is called an extension field of A. Given a field F we define for every positive integer k the element ke (c unit element of F) by

The field F is said to have characteristic zero if ke + 0 for every positive integer k. If F has characteristic zero it follows that kc+k'c whenever k+k'. Hence, a field of characteristic zero is an infinite set. Throughout this book it will be assumed without explicit mention that all fields are of characteristic zero.

Chapter 0. Prerequisites

4

For more details on groups and fields the reader is referred to [29]. 0.6. Partial order. Let cl be a set and assume that for some pairs X. Y (XE.ci. YE.cl) a relation, denoted by X Y, is defined which satisfies the following conditions: (i) XX for every Xe.cl (Reflexivity) (ii) if X Y and Y X then X = Y (Antisymmetry) (iii) If X Y and Y Z, then X Z (Transitivity). Then is called a partial order in d.

A homomorphism of partial/v ordered sets is a map go:

such

that goX go Y whenever X Y

Clearly a subset of a partially ordered set is again partially ordered.

Let d be a partially ordered set and suppose A Ed is an element such that the relation A X implies that A = X. Then A is called a maximal element of ci. A partial ordered set need not have a maximal element. A partially ordered set is called linear/v ordered or a chain if for every

pairX, YeitherXYor be a subset of the partially ordered set ci. Then an element if X A for every XEc11. A ed is called an upper hound for In this book we shall assume the following axiom: A partially ordered set in which every chain has an upper bound, contains a maximal element. This axiom is known as Zorn's lemma, and is equivalent to the axiom Let

of choice (cf. [11]).

d be a 0.7. Lattices. Let d be a partially ordered set and let subset. An element Aed is called a least upper hound (l.u.b.) for d1 if 1) A is an upper bound for cl1. 2) If X is any upper bound, then A X. It follows from (ii) that if a l.u.b. for d1 exists, then it is unique. In a similar way, lower bounds and the greatest lower bound (g.1.b.) for a subset of d are defined. A partially ordered set ci is called a lattice, if for any two elements X, Y the subset {X, Y} has a 1.u.b. and a g.l.b. They are denoted by X v Y and X A Y. It is easily checked that any finite subset (X1, ..., Xr) of a lattice

has a 1.u.b. and a g.l.b. They are denoted by V

and A X,.

As an example of a lattice, consider the collection of subsets of a given set, X, ordered by inclusion. If U, V are any two subsets, then

UAV=UflV and UvV=UUV.

Chapter 1

Vector Spaces § 1.

Vector spaces

1.1. Definition. A vector (linear) space, E, over the field F is a set of elements x, y, ... called vectors with the following algebraic structure:

I. E is an additive group; that is, there is a fixed mapping Ex E—+E denoted by (1.1)

and satisfying the following axioms: 1.1. (x +y) +z = x + (y + z) (associative law)

1.2. x+y=y+x (commutative law) 1.3. there exists a zero-vector 0; i.e., a vector such that x+0= O+x=x for every xeE. 1.4. To every vector x there is a vector —x such that x+(—x)=O.

II. There is a fixed mapping F x

denoted by

(2,x)—÷Ax

(1.2)

and satisfying the axioms: 11.1. (associative law) 11.2.

A(x + y) = Ax + Ay (distributive laws) 11.3.

unit element of F)

(The reader should note that in the left hand side of the first distributive law, + denotes the addition in F while in the right hand side, + denotes the addition in E. In the sequel, the name addition and the symbol + will continue to be used for both operations, but it will always be clear from the context which one is meant). F is called the coefficient field of the vector space E, and the elements of F are called scalars. Thus the mapping

Chapter 1. Vector spaces

(1.2) defines a multiplication of vectors by scalars, and so it is called scalar multiplication.

If the coefficient field 1' is the field l1 of real numbers (the field C of complex numbers), then £ is called a real (complex) vector space. For the rest of this paragraph all vector spaces are defined over a fixed, but arbitrarily chosen field F of characteristic 0. If {x1 ,...,x,1} is a finite family of vectors in E, the sum x1 + will often be denoted

Now we shall establish some elementary properties of vector spaces. It follows from an easy induction argument on a that the distributive laws hold for any finite number of terms,

A) x .

Ax

holds if and only if

=

=

0

2=0 or x=0.

Proof Substitution of pc=O in the first distributive law yields Ax = Ax + Ox

whence Ox =0. Similarly, the second distributive law shows that

20 = 0. Conversely, suppose that Ax=O and assume that 2+0. Then the associative law 11.1 gives that

lx = (A1A)x = A1(Ax) = Ab0 = 0 and hence axiom 11.3 implies that x=O. The first distributive law gives for Ax + (— A)x =

(A

—

—A

A)x =

whence (— A)

x = — Ax.

0

§ 1. Vector spaces

In the same way the formula 2(— x) = — 2x is

proved.

1.2. Examples. 1. Consider the set F "=

of n-tuples

and define addition and scalar multiplication by I

+

1

+

=

1,

+

and

Then the associativity and commutativity of addition follows at once from the associativity and commutativity of addition in F. The zero vector is the n-tuple (0, ..., 0) and the inverse of is the n-tuple ..., Consequently, addition as defined above makes the set ..., F" into an additive group. The scalar multiplication satisfies I1.i, 11.2, and 11.3, as is equally easily checked, and so these two operations make F" into a vector space. This vector space is called the n-space over F. In particular, F is a vector space over itself in which scalar multiplication coincides with the field multiplication.

2. Let C be the set of all continuous real-valued functions, f, in the

interval I:Ot 1, R.

1ff, g are two continuous functions, then the function f+g defined by (1 + g) (t) is

= f (t) + g (t)

again continuous. Moreover, for any real number 2, the function 2f

defined by

(1f)(t) = 2.1(t) is

continuous as well. It is clear that the mappings (1' g) —p

f +g

and

(2,1)

2]

the systems of axioms 1. and 11. and so C becomes a real vector space. The zero vector is the function 0 defined by satisfy

0(t) =

0

Chapter 1. Vector spaces

and the vector —f is the function given by

(- f)(t) = - f(t). Instead of the continuous functions we could equally well have considered the set of k-times differentiable functions, or the set of analytic functions.

3. Let X be an arbitrary set and E be a vector space. Consider all and define the sum of two mappings f and g as the xeX (f + g)(x) = f(x) + g(x) and the mapping )f by

mappings J: mapping

(i.[)(x) =

XEX.

Under these operations the set of all mappings f:

becomes a vector space, which will be denoted by (X; E). The zero vector of (X; E) is the function f defined by f(x)=0, XEX. 1.3. Linear combinations. Suppose E is a vector space and x1 are vectors in E. Then a vector XEE is called a linear combination of the vectors x1 if it can be written in the form x=

be any family of vectors. Then a vector More generally, let if there is a family xEE is called a linear combination of the vectors only finitely many different from zero, such that of scalars, x= where the summation is extended over those We shall simply write =

for which

cLEA

and it is to be understood that only finitely many are different from zero. In particular, by setting =0 for each we obtain that the 0-vector is a linear combination of every family. It is clear from the definition that if x is a linear combination of the family {xj then x is a linear combination of a finite subfamily. Suppose now that x is a linear combination of vectors cteA

x= rzeA

and assume further that each x2 is a linear combination of vectors

§ 1. Vector spaces

9

JJeB2,

=

e F. p

Then the second distributive law yields X

=

x2

r

=

Yap'

=

hence x is a linear combination of the vectors A subset ScEis called a system of generators for Eif every vector xeE is a linear combination of vectors of S. The whole space E is clearly a system of generators. Now suppose that S is a system of generators for E and that every vector of S is a linear combination of vectors of a subset Tc: S. Then it follows from the above discussion that T is also a system of generators for E. 1.4. Linear dependence. Let be a given family of vectors. Then a non-trivial linear combination of the vectors is a linear combination where at least one scalar is different from zero. The family {xa} and

is called linearly dependent if there exists a non-trivial linear combination of the that is, if there exists a system of scalars such that (1.3)

and at least one + 0. It follows from the above definition that if a subfamily of the family is linearly dependent, then so is the full family. An equation of the form (1.3) is called a non-trivial linear relation. A family consisting of one vector x is linearly dependent if and only if x 0. In fact, the relation

1.0=0 that the zero vector is linearly dependent. Conversely, if the vector x is linearly dependent we have that 2x=O where 2+0. Then Proposition I implies that x=0. It follows from the above remarks that every family containing the zero vector is linearly dependent. shows

is linearly dependent if and Proposition II: A family of vectors is a linear combination of the vectors x2, cx+fl. only if for some fleA, Proof: Suppose that for some fleA, xp

Chapter 1. Vector spaces

10

Then setting

—1

we obtain

=

o

and hence the vectors x2 are linearly dependent.

Conversely, assume that

=

0

and that

for some fleA. Then multiplying by view of II.! and 11.2 0= +

we obtain in

2*/i

()ji)_l)2x

xp=— a*f1

Corollary: Two vectors x, y are linearly dependent if and only ify=Ax (or x==)Ly) for some AcT. 1.5. Linear independence. A family of vectors (x2)26A is called linearly

independent if it is not linearly dependent; i.e., the vectors x2 are linearly independent if and only if the equation

= implies that A2=O for each

0

It is clear that every subfamily of a line-

arly independent family of vectors is again linearly independent. If flEA,

is a linearly independent family, then for any two distinct indices is injective. and so the map

Proposition III: A family (x2)2EA of vectors is linearly independent if and only if every vector x can be written in at most one way as a linear combination of the x2 i.e., if and only if for each linear combination

X>22X2

(1.4)

the scalars A2 are uniquely determined by x. Proof. Suppose first that the scalars A2 in (1.4) are uniquely determined by x. Then in particular for x=0, the only scalars A2 such that =0

are the scalars A2=O. Hence, the vectors x2 are linearly independent. Con-

§ 1. Vector spaces

versely,

suppose that the

11

are linearly independent and consider the

relations Then

=0

—

whence in view of the linear independence of the i.e., 1.6. Basis. A family of vectors

in E is called a basis of E if it is simultaneously a system of generators and linearly independent. In view of Proposition III and the definition of a system of generators, we have that A is a basis if and only if every vector xeE can be written in precisely one way as

The scalars

are called the components of x with respect to the basis

As an example, consider the n-space, ['fl, over F defined in example 1, sec. 1.2. It is easily verified that the vectors

i=1...n

x1=(O,...,O,l,O...O)

form a basis for F". We shall prove that every non-trivial vector space has a basis. For

the sake of simplicity we consider first vector spaces which admit a finite system of generators. Proposition IV: (i) Every finitely generated non-trivial vector space has a finite basis (ii) Suppose that S=(x1 ,...,Xm) is a finite system of generators of E and that the subset R c S given by R = (x1 Xr) (r m) consists of

linearly independent vectors. Then there is a basis,

of E such that

RcTcS. Proof: (i) Let x1 the vectors x1

x,, be a minimal system of generators of E. Then are linearly independent. In fact, assume a relation

=

0.

Chapter I. Vector spaces

12

If

= 0 for some 1, it follows that

;eT and so the vectors

(1.5)

generate E. This contradicts the minimality

of ii. (ii) We proceed by induction on ,i

(nr). If n=r then there is nothing to prove. Assume now that the assertion is correct for ii — I. Consider the vector space, F, generated by the vectors x1 Xr+i Then by induction, F has a basis of the form X1

Xr,

where V1ES

Ys

1

(j = 1

Now consider the vector x,1. If the vectors x1

). 1

x, are

linearly independent, then they form a basis of E which has the desired property. Otherwise there is a non-trivial relation

= 0.

+ Since the vectors x1

are

that y+O. Thus

linearly independent, it follows

r

x,7= Q=l

71 generate E. Since they are linearly

Hence the vectors x1

independent, they form a basis. Now consider the general case. Theorem I: Let E be a non-trivial vector space. Suppose S is a system

of generators and that R is a family of linearly independent vectors in E such that R S. Then there exists a basis, of E such that R S. Proof Consider the collection .W(R, S) of all subsets, X, of E such that

1) RcXcS 2) the vectors of X are linearly independent. The a partial order is defined in d(R, 5) by inclusion (cf. sec. 0.6). We show that every chain, in .&(R, 5) has a maximal element A. In fact, R

A

set A =

We

have to show that A ed(R, S). Clearly,

S. Now assume that x5,eA.

(1.6)

§ 1. Vector spaces

Then, for each assume that

i,

for some

is a chain, we may

Since

(i=1...n).

(1.7)

are linearly independent it follows that Since the vectors of (v = ... n) whence A ed(R, S). Now Zorn's lemma (cf. sec. 0.6) implies that there is a maximal element, L in d(R, S). Then R c: Tc:S and the vectors of Tare linearly 1

independent. To show that T is a system of generators, let xeE be arbitrary. Then the vectors of T U x are linearly dependent because otherwise it would follow that x U Ted(R, S) which contradicts the maximality of T Hence there is a non-trivial relation ,tVET, Since

the vectors of T are linearly independent, it follows that

whence

xv.

This equation shows that T generates E and so it is a basis of E.

Corollary I: Every system of generators of E contains a basis. In particular, every non-trivial vector space has a basis.

Corollary II: Every family of linearly independent vectors of E can be extended to a basis. 1.7. The free vector space over a set. Let X be an arbitrary set and such that f(x)+0 only for finitely many XEX. consider all maps f:

Denote the set of these maps by C(X). Then, if fe C(X), ge C(X) are scalars, 2f+ is again contained in C(X). As in example 3, sec. 1.2, we make C(X) into a vector space. For any aeX denote by the map given by and ).,

x=a

51

Then the vectors (aEX) form a basis of C(X). In fact, let fe C(X) be the (finitely many) distinct points be given and let a1 a, such that [(a) 0. Then we have

f where

= [(a)

i=1

(1 =

1

n)

Chapter I. Vector spaces

14

and so the element a relation

(aEX) generate C(X). On the other hand, assume

= 0. j=1

Then we have for each 1(1= ... ii) 1

0

1

=

... a). This shows that the vectors

(aeX) are

linearly independent and hence they form a basis of C(X). Now consider the inclusion map X—*C(X) given by

ix(a)=Ja

aEX.

This map clearly defines a bijection between X and the basis vectors

of C(X). If we identify each element aeX with the corresponding map then X becomes a basis of C(X). C(X) is called the free vector space over X or the vector space generated by X.

Problems 1. Show that axiom 11.3 can be replaced by the following one: The equation 2x=O holds only if 2=0 or x=0. 2. Given a system of linearly independent vectors (x1, ..., xe,), prove that the system (x1, .. i4j with arbitrary 2 is again linearly independent. 3. Show that the set of all solutions of the homogeneous linear differential equation d2y dt2

+

dy + qy = 0 dt

p and q are fixed functions of t, is a vector space. 4. Which of the following sets of functions are linearly dependent in the vector space of Example 2? where

a)f1=3t; f2=t+5; f3=2t2; b)f1=(t+1)2;f2=t2—1; f3=2t2+2t—3 f2=et; f3=e_t c) f1=1; d)f1=t2; e)

f1=1—t;

f2=t;

f3=1

f2=t(1—t); f3=1—t2.

§ 1. Vector spaces

5. Let E be a real linear space. Consider the set E x E of ordered pairs

(x, y) with xeE and yeE. Show that the set Ex E becomes a complex vector space under the operations: (x1,y1) + (x2,y2) =

(x1

+ x2,y1 + Y2)

and + i fJ)(x, y) =

x—

,8

y, y + J3 x)

/3 real numbers).

6. Which of the following sets of vectors in (a generating set, a basis)?

linearly independent,

a) (1, 1, 1, 1), (1, 0, 0, 0), (0, 1, 0, 0), (0, 0, 1, 0), (0, 0, 0, 1) b) (1, 0, 0, 0), (2, 0, 0, 0) c) (17, 39, 25, 10), (13, 12, 99, 4), (16, 1,0,0) 0, 0, d) (1, 1, 0, 0), (0, 0, 1, 1), (0, 4, 4, 1),

Extend the linearly independent sets to bases. 7. Are the vectors x1 (1, 0, 1); x2 = (i, 1, 0), x3 = (1, 2, 1 +1) linearly independent in C3? Express x = (1, 2, 3) and y = (1, 1, 1) as linear combibinations of x1, x2, x3. is defined by a map 8. Recall that an n-tuple given by

(i=1...n). Show that the vector spaces C{l...n} and F" are equal. Show further that the basis J defined in sec. 1.7 coincides with the basis x defined in sec. 1 .6.

9. Let S be any set and consider the set of maps

f: S

F"

such thatf(x)=0 for all but finitely many xeS. in a manner similar to that of sec. 1.7, make this set into a vector space (denoted by C(S, F')). Construct a basis for this vector space. be a basis for a vector space E and consider a vector 10. Let

a= that for some fleA. again a basis for E. Suppose

Show that the vectors

form

Chapter I. Vector spaces

11. Prove the following exchange theorem of Steinitz: Let be a a system of linearly independent vectors. basis of E and Then it is possible to exchange certain p of the vectors by the vectors a such that the new system is again a basis of F. Hint: Use problem 10. 12. Consider the set of polynomial functionsf:

f (x) =

xi.

Make this set into a vector space as in Example 3, and construct a natural basis. § 2.

Linear mappings

In this paragraph, all vector spaces are defined over a fixed but arbitrarily chosen field F of characteristic zero.

1.8. Definition. Suppose that E and F are vector spaces, and let (p:

be a set mapping. Then 'p will be called a linear mapping if

'p(x+y)='px+'py x,yeE

(1.8)

and

)LEF,xEE

= 2çox

(1.9)

(Recall that condition (1.8) states that 'p is a homomorphism between abelian groups). If F=F then 'p is called a linear function in E. Conditions (1.8) and (1.9) are clearly equivalent to the condition =

'p

'p

and so a linear mapping is a mapping which preserves linear combinations. From (1.8) we obtain that for every linear mapping, 'p, 'p 0

whence 'p (0)

=

'p (0

+0) =

'p (0)

+ 'p (0)

0. Suppose now that

=0 is a linear relation among the vectors 'p x1

=

'p

)!

Then we have

=

'p0

=0

§ 2. Linear mappings

whence

= 0.

17

(1.11)

Conversely, assume that ço:E—*F is a set map such that (1.11) holds whenever (1.10) holds. Then for any x, yeE and )LEF set

u=x+y and v=2x. Since

u—x—y=0 and v—2x=0

it follows that x

(x + y) —

—

y= 0

and ç (2 x) —2 (p x =

0

and hence p is a linear mapping. This shows that linear mappings are precisely the set mappings which preserve linear relations. In particular, it follows that if x1 . ..v,. are linearly dependent, then so If X1 .• . are linearly independent, it does not, are the vectors çox1 however, follow that the vectors cOXi...cOXr are linearly independent. In fact, the zero mapping defined by (px=0,xEE,is clearly a linear mapping which maps every family of vectors into the linearly dependent set (0). A bijective linear mapping (p: E3F is called a linear isomorphism and will be denoted by p: Given a linear isomorphism (p: the set mapping (p 1: E+—F. It is easy to verify that p again satisfies the .

. . .

conditions (1.8) and (1.9) and so it is a linear mapping. p' is bijective and hence a linear isomorphism. It is called the inverse isomorphism of (p. Two vector spaces E and F are called isomorphic if there exists a linear

isomorphism from E onto F. is called a linear transformation of E. A A linear mapping (p bijective linear transformation will be called a linear automorphism of E. 1.9. Examples: 1. Let E=F" and define (p:E—+E by =

+

satisfies the conditions (1.8) and (1.9) and hence it is a linear transformation of E. 2. Given a set S and a vector space E consider the vector space (S; E) be the mapping given by defined in Example 3, sec. 1.2. Let (S;

Then

(pf=f(a) fe(S;E) where aES is a fixed element. Then (p is 2

Greub. Linear Atgebra

a linear mapping.

Chapter

18

1.

Vector spaces

where )LeT is a be the mapping defined by 3. Let fixed element. Then p is a linear transformation. In particular, the ideaWv map i:E—+E, ix=x, is a linear transformation. 1.10. Composition. Let p: E—*F and i/i: F—*G be two linear mappings. Then the composition of ço and /i0ço:E—*G

is defined by XEE. The relation

x) =

(/9

ço x1)

k/i

= p is a linear mapping of E into G. k/Jo p will often be denoted simply by k/1p. If p is a linear transformation in E, then we denote (/9 o (p by (p2. More generally, the linear transformation p ... p is denoted by

shows that k/Jo

i. A linear transformation, p, satisfying (p2 = i is called an involution in E. 1.11. Generators and basis. S—*F Proposition I: Suppose S is a system of generators for E and is a set map (Fa second vector space). Then Po can be extended in at most one way to a linear mapping We extend the definition to the case k=O by setting (p°=

F.

(p: E

A necessary and sufficient condition for the existence of such an extension is that 0

whenever

= 0.

Proof If (p X1

is

an extension of

we

have for each finite set of vectors

ES

=

=

Since the set S generates E it follows from this relation that is (/9 uniquely determined by (po Moreover, if

XES

§ 2. Linear mappings

it follows that =

=

=

cOO

=

0

and so condition (1.12) is necessary. Conversely, assume that (1.12) is satisfied. Then define

by (1.13)

To prove that

is

a well defined map assume that

Then =o

—

whence in view of (1.12)

0

—

and so

= The linearity of p follows immediately from the definition, and it is clear that p extends cOo.

Proposition II: Let be a basis of E and cOo: be a set can be extended in a unique way to a linear mapping map. Then (p:

F.

Proof. The uniqueness follows from proposition I. To prove the existence of p consider a relation = 0. Since the vectors

are linearly independent it follows that each

whence

= 0.

Now proposition I shows that

can be extended to a linear mapping

Corollary: Let S be a linearly independent subset of E and be a set map. Then cOo can be extended to a linear mapping

Chapter I. Vector spaces

20

Proof. Let T be a basis of F containing S (cf. sec. 1.6). Extend c°o in an T—+F. Then k/I0 may be extended to a linear arbitrary way to a set map mapping t/i:E—+F and it is clear that k/i extends

Now let be a surjective linear map, and suppose that S is a system of generators for E. Then the set = is a system of generators for F. In fact, since çü is surjective, every vector yEF can be written as = x

for some xeE. Since S generates E there are vectors such that x=

and scalars

whence

y=

=

This shows that every vector yEF is a linear combination of vectors in

'p (5) and hence 'p (5) is a system of generators for(5). 'p Next, suppose that p: E—÷ F is injective and that S is a linearly independent subset of E. Then 'p (5) is a linearly independent subset of F. in

fact the relation implies that

Since 'p

is

= 0,

= 0.

injective we obtain

=

0

whence, in view of the linear independence of the vectors 'p (5) is a linearly independent set. In particular, if çø: E—+F is a linear isomorphism and for E, then is a basis for F.

A! =0. Hence

is a basis

Proposition III: Let ço:E—+Fbe a linear mapping and be a basis of E. Then 'p is a linear isomorphism if and only if the vectors = form a basis for F. Proof. If 'p is a linear isomorphism then the vectors form a linearly independent system of generators for F. Hence they are a basis. Converse-

§ 2. Linear mappings

ly, assume that the vectors

21

form a basis of F. Then we have for every

yEF y= and so

is

=

surjective.

Now assume that jf Xci.

Then it follows that o=

—

= Since the vectors each and so

—

are linearly independent, we obtain that

for

1?

It follows that 'p is injective, and hence a linear isomorphism.

Problems 1. Consider the vector space of all real valued continuous functions defined in the interval a t b. Show that the mapping 'p given by (p:X(t)—* is

tX(t)

linear.

2. Which of the following mappings of F4 into itself are linear transformations?

-

a) b)

+

c)

+

+

Let E be a vector space over F, and letf1 . given by E. Show that the mapping p: 3.

.

be linear functions in

= (ft(x), ...,fr(X)) is

linear.

4. Suppose

is a linear map, and write 'p x = (f1 (x), .. , fr (x)).

Show

that the

are linear functions in E.

Chapter I. Vector spaces

22

5. The unirersal property of C(X). Let X be any set and consider the free vector space, C(X). generated by X (cf. sec. 1 .7).

(i) Show that if j: then there is a unique linear map p:

X into a vector space F such that (p0

where X—*C(X) is the inclusion map. (ii) Let Y be a set map. Show that there is a unique linear C(X)—*C(Y) such that the diagram map

tX1.

C(X)f'4 C(Y)

is a second set map prove the composition

commutes. If [3: formula

(fib

=

(iii) Let E be a vector space. Forget the linear structure of E and form

the space C(E). Show that there is a unique linear map mE:

C(E)—÷E

such that mE ° 1E

(iv) Let E and F be vector spaces and let E—*F be a map between the underlying sets. Show that is a linear map if and only if ltFO(p*_ (POThE (v) Denote by N(E) the subspace of C(E) generated by the elements of the form a, beE, /IEF — (cf.

part (iii)). Show that

kermE = N(E).

6.Let v=O

be a fixed polynomial and letf be any linear function in a vector space E. Define a function P(f): E-÷F by

P(f)x =

(x)".

Find necessary and sufficient conditions on P that P(f) function. § 3.

be

again a linear

Subspaces and factor spaces

In this paragraph, all vector spaces are defined over a fixed, but arbitrarily chosen field F of characteristic 0.

§ 3. Subspaces

and factor spaces

23

1.12. Subspaces. Let E be a vector space over the field F. A non-empty subset, E1, of E is called a subspace if for each x, yEE1 and every scalar

x+yEE1

(1.14)

and

Equivalently, a subspace is a subset of E such that Ax + /IyEE1

whenever x, yeE1. In particular, the whole space E and the subset (0) consisting of the zero vector only are subspaces. Every subspace E1 E contains the zero vector. In fact, if x1 e E1 is an arbitrary vector we have that 0= x1 — x1 E E1. A subspace E1 of E inherits the structure of a vector space from E. Now consider the injective map i: E1 —* E defined by

ix=x,

xeE1.

In view of the definition of the linear operations in E1 i is a linear mapping, called the canonical injection of E1 into E. Since i is injective it follows from (sec. 1.11) that a family of vectors in E1 is linearly independent (dependent) if and only if it is linearly independent (dependent) in E. Next let S be any non-empty subset of E and denote by E5 the set of linear combinations of vectors in S. Then any linear combination of vecis a linear combination of vectors in S (cf. sec. 1.3) and hence tors in Thus is a subspace of E, called the subspace generated it belongs to by 5, or the linear closure of S. In particular, if the set S is Clearly, S is a system of generators for linearly independent, then S is a basis of We notice that = S if and only if S is a subspace itself. 1.13. Intersections and sums. Let E1 and E2 be subspaces of E and consider the intersection E1 n E2 of the sets E1 and E2. Then E1 fl E2 is again a subspace of E. In fact, since OeE1 and OeE2 we have OEE1 fl E2 and so E1 fl E2 is not empty. Moreover, it is clear that the set E1 n E2 satisfies again conditions (1.14) and (1.15) and so it is a subspace of E. E1 n E2 is called the intersection of the subspaces E1 and E2. Clearly, E1 fl E2 is a subspace of E1 and a subspace of E2. The sum of two subspaces E1 and E2 is defined as the set of all vectors of the form x1eE1,x2EE2 (1.16) x=x1 +x2,

Chapter 1. Vector spaces

24

and is denoted by E1 +122. Again it is easy to verify that E1 +E2 is a subspace of E. Clearly E1 + E2 contains E1 and E2 as subspaces. A vector x of E1 + E2 can generally be written in several ways in the form (1.16). Given two such decompositions

x=x1+x2 and it follows that x1

—

=

x'2 —

x2.

Hence, the vector z = x1

—

fl E2. Conversely, let x=x1 +x2, x1 eE1, x2eE2 be a decomposition of x and z be an arbitrary vector of E1 fl E2. Then the vectors

is contained in the intersection

=

— zEE1

=

and

x2

+ zEE2

form again a decomposition of x. It follows from this remark that the decomposition (1.16) of a vector xeE1 +E2 is uniquely determined if and only if E1 n E2 0. In this case E1 + E2 is called the (internal) direct sum of E1 and 122 and is denoted by Now let S1 and be systems of generators for E1 and E2. Then clearly is a system of generators for E1 +E2. If T1 and T2 are respectively S1 U

bases for E1 and E2 and the sum is direct, E1 fl 122=0, then T1 U T2 is a To prove that the set T1 U T2 is linearly independent, basis for suppose that = 0, + Then

=

=

fl

—

0

whence and

Now the x1 are linearly independent, and so )J=O. Similarly it follows

that Suppose that

E=E1$E2

(1.17)

is a decomposition of E as a direct sum of subspaces and let F be an arbitrary subspace of E. Then it is not in general true that

F=F

n E1

Fn

E2

§ 3. Subspaces

and factor spaces

the example below will show. However, if E1 In fact, it is clear that

as

25

then (1.18) holds. (1.19)

On the other hand, it'

y=x1 +x2

x1eE1,x2eE2

is the decomposition of any vector yEF, then

x1eE1=Ffl E1, x2=y—x1eFfl E2. It

follows that

FcFfl

E2.

(1.20)

The relations (1.19) and (1.20) imply (1.18). Example 1: Let E be a vector space with a basis e1, e2. Define E1, E2 and F as the subspaces generated by e1, e2 and e1 + e2 respectively. Then

E =E1 while on the other hand

Fn

E1

=F

n E2

= 0.

Hence

F+Fn

E2.

1.14. Arbitrary families of subspaces. Next consider an arbitrary family of subspaces E, Then the intersection flEa is again a subspace of E. The sum EE1 is defined as the set of all vectors which can be written as finite sums,

(1.21)

and is a subspace of E as well. If for every

then each vector of the sum

can be uniquely represented in the form

(1.21). In this case the space

is called the (internal) direct sum of the

and is denoted by

subspaces

If

is a system of generators for

then the set

If the sum of the

is direct and

generators for then

is a basis for

is a system of

is a basis of

Chapter 1. Vector spaces

26

Example 2. Let by Then

be a basis of E and E2 be the subspace generated

E= Suppose (1.22)

is a direct sum of subspaces. Then we have the canonical injections We define the canonical projections

determined by

= where

It is clear that the are surjective linear mappings. Moreover, it is easily verified that the following relations hold: c'

XEE.

1.15. Complementary subspaces. An important property of vector spaces is given in the

Proposition I: If E1 is a subspace of E, then there exists a second subspace E2 such that E = E1 E2. E2 is called a complementary subspace for E1 in E. Proof We may assume that E1 E and E1 (0) since the proposition is trivial in these cases. Let (xx) be a basis of E1 and extend it with vectors to form a basis of E (cf. Corollary TI to Theorem I, sec. 1.6). Let E2 Then be the subspace of E generated by the vectors

E=

E1

E2.

In fact, since (xx) U (yp) is a system of generators for E, we have that

E=E1+E2.

(1.23)

On the other hand, if XEE1 fl E2, then we may write

x=

x=

Yfi II

and factor spaces

§ 3. Subspaces

27

whence

= 0.

— 13

Now since the set (x2) U

(y13) is

linearly independent, we obtain and

whence x=0. It follows that E1 fl E2=O and so the decomposition (1.23) is direct. As an immediate consequence of the proposition we have Corollary I. Let E1 be a subspace of E and : E1 —* F a linear mapping (Fa second vector space). Then ço1 may be extended (in several ways) to a linear map (p: E—*F.

Proof Let E2 be a complementary subspace for E1 in E,

E=E1Ej3E2

(1.24)

and define p by çox =

where

x—y+z is the decomposition of x determined by (1.24). Then 'p

=

+

'p

=

+

=

= and so q, is linear. It is trivial that 'p extends 'pr.

As a special example we have:

Corollary II: Let E1 be a subspace of E. Then there exists a surjective linear map —*

E1

such that

çox=x

xEE1.

Chapter I. Vector spaces

28

Proof Simply extend the identity map i : E1 —÷E1 to a linear map (p:E-*E1. 1.16.

Factor spaces. Suppose E1 is a subspace of the vector space E.

Two vectors XE E and x' e E are called equivalent mod E1 if x' — XE E1. it

is easy to verify that this relation is reflexive, symmetric and transitive and hence is indeed an equivalence relation. (The equivalence classes are the cosets of the additive subgroup E1 in E(cf. sec. 0.4)). Let E/E1 denote the set of the equivalence classes so obtained and let be

the set mapping given by

7rXX, where

XEE

is the equivalence class containing x. Clearly m is a surjective

map.

Proposition II: There exists precisely one linear structure in E/E1 such that m is a linear mapping. Proof: Assume that E/E1 is made into a vector space such that m is a linear mapping. Then the equations

m(x + y) =

mx

+ my

and m (2 x) =

2

mx

show that the linear operations in E/E1 are uniquely determined by the linear operations in E. it remains to be shown that a linear structure can be defined in E/E1 such that m becomes a linear mapping. Let i and 9 be two arbitrary elements of E/E1 and choose vectors XEE and yEE such that

my=9. Then the class m (x + y) depends only on

and 9. Assume for instance that

x'EE is another vector such that mx'=i. Then mx' = mx and hence we may write

x'=x+z,

zEE1.

it follows that

x' + y = (x + y) + z whence

m(x' + y)= m(x + y).

§ 3. Subspaces and factor spaces

We now define the sum of the elements where

29

and 9eE/E1 by

and 9=7ry.

(1.25)

It is easy to verify that E/E1 becomes an abelian group under this operation and that the class = E1 is the zero-element. Now let be an arbitrary element and 2eF be a scalar. Choose xeE such that Then a similar argument shows that the class ir(Ax) depends only on (and not on the choice of the vector x). We now define the scalar multiplication in E/E1 by where

(1.26)

Again it is easy to verify that the multiplication satisfies axioms 11.1—11.3

and so E/E1 is made into a vector space. It follows immediately from (1.25) and (1.26) that

ir(x+y)=itx+iry

x,yeE

= 2itx i.e., iv is a linear mapping.

The vector space E/E1 obtained in this way is called the factor space of E with respect to the subspace E1. The linear mapping iv is called the canonical projection of E onto E1. If E1 = E, then the factor space reduces to the vector 0. On the other hand, if E1 =0, two vectors xeE and yeE are equivalent mod E1 if and only if y = x. Thus the elements of E/(0) are the singleton sets {x} where x is any element of E, and iv is the linear isomorphism Consequently we identify E and E/(0). 1.17. Linear dependence mod a subspace.. Let E1 be a subspace of E, and suppose that (x2) is a family of vectors in E. Then the will be called linearly dependent mod E1 if there are scalars not all zero, such that

If the

are not linearly dependent mod E1 they will be called linearly

independent mod E1. Now consider the canonical projection

It follows immediately from the definition that the vectors dependent (independent) mod E1 if and only if the vectors dependent (independent) in E/E1.

are linearly are linearly

Chapter 1. Vector spaces

1.18. Basis of a factor space. Suppose that U (:fl) is a basis of E form a basis of E1. Then the vectors 7r:p form a such that the vectors basis of E/E1. To prove this let E2 be the subspace of E generated by the vectors zp . Then

(1.27)

Now consider the linear mapping

defined by

goz=rrz Then

ZEE2.

is surjective. In fact, let

be an arbitrary vector. Since

it:E—*E/E1 is surjective we can write

x—irx,

xEE.

In view of (1.27) the vector x can be decomposed in the form

x=y+z

yEE1,ZEE2.

(1.28)

Equation (1.28) yields

= irx = icy + Trz = and so çø is surjective. To show that çø is injective assume that

çoz=çoz'

=

z,z'eE2.

Then —

z) = p(z' — z) = 0

and hence z'—zeE1. On the other hand we have that z'—zeE2 and thus

z'—zeE1 fl is a linear isomorphism and now PropoIt follows that sition Ill of sec. 1.11 shows that the vectors irzp form a basis of E/E1.

Problems 1. Let be an arbitrary vector in F3. Which of the following subsets are subspaces? a) all vectors with b) all vectors with =0 c) all vectors with d) all vectors with = F,,, Fd generated by the sets of problem 1, 2. Find the subspaces and construct bases for these subspaces.

§ 3. Subspaces and factor spaces

31

3. Construct bases for the factor spaces determined by the subspaces of problem 2.

4. Find complementary spaces for the subspaces of problem 2, and construct bases for these complementary spaces. Show that there exists more than one complementary space for each given subspace. 5. Show that

a) 13=Fa+Fb b)

13=Fb+FC

c) 13=Fa+Fc Find the intersections Fa fl Fb, Fb fl

Fa n

and decide in which cases

the sums above are direct.

6. Let S be an arbitrary subset of E and let be its linear closure. is the intersection of all subspaces of E containing S. Show that 7. Assume a direct composition Show that in each class of E with respect to E1 (i.e. in each coset there is exactly one vector of E2. 8. Let E be a plane and let E1 be a straight line through the origin. What is the geometrical meaning of the equivalence classes with respect to E1. Give a geometrical interpretation of the fact that x

y

x' + i.

Suppose S is a set of linearly independent vectors in E, and suppose T is a basis of E. Prove that there is a subset of T which, together with S, is again a basis of E. and E_ defined 10. Let w be an involution in E. Show that the sets 9.

by

= {xeE; wx = x}, are

E_ = {xeE; wx = — x}

subspaces of E and that

E=

E..

11. Let E1, E2 be subspaces of E. Show that E1 + E2 is the linear closure of E1 u E2. Prove that

E1 + E2 =

E1

U E2

if and only if

E1DE2 or E2DE1. 12. Find subspaces E1, E2, E3 of r3 such that

i) En E1=0 (i+j) ii) E1 + E2 + E3 = p3 iii) the sum in ii) is not direct.

Chapter I. Vector spaces

32

§ 4.

Dimension

In this paragraph all vector spaces are defined over a fixed, but arbitrarily chosen field F of characteristic 0. 1.19. Finitely generated vector spaces. Suppose E is a finitely generated vector space, and consider a surjective linear mapping (p:E—+F. Then F is a system of generators for is finitely generated as well. In fact, if generate F. In particular, the factor space E, then the vectors çox1, ..., of a finitely generated space with respect to any subspace is finitely generated. Now consider a subspace E1 of E. In view of Cor. II to Proposition I, sec. 1.15 there exists a surjective linear mapping ço:E—÷E1. It follows that E1 is finitely generated.

1.20. Dimension. Recall that every system of generators of a nontrivial vector space contains a basis. It follows that a finitely generated non-trivial vector space has a finite basis. In the following it will be shown that in this case every basis of E consists of the same number of vectors.

This number will be called the dimension of E and will be denoted by dim E. E will be called a finite-dimensional vector space. We extend the definition to the case E=(0) by assigning the dimension 0 to the space (0). If E does not have finite dimension it will be called an infinite-dimensional vector space.

Proposition 1. Suppose a vector space has a basis of 11 vectors. Then every family of (n+ 1) vectors is linearly dependent. Consequently, ii is the maximum number of linearly independent vectors in E and hence every basis of E consists of n vectors. Proof: We proceed by induction on ii. Consider first the case n= 1 and let a be a basis vector of E. Then if x 0 and y 0 are two arbitrary vectors we have that whence

and y=pa, ii+O —

= 0.

Thus the vectors x and y are linearly dependent. Now assume by induction that the proposition holds for every vector space having a basis of 1 vectors. = 1. ii) be a basis of E and a family of ii + I vectors. We may assume that + 0 because

Let F be a vector space, and let alL x1 . .

. .

1

§ 4. Dimension

33

otherwise it would follow immediately that the vectors x1 . 1 were linearly dependent. Consider the factor space E1 =E/(xn+ i) and the canonical projection

it: E —÷ E/(xn+i) Since the system where (xn+ i) denotes the subspace generated by generates E1 it contains a basis of E1 (cf. Cor. Ito Theorem I, a1, ...,

sec. 1.6). On the other hand the equation Xn+i

implies that

vi

AV

=0

and so the vectors (a1, ..., an) are linearly dependent. It follows that E1

has a basis consisting of less than n vectors. Hence, by the induction hypothesis, the vectors are linearly dependent. Consequently, there exists a non-trivial relation

=0 and so

=

Xn+i.

This formula shows that the vectors Xi...Xn+i are linearly dependent and

closes the induction.

Example: Since the space vectors it follows that

n

(cf. Example 1, sec. 1.2) has a basis of n

= n.

Proposition II: Two finite dimensional vector spaces E and F are isomorphic if and only if they have the same dimension. Proof: Let be an isomorphism. Then it follows from Proposition III, sec. 1.11 that p maps a basis of E injectively onto a basis of F and so dim E=dim F. Conversely, assume that dim E=dim F—n and let and = 1.. .n) be bases of E and F respectively. According to Propo-

sition II, sec. 1.11 there exists a linear mapping

such that

1...n). Then p maps the basis onto the basis it is a linear isomorphism by Proposition Ill, sec. 1.11.

and hence

3

(Ireub. Lineur Algebra

Chapter I. Vector spaces

34

1.21. Subspaces and factor spaces. Let E1 be a subspace of the n-dimen-

sional vector space E. Then E1 is finitely generated and so it has finite dimension in. Let Xi...Xm be a basis of E1. Then the vectors xi...xm are linearly independent in E and so Cor. II to Theorem I, sec. 1.6 implies that the vectors may be extended to a basis of E. Hence

dimE1 dimE.

(1.29)

if equality holds, then the vectors xi...xm form a basis of E and it follows that E1 = E. Now it will be shown that

dimE=dimE1 +dimE/E1.

(1.30)

If E1 =(0) or E1 =E (1.30) is trivial and so we may assume that E1 is a proper non-trivial subspace of E,

0< dimE1 , is called the scalar product of and x, The scalar and the bilinear function is called a scalar product between E* and E. 2.22.

Examples.

1

Let E= E* = F and define a mapping

by

2,peF.

=2ji

Clearly is a non-degenerate bilinear function, and hence F can be regarded as a self-dual space.

2. Let E= E* = F" and consider the bilinear mapping defined by = 1=1 Greub. Li near AJgcbra

Chapter II. Linear mappings

where

x=

...,

It is easy to verify that the bilinear mapping is non-degenerate and is dual to itself.

hence

3. Let E be any vector space and E* = L (E) the space of linear functions in E. Define a bilinear mapping by

=f(x),

JEL(E),XEE.

Sincef(x)=O for each XEE if and only iff=O, it follows that NL(E)=O. On the other hand, let aeE be a non-zero vector and E1 be the onedimensional subspace of E generated by a. Then a linear function g is defined in E1 by g(x)=)L where In view of sec. 1.15, g can be extended to a linear function f in E. Then

a> =

f (a) =

g

1 +0.

(a)

It follows that NE = 0 and hence the bilinear function is non-degenerate.

This example is of particular importance because of the following

Proposition I: Let E*, E be a pair of vector spaces which are dual with respect to a scalar product Then an injective linear map & E*_*L(E) is defined by =

E E*, XE E.

x>

Proof? Fix a vector a*EE*. Then a linear function,

ja*k) = Since

is

bilinear,

X>

XE E.

(2.39)

is defined in E by (2.40)

depends linearly on a*. Now define cP by setting (2.41)

To show that 1i is injective, assume that i(ci*)=O for some a*EE*. Then

X>=0 for every XEE. Since

is

non-degenerate. it follows

that a*=0. Note: It will be shown in sec. 2.33 that P is surjective (and hence a linear isomorphism) if E has finite dimension.

§ 5. Dual vector spaces

67

e E * and xe E are 2.23. Orthogonal complements. Two vectors called orthogonal if = 0. Now let E1 be a subspace of E. Then the of E*. vectors of E* which are orthogonal to E1 form a subspace is called the orthogonal complement of E1. In the same way every subspace

determines an orthogonal complement (Er)' E. The fact that the bilinear function is non-degenerate can now be expressed by the relations

E'=O and (E*)±=O. It follows immediately from the definition that for every subspace E1

E

(2.42)

E1

Suppose next that E*, E are a pair of dual spaces and that F is a sub-

space of E. Then a scalar product is induced in the pair E*/F', F by VEF,

where is a representative of the class In fact, let be the restriction of the scalar product to E* x F. Then the nullspaces of cli are given by

and NF=O. Now our result follows immediately from sec. 2.21.

More generally, suppose Fc E and H* E* are any subspaces. Then a scalar product in the pair H*/H* fl F', F/Fn (H*)', is determined by

= as

a similar argument will show.

2.24. Dual mappings. Suppose that E, E* and F, F* are two pairs of dual spaces and p:E—÷F, are called dual if 'p and

are linear mappings. The mappings

= exists at most one dual mapping.

To a given linear mapping

If

and

are dual to 'p we have that =

=

and

whence —

=

0

XEE,Y*EF.

Chapter II. Linear mappings

68

This implies, in view of the duality of E and E*, that

=

whence

an example of dual mappings consider the dual pairs E*, E and is a subspace of E (cf. sec. 2.23) and let it be the canonical projection of E* onto E*/EjL, As

E*/EjL, E1 where E1

E*. Then the canonical injection

i:E1

is dual to it. In fact, if xeE1, and y*EE* are arbitrary, we have

= =

=

and thus it

= i*.

2.25. Operations with dual mappings. Assume that E*, E and F*, F are two pairs of dual vector spaces. Assume further that p: E—*F and i/i:E-+F are linear mappings and that there exist dual mappings p*: and ,/,*:E*÷_F*. Then there are mappings dual to and 2p and these dual mappings are given by = (p* + 1/1*

(p +

(2.43)

and (2.44)

(2.43) follows from the relation + VJ*)y*,x> =

+ +

—

=

+

and (2.44) is proved in the same way. Now let G, G* be a third pair of F* +— G * be a pair of dual mappings. Then dual spaces and let x: F—* G, the dual mapping of x p exists, and is given by :

(xoco)* =

In fact, if z*EG* and xeE are arbitrary vectors we have that = .

§ 5. Dual vector spaces

69

For the identity map we have clearly —1 —

\*

Now assume that p: E—+F has a left inverse 'Pi : F—+E, (2.45) and that the dual mappings

and

exist. Then we

obtain from (2.45) that = (1E)*

(2.46)

In view of sec. 2.11 the relations (2.45) and (2.46) are equivalent to p injective,

surjective

injective,

(p* surjective.

and

In particular, if p and

are inverse linear isomorphisms, then so are (p* and 'pt. 2.26. Kernel and image space. Let p : E—*F and (p*: be a pair of dual mappings. In this section we shall study the relations between the subspaces

kerpcE, ImqcF

and ker p*

F

Im (p* c E*.

First we establish the formulae

kerp* = (lmço)'

(2.47)

kerp =(Imp*)±.

(2.48)

In fact, for any two vectors y*eker p*, pxelm çü we have = =

0

and hence the subspaces ker and Im 'p are orthogonal, ker Now let y*E(Im p)' be any vector. Then for every xeE EJ. j*i

Chapter II. Linear mappings

74

is again non-

Moreover, the restriction of the scalar product to E1, degenerate, and

Proposition V has the following converse: Proposition VI. Let E1 c E be any subspace, and let subspace dual to E1 such that

L (E) be a

= Then (2.56)

and

L(E) =

(2.57)

Proof: We have

(E1 + Er)' =

n

(Er)' =

n

=0

whence

E=O'=(E1

(2.58)

On the other hand, since E1 and E1 n

are dual, it follows that =0

which together with (2.58) proves (2.56). (2.57) follows from Proposition V and (2.56).

Problems 1. Given two pairs of dual spaces E*, E and F*, F prove that the spaces

and EG3F are dual with respect to the bilinear function = + •

2. Consider two subspaces E1 and E2 of E. Establish the relation (E1

n

E

consider the mapping ).:

fined by

Prove that is injective. Show that

de-

aEE, jeL(E). is surjective if and only if dim E
= cp(x*, x)

E

E*, xe E

and so a mapping (p : E—÷E is defined. The linearity of çø follows immediately from the bilinearity of and Suppose now that are two

linear transformations of F such that cli(x*,x) =

x>

and

cli(x*,x) =

(p2x>

Then we have that

= 0 whence Proposition III establishes a canonical linear isomorphism between the

spaces B(E*, E) and L(E; F),

L(E; E). Here B(E*, F) is the space of bilinear functions &E* x dition and scalar multiplication defined by (tIi1 + tli2)(x*,x) =

and

(2 k)(x*, x) =

with ad-

+ cP2(x*,x) çji (x*, x).

2.34. The rank of a linear mapping. Let (p : E—÷F be a linear mapping of finite dimensional vector spaces. Then ker cp c E and Tm (p F have finite dimension as well. We define the rank of (p as the dimension of Tm (p

r((p) = dim Im (p. In view of the induced linear isomorphism Im(p

we have at once r((p) + dim ker(p = dimE.

(2.64)

(p is called regular if it is injective. (2.64) implies that (p is regular if and only if r ((p) = dim F.

Chapter 11. Linear mappings

In the special case dim E= dim F (and hence in particular in the case of a linear transformation) we have that is regular if and only if it is surjective. 2.35. Dual mappings. Let E*, E and F*, F be dual pairs and çø: be a linear mapping. Since E* is canonically isomorphic to the space L (E) there exists a dual mapping Hence we have the relations (cf. sec. 2.281

Im

= (ker

and (ker(p)L.

The first relation states that the equation = has

a solution for a given yEF if and only if y satisfies the condition

=O forevery

x*ekerp*.

The second relation implies that dual mappings have the same rank. In fact, from (2.63) we have that

dimlmp* = dim(kerp)' = dimE — dimkerço = dimlmp whence r(co*)

= r(p).

(2.65)

Problems (In problems 1—10 it will be assumed that all vector spaces have finite dimension).

1. Let E, E* be a pair of dual vector spaces, and let E1, E2 be subspaces of E. Prove that (E1 n

Hint: Use problem 2, § 5. 2. Given subspaces Uc E and V* c E* prove that

dim(U' n V*) + dim U = dim(U

n

+ dim

3. Let E, E* be a pair of non-trivial dual vector spaces and let 'p : be a linear mapping such that 'p = (z*) - 'p for every linear auto1

morphism t of E. Prove that 'p =0. Conclude that there exists no linear mapping which transforms every basis of E into its dual basis.

§ 6. Finite dimensional vector spaces

(v = 1. . .n) of E, E* show that the

4. Given a pair of dual bases

••,

bases (x*' + again dual. v2

...,

and (x1,

are

5. Let E, F, G be three vector spaces. Given two linear mappings prove that

ço:E—*F and

and

6. Let E be a vector space of dimension n and consider a system of n linear transformations such that

(i,j=1...n). a) Show that every

has rank 1 a second system with the same property, prove that there exists a linear automorphism t of E such that

b) If

=

t.

a

— 1

7. Given two linear mappings p:

r(q) -

and

: E—>F prove that

r(ço) +

+

Show that the dimensions of the spaces M' (ço), are given by

8. § 2

dimM'(p) dim

(dim F —

(p) in problem 3,

r(p))dimE

((p) = dim ker çø dim F.

9. Show that the mapping

defines a linear isomorphism,

cP:L(E;F)3 L(F*;E*). 10. Prove that c1'M'(ço) =

and = Ml(ço*) where

the notation is defined in problems 8 and 9. Hence obtain the

formula

11. Let p:

r(ço) = r(p*).

be a linear mapping (E, F possibly of infinite dimension). Prove that Im ço has finite dimension if and only if ker 'p has finite codimension (recall that the codimension of a subspace is the dimension 6

Greub, Lineur Algebra

Chapter II. Linear mappings

of the corresponding factor space), and that in this case

codimkerp = dimlmp. 12. Let E and F be vector spaces of finite dimension and consider a bilinear function P in Ex F. Prove that dim E — dim NE = dim F — dim NF

where NE and NF denote the null spaces of P. Conclude that if dim E=dim F. then NE=O if and only if

Chapter III

Matrices In this chapter all vector spaces will be defined over afixed, but arbitrarily chosen field F of characteristic 0.

§ 1. Matrices and systems of linear equations 3.1. Definition. A rectangular array A—_(

(3.1)

)

is called a matrix of n rows and m columns or, in of nm scalars an n x m-matrix. The scalars are called the entries or the elements of the matrix A. The rows (v—_1...n) and therefore are called the

can be considered as vectors of the space row-vectors of A. Similarly, the columns —

...

considered as vectors of the space

(it = 1 ... m)

are called the column-vectors of A.

Interchanging rows and columns we obtain from A the transposed matrix

A*=(

).

(3.2)

In the following, matrices will rarely be written down explicitly as in This notation has the (3.1) but rather be abbreviated in the form A = disadvantage of not identifying which index counts the rows and which the columns. It has to be mentioned in this connection that it would be very undesirable — as we shall see — to agree once and for all to always let

Chapter III. Matrices

the subscript count the rows, etc. If the above abbreviation is used, it will be stated explicitly which index indicates the rows. 3.2. The matrix of a linear mapping. Consider two linear spaces E and With the aid of F of dimensions ii and rn and a linear mapping p: (v 1. . n) and bases = 1. in) in E and in F respectively, every can be written as a linear combination of the vectors vector (1u:=l...rn), . .

.

(v

1... n).

(3.3)

where v In this way, the mapping p determines an n x rn-matrix counts the rows and ,u counts the columns. This matrix will be denoted by M(p, or simply by M(tp) if no ambiguity is possible. Conversely, every n x in-matrix determines a linear mapping p:E-+F by the equations (3.3). Thus, the operator M('p)

M: 'p

defines a one-to-one correspondence between all linear mappings ço:E—*F and all n x rn-matrices.

3.3. The matrix of the dual mapping. Let E* and F* be dual spaces of E and F, respectively, and p: E-+F, a pair of dual mappings. Consider two pairs of dual bases X*V, (v = 1. n) and (/2 1.. . in) of E*, E and F*, F, respectively. We shall show that the two corresponding matrices and M('p*) (relative to these bases) are transposed, i.e., that M('p*) = M((p)*. (3.4) :

. .

The matrices M('p) and M('p*) are defined by the representations 'p

=

and

=

Note here that the subscript v indicates in the first formula the rows of

the matrix and

and in the second the columns of the matrix in the relation

Substituting

=

(3.5)

=

(3.6)

we obtain Now (p

=

YK> =

(3.7)

§ 1. Matrices and systems of linear equations

and

=

(3.8)

The relations (3.6), (3.7) and (3.8) then yield */1_ — V• JL

V

— as stated before — that the subscript v indicates rows of and columns of (ce) we obtain the desired equation (3.4). 3.4. Rank of a matrix. Consider an n x ni-matrix A. Denote by r1 and by r2 the maximal number of linearly independent row-vectors and co-

Observing

lumn-vectors, respectively. It will be shown that r1 =r2. To prove this let E and F be two linear spaces of dimensions n and iii. Choose a basis x. (v 1.. .n) and y,2 (p = 1.. rn) in E and in F and define the linear mapping p: F by (/1X% = Besides

'p, consider the isomorphism f3:

F

F"'

defined by where

y= Then

j3 o p

is a linear mapping of E into F"'. From the definition of fi it into the v-th row-vector,

follows that f3 a p maps

=

/3

Consequently,

the rank of flap

is

equal to the maximal number r1 of

linearly independent row-vectors. Since /3 is a linear isomorphism, has the same rank as p and hence r1 is equal to the rank r of p. Replacing çø by we see that the maximal number r2 of linearly independent column-vectors is equal to the rank of ço*. But (p* has the same rank as ço and thus r1 = r2 = r. The number r is called the rank of the matrix A. 3.5. Systems of linear equations. Matrices play an important role in the discussion of systems of linear equations in afield. Such a system

=

(p = 1... in)

(3.9)

Chapter III. Matrices

of in equations with ii unknowns is called inhomogeneous if at least one is different from zero. Otherwise it is called homogeneous. From the results of Chapter II it is easy to obtain theorems about the existence and uniqueness of solutions of the system (3.9). Let E and F be two linear spaces of dimensions ii and in. Choose a basis (v = I .n) of .

E as well as a basis

(p =

1

. . .

in)

.

of F and define the linear mapping

F by

(p:

= Consider two vectors (3.10)

and

(3.11)

Then (3.12)

Comparing the representations (3.9) and (3. 12) we see that the system (3.9) is equivalent to the vector-equation X

= j.

Consequently, the system (3.9) has a solution if and only if the vector y

is contained in the image-space Im tp. Moreover, this solution is uniquely determined if and only if the kernel of ço consists only of the zero-vector. 3.6. The homogeneous system. Consider the homogeneous system =

0

(p

1... in).

(3.13)

is a From the foregoing discussion it is immediately clear that solution of this system if and only if the vector x defined by (3.10) is contained in the kernel ker p of the linear mapping p. In sec. 2.34 we have shown that the dimension of ker 'p equals n—r where r denotes the rank of 'p. Since the rank of 'p is equal to the rank of the matrix we therefore obtain the following theorem:

A homogeneous system of m equations with n unknowns whose coefficient-

matrix is of rank r has exactly n — r linearly independent solutions. In the special case that the number in of equations is less than the number n of unknowns we have ii— r n — in 1. Hence the theorem asserts that the system (3. 13) always has non-trivial solutions if ni is less than n.

§ I. Matrices and systems of linear equations

87

3.7. The alternative-theorem. Let us assume that the number of equations is equal to the number of unknowns,

=

= 1... n).

(3.14)

Besides (3.14) consider the so-called "corresponding" homogeneous system =0 (3.15) (ii = 1... n).

The mapping introduced in sec. 3.5 is now a linear mapping of the ndimensional space E into a space of the same dimension. Hence we may apply the result of sec. 2.34 and obtain the following alternative-theorem: If the homogeneous system possesses only the trivial solution (0.. .0), the inhomogeneous system has a solution

every choice of the right-

..

hand side. If the homogeneous system has non-trivial solutions, then the inhoinogeneous one is not so/cable for eterv choice of the ;7v(v = . ii). From the last statement of section 3.5 it follows immediately that in the first case the solution of (3.14) is uniquely determined while in the second case the system (3.14) has — if it is solvable at all — infinitely many solutions. 1

.

3.8. The main-theorem. We now proceed to the general case of an arbitrary system

=

(1u

= 1 ... m)

(3.16)

of m linear equations in n unknowns. As stated before, this system has a solution if and only if the vector

y= is

contained in the image-space Im ço. In sec. 2.35 it has been shown that

the space Im p is the orthogonal complement of the kernel of the dual mapping In other words, the system (3.16) is solvable if and only if the right-hand side (p=l...m) satisfies the conditions (3.17)

for all solutions

=I

.. ni)

of the system

(v=1...n). We formulate this result in the following

(3.18)

Chapter III. Matrices

Main-theorem: An inhoniogeneous system of n equations in in unknowns has a solution if and only if every solution 1. in) of the transposed homogeneous system (3.18) satisfies the orthogonality-relation (3.17). . .

Problems 1. Find the matrices corresponding to the following mappings:

a) px=O. b) çox=x. c) px=2x. d)

x=

where

(v =

1,

..., n) is a given basis and in n is a

given number and

2. Consider a system of two equations in n unknowns

Find the solutions of the corresponding transposed homogeneous system.

3. Prove the following statement: The general solution of the inhomogeneous system is equal to the sum of any particular solution of this system and the general solution of the corresponding homogeneous system. 4. Let and be two bases of E and A be the matrix of the basistransformation Define the automorphism c' of E by Prove that A is the matrix of as well with respect to the basis as with respect to the basis 5. Show that a necessary and sufficient condition for the n x n-matrix A= to have rank 1 is that there exist elements and ..., j31 •.., f3fl such that f32 = (v = 1,2, ...,n; = 1,2, ...,n). if A +0, show that the elements cc and 13P are uniquely determined up to = 1. constant factors ). and p respectively, where as 6. Given a basis a linear space E, define the mapping p : p

=

Find the matrix of the dual mapping relative to the dual basis. 7. Verify that the system of three equations:

+ + = 3, —11—4' = 4, + 3ij + 34' = 1

§ 2. Multiplication of matrices

89

has no solution. Find a solution of the transposed homogeneous system which is not orthogonal to the vector (3, 4, 1). Replace the number 1 on the right-hand side of the third equation in such a way that the resulting system is solvable. 8. Let an inhomogeneous system of linear equations be given,

(ii=1,...,in). augmented matrix of the system is defined as the in x (n+ 1)-matrix ..., çi"). Prove that obtained from the matrix by adding the column the above system has a solution if and only if the augmented matrix has the same rank as the matrix (cd). The

§ 2.

Multiplication of matrices

3.9. The linear space of the n x m matrices. Consider the space L(E; F) of all n x rn-matrices. of all linear mappings ço:E—÷F and the set Once bases have been chosen in E and in F there is a 1-1 correspondence between the mappings p: E—÷ F and the n x rn-matrices defined by (3.19)

This correspondence suggests defining a linear structure in the set MnXrn such that the mapping (3.19) becomes an isomorphism. We define the sum of two n x rn-matrices and

A= as

B

the n x rn-matrix

A+B=

+

and the product of a scalar 2 and a matrix A as the matrix 2A

is immediately apparent that with these operations the set is a linear space. The zero-vector in this linear space is the matrix which has only zero-entries. Furthermore, it follows from the above definitions that It

i.e., that the mapping (3.19) defines an isomorphism between L (E; F) and the space X

Chapter III. Matrices

3.10. Product of matrices. Assume that and

are linear mappings between three linear spaces E, F, G of dimensions ii, in and I, respectively. Then /i is a linear mapping of E into G. Select in) and a basis (v I .. ii), (p = .. .1) in each of the three spaces. Then the mappings and i/i determine two matrices and by the relations = 1

. . .

1

and

These two equations yield = Consequently, the matrix of the mapping

ço

relative to the bases x%, and

is given by

(3.20)

The n x 1-matrix (3.20) is called the product of the ii x rn-matrix A = and the rn x /-matrix B= and is denoted by A B. It follows immediately from this definition that

= M(tp)M(t/i).

(3.21)

Note that the matrix M p) of the product-mapping p is the product of the matrices M(ço) and in reversed order of the factors. It follows immediately from (3.21) and the formulas of sec. 2.16 that the matrix-multiplication has the following properties:

A(.)LB1 +pB2)=).AB1 +pAB2 (AB)C = A(BC) (AB)* = B*A*. 3.11. Automorphisms and regular matrices. An ii x n-matrix A is called regular if it has the maximal rank /1. Let 'p be an automorphism of the n-dimensional linear space E and A = M ('p) the corresponding /1 x nmatrix relative to a basis (v = .. n). By the result of section 3.4 the rank of 'p is equal to the rank of the matrix A. Consequently, the matrix A is regular. Conversely, every linear transformation p: E—* E having a regular matrix is an automorphism. 1

.

§ 2. Multiplication of matrices

To every regular matrix A

A' such that

there

exists an in verse matrix, i.e., a matrix

AA' =

=

.1,

where J denotes the unit matrix whose entries are In fact, let cp be the E automorphism of E such that M((p)=A and let be the inverse automorphism. Then = 1

whence

M(p)M(q,)' =

o(p)

= M(i)

J

and

M(ço')M(p)= These

M(t)= J.

equations show that the matrix

A' = is the inverse of the matrix A.

Problems 1. Verify the following properties: a) b)

(L4)*=2A*.

c) 2. A square-matrix is called upper (lower) triangular if all the elements below (above) the main diagonal are zero. Prove that sum and product of triangular matrices are again triangular.

3. Let p be linear transformation such that 'p2 = 'p. Show that there exists a basis in which 'p is represented by a matrix of the form: O...O

1 1

.

rn

O...O

1

o

0

n—rn 0

o n

Chapter III. Matrices

92

4. Denote by A13 the matrix having the entry I at the place (i,j) and zero elsewhere. Verify the formula =

Prove that the matrices form a basis of the space M"

X

Basis-transformation

§ 3.

3.12. Definition. Consider two bases and E. Then every vector (v = 1. ii) can be written as

l...n) of the space

. .

=

(3.22)

Similarly, (3.23)

The two n x n-matrices defined by (3.22) and (3.23) are inverse to each other. In fact, combining (3.22) and (3.23) we obtain — xi,— A

This is equivalent to

5:A = 0

— A

p

and hence it implies that

In a similar way the relations

are proved. Thus, any two bases of E are connected by a pair of inverse matrices. Conversely, given a basis (v = 1. n) and a regular n x n-matrix another basis can be obtained by = . .

To show that the vectors

are linearly independent, assume that

= 0. Then

=0

§ 3. Basis-transformation

and

93

hence, in view of the linear independence of the vectors

Multiplication with the inverse matrix

yields

(K=1...n). 3.13. Transformation of the dual basis. Let E* be a dual space of E, the dual of the basis (v I.. .n). Then the dual basis of and = where

is

(3.24)

a regular n x n-matrix. Relations (3.23) and (3.24) yield

;>.

x%) =

(3.25)

Now xi,> =

and

=

Substituting this in (3.25) we obtain I-'v

This shows that the matrix of the basis-transformation is the inverse of the matrix of the transformation The two basis-transformations and = (3.26) = are called contragradient to each other. The relations (3.26) permit the derivation of the transformation-law

for the components of a vector xeE under the basis-transformation Decomposing x relative to the bases

and

we obtain

and

From the two above equations we obtain in view of (3.26). =

=

(3.27)

Comparing (3.27) with the second equation (3.26) we see that the components of a vector are transformed exactly in the same way as the vectors of the dual basis.

Chapter III. Matrices

94

3.14. The transformation of the matrix of a linear mapping. In this section it will be investigated how the matrix of a linear mapping (p: E—÷ F is changed under a basis-transformation in E as well as in F. Let M(p; xi,, and M (p; be the ii x in-matrices of çø relative = = to the bases xv,, (v = .. in), respectively. Then and .1?, p = 1

.

1

(v=1...n).

and

Introducing the matrices A=

and

of the basis-transformations we then have the relations

-

(3.28)

B= and their inverse matrices,

and

=

=

=

=

(3.29) YK.

Equations (3.28) and (3.29) yield

and we obtain the following relation between the matrices =

and (n): (3.30)

Using capital letters for the matrices we can write the transformation formula (3.30) in the form = A M (p;

M ((p;

It shows that all possible matrices of the mapping p are obtained from a particular matrix by left-multiplication with a regular 11 x n-matrix and right-multiplication with a regular in x in-matrix.

Problems 1. Let f be a function defined in the set of all ii x n-matrices such that

f(TAT1)=f(A) for every regular matrix T. Define the function F in the space L (E; E) by

F (q) =

f (M (ço;

xv))

§ 4. Elementary transformations

95

where E is an n-dimensional linear space and (v = 1.. . n) is a basis of E. Prove that the function F does not depend on the choice of the basis is a linear transformation E—+E having the same 2. Assume that matrix relative to every basis (v = 1. n). Prove that p = 2i where 2 is a scalar. 3. Given the basis transformation . .

= X2

—

x2

—

x3

— X2

X3 = 2x2 + x3

find all the vectors which have the same components with respect to the bases x1, and

1,

§ 4.

2, 3).

Elementary transformations

3.15. Definition. Consider a linear mapping p: Then there exists a basis ..., n)ofE and a basis ..., n) ofFsuch that the corresponding matrix of 'p has the following normal-form:

r (3.31)

b

.

.

where r is the rank of 'p. In fact, let a basis of E such form a basis of the kernel. Then the vectors that the vectors ar+ bQ 'p aQ = 1, ..., r) are linearly independent and hence this system can be extended to a basis (b1, ..., bm) of F. It follows from the construction and that the matrix of 'p has the form (3.31). of the bases 1, ..., n) and 1, ..., m) be two arbitrary bases of Now let E and F. It will be shown that the corresponding matrix M('p; can be converted into the normal-form (3.31) by a number of elementary basis-transformations. These transformations are:

Chapter III. Matrices

(1.1.) Interchange of two vectors and x1(i+j). (1.2.) Interchange of two vectors Yk and Yi (k 1). (11.1.) Adding to a vector an arbitrary multiple of a vector (11.2.) Adding to a vector Yk an arbitrary multiple of a vector Yi (1+ k). it is easy to see that the four above transformations have the following effect on the matrix M((p): (1.1.) Interchange of the rows I andj. (1.2.) Interchange of the columns k and 1. (11.1.) Replacement of the row-vector by (11.2.) Replacement of the column-vector bk by It remains to be shown that every n x rn-matrix can be converted into the normal form (3.31) by a sequence of these elementary matrix-transformations and the operations be the given ii x rn-matrix. 3.16. Reduction to the normal-form. Let otherwise the It is no restriction to assume that at least one matrix is already in the normal-form. By the operations (1.1.) and (1.2.) this element can be moved to the place (1, 1). Then 0 and it is no restriction to assume that = 1. Now, by adding proper multiples of the first row to the other rows we can obtain a matrix whose first column consists of zeros except for Next, by adding certain multiples of the first column to the other columns this matrix can be converted into the form

o...o

1

0*

*

(3.32)

0* if all the elements

*

(v = 2.. .n, R = 2.. rn) are zero, (3.32) is the normal-

form. Otherwise there is an element This can be moved to the place (2,2) by the operations (1.1. and (1.2.). Hereby the first row and the first column are not changed. Dividing the second row by and applying the operations (11.1.) and (11.2.) we can obtain a matrix of the form 1

0

0

1

0 *

00*

*

In this way the original matrix is ultimately converted into the form (3.31.).

§ 4. Elementary transformations

97

3.17. The Gaussian elimination. The technique described in sec. 3.16 can be used to solve a system of linear equations by successive elimination. Let (3.33)

be a system of in linear equations in n unknowns. Before starting the elimination we perform the following reductions: If all coefficients in a certain row, say in the i-th row, are zero, consider the corresponding number on the right hand-side. If the i-th equation contains a contradiction and the system (3.33) has no solution. If the i-th equation is an identity and can be omitted. Hence, we can assume that at least one coefficient in every equation is different from zero. Rearranging the unknowns we can achieve that 40. Multiplying the first equation by — and adding it to the p-th equation we obtain a system of the form 1

+

= (3.34)

which is equivalent to the system (3.33).

Now apply the above reduction to the (rn—. 1) last equations of the system (3.34). If one of these equations contains a contradiction, the system (3.34) has no solutions. Then the equivalent system (3.33) does not have from the a solution either. Otherwise eliminate the next unknown, say reduced system.

Continue this process until either a contradiction arises at a certain step or until no equations are left after the reduction. In the first case, (3.33) does not have a solution. In the second case we finally obtain a triangular system 132+0 (3.35)

which is equivalent to the original system*).

*) If no equations are left after the reduction, then every n-tupie solution of (3.33). 7

Greub. Linear Algebra

...

is a

ChaI)ter I II. \4atrices

The system (3.35) can be solved in a step by step manner beginning

with (3.36)

1

—

Inserting (3.36) into the first (r— 1) equations we can reduce the system to a triangular one of r— equations. Continuing this way we finally obtain the solution of (3.33) in the form 1

(v=1...r)

JLr+ 1

In) are arbitrary parameters.

where the

Problems 1. Two n x ni-matrices C and C' are called equivalent if there exists a regular n x n-matrix A and a regular in x rn-matrix B such that C' = A CB. Prove that two matrices are equivalent if and only if they have the same rank. 2. Apply the Gauss elimination to the following systems: a) —

= 2.

+

b)

2,71+ 172+4173+ 3,71+4,72+ 2,7' + + 5,73 c)

+

2c'+c2

—

= 1,

= 3.

Chapter IV

Determinants In this chapter, except for the last paragraph, all vector spaces will be defined over afixed but arbitrarily chosen field F of characteristic 0.

§ 1. Determinant functions 4.1. Even and odd permutations. Let X be an arbitrary set and denote Let for each 1 by X" the set of ordered p-tuples (x1 Y

be a map from

to a second set, Y. Then every permutation

a new map

acP: X"-* Y

defined by

= It follows immediately from the definitions that (4.1)

and (4.2)

where i is the identity permutation. Now let X = Y= ZL and define cli by cP(x1

=

fl

—

•

x1 E ZL.

in.

Proposition II: Assume that E is an n-dimensional vector space and let P be a skew symmetric n-linear map from E to a vector space F. Then P is completely determined by its value on a basis of E. In particular, if P vanishes on a basis, then = 0. a, of E and write Proof: In fact, choose a basis a1

x)=

(,.= 1 ...

n).

Then çJ5

(x1

.. .

v,,) =

=

1)(a,

"P

1)

=

1)

.

a,,).

Since the first factor does not depend on P, the proposition follows. 4.3. Determinant functions. A determinant function in an n-dimensional vector space E is a skew symmetric n-linear function from E to F. In every n-dimensional vector space E (n 1) there are determinant functions which are not identically zero. In fact, choose a basis, f of the space L(E) and define the n-linear function P by

=

P(x1. ....

...

E E.

is the dual basis in E,

Then. if a1

(4.3)

Now set 4

Then A is a skew symmetric. Moreover, relation (4.3) yields A(a1

and so A

0.

=

§ 1. Determinant functions

103

Proposition III: Let E be an n-dimensional vector space and fix a nonzero determinant function 4 in E. Then every skew symmetric n-linear map ço from E to a vector space F determines a unique vector b E F such that = zl(x1

(p(x1

b.

(4.4)

of E so that zl (a1 = Then the n-linear map i/i: given by

Proof: Choose a basis a1

b=

p(a1

a,1).

= 4(x1

.

1

and set (4.5)

b

agrees with on this basis.Thus, by Proposition II. ,/i = p, i.e., = 4 Clearly, the vector b is uniquely determined by relation (4.4).

.

b.

Corollary: Let zl be a non-zero determinant function in E. Then every determinant function is a scalar multiple of 4.

Proposition IV: Let 4 be a determinant function in E (dim E=n). Then, the following identity holds: XEE, (4.6)

If the vectors x1 are linearly dependent a simple calculation shows that the left hand side of the above equation is zero. On the other hand, by sec. 4.2, 4(x1 Thus we may assume x,, form a basis of E. Then, writing

that the vectors x1

x we

have x1

=

(—

1

4 (xi,, x1

=4(x1

x1,

(4.7)

xv). x.

Problem Let E*, E be a pair of dual spaces and 4+0 be a determinant function in E. Define the function 4* of n vectors in E* as follows: Ifthevectorsx*v(v = .n)arelinearlydependent,then4* (x* 1•• .x*'l)= 0. 1

. .

Chapter I V. Detcrtiiinants

104

Ifthe vectors

(v 1 where

=

.n)are linearly independent, then zi * (x*

1

1

n) is the dual basis. Prove that /1* is a deter-

I

minant function in E*. § 2.

The determinant of a linear transformation

4.4. Definition. Let p be a linear transformation of the n-dimensional linear space E. To define the determinant of (p choose a non-trivial determinant function z1. Then the function defined by /14,

=

(x1 ...

A

x1

...

obviously is again a determinant function. Hence, by the uniquenesstheorem of section 4.3, /14, =

LI,

is a scalar. This scalar does not depend on the choice of A. In fact, if A' is another non-trivial determinant function, then zI'=2A and where

consequently Thus, the scalar

A ,,

=

2

=

is uniquely determined by the transformation p. It is

called the determinant of (p and it will be denoted by det (p. So we have the

following equation of definition: /14,

where A is an arbitrary non-trivial determinant function. In a less condensed form this equation reads = det (p A (x1 ...

A (p X1 ... p

(4.8)

In particular, if (p = 2i, then /14,

=

and hence

det(2i) = It follows from the above equation that the determinant of the identitymap is 1 and the determinant of the zero-map is zero. 4.5. Properties of the determinant. A linear transformation is regular if and only if its determinant is different from zero. To prove this, select a basis (v = 1.. .n) of E. Then A (p e1 ...

p

= det (p A (e1 ... en).

(4.9)

§ 2. The determinant of a linear transformation

If p is regular, the vectors p (v =

1.

. .

105

are linearly independent; hence

n)

(4.10)

Relations (4.9) and (4.10) imply that

+ 0.

det

+ 0. Then it follows from (4.9) that

Conversely, assume that det

Hence the vectors p (v = 1. . n) are linearly independent and Consider two linear transformations and i,Ii of E. Then

is regular.

.

det

det

In particular, if is a linear automorphism and p1 is the inverse automorphism, we obtain

detq1 det(p = deti =

1.

4.6. The classical adjoint. Let E be an n-dimensional vector space and let zi +0 be a determinant function in E. Let çoeL(E; E). Then an n-linear map E)

is given by

v,)x =

px1

j=

E.

.

1

It is easy to check that Ji is skew symmetric. Thus, by Proposition 11,

there is a unique linear transformation, ad(p), of E such that cP(x1

= z1(x1

ad(p)

E;

E

E.

This equation shows that the element ad(p)EL(E; E) is independent of the choice of A and hence it is uniquely determined by 'p. It is called the classical adjoint of'p.

Chapter IV. 1)eterminaiits

106

Proposition V: The classical adjoint satisfies the relations

ad(ço)o(p=1 •detq and i

. det(p.

Proof: Replacing x by pX in (4.14) we obtain x,) x1 = 4(x1

x,

IP

,...,x,) ad

x).

j=1

Now observe that, in view of the definition of the determinant and

Proposition IV.

P1

=det(p j= 1

=detq)

x1EE, XEE.

x

.

Composing these two relations yields

4(x1 ,...,x,) ad (p((px) = det

.

zl(x1 ,...,x,) x .

and so the first relation of the proposition follows. The second relation is established by applying p to (4.12) and using the identity in Proposition IV, with x replaced by (i= ... a). 1

Corollary: If go is a linear transformation with det an inverse and the inverse is given by go1

then go has

det go

Thus a linear transformation of E is a linear isomorphism if and only if its determinant is non-zero. 4.7. Stable subspaces. Let go: E—*E be a linear transformation and assume that E is the direct sum of two stable subspaces,

E=

E1

$E2.

Then linear transformations and

go2:E2—*E2

§ 2. The determinant of a linear transformation

107

are induced by (p. It will be shown that

det(p = detp1detq2.

Define the transformations

and

E—÷E

by

mE1 i

Then (P

and so det(p =

Hence it is sufficient to prove that det

=

det(p1

and

detk//2 = dettp2.

0 be a determinant function in E and h1 ... the function /11, defined by

Let zl

...

be a

basis of E2. Then

p=dimE,

... hqh

is a non-trivial determinant function in E1. Hence

= detp1A1(x1

A1 (Pi x1 ...

On the other hand we obtain from (4.14) ... xi,, b1 ... A1(x1 ... xi,). A (x1

=

bq)

These relations yield det(p1

The second formula (4.13) is proved in the same way.

Problems 1. Consider the linear transformation p:

(v=1...n) where

(v =

1.

.

.n)

is a basis of E. Show that

(4.13)

defined by

(4.14)

I \'.

Deicriiii

2. Let be a linear transformation and assume that E1 is a and stable subspace. Consider the induced transformations Prove that

detp = 3. Let x:E-+F be a linear isomorphism and p be a linear transformation of E. Prove that = detp. 4. Let E be a vector space of dimension ii and consider the space L(E; E) of linear transformations. a) Assume that F is a function in L (E; F) satisfying

F can be written in the form

F(p)= f(detq) wheref:F—+F is a mapping such that

=f(2)f(p). b) Suppose that F satisfies the additional condition that Then, if E is a real vector space,

= detp or F(p) = and if E is a complex vector space F(ço) = detçc.

Hint for part a): Let (i== 1.. .n) be a basis for E and define the transformations and by

v+i v=t and

v=i

.

Show first that =I and that F(ço1)

is

independent of i.

i,j=1...n

§ 3. The determinant of a matrix

4. Let E be a vector space with a countable basis and assume that a function F is given in L (E; E) which satisfies the conditions of problem 3a). Prove that

peL(E;E).

Hint: Construct an injective mapping that

such

and a surjective mapping i/i

=0.

5. Let w be a linear transformation of E such that w2 = i. Show that where r is the rank of the map i — w. — 6. Let j be a linear transformation of E such that = — i. Show that then the dimension of E must be even. Prove that det j = 1. 7. Prove the following properties of the classical adjoint: det w = (

1

(i)

(ii) det (iii) If p has rank n—i, then Im ad((p)=ker cm (iv) If p has rank n—2, then ad(p)=0. (v) 8.

ad(ad p)=(det If n=2 show that

ad(p)=i . trp—p. § 3.

The determinant of a matrix

4.8. Definition. Let ço be a linear transformation of E and sponding matrix relative to a basis (v = 1. . n). Then .

=

'p

Substituting

in (4.8) we obtain A ('p

= det 'p A (e1 ...

... 'p

The left-hand side of this equation can be written as A ('p e1, ... 'p

= =

We thus obtain

A

ep,

r(1)

... c(n)

.A(e ...

en).

the corre-

Chapter IV. Determinants

1 1()

This formula shows how the determinant of is expressed in terms of the corresponding matrix. We now define the determinant of an ii x n-matrix A by

f(n)

detA =

(4.16)

Then equation (4.15) can be written as det(p =

(4.17)

Now let A and B be two n x n-matrices. Then det (A B) = det A det B.

(4.18)

In fact, let E be an n-dimensional vector space and define the linear transformations 'p and of E such that (with respect to a given basis)

M(q)=A and M ('p) M (i/i) = det M p) = det (i/i p) = det M ('p) det M (i/i) = det A det B.

= det 'p det

Formula (4.18) yields for two inverse matrices

detAdet(A1) = detJ =

1

(J unit-matrix)

showing that

det(A1) = (detA)'. Finally note that if an (n x ;i)-matrix A is of the form

A = det

det A2

(4.19)

as follows from sec. 4.7. 4.9. The determinant considered as a function of the rows. If the rows

of the matrix A are considered as vectors of the space = . .. the determinant det A appears as a function of the n vectors (v = n). To investigate this function define a linear transformation 'p of by 1

(v=1...n)

. . .

§ 3. The determinant of a matrix where the vectors

111

are the n-tuples

(v=1...n). Now let A be the deterThen A is the matrix of relative to the basis minant function in r which assumes the value one at the basis (v = 1.. .n),

A

A

A

det

A (a1 ... an).

A

(4.20)

This formula shows that the determinant of A considered as a function of the row-vectors has the following properties: 1. The determinant is linear with respect to every row-vector. 2. If two row-vectors are interchanged the determinant changes the sign.

3. The determinant does not change if to a row-vector a multiple of another row-vector is added. 4. The determinant is different from zero if and only if the row-vectors are linearly independent. An argument similar to the one above shows that

where the bV are the column-vectors of A. It follows that the properties 1—4 remain true if the determinant of A is considered as a function of the column-vectors.

Problems 1. Let A = (cd) be a matrix such that

= 0 if v 0. However,

if det p

0

if p(n — p) is even if p(n—p)is odd.

(4.71)

is positive with respect to the original orienSince the basis of tation, relation (4.71) shows that the induced orientation coincides with the original orientation if and only if p(n—p) is even. 4.30. Example. Consider a 2-dimensional linear space 12. Given a basis (e1, e2) we choose the orientation of E in which the basis e1, e2 is positive. Then the determinant function A, defined by

zl(e1,e2) =

1

1,2) generrepresents this orientation. Now consider the subspace 1,2) with the orientation defined by Then E1 induces in ated by 122 the given orientation, but E2 induces in E1 the inverse orientation.

§ 8. Oriented vector spaces

135

In fact, defining the determinant-functions A1 and A1 (x)

A (e2,

x)

xe E1,

A2 (x) = A

and

A2 in E1 and in E2 by (e1,

x)

x e E2

we find that

4.31.

A2(e2) = A(e1, e2) =

1

Intersections. Let E1

and E2 be

and

A1 (e1) =

A(e2,

e1) = —

two subspaces of E

E = E1 + E2

1.

such

that (4.72)

and assume that orientations are given in E1, E2

and E. It

will be shown

that then an orientation is induced in the intersection E12=E1 n

E2.

Setting

dimE1 = p,

dimE2 = q,

dimE12 = r

we obtain from (4.72) and (1.32) that

r= p + q — n. Now consider the isomorphisms ço:E/E1 and Vi:E/E2

E1/E12.

Since orientations are induced in E/E1 and E/E2 these isomorphisms determine orientations in E2/E12 and in E1/E12 respectively. Now choose two positive bases +1• a,, and hr . bq in E1/E12 and E2/E12 respecand bJEE2 be vectors such that tively and let .

and

where it1 and

it2

denote the canonical projections and

ir2:E2—+E2/E12.

Now define the function A12 by A12 (z1 ... Zr) = A (Z1 ... Zr, ar+ 1

a,,, br+ 1 .. bq).

(4.73)

In a similar way as in sec. 4.30 it is shown that the orientation defined in depends only on the orientations of E1, and E (and not on E12 by the choice of the vectors and ba). Hence an orientation is induced in E12.

Chapter IV. Determinants

Interchanging E1 and E2 in (4.73) we obtain

(:i

•..

(:i

=A

r,

br+ 1

bq,

1

ar).

(4.74)

Hence it follows that =

(q-r)4

1)(P_r)

(n-q)4

= (_

(475)

Now consider the special case of a direct decomposition. Then p+q=n and E12=(0). The function /112 reduces to the scalar (4.76)

Moreover the number—2 depends

It follows from (4.76) that

only on the orientations of E1, E2 and E. It is called the intersection number

of the oriented subspaces E1 and E2. From (4.75) we obtain the relation =(

= 1.. ii) be two bases of E. Basis deformation. Let and is called deformable into the basis if there exist n Then the basis continuous mappings 4.32.

t t1

t

satisfying the conditions 1. (t0) = and (t1) = (t) (v = 2. The vectors 1

The

.

.11) are

linearly independent for every fixed t.

deformability of two bases is obviously an equivalence relation.

Hence, the set of all bases of E is decomposed into classes of deformable bases. We shall now prove that there are precisely two such classes. This is a consequence of the following Theorem: Two bases and (v = 1.. ./1) are deformable into each other if and only if the linear transformation p: E defined by 'p = has positive determinant.

Proof: Let 4+0 be an arbitrary determinant function. Then formula lii) are 4.17 together with the observation that the components continuous functions of shows that the mapping Ex x E—*R defined by A is continuous. into the is a deformation of the basis Now assume that basis Consider the real valued function (t) ...

§ 8. Oriented vector spaces

137

The continuity of the function 4 and the mappings

the function Ji

is

implies that

continuous. Furthermore,

ck(t)+0 the vectors (t) (v = I .n) are linearly independent. Thus the function cP assumes the same sign at t=t0 and at t=t1. But because

. .

= 4(pa1 ...

= A(b1

= det(pA(a1 ...

=

whence

detço >0 and so the first part of the theorem is proved. 4.33. Conversely, assume that the linear transformation a deformation assume first that the vector n-tuple (a1 ...

...

(4.77)

1). Then consider the de-

is linearly independent for every 1(1 composition = By the above assumption the vectors (a1..

are linearly independ-

ent, whence /3" + 0. Define the number c,, by

1+1 if if

L—1

It will be shown that the n mappings

Ix,(t)=a.(v=1...n—1) = define

(1 —

+

(0O.

(4.79)

and by the result of sec. 4.32 Relations (4.78), and (4.79) imply that > 0.

det But whence

+1. Thus, the number of equal to — is even. Rearranging the vectors (v = 1.. n) we can achieve that 1

.

5—i

vl+i

(v=1...2p)

(v=2p+1...n).

§ 8. Oriented vector spaces

139

Then a deformation b1

...

e,,

-÷ (b1 ...

is defined by the mappings

+

=— =—

(v — —

1

—

(v=2p+1...n) 4.34. The case remains to be considered that not all the vector n-tuples (4.77) are linearly independent. Let A + 0 be a determinant function. The linear independence of the vectors (v = 1 . n) implies that . .

Since zl is a continuous function, there exists a spherical neighbourhood that Ua

(v= 1...n).

if XvEUa

Choose a vector a'1 e Uaj which is not contained in the (n — 1)-dimensional subspace generated by the vectors (b2. . Then the vectors (a'1, b2.. are linearly independent. Next, choose a vector a'2 E Ua2 which is not con-

tained in the (n— 1)-dimensional subspace generated by the vectors (a'1, b2. Then the vectors (a'1, a'2, b3. are linearly independent. Going on this way we finally obtain a system of n vectors (v = 1.. n) .

.

.

such that every n-tuple ...

(a'1

is

linearly independent. Since

Hence the vectors

(1= 1 ... ;i)

it follows that

(v = 1. . n) form a basis of E. The n mappings .

define a deformation (4.80)

In fact,

(t)(0 t

1) is contained in Uav whence A

(t) ...

(t)) + 0

(0

t

This implies the linear independence of the vectors

1).

(t)(v = 1.. .n).

140

Chapter IV. Determinants

By the result of sec. 4.33 there exists a deformation ...

... ba).

(4.81)

The two deformations (4.80) and (4.81) yield a deformation (a1 ...

... ba).

This completes the proof of the theorem in sec. 4.32. 4.35. Basis deformation in an oriented linear space. If an orientation is given in the linear space E, the theorem of sec. 4.32 can be formulated as and (v = I n) can be deformed into each other follows: Two bases if and only if they are both positive or both negative with respect to the given orientation. In fact, the linear transformation . .

p:

.

(v = I ... a)

—*

has positive determinant if and only if the bases and b%, (v = I .. are both positive or both negative. Thus the two classes of deformable bases consist of all positive bases and all negative bases. 4.36. Complex vector spaces. The existence of two orientations in a

real linear space is based upon the fact that every real number 2t0 is either positive or negative. Therefore it is not possible to distinguish two orientations of a complex linear space. In this context the question arises whether any two bases of a complex linear space can be deformed into each other. It will be shown that this is indeed always possible. Consider two bases and (v = a) of the complex linear space E. As in sec. 4.33 we can assume that the vector n-tuples 1

(a1...

. .

...

are linearly independent for every 1(1 1). It follows from the above assumption that the coefficient in the decomposition = is

different from zero. The complex number

Now choose a positive continuous function

r(0) =

1,

r(I) = r

can

be written as

1) such that (4.82)

§ 8. Oriented vector spaces

and a continuous function

1) such that

(t)(O t

(4.83)

Define mappings

(t),

(0

t 1)

(v =

=

by 1 ...

n — 1) )

and (t)

=t

Then the vectors (t) (v = fact, assume a relation

1 ..

(4.84)

1.

t

flV

+ r(t) .

n)

are linearly independent for every t. In

0. Then

+

r (t)

+

=

0

whence

(v==1...n—1) and = 0.

1, the last equation implies that )J' = 0. r (t) + 0 for 0 t (n— 1) equations reduce to in— 1). It follows from (4.84), (4.82) and (4.83) that

Since

Hence the

first

=

and

=

Thus the mappings (4.84) define a deformation (a1 ...

—÷

Continuing this way we obtain after 1...n). into the basis

(a1 n

...

ba).

steps a deformation of the basis

Problems 1. Let E be an oriented n-dimensional linear space and

(v = 1.. .11) be

the subspace generated by the vectors a positive basis; denote by is positive with respect Prove that the basis (x1 to

the orientation induced in

by the vector

l)i —

1

xL.

Chapter IV. Determinants

142

2. Let E be an oriented vector space of dimension 2 and let a1, a2 be two linearly independent vectors. Consider the 1-dimensional subspaces E1 and E2 generated by a1 and a2 and define orientations in E. such that the bases a are positive (i= 1,2). Show that the intersection number of E1 and E2 is + 1 if and only if the basis a1, a2 of E is positive.

3. Let E be a vector space of dimension 4 and assume that (v

= ... 1

4)

is a basis of E. Consider the following quadruples of vectors:

I. e1+e2,e1+e2+e3,e1+e2+e3+e4,e1—e2+e4 II. e1+2e3, e2+e4, e2—e1+e4, e2

III.

e2+e4, e3+e2, e2—e1

1V.

e1+e2—e3, e2—e4, e3+e2, e2—e1

V. e1—3e3, e2+e4, e2—e1—e4, e2. a) Verify that each quadruple is a basis of E and decide for each pair of bases if they determine the same orientation of E. b) If for any pair of bases, the two bases determine the same orientation, construct an explicit deformation. c) Consider E as a subspace of a 5-dimensional vector space E and assume that (v = 1, . .5) is a basis of E. Extend each of the bases above to a basis of which determines the same orientation as the basis (v= 1, ..., 5). Construct the corresponding deformations explicitly. 4. Let E be an oriented vector space and let E1, E2 be two oriented subspaces such that E== E1 + E2. Consider the intersection E1 fl E2 together with the induced orientation. Given a positive basis (c1, ..., Cr) of extend it to a positive basis (c1, ..., Cr, ar+ E1 fl a positive basis (C1, ..., Cr, br+ i, ..., bq) of E2. Prove that then (C1, ..., Cr, ar+ 1' ..., ap, br+ i, ..., bq) is a positive basis of E. .

5. Linear isotopices. Let E be an n-dimensional real vector space. will be called linearly Two linear automorphisms p: and i/i: (I the closed unit isotopic if there is a continuous map 1: I x and such that for each interval) such that x)=p(x), x) is a linear automorphism. tEl the map given by (i) Show that two automorphisms of E are linearly isotopic if and only if their determinants have the same sign. (ii) Let j: E—+E be a linear map such that = — Show that j is linearly isotopic to the identity map and conclude that detj= 1. 6. Complex structures. A complex structure in real vector space E is a linear transformation j: E—+E satisfying j2 = — 1.

§8. Oriented vector spaces

143

(i) Show that a complex structure exists in an n-dimensional vector space if and only if ii is even. (ii) Let j be a complex structure in E where dimE=2n. Let 4 be a determinant function. Show that for (v= 1 ... 11) either 4(x1

or 4(x1 ,...,x,1,jx1 The natural orientation of (E, J) is

defined to be the orientation re-

presented by a non-zero determinant function satisfying the first of these.

Conclude that the underlying real vector space of a complex space carries a natural orientation. (iii) Let be a vector space with complex structure and consider the complex structure (E, —j). Show that the natural orientations of and (E, —j) coincide if and only if n is even (dimE=2n).

Chapter V

Algebras In paragraphs one and two all vector spaces are defined over a fixed, but arbitrarily chosen field F of characteristic 0.

§ 1. Basic properties 5.1. Definition: An algebra, A, is a vector space together with a mapping A x such that the conditions (M1) and (M2) below both hold. The image of two vectors xeA, yeA, under this mapping is called the product of x and y and will be denoted by xy. The mapping A x A-+A is required to satisfy: (M1) (M2)

(2x1 + jix2)y = 2(x1 y) + ,u(x2y) + 11y2)

2(xy1) + ji(xy2).

As an immediate consequence of the definition we have that

0x = x0 0. Suppose B is a second algebra. Then a linear mapping p : a homomorphism (of algebras) if çø preserves products; i.e.,

p(xy)=pxpy.

is called (5.1)

A homomorphism that is injective (resp. surjective, bijective) is called a monomorphism (resp. epimorphism, isomorphism). If B=A, p is called an endomorphism. Note: To distinguish between mappings of vector spaces and mappings of algebras, we reserve the word linear mapping for a mapping between vector spaces satisfying (1.8), (1.9) and homomorphism for a linear mapping between algebras which satisfies (5.1). Let A be a given algebra and let U, V be two subsets of A. We denote by U V, the set

§ 1. Basic properties

145

Every vector aeA induces a linear mapping p(a):A—*A defined by

1u(a)x=ax

(5.2)

4u (a) is called the multiplication operator determined by a.

An algebra A is called associative if

x(yz)=(xy)z and commutative if

xy=yx

x,y,zEA

x,yeA.

From every algebra A we can obtain a second algebra (x y)OPP

by defining

=yx

AON' is called the algebra opposite to A. It is clear that if A is associative

then so is A°". If A is commutative we have A°"=A. If A is an associative algebra, a subset Sc: A is called a system of generators of A if each vector xeA is a linear combination of products of elements in 5, 1... x= e S, 1... Vp ... (v)

A unit element (or identity) in an algebra is an element e such that for every x

xe=ex=x.

(5.3)

if A has a unit element, then it is unique. In fact, if e and e' are unit elements, we obtain from (5.3) e = e e' = e'. Let A be an algebra with unit element eA and ço be an epimorphism of A onto a second algebra B. Then eB = eA is the unit element of B. In fact,

if yEB is arbitrary, there exists an element xeA such that y= çox. This gives

yeB =

=

=

= y.

In the same way it is shown that An algebra with unit element is called a division algebra, if to every element a 0 there is an element a' such that a a — = a — = c. 1

5.2. Examples: 1. Consider the space L(E; E) of all linear transformations of a vector space E. Define the product of two transformations by 10

= i/i o Greub Linear Algebra

Chapter V. Algebras

146

The relations (2.17) imply that the mapping (p, i)—*çlip satisfies (M1) and (M2) and hence L(E; E) is made into an algebra. L(E; E) together with this multiplication is called the algebra of linear transformations of E and is denoted by A (E; E). The identity transformation i acts as unit element in A (E; E). It follows from (2.14) that the algebra A (E; E) is associative.

However, it is not commutative if dim E 2. In fact, write E= (x1) and (x2) are the one-dimensional subspaces generated by two linearly independent vectors x1 and x2, and F is a complementary subspace. Define linear transformations 'p and by

'px1=O,

(px2=x1,

py=O,yeF

ç/ix2=O;

çliy=O,yeF.

and Then

= 'p0 = 0 while (pX2

=

çlix1

= x2

whence

Suppose now that A is an associative algebra and consider the linear mapping defined by

p(a)x=ax.

(5.4)

Then we have that

p(a b)x = a bx = p(a)p(b)x whence

p(ab) = p(a)p(b). This

relation shows that p is a homomorphism of A

into A (A; A).

M X be the vector space of (n x mi)-matrices for a given integer ii and define the product of two (mi x n)-matrices by formula Example 2: Let

(3.20). Then it follows from the results of sec. 3.10 that the space is made into an associative algebra under this multiplication with the unit matrix J as unit element. Now consider a vector space E of dimension n with a distinguished basis (v = 1. n). Then every linear transforE determines a matrix M ('p). The correspondence 'p —+ M(co) mation 'p . .

§ 1. Basic properties

determines a linear isomorphism of A (E; E) onto

147 X

In view of sec.

3.10 we have that

M is an isomorphism of the algebra A (E; E) onto the opposite algebra (M" X n)0PP. Example 3: Suppose F1 cF is a subfield. Then F is an algebra over F1. We show first that F is a vector space over F1. In fact, consider the mapping F1 x F—+F defined by

(2,x)-÷2x, It satisfies the relations

2eF1,xEF.

+ x =2x+ x 2(x + y) = Ax + 2y (24u)x = 2(1ux)

lx = x where 2,

x, yeF. Thus F is a vector space over F by (x, y) x y (field multiplication).

Then M1 and M2 follow from the distribution laws for field multiplication.

Hence F is an associative commutative algebra over F1 with 1 as unit element.

Example 4: Let cr be the vector space of functions of a real variable t which have derivatives up to order r. Defining the product by

(1 g) (t) = f (t) g (t) we obtain an associative and commutative algebra in which the function 1(t) = 1 acts as unit element. 5.3. Subalgebras and ideals. A subalgebra, A1, of an algebra A is a linear subspace which is closed under the multiplication in A; that is, if x and y are arbitrary elements of A1, then xyeA1. Thus A inherits the structure of an algebra from A. It is clear that a subalgebra of an associative (commutative) algebra is itself associative (commutative). Let S be a subset of A, and suppose that A is associative. Then the subspace B A generated (linearly) by elements of the form

is

SEES Si...Sr, clearly a subalgebra of A, called the subalgebra generated by S. It is

easily verified that

where the 10*

B= A containing S.

Chapter V. Algebras

A right (left) ideal in an algebra A is a subspace I such that for every xEI, and every yEA, xyEI(yxEI). A subspace that is both a right and left ideal is called a tivt'o-sided ideal, or simply an ideal in A. Clearly, every

right (left) ideal is a subalgebra. As an example of an ideal, consider the subspace A2 (linearly generated by the products xy). A2 is clearly an ideal and is called the derived algebra. The ideal I generated by a set S is the intersection of all ideals containing S. If A is associative, I is the subspace of A generated (linearly) by elements of the form

s,as,sa

sES,aEA.

In particular every single clement a generates an ideal principal ideal generated by a.

4

is

called the

Example 5: Suppose A is an algebra with unit element e, and let (p:F-÷A be the linear mapping defined by ço2 = Ae.

Considering F as an algebra over itself we have that =

e

=

e)(R e) =

then ).e=0 whence p is a homomorphism. Moreover, if )L=0. It follows that p is a monomorphism. Consequently we may identify F with its image under p. Then F becomes a subalgebra of A and scalar multiplication coincides with algebra multiplication. In fact, if). is any scalar, then = (2e).a = = Example

6: Given an element a of an associative algebra consider

the set, Na, of all elements xEA such that ax =0. If xENa then we have for every VEA

and so xyENa. This shows that Na is a right ideal in A. It is called the right annihilator of a. Similarly the left annihilator of a is defined. 5.4. Factor algebras. Let A be an algebra and B be an arbitrary subspace of A. Consider the canonical projection

It will be shown that A/B admits a multiplication such that ir is a homomorphism if and only if B is an ideal in A. Assume first that there exists such a multiplication in A/B. Then for

§ 1. Basic properties

149

every xeA, yeB, we have

ir(xy) =

irxiry =

it follows that and so B must be an ideal. Conversely, assume B is an ideal. Then define the multiplication in

A/B by

= ir(xy)

(5.5)

where x and y are any representatives of

and 7 respectively.

It has to be shown that the above product does not depend on the choice of x and y. Let x' and y' be two other elements such that it x' = and iry' =7. Then

x'—xeB and y'—yeB. Hence we can write

x'==x+b, It

beB and y'—y+c,

ceB.

follows that

x'y' — xy = by + xc + bceB and so

rr(x'y') = it(xy). The multiplication in A/B clearly satisfies (M1) and (M2) as follows from the linearity of it. Finally, rewriting (5.5) in the form

it (x y) = it x iry

we see that it is a homomorphism and that the multiplication in A/B is uniquely determined by the requirement that it be a homomorphism. The vector space A/B together with the multiplication (5.5) is called the

factor algebra of A with respect to the ideal B. It is clear that if A is associative (commutative) then so is A/B. If A has a unit element e then ë= ire is the unit element of the algebra A/B. 5.5. Homomorphisms. Suppose A and B are algebras and is

a homomorphism. Then the kernel of p is an ideal in A. In fact, if xeker ço and yEA

are

arbitrary we have that

p(xy) =

=

=

0

whence xyEker qi. In the same way it follows that yxEker tp. Next consider the subspace Im pcB. Since for every two elements x, yeA = qi(xy)elmq

it follows that Im 'p is a subalgebra of B.

Chapter V. Algebras

Now let

be the induced injective linear mapping. Then we have the commutative d iagrani

A/kerço

and since

is

a homomorphism, it follows that = = = =

'p

(x y) (x). 'p (y)

is a homomorphism and hence a monomorphism. In particular, the induced mapping This relation shows that

4 Imço is an isomorphism. Finally, assume that C is a third algebra, and let be a homomorphism. Then the composition ti0 p: A —+ C is again a homomorphism. In fact, we have 'ps. = = =

Let 'p:A—*B be any homomorphism of associative algebras and S be by a system of generators for A. Then 'p determines a set map (po:

'p0x=çox,

xeS. In fact, if

The homomorphism 'p is completely determined by

x=

...

E S,

(v)

is

an arbitrary element we have that

... 'poxv

= (v)

I

Vp

e I'

§ 1. Basic properties

151

be an arbitrary set map. Then Po can be Proposition I: Let can be extended to a homomorphism ço:A—+B if and only if ...

whenever

0

po

...

= 0. (5.6)

(v)

(v)

Proof: It is clear that the above condition is necessary. Conversely, assume that (5.6) is satisfied. Then define a mapping ço:A—÷B by

= (v)

...

(5.7)

(v)

To show that çü is, in fact, well defined we notice that if = ... •.. Ypq

(v)

(p)

then —

Ypq

= 0.

(ji)

(v)

In view of (5.6) ...

...

—

(v)

0

(p)

and so

...

It follows from (5.7) that

px=ip0x xeS p(2x + jiy) = 2px + and

'p (x y) = 'p

'p y

and hence 'p is a homomorphism. Now suppose that

be a linear map such

is a basis for A and let

= for each x, j3. Then 'p is a homomorphism, as follows from the relation 'p (x y) =

'p

ep)}

e2) (> p

2,fl

=

'p (ep)) =

'p p

'p

(x) 'p (y).

Chapter V. Algebras

152

5.6. Derivations. A linear mapping 0: A called a derivation if

A of an algebra into itself is

X,yEA.

(5.8)

and As an example let A be the algebra of Cr-functions f: define the mapping 0 by 0:f—+f' wheref' denotes the derivative off. Then the elementary rules of calculus imply that 0 is a derivation. If A has a unit element e it follows from (5.8) that Oe =

Oe

+ 0e

whence Oe=—O. A derivation is completely determined by its action on a

system of generators of A, as follows from an argument similar to that used to prove the same result for homomorphisms. Moreover, if O:A—*A

is a linear map such that

=

+

is a basis for A, then 0 is a derivation in A. For every derivation 0 we have the Leibniz formula

where

on(xy)

(5.9)

= In fact, for n = 1, (5.9) coincides with (5.8). Suppose now by induction that (5.9) holds for some n. Then +

1(x y) =

o on

(x y) or+

-r y +

= x.On+ly +

+

or x

-r+

=

= x.On+ly + n+ 1

and so the induction is closed.

Orx.On+l_ry +

(n+1) Orx.On_r+ly +

§ 1. Basic properties

153

The image of a derivation 0 in A is of course a subspace of A, but it is in general not a subalgebra. Similarly, the kernel is a subalgebra, but it is not, in general, an ideal. To see that ker 0 is a subalgebra, we notice that for any two elements x, yeker 0

0(xy) =

0

whence xy E ker 0.

it follows immediately from (5.8) that a linear combination of derivations 01:A—*A is again a derivation in A. But the product of two derivations 01, 02 satisfies

(0102)(xy) = 01(02x.y + x.02y) (5.10)

and so is, in general, not a derivation. However, the commutator [01,02] = 0102—0201 is again a derivation, as follows at once from (5.10). 5.7. ip-derivations. Let A and B be algebras and 'p : A B be a fixed homomorphism. Then a linear mapping 0: A B is called a 'p-derivation if

0(xy)=

x,yeA.

+

denotes In particular, all derivations in A are i-derivations where i : A the identity map. As an example of a 'p-derivation, let A be the algebra of Cm-functions f: R—* and let B= 11. Define the homomorphism cp to be the evaluation homomorphism : f -÷ (0) and the mapping 0 by

f

0:f Then it follows that

0(fg) = (1 g)'(O) = = 0

f' (0) g (0) + f (0) g' (0)

a 'p-derivation.

More generally, if °A is any derivation in A, then derivation. In fact, 0(xy) = çoOA(xy) + xOAv) = = 'p

+

'p x

0 y.

Similarly, if °B is a derivation in B, then °B° 'p

is

a 'p-derivation.

is

a ço-

Chapter V. Algebras

154

5.8. Antiderivations.

Recall that an involution in a linear space is a

linear transformation whose square is the identity. Similarly we define an involution w in an algebra A to be an endomorphisin of A whose square is the identity map. Clearly the identity map of A is an involution. If A has a unit element e it follows from sec. 5.1 that we=e. Now let w be a fixed involution in A. A linear transformation Q:A—*A will be called an antiderivation with respect to w if it satisfies the relation

Q(xj) =

+

wxQy.

(5.11)

In particular, a derivation is an antiderivation with respect to the involution i. As in the case of a derivation it is easy to show that an antiderivation is determined by its action on a system of generators for A and that ker Q is a subalgebra of A. Moreover, if A has a unit element e, then Qe=O. It also follows easily that any linear combination of antiderivations with respect to a fixed involution w is again an antiderivation with respect to w. Suppose next that Q1 and Q2 are antiderivations in A with respect to 0w1. Then w1 oW2 the involutions w1 and w2 and assume that w1 0w2 is again an involution. The relations

(Q122)(xy)= Q1(Q2xy + w2xQ2y)=

+

and

(Q2Q1)(xy)=

+ w1xQ1y)= Q2Q1xy + + w2Q1xQ2y + Q2w1xQ1y + w2w1xQ2Q1y

yield

(Q1Q2 ± Q2 ± Q2w1)xQ1 y + +(Q1w2 ± w2Q1)xQ2y + W1W2X.(Q1Q2 ± Q2Q1)y. Q2

(5.12)

Now consider the following special cases: 1. w1 Q2 = Q2 w1 and w2 Q1 = Q1 (this is trivially true if w1 = ± i and Q2 —Q2 is an antiderivation = ± i). Then the relation shows that with respect to the involution W1 w2. In particular, if Q is an antiderivation

with respect to w and 0 is a derivation such that 0)0=0W, then OQ—Q0 is again an antiderivation with respect to oi. 2. w1Q2=—Q2W1andw2Q1=—Q1W2.ThenQ1Q2+Q2Q1isanantiderivation with respect to the involution W1 0)2.

§ 1. Basic properties

155

Now let Q1 and Q2 be two antiderivations with respect to the same involution w such that

(i=1,2). Then it follows that Q1 Q2 + Q2 Q1 is a derivation. In particular, if Q is any antiderivation such that wQ = - Qw then Q2 is a derivation. Finally, let B be a second algebra, and let cp: A —÷ B be a homomorphism. Assume that WA is an involution in A. Then a p-antiderivation with respect to WA is a linear mapping Q:A-+B satisfying (5.13) If WB is

an involution in B such that WA

WB 'P

then equation (5.13) can be rewritten in the form

Q(xy) =

+

Problems 1. Let A be an arbitrary algebra and consider the set C(A) of elements aEA that commute with every element in A. Show that C(A) is a subspace

of A. If A is associative, prove that C(A) is a subalgebra of A. C(A) is called the centre of A. 2. If A is any algebra and 0 is a derivation in A, prove that C(A) and the derived algebra are stable under 0. 3. Construct an explicit example to prove that the sum of two endomorphisms is in general not an endomorphism. 4. Suppose 'p A —÷ B is a homomorphism of algebras and let ). + 0, 1 be

an arbitrarily chosen scalar. Prove that 2p is a homomorphism if and only if the derived algebra is contained in ker 'p. 5. Let C1 and C denote respectively the algebras of continuously differentiable and continuous functionsf: (cf. Example 4). Consider the linear mapping d: C' C

given by df=f' wheref' is the derivative off.

Chapter V. Algebras

a) Prove that this is an i-derivation where i: C1—*C denotes the canonical injection. b) Show that d is surjective and construct a right inverse for d.

c) Prove that d cannot be extended to a derivation in the algebra C. 6. Suppose A is an associative commutative algebra and 0 is a derivation in A. Prove that Ox" = px"' 0(x). 7. Suppose that 0 is a derivation in an associative commutative algebra A with identity e and assume that xeA is invertible; i.e.; there exists an element x ' such that

Prove that x" (p 1) is invertible and that

(x")' —(x')" Denoting the inverse of x" by

show that for every derivation 0

0(x") = — px"1 0(x). 8. Let L be an algebra in which the product of two elements x, y is denoted by [x, y]. Assume that [x, y] + [y, x] = z], x] + y], z] +

0

(skew symmetry)

x], y] = 0 (Jacobi identity)

Then L is called a Lie algebra. Let Ad (a) be the multiplication operator in the Lie algebra L. Prove that Ad (a) is a derivation.

9. Let A be an associative algebra with product xy. Show that the multiplication (x, y)—÷[x, y] where

[x,y]

xy — yx

makes A into a Lie algebra. 10. Let A be any algebra and consider the space D (A) of derivations in A. Define a multiplication in D(A) by setting [01,02] = a)

0102 —0201.

Prove that D (A) is a Lie algebra.

b) Assume that A is a Lie algebra itself and consider the mapping given by Show that 'p is a homomorphism of Lie algebras. Determine the kernel of p.

§ 1. Basic properties

11. If L is a Lie algebra and I is an ideal in A, prove that the algebra L/I is again a Lie algebra. 12. Let E be a finite dimensional vector space. Show that the mapping 1: A (E; E) —÷ A (E*; E*)OPP

is an isomorphism of algebras. given by 13. Let A be any algebra with identity and consider the multiplication

operator

ji: A

A (A; A).

Show that p is a monomorphism. If A = L (E; E) show that by a suitable restriction of p a monomorphism

(i= 1

E be an n-dimensional vector space. Show that each basis n) of L(E; E) such that n) of E determines a basis 1

(i) QijOlk = (ii)

i.

Conversely, given n2 linear transformations of E satisfying i) and ii), prove that they form a basis of L(E; E) and are induced by a basis of E. Show that two bases e1 and of E determine the same basis of L (E; E) if and only if e = A 2 eF. 15. Define an equivalence relation in the set of all linearly independent E), in the following way: n2-tuples (p1...pn2), ...

('P1 ...

if and only if there exists an element =

such that (v

= ... 1

,12)

Prove that 2EV

only if 2=1. 16. Prove that the bases of L(E; E) defined in problem 14 form an equivalence class under the equivalence relation of problem 15. Use this to show that every non-zero endomorph ism cP: A (E; E)—*A (E; E) is an inner automorphism; i.e., there exists a fixed linear automorphism of E such that 1

17. Let A be an associative algebra, and let L denote the corresponding

Chapter V. Algebras

158

is Lie algebra (cf. problem 9). Show that a linear mapping derivation in A only if it is a derivation in L. 18. Let E be a finite dimensional vector space and consider the mapping E)—*A(E; E) defined by

= — Prove that is a derivation. Conversely, prove that every derivation in A (E; E) is of this form. Hint. Use problem 14. § 2.

Ideals

5.9. The lattice of ideals. Let A be an algebra, and consider the set 1 of ideals in A. We order this set by inclusion; i.e., if and '2 are ideals if and only if in A, then we write '1 The relation is clearly a partial order in 1 (cf. sec. 0.6). Now let and '2 be ideals in A. Then it is easily cheeked that + and '1 n '2 are again ideals, and are in

fact the least upper bound and the greatest lower bound of '1 and '2 Hence, the relation induces in 5 the structure of a lattice. 5.10. Nilpotent ideals. Let A be an associative algebra. Then an element aEA will be called ni/potent if for some k, (5.14)

The least k for which (5.14) holds is called the degree of ni/potency of a. An ideal I will be called nilpotent if for some k,

,k0

(5.15)

The least k for which (5.15) holds is called the degree of ni/potency of I and will be denoted by deg I. 5.11.* Radicals. Let A be an associative commutative algebra. Then the nilpotent elements of A form an ideal. In fact, if x and y are nilpotent of degree p and q respectively we have that p+q

+

p+q

p

= i=O

and

(xy)" = x"y" = 0.

§ 2. Ideals

The ideal consisting of the nilpotent elements is called the radical of A and will be denoted by rad A. (The definition of radical can be generalized to the non-commutative case; the theory is then much more difficult and belongs to the theory of rings and algebras. The reader is referred to [14]).

It is clear that rad (rad A) = rad A.

The factor algebra A/rad A contains no non-zero nilpotent elements. To prove this assume that A is an element such that for some k. Then x"erad A and hence the definition of rad A yields / such that = (x")' = 0. It follows that xErad A whence by the formula

The above result can be expressed

rad(A/radA) = 0.

Now assume that the algebra A has dimension n. Then rad A is a nilpotent ideal, and (5.16) deg(radA) dim(radA) + 1 n + 1. For the proof, we choose a basis e1, ..., er of rad A. Then each is nilpotent. Let k = max (deg e1), and consider the ideal (rad An arbitrary element in this ideal is a sum of elements of the form where

k

so

...

=0. This shows that

=0 and so rad A is nilpotent. Now let s be the degree of nilpotency of rad A, and suppose that for some m dim (radA)m+l,

m 0 is a constant. 4. Prove that every normal non-selfadjoint transformation of a plane is homothetic. 5. Let be a mapping of the inner product space E into itself such that (pO=0 and çox — (P11 = lx — x. VEE.

Prove that

is then linear.

there exists a continuous 6. Prove that to every proper rotation family (p(0t I) of rotations such that and (P1 = 7. Let be a linear automorphism of an n-dimensional real linear space E. Show that an inner product can be defined in E such that p becomes a rotation if and only if the following conditions are fulfilled:

(i) The space E

can

be decomposed into stable planes and stable

straight lines. (ii) Every stable straight line remains pointwise fixed or is reflected at the point 0.

(iii) In every irreducible invariant plane a linear automorphism induced such that and tn/il 2. Consider a proper rotation t which commutes with all proper rotations. Prove that = i where c = if ii is odd and = ± if n is even. 1

§ 6.

1

Rotations of Euclidean spaces of dimension 2,3 and 4

8.21. Proper rotations of the plane. Let E be an oriented Euclidean plane and let zl denote the normed positive determinant function in E. is determined by the equation Then a linear transformation j: zi (x, i') = (j

i)

x, v E.

Chapter VIII. Linear mappings of inner product spaces

238

The transformation / has the following properties:

I) (jx,v)+(x,jv)=O (jx,jv)=(x,v)

2)

3)

4)

detj=1.

In fact, 1) follows directly from the definition. To verify 2) observe that the identity (7.24), sec. 7.13, implies that (x, y)2 + (/x,v)2 y

= IxV

x we obtain, in view of 1),

(jx, jx)2 = j is injective (as follows from the definition) we obtain

= imply that

Now the relations .1= —.1 and

=—

Finally, we

obtain from 1), 2), 3) and from the definition of /

4(jx,iv) = (/2 x, iv) = — (x,jy) = (jx, v) = zl(x,

y)

whence

det j =

1.

The transformation / is called the canonical complex structure of the oriented Euclidean plane E. Next, let p be any proper rotation of E. Fix a non-zero vector x and denote by 0 the oriented angle between x and p x (cf. sec. 7.1 3). It is determined mod 2 it by the equation

px = x cos 0 +j(x) sin 0.

(8.39)

We shall show that cos 0 = tr 'p

and

sin 0 = 4 tr(jo p).

(8.40)

In fact, by definition of the trace we have

zl(p x, y) + 4(x, py) = 4(x, y). tr 'p. Inserting (8.39) into (8.41) and using properties 2) and 3) of j

(8.41)

we

obtain

2cos0 .A(x, v)=zl(x,y) . trqi and so the first relation (8.40) follows. The second relation is proved in a similar way.

§ 6. Rotations of Euclidean spaces of dimension 2, 3 and 4

239

Relations (8.40) show that the angle (9 is independent of x. It is called the rotation angle of and is denoted by e((p). Now (8.39) reads

cosO((p)+j(x) sinê((p)

XEE.

In particular, we have

O(i)=0,

/ix=x

&(—i)=m, cos

sin

a second proper rotation with rotation angle shows that is

a

simple calculation

= x cos(&(p)+ (9(h)) +j(x) sin((9(p)+ Thus any two proper rotations commute and the rotation angle for their

product is given by (p) = (9((p) +

(mod 2it).

(8.42)

Finally observe that if e1, e2 is a positive orthonormal basis of E then we have the relations (p e1 e1 cos O(p) + e2 sin O((p) (p e2 = — e1 sin O(p) + e2 cos O(p).

Remark: If E is a non-oriented plane we can still assign a rotation angle to a proper rotation (p. It is defined by the equation

cos 9(p) = tr p

(0

it).

(8.43)

Observe that in this case (9(p) is always between 0 and it whereas in the oriented case 9(p) can be normalized between —it and it. 8.22. Proper rotations of 3-space. Consider a proper rotation p of a 3-dimensional inner product space E. As has been shown in sec. 8.19,

there exists a 1-dimensional subspace E1 of E whose vectors remain fixed. If p is different from the identity-map there are no other invariant vectors

(an invariant vector is a vector xeE such that (px=x). In fact, assume that a and b are two linearly independent invariant vectors. Let c(c+0) be a vector which is orthogonal to a and to b. Then (pc—2c where 2= ± 1. Now the equation det (p= implies that 2= + 1 showing that (p is the identity. In the following discussion it is assumed that +1. Then the invariant vectors generate a i-dimensional subspace E1 called the axis of (p. 1

240

Chapter VIII. Linear mappings of inner product spaces

To determine the axis of a given rotation (p consider the skew mapping (8.44)

and introduce an orientation in E. Then

can

t/ix=u xx

be written in the form

UEE

(8.45)

(cf. problem 2, § 4). The vector u which is uniquely determined by (p iS

called the rotation-vector. The rotation vector is contained in the axis of (p. In fact, let a+O be a vector on the axis. Then equations (8.45) and (8.44) yield (8.46)

showing that u is a multiple of a. Hence (8.45) can be used to find the rotation axis provided that the rotation vector is different from zero. This exceptional case occurs if and only if = i.e. if and only if Then (p has the eigenvalues 1, —1 and — 1. In other words, is (p = (p a reflection at the rotation axis. 8.23. The rotation angle. Consider the plane F which is orthogonal to E1. Then transforms F into itself and the induced rotation is again proper. Denote by 0 the rotation angle of (p1 (cf. remark at the end of see 8.21). Then, in view of (8.43),

cosO =

Observing that trço =

+1

trço1

we obtain the formula cosO = 3-(trp

1).

To find a formula for sin 0 consider the orientation of Fwhicl' is induced by E and by the vector u (cf. sec. 4.29)*). This orientation is represented by the normed determinant function

A1(y,z)= lul

where A is the normed determinant function representing the orientation

of E. Then formula (7.25) yields

sinO=A1(y,(py)='A(u,y,(py) lul

*) It is assumed that u

0.

(8.47)

§ 6. Rotations of Euclidean spaces of dimension 2. 3 and 4

241

where y is an arbitrary unit vector of F. Now

A(u,y,çoy)= = 4(u,co' y,y)= — A(u,y,ço1y) and hence equation (8.47) can be written as (8.48) lul

By adding formulae (8.47) and (8.48) we obtain (8.49)

lxi

21u1

Inserting the expression (8.45) in (8.49), we thus obtain

x y)=

sin0 = lul Since

x lul

(8.50)

y is a unit-vector orcnogonal to u, it follows that iu

xyl=iuilyl=lui

and hence (8.50) yields the formula

sin0=iui. This equation shows that sin 0 is positive and hence that 0 1 and define polynomials by —

g

are relatively prime, and hence there

exist polynomials h1 such that (13.19)

On the other hand, it follows from Corollary II, Proposition II, sec. 13.2 that

K(g) =

j*i

and so, in particular, =

0

xe

j*i

Now let xeE be an arbitrary vector, and let x1EE1

(13.20)

392

Chapter XIII. Theory of a linear transformation

be the decomposition of x determined by (13.18). Then (13.19) and (13.20) yield the relation x=

=

x =

h (go) g1 (go)

h. (go) g. ((p) x1

whence x1

=

I = 1,

...,r.

(13.21)

It follows at once from (13.20) and (13.21) that

=

I

=

1,

...,r

which completes the proof. 13.6. Arbitrary stable subspaces. Let Fc: E be any stable subspace. Then

F=

(13.22)

n

where the are the generalized eigenspaces of go. In fact, since the projection operators are polynomials in go, it follows that F is stable under each 7C1.

Now we have for each xeF that X

=1X=

X

and fl

It follows that

E1,

whence E1.

inclusion in the other direction is obvious, (13.22) is established. 13.7. The Fitting decomposition. Suppose F0 is the generalized eigenspace of go corresponding to the irreducible polynomial t (if t does not divide then of course F0=0). Let F1 be the direct sum of the remaining generalized eigenspaces. The decomposition Since

E=

F0

F1

is called the Fitting decomposition of E with respect to go. F0 and F1 are called respectively the Fitting-null component and the Fitting-one component of E.

§ 2. Generalized eigenspaces

393

Clearly F0 and F1 are stable subspaces. Moreover it follows from the and F1, then definitions thatif 'p is ni/potent; i.e.,

forsomel>O

while Pi is a linear isomorphism. Finally, we remark that the corresponding projection operators are polynomials in (p, since they are sums of the projection operators defined in sec. 13.5. 13.8. Dual mappings. Let E* be a space dual to E and let E*

çp* E*

be the linear transformation dual to 'p. Then if f is any polynomial, it follows from sec. 2.25 that This

=

implies that f(p*) = 0 if and only if f('p) 0. In particular, the

minimum polynomials of 'p and

coincide.

Now suppose that F is any stable subspace of E. Then F' is stable under co*. In fact, if yEF and y*EF' are arbitrarily chosen, we have =

=

'p

0

whence 'p*y*EFI. This proves that F' is stable. Thus 'p* induces a linear transformation ((p*)I.

E*/FI

E*/FI.

On the other hand, let and ('p*)1 are be the restriction of 'p to F. It will now be shown that dual with respect to the induced scalar product between F and E*/FI is a representative of (cf. sec. 2.24). In fact, if yEF is any vector and an arbitrary vektor e E*/FI, then EJ)l

j*i

defined by

i=1,...,r.

Then (13.24)

§ 2. Generalized eigenspaces

are stable under

Then, as shown above, the

395

and (13.25)

It will now be shown that (13.26)

Let y*EF,* be arbitrarily chosen. Then for each

=

=

we have

= 0.

In view of the duality between EL and F1*, this implies thatf/'1(cp*)y* =0;

y*EE* This establishes (13.26). Now a comparison of the decompositions (13.23) and (13.25) yields (13.24).

Problems 1. Show that the minimum polynomial of p is completely reducible (i.e. all prime factors are of degree 1) if and only if every stable subspace contains an eigenvector.

2. Suppose that the minimum polynomial p of ço is completely reducible. Construct a basis of E with respect to which the matrix of ço is lower triangular; i.e., the matrix has the form 0

*

Hint: Use problem 1.

3. Let E be an n-dimensional real vector space and p: E—*E be a regular linear transformation. Show that çø can be written p=c01p2 where every eigenvalue of p1 is positive and every eigenvalue of .p2 is negative.

4. Use problem 3 to derive a simple proof of the basis deformation theorem of sec. 4.32. 5. Let ço:E-*E be a linear transformation and consider the subspaces F0 and F1 defined by F0 =

and

F1 =

fl

Chapter XIII. Theory of a linear transformation

396

a) Show that F0 = U ker

ço1.

1

b)

Show that

c) Prove that F0 and F1 are stable under

and that the restrictions are respectively nilpotent and regular. and cond) Prove that c) characterizes the decomposition

(p0:F0-3F0 and ço1

clude that F0 and F1 are respectively the Fitting null and the Fitting 1-component of E.

6. Consider the linear transformations of problem 7, § 1. For each transformation a) Construct the decomposition of into the generalized eigenspaces. b) Determine the eigenspaces. c) Calculate explicitly polynomials g, such that the g.(p) are the pro-

jection operators in E corresponding to the generalized eigenspaces. Verify by explicit consideration of the vectors that the (ço) are in fact the projection operators. d) Determine the Fitting decomposition of E. 7. Let E= be the decomposition of E into generalized eigenspaces

of p, and let

be the corresponding projection operators. Show that

there exist unique polynomials

(p) = Conclude that the polynomials

such that and

deg

deg

depend only on p. E*

8. Let E* be dual to E and

E* into generalized eigenspaces of co* prove that

=

are the corresponding projection operators and the g, are defined in problem 7. Use this result to show that and are dual and to obtain formula

where the

(13.24).

9. Let FcE be stable under 'p and consider the induced mappings E/F—+ E/F. Let E= be the decomposition of F into

c°F :F—÷F and

be the canonical injection and generalized eigenspaces of 'p. Let Q:E—*E/Fbe the canonical projection. a) Show that the decomposition of F into generalized eigenspaces is given by

F=

where

F

fl

§ 3. Cyclic spaces

397

Show that the decomposition of E/F into generalized eigenspaces of is given by E/F = (ElF)1 where (E/F), = (E1). b)

Conclude that

determines a linear isomorphism (E/F),.

E1/F,

c) If denote the projection operators in E, F and E/F associated with the decompositions, prove that the diagrams

and F

are commutative, and that

are the unique linear mappings for

which this is the case. Conclude that if g are the polynomials of problem 7,then F g1((p) and = —

10. Suppose that the minimum polynomial ji of çü is completely reducible. a) By considering first the case p = (t —

dimE.

degp

b) With the aid of a) prove that of p. § 3.

prove that

x the characteristic polynomial

Cyclic spaces

13.9. The map 0a• Fix a vector aEE and consider the linear map ar,:

given by a

fer[t].

Let K1, denote the kernel of ar,. It follows from the definition that feK,, Ka, where p is if and only if aEK(J). K,, is an ideal in T[t]. Clearly the minimum polynomial of p. Proposition 1: There exists a rector a E E such that K,, = I,, Proof: Consider first the case that p is of the form

p=

k 1.

/ irreducible.

Chapter XIII. Theory of a linear transformation

398

Then there is a vector tIE E such that (13.27)

0.

Suppose now that hEKa and let g be the greatest common divisor of h and p. Since aeK(fl, Corollary Ito Proposition I, sec. 13.2 yields

ueK(g).

(13.28)

Since g, p it follows that g =1' where / k. Hence 1 (p) a = () and relation (13.27) implies that /=k. Thus K(g)=K0. Now it follows from (13.28) that

In the general case decompose p in the form =

J irreducible

...

and let (i= 1 i) be be the corresponding decomposition of E. Let is the induced transformation. Then the minimum polynomial of given by (cf. sec. 13.4) = r) (1= 1

Thus, by the first part of the proof, there are vectors K,, =

I,,

(i =

I

such that

. . .

Now set

(1(l1 Assume that jEK11. Then f(p) a=0 whence

= 0.

Since f(p)

it follows that

f(p) whence fe Ka (i =

1

=0

(1 =

I

... r). Thus •f C

and so tel0. This shows that

•'' fl

=

III

whence K(,=I,L.

13.10. Cyclic spaces. The vector space E is called crc/ic (with respect

to the linear transformation p) if there exists a vector acE such that is surjective. Every such vector is called a geiiclaloI of E. the map In fact, let I cK and let XEE. Then, Ifa is a generator of E. then

§ 3. Cyclic spaces

399

for some geT[t], x=g(p)a. It follows that

=f(p)g(p)a = whence

f(p)a = g(p)O = 0

and so tel1.

Proposition II: If E is cyclic, then

degp = dimE.

Proof Let a be a generator of E. Then, since Ka =

0a induces an

isomorphism E.

It follows that (cf. Proposition I, sec. 12.11)

= degp.

dim

Proposition III: Let a be a generator of E and let deg p = m. Then the vectors

a,p(a)

form a basis of E.

Proof. Let / he the largest integer such that the vectors a, p(a)

(13.29)

are linearly independent. Then these vectors generate E. In fact, every

vector xeE can be written in the form x=f(p)a where I =

is a

polynomial. It follows that i—i

k

X

=

(a) = V=

p' (a). 1)

1)

Thus the vectors (13.29) form a basis of E. Now Proposition II implies that I=ni. Proposition IV: The space E is cyclic if and only if there exists a basis (v=O... n—i) of E such that =

(v = 0 ... ii —2).

Proof: If E is cyclic with generator a set a=a and apply Proposiu—I) is a basis satisfying tion III. Conversely. assume that

the conditions above. Let v= e I'[t] by

be an arbitrary vector and define V

Chapter XIII. Theory of a linear transformation

400

Then

=x as is easily checked and so E is cyclic. (v = 0 ii — I be a basis as in the Proposition CoioI/urr: Let above. Then the minimum polynomial of is given by )

= where the

are

—

determined by

Proof: It is easily checked that p(p)=O and so p is a multiple of the minimum polynomial of p. On the other hand. by Proposition II. the minimum polynomial of has degree ii and thus it must coincide with p. 13.11. Cyclic subspaces. A stable subspace is called a crc/ic siihspacc if it is cyclic with respect to the induced transformation. Every vector UEE determines a cyclic subspace. namely the subspace = Im Proposition V: There exists a cyclic subspace whose dimension is equal to degp. Proof: In view of Proposition I there is a vector u e F such that ker = Then induces an isomorphism E11.

It follows that

dimE, = diml'[t]

'R

= degp.

Throic;,, I: The degree of the minimum polynomial satisfies

degp dimE. Equality holds if and only if E is cyclic. Moreover, if F is any cyclic subspace of E. then

dimF degp.

Proof: Proposition V implies that deg p dimE. If E is cyclic. equality

holds (cf. Proposition III). Conversely. assume that degp=dimE. By Proposition V there exists a cyclic subspace with dim F=degp. It follows that F=E and so E is cyclic.

§ 3. Cyclic spaces and irreducible spaces

401

Finally, let Fc:E be any cyclic subspace and let v denote the minimum polynomial of the induced transformation. Then, as we have seen above, dim F = deg v. But v/p (cf. sec. 13.2) and so we obtain dim F deg p.

Corollary: Let F E be any cyclic subspace, and let v denote the minimum polynomial of the linear transformation induced in F by (p. Then

v=u if and only if

dimF = degp. Proof: From the theorem we have

dimF =

degv

while according to sec. 13.2 v divides p. Hence v=p if and only if deg v=deg ,i; i.e., if and only if

dimF = degp. 13.12. Decomposition of E into cyclic subspaces. Theorem II: There exists a decomposition of E into a direct sum of cyclic subspaces. Proof: The theorem is an immediate consequence (with the aid of an induction argument on the dimension of E) of the following lemma.

E such that

Lemma I. Let dimEa = degp.

Then there is a complementary stable subspace, Fc E, E = Ea

F.

Proof: Let Ea

denote the restriction of (p to Ea, and let (p*:E*

E*

the linear transformation in E* dual to (p. Then (cf. sec. 13.8) stable under (p*, and the induced linear transformation be

*

*

J_

*

(pa:E lEa *—E lEa 26

Greub. Linear Algebra

is

Chapter XIII. Theory of a linear transformations

402

is dual to

with respect to the induced scalar product between Ea and

The corollary to Theorem I, sec. 13.11 implies that the minimum polynomial Of 'Pa is again p. Hence (cf. sec. 13.8) the minimum polynomial

are dual, so that

of p is p. But Ea and

= dimEa = degp.

dim

is cyclic with respect to çü.

Thus Theorem I, sec. 13.11 implies that Now let

be the projection and choose u*eE* so that the element generates E*/E±. Then, by Proposition III, the vectors (p = 0... in — 1) form a basis of Hence the vectors

(p* V

(p = I ... in — I) are linearly independent.

Now consider the cyclic subspace Since I) it follows that On the other hand. Theorem I. sec. 13.11. implies that dim Hence dim

= (p = 0... is injective. Thus 0

Finally, since 7r* TE to

dim

+ dim

=

in —

= m. I

)

it follows that the restriction of But

in + (ii — in)

=

ii

(n =dim E)

and thus we have the direct decomposition E* =

of E* into stable subspaces. Taking orthogonal complements we obtain the direct decomposition

E=

E into stable subspaces which completes the proof.

§ 4.

Irreducible spaces

13.13. Definition. A vector space E is called

!I1(Iecoin/)os(Ib/e or

ii'i'cthicible with respect to a linear transformation 'P. if it can not be expressed as a direct sum of two proper stable subspaces. A stable

§4. Irreducible spaces

403

subspace F E is called irreducible if it is irreducible with respect to the linear transformation induced by (p. Proposition I: E is the direct sum of iireducihle subspaces. Proof: Let

E into stable subspaces such that s is maximized

(this is clearly possible. since for all decompositions we have sdim E). Then the spaces are irreducible. In fact, assume that for some i, =

is a decomposition of

dim

Ft"

> 0, dim

>0

into stable subspaces. Then

E=

F Fe" j*i is a decomposition of E into (s+ I) stable subspaces, which contradicts the maximality of s. An irreducible space E is always cyclic. In fact, by Theorem II, sec. 13.12, E is the direct sum of cyclic subspaces.

E=

If E is indecomposable. it follows that 1=and so E is cyclic. On the other hand, a cyclic space is not necessarily indecomposable. In fact, let E be a 2-dimensional vector space with basis a. h and define p: E—+E by setting (pa=h and (ph =0. Then (p is cyclic (cf. Proposition IV, sec. 13.10). On the other hand, E is the direct sum of the stable subspaces generated by a + b and a — b. 1

Theorem I: E is irreducible if and only if i)

p=fk,

f irreducible;

ii) E is cyclic.

Proof. Suppose E is irreducible. Then, by the remark above. E is E into the cyclic. Moreover, if generalized eigenspaces (cf. sec. 13.4), it follows that r= 1 and so

(f irreducible). Conversely, suppose that (i) and (ii) hold. Let E=

E1

E2

Chapter XIII. Theory of a linear transformations

404

be any decomposition of E into stable subspaces. Denote by and c02 the linear transformations induced in F1 and E, by ço, and let and 112 be the minimum polynomials of and q2. Then sec. 13.2 implies that Hence, we obtain from (i) that p and 111 =

k2 k.

=

(13.33)

Without loss of generality we may assume that k1 k2. Then xEE1

or

XEE7

and so

0.

It follows that

whence k1 k. In view of (13.33) we obtain k1 =k

i.e., It

= Iti•

Now Theorem I, sec. 13.10 yields that

dimE1 degp.

(13.34)

On the other hand, since E is cyclic, the same Theorem implies that

dimE = degjt.

(13.35)

Relations (13.34) and (13.35) give that

dimE = dimE1. Thus E= F1 and F2 = 0. It follows that E is irreducible.

Corollary I. Any decomposition of F into a direct sum of irreducible subspaces is simultaneously a decomposition into cyclic subspaces.

Then a stable subspace Corollary II. Suppose that of F is cyclic if and only if it is irreducible. 13.14. The Jordan canonical matrix. Suppose that F irreducible with respect to p. Then it follows from Theorem I sec. 13.13 that F is cyclic and that the minimum polynomial of ço has the form

p_fk

kl

(13.36)

where f is an irreducible polynomial. Let e be a generator of E and

§4. Irreducible spaces

405

consider the vectors (13.37)

it will be shown that these form a basis of E. Since

dimE =

deg,u

= pk

it is sufficient to show that the vectors (13.37) generate E. Let Fc:E be the subspace generated by the vectors (13.37). Writing

we obtain that

i==1,...,k j=1,...,p—1 e

= f(çp)1_i ço"e =

p—i —

i1,...,k1 These

equations show that the subspace F is stable under 'p. Moreover,

e = a11 e F. On the other hand, since E is cyclic, E is the smallest stable

subspace containing e. It follows that F—E. Now consider the basis ...

a11,

...;akl ...

of E. The matrix of 'p relative to this basis has the form 0 1

(13.38)

0

406

Chapter XIII. Theory of a linear transformation

are all equal, and given by

where the submatrices

01 0

0 1

0•.

j=1,...,k. 0

1

The matrix (13.38) is called a Jordan canonical matrix of the irreducible transformation p. Next let ço be an arbitrary linear transformation. In view of sec. 13.12 there exists a decomposition of E into irreducible subspaces. Choose a basis in every subspace relative to which the induced transformation has the Jordan canonical form. Combining these bases we obtain a basis of E. In this basis the matrix of p consists of submatrices of the form (13.38)

following each other along the main diagonal. This matrix is called a Jordan canonical matrix of çø. 13.15. Completely reducible minimum polynomials. Suppose now that E is an irreducible space and that the minimum polynomial is completely

reducible (p= 1); i.e. that

p = (t

—

2)k.

It follows that the

are (1 x 1)-matrices given by Jordan canonical matrix of ço is given by

21

Hence the

0

2 1. (13.39)

..1 0

In particular, if E is a complex vector space (or more generally a vector space over an algebraically closed field) which is irreducible with respect to 'p, then the Jordan canonical matrix of çü has the form (13.39). 13.16. Real vector spaces. Next, let E be a real vector space which is irreducible with respect to ço. Then the polynomialf in (13.36) has one of the two forms f=t—2 2ER or

§4. Irreducible spaces

407

In the first case the Jordan canonical matrix of p has the form (13.39). In the second case the are 2 x 2-matrices given by 0

Hence

1

the Jordan canonical matrix of 0

has the form

1

-fl

1

- -

1

H 13.17. The number of irreducible subspaces. It is clear from the construction in sec. 13.12 that a vector space can be decomposed in several ways into irreducible subspaces. However, the number of irreducible subspaces of any given dimension is uniquely determined by as will be shown in this section. Consider first the case that the minimum polynomial of 'p has the form

p=f

degf=p

wheref is irreducible. Assume that a decomposition of E into irreducible subspaces is given. The dimension of every such subspace is of the form pK(l as follows from sec. 13.12. Denote by FK the direct sum of the irreducible subspaces of dimension pic and denote by NK the number of the subspaces Then we have

E=

FK.

Comparing the dimensions in (13.40) we obtain the equation = dim E=n.

(13.40)

Chapter XIII. Theory of a linear transformation

405

Now consider the transformation

it follows from (13.40) that

Since the subspaces FK are stable under

(13.41)

By the definition of

we have FK

dim

=

whence

=

Since the dimension of each follows that

decreases by p under k/I (cf. sec. 1 3. 14) it

dimk//Fh =

—

(13.42)

1)NK.

Equations (13.41) and (13.42) yield (r(k/I) = rank t/i)

K=2

Repeating the above argument we obtain the equations j=1,...,k.

Kj+

1

Replacing] byj+ I and]—! respectively we find that

(K—j—l)NK=p (13.43)

and —j + 1)NK =

=

—j)NK +

pINK.

(13.44)

Adding (13.43) and (13.44) we obtain

+

= 2p

(K

=2p Kj+

1

—j)NK +

+

+

§4. Irreducible spaces

409

whence

=

+

j

—

= 1,...,k.

This equation shows that the numbers N1 are uniquely determined by the

(j= l,...,k).

ranks of the transformations In particular.

1.(i/1k_l)>

Nk=

I

if]> k. Thus the degree of is pk where k is the largest integer k and such that Nk>O. In the general case consider the decomposition of E into the generalized eigenspaces E= a a direct sum of irreducible subspaces. Then E every irreducible subspace F, is contained in some E (cf. Theorem I, sec. 13.13). Hence the decomposition determines a decomposition of

each E, into irreducible subspaces. Moreover, it is clear that these subspaces are irreducible with respect to the induced transformation E. Hence the number of irreducible subspaces in E1 of a given dimension is determined by and thus by p. It follows that the number of spaces F, of a given dimension depends only on (p. 13.18. Conjugate linear transformations. Let E and F be u-dimensional vector spaces. Two linear transformations p: E—*E and i/i: F—÷F are called conjugate if there is a linear isomorphism such that (pt: E1

= Proposition II: Two linear transformations (p and u/i

are

conjugate if

and only if they satisfy the following conditions: (i)

The minimum polynomials of (p and

factors /1

u/i

have

the same prime

Ir

(ii)

Proof: If (p and u/i

1=1 ... are

I.

conjugate the conditions above are clearly

satisfied.

To prove the converse we distinguish two cases.

Chapter XIII. Theory of a linear transformation

410

Casc I: The minimum polynomials of p and /i are of the form Pq)

.1

=

and

f irreducible.

.11.

Decompose E and F into the irreducible subspaces I

i=-1 j=1

The numbers =

.\ (/1)

F=>i=1 j=1

and N1(l/J) are given by

-

+

p=degf

and

= (cf.

-

+

sec. 13.17). Thus (ii) implies that

il. Since

i>k and

I>!

it follows that k=l. In view of Theorem I, sec. 13.13, the spaces El and F/ are cyclic. Thus Lemma I below yields an isomorphism such that El

= These

isomorphisms determine an isomorphism 1/I

Case II: and ized eigenspaces

=

0

such that

0

are arbitrary. Decompose E and F into the general-

and set

(i= I ... i). Then

restricts to a linear isomorphism in the subspace

§4. Irreducible spaces

Moreover, for

411

is zero in E,. Thus dimE —

=

= 1 ...

dimF —

=

=

Similarly.

... r.

i

Now (ii) implies that

1=1 ...r. Let and denote the respectively. The minimum polynomials of

restrictions of 'p and are of the form

and

i=l ... r and Since

+ dimE —

and

= r(f

j

+ dim F — dim

1

—

it follows from (ii) that l

(j(y) = Thus the pair

satisfies

...

the hypotheses of the theorem and so we

are reduced to case I.

Lemma I: Let p: E E and /i: F —* F (dim E = dim F = ii) be linear transformations having the same minimum polynomial. Then, if E and F are cyclic, 'p and are conjugate. Proof: Choose generators a and b of E and F. Then the vectors 01='pi(a) and

ii— I) forma basis ofEand F respectively.

Now set a,='p(a) and h= t/i'(h). Then, for some and

a,, = 1=

h=

()

h1. 1=

()

Moreover, the minimum polynomials of 'p and =

t'

and

—

1=)

=

t'

are given by —

1=0

Chapter XIII. Theory of' a linear transformations

412

(cf. the corollary of Proposition IV, sec. 1 3.10). Since 1 = 0 ...

=

it follows that

I).

n

by setting

define an isomorphism =

=

/ = 0 ...

h1

n —

Then we have

= (a,) = These

= h,. j=()

j=()

equations imply that

j=0...ii— I. This shows that Anticipating the result of sec. 13.20 we also have Corollary I: Two linear transformations are conjugate if and only if they have the same minimum polynomial and satisfy condition (ii) of Proposition II.

Corollary II: Two linear transformations are conjugate if and only if they have the same characteristic polynomial (cf. sec. 4.19) and satisfy (ii).

Proof: Let J be the common characteristic polynomial of p and /i. Write

=

. .

=

...

.

(f irreducible).

and = .1111

Set

the generalized eigenspaces for direct decompositions

and

fr

13(1=1 ... r) denote 1= 1 ... r; let and t/i respectively. Then we have the

§4. Irreducible spaces

it follows that

Since

413

(1=1 ... r). Similarly.

whence

(i=l...r). Thus we may assume that (g irreducible). Then (kni). ,=g'

I

is

of the form f=g" and

the corollary

follows immediately from the proposition.

Corollaiv III: Every linear transformation is conjugate to its dual.

Problems 1.

(i)

Let P: A(E; E) be a non-zero algebra endomorphism. Show that for any projection operator it.

r(P(m))=i(rr), (ii) Use (i) to prove that 2. Show that every non-zero endomorphism of the algebra A (E; E) is an inner automorphism. Hint: Use Problem I and Proposition II. sec. 13.18. 3. Construct a decomposition of into irreducible subspaces with respect to the linear transformations of problem 7, § 1. Hence obtain the Jordan canonical matrices. 4. Which of the linear transformations of problem 7, § make into a cyclic space? For each such transformation find a generator. For which of these transformations is irreducible? 5. Let E1 cE be a stable subspace under p and consider the induced mappings p1: E1 E1 and E/E1 E/E1. Let p, 1u1, ji be the corresponding minimum polynomials. a) Prove that E is cyclic if and only if 1

i) E1 is cyclic ii) E/E1 is cyclic

iii) p=u1ü. In particular conclude that every subspace of a cyclic space is again cyclic.

b) Construct examples in which conditions i), ii), iii) respectively fail, while the remaining two continue to hold. Hint: Use problem 5, § 1. 6. Let j=1 are

E into subspaces and suppose linear transformations with minimum polynomials

Chapter XIII. Theory of a linear transformations

414

Define a linear transformation

of E by

a) Prove that E is cyclic if and only if each relatively prime.

is cyclic and the

are

b) Conclude that if E is cyclic, then each F3 is a sum of generalized eigenspaces for çø.

c) Prove that if E is cyclic and

a generates Eifand only if generates 1. ..s). 7. Suppose Fc: E is stable under p and let (pF:F-+F (minimum polynomial jib.) and (minimum polynomial ji) be the induced transformations. Show that E is irreducible if and only if i) E/F is irreducible. ii) F is irreducible. iii) wheref is an irreducible polynomial. 8. Suppose that Eis irreducible with respect to çø. Letf (f irreducible)

be the minimum polynomial of ço. a) Prove that the k subspaces K(f) stable subspaces of E b) Conclude that lmf(co)K =

K(fk_l) are the only non-trivial

K

k.

9. Find necessary and sufficient conditions that E have no nontrivial stable subspaces. 10. Let be a differential operator in E.

a) Show that in any decomposition of E into irreducible subspaces, each subspace has dimension I or 2. b) Let N1 be the number of/-dimensional irreducible subspaces in the above decomposition (j= 1, 2). Show that

N1-f-2N,=dimE and N1=dimH(E). c) Using part b) prove that two differential operators in E are conjugate if and only if the corresponding homology spaces coincide. Hint: Use Proposition II, sec. 13.18.

11. Show that two linear transformations of a 3-dimensional vector space are conjugate if and only if they have the same minimum polynomial.

§ 5. Applications of cyclic spaces

12. Let be a linear transformation. Show that there exists a (not necessarily unique) multiplication in E such that i) E is an associative commutative algebra ii) E contains a subalgebra A isomorphic to F((p)

iii) If &F(p)4A is the isomorphism, then

13. Let F be irreducible (and hence cyclic) with respect to p. Show that the set S of generators of the cyclic space E is not a subspace. Construct a subspace F such that S is in I — correspondence with the non-zero elements of E/F. 1

14. Let

be

a linear transformation of a real vector space having can not be written in the

distinct eigenvalues, all negative. Show that form § 5. Applications

of cyclic spaces

In this paragraph we shall apply the theory developed in the preceding paragraph to obtain three important, independent theorems. 13.19. Generalized eigenspaces. Direct sums of the generalized eigenspaces of 'p are characterized by the following

Theorem I: Let (13.45)

be any decomposition of E into a direct sum of stable subspaces. Then the following three conditions are equivalent: (i) Each is a direct sum of some of the generalized eigenspaces E, of 'p. (ii) The projection operators

in E associated with the decomposition

(13.45) are polynomials in 'p. (iii) Every stable subspace UcE satisfies U=

U n

Proof: Suppose that (i) holds. Then the projection operators are sums of the projection operators associated with the decomposition of F into generalized eigenspaces, and so it follows from sec. 13.5 that they are polynomials in 'p. Thus (i) implies (ii).

416

Chapter XIII. Theory of a linear transformation

Now suppose that (ii) holds, and let Uc E be any stable subspace. Then i, we have

since

Uc>o1U. Since Uis stable underQ1 it follows that Q1Uc: Un U c:

U

fl

whence

F1.

The inclusion in the other direction is obvious. Thus (ii) implies (iii). Finally, suppose that (iii) holds. To show that (i) must also hold we first prove

Lemma I: Suppose that (iii) holds, and let

E into generalized eigenspaces. Then to every i, such that (1= 1, ...,r) there corresponds precisely one integer], (1 E1 n

Proof of lemma I: Suppose first that s). Then from (iii) we obtain every ./(1

E. =

E, n

=0 for a fixed I and for

fl

=

0

j= 1

which is clearly false (cf. sec. 13.4). Hence there is at least one] such that

To prove that there is at most one] (for any fixed 1) such that E1 n we shall assume that for some f,]1,]2, fl

+0

and E1 n

0,

+0

and derive a contradiction. Without loss of generality we may assume that

E1flF1+0 and E1flF2+0. Choose two non-zero vectors

y1eE1flF1 and y2EE1flF2. Then since Yi' y2eE1 we have that =

§ 5. Applications of cyclic spaces

417

J irreducible, is the decomposition of p. Let and '2 the least integers such that =0 and =0. We may assume that = '2 we simply replace Yt by the vector where ji

. .

be

Then

f

= 0 and

11

+ 0 for k
k

whence k=l== 1. Hencef(p)=O. is simultaneously nilpotent and semisimple, then Corollary I: If (p = 0.

The major result on semisimple transformations obtained in this section is the following criterion:

Theorem I: A linear transformation is semisimple if and only if its minimum polynomial is the product of relatively prime irreducible polynomials (or equivalently, if the polynomials p and ji' are relatively prime).

Remark: Theorem I shows that a linear transformation p is semisimple if and only if it is a semisimple element of the algebra F(p) (cf. sec. 12.17).

Proof: Suppose (p

=

j1ki

is

...

semisimple. Consider the decompositions

1 irreducible and relatively prime

of the minimum polynomial, and set

ffifr

Then

f(p) is nilpotent, and hence by Proposition I of this section,

f(p)=O. It follows that

by definition, we have

This proves the only if part of the theorem.

To prove the second part of the theorem we consider first the special case that the minimum polynomial, ji, of ip is irreducible. To show that (p is semisimple consider the subalgebra, T(p), of A (E; E) generated by (p and i. Since p is irreducible, F(p) is a field (cf. sec. 12.14). E(p) contains F and hence it is an extension field of F, and E may be considered as a vector space over F(p) (cf. § 3, Chapt. V). Since a subspace of the F-vector space E is stable under p if and only if it is stable under every transforma-

tion of F(p), it follows that the stable subspaces of E are precisely the F(p)-subspaces of the space E. Since every subspace of a vector space has a complementary subspace it follows that 'p is semisimple.

Chapter XIII. Theory of a linear transformation

Now consider the general case

P=

fi

•

.

. fr

ft irreducible and relatively prime.

Then we have the decomposition

E into generalized eigenspaces. Since the minimum polynomial of the is preciselyf (cf. sec. 13.4) it follows induced transformation from the above result that is semisim pIe. Now let E be a stable subspace. Then we have, in view of sec. 13.6,

F F is a stable subspace of complementary subspace

=

and hence there exists a stable

(F n E1)

These equations yield =

H=

H is a stable subspace of E it follows that p is semisimple. Corollary I. Let p be any linear transformation and assume that

E=(F E into stable subspaces such that the induced are semisimple. Then p is semisimple.

transformations be the minimum polynomial of the induced transforProof: Let Since is semisimple each p is a product of relatively mation prime irreducible polynomials. Hence, the least common multiple, f, of the is again a product of such polynomials. Butf(p) annihilates E and hence the minimum polynomial, p, of 'p dividesf It follows that p is a product of relatively prime irreducible polynomials. Now Theorem I implies that 'p is semisimple. Proposition H: Let A c F be a subfield, and assume that E, considered as a A-vector space has finite dimension. Then every (F-linear) transfor-

mation, çü, of E which is semisimple as a A-linear transformation is semisimple considered as F-linear transformation.

§ 6. Nilpotent

and semisimple transformations

429

Proof Let p4 be the minimum polynomial of p considered as a zi-linear are relatively

transformation. It follows from Theorem I that and prime. Hence there are polynomials g,rE4[t] such that

(13.55)

On the other hand, every polynomial over A[t] may be considered as a polynomial in F[t]. Since ((p) 0 we have JLr

denotes the minimum polynomial of the (F-linear) transformation p. Hence we may write = p1 h some /1 E F [t] and so = (13.56) + Pr/I'. where

Combining (13.55) and (13.56) we obtain

q/zp1+h'rp1+hrp-= 1 whence

p- = I

(q Ii + h 'r) Pr + (Ii

are relatively prime. This relation shows that the polynomials and Now Theorem I implies that the r-linear transformation ço is semisimple.

Theorem II: Every linear transformation p can be written in the form

where c°s is semisimple and p and

are given by

and

k=max(k1 Moreover, if N

any decomposition of p into a semisimple and nilpotent transformation such that then is

and Proof': For the existence apply Theorem I and Theorem IV. sec. 12.16

with A=F(p).

430

Chapter XIII. Theory of a linear transformation

To prove the uniqueness let

be

any decomposition of (p

into a semisimple and a nilpotent transformation such that with Then the subalgebra of A(E; E) generated by i.

commutes is and

commutative and contains (p. Now apply the uniqueness part of Theorem IV. sec. 12.16.

13.25. The Jordan normal form of a semisimple transformation. Suppose that E is irreducible with respect to a semisirn pie transformation Then it follows from sec. 13.13 and Theorem I sec. 13.24 that the minimum polynomial of (p

has

the form IL =

where

f is irreducible. Hence the Jordan canonical matrix of (p

has

the

form (cf. see. 13.15) o i.••

(13.57) o

p

•i

p=

deg p.

It follows that the Jordan canonical matrix of an arbitrary semisimple transformation consists of submatrices of the form (13.57) following each other along the main diagonal. Now consider the special case that E is irreducible with respect to a semisimple transformation whose minimum polynomial is completely reducible. Then we have that p= 1 and hence E has dimension 1. It follows that if is a semisimple transformation with completely reducible minimum polynomial, then E is the direct sum of stable subspaces of dimension 1; i.e., E has a basis of eigenvectors. The matrix of with respect to this basis is of the form

(13.58)

§ 6. Nilpotent

and semisimple transformations

431

are the (not necessarily distinct) eigenvalues of p. A linear transformation with a matrix of the form (13.58) is called diagonalizable. Thus semisimple linear transformations with completely reducible minimum polynomial are diagonalizable. Finally let p be a semisimple transformation of a real vector space E. where the

Then a similar argument shows that E is the direct sum of irreducible subspaces of dimension 1 or 2. 13.26. * The commutant of a semisimple transformation.

Theorem III: The commutant C(p) of a semisimple transformation is a direct sum of ideals (in the algebra C((p)) each of which is isomorphic to the full algebra of transformations of a vector space over an extension

field ofF. Proof: Let

EE1EI3"EBEr

(13.59)

be the decomposition of E into the generalized eigenspaces. It follows from sec. 13.21 that the eigenspaces E are stable under every transformation I/JEC((p). Now let be the subspace consisting of all transformations t/i such that

is stable under each element of C(p) it follows that is an ideal in the algebra C((p). As an immediate consequence of the definition, we Since

have

fin

j=1,...,r.

k*j

(13.60)

be arbitrary and consider the projection operators associated with the decomposition (13.59).

Now let

Then (13.61)

where (13.62)

It follows from (13.62) that

Hence formulae (13.60) and (13.61)

imply that C ('p) =

It is clear that C

is the restriction of 'p to

Chapter XIII. Theory of a linear transformation

432

where the isomorphism is obtained by restricting a transformation to

induced by (p. Since the Now consider the transformations minimum polynomial of is irreducible it follows that E we obtain from chap. V, § 3 a vector space over that = of linear 13.27. Semisimple sets of linear transformations. A set { (p transformations of E will be called seniisimple if to every subspace F1 c: E which is stable under each there exists a complementary subspace F2 which is stable under each }

Suppose now that {co2} is any set of linear transformations and let be the subalgebra generated by the Then clearly, a subspace Fc:E is stable under each if and only if it is stable under is semisimple if and only if the each L'eA. In particular, the set algebra A is semisimple. be a set of commuting semisimple transforTheorem IV: Let mations. Then is a semisimple set.

Proof: We first consider the case of a finite set of transformations and proceed by induction on s. If s= I the theoreni is trivial. Suppose now it holds for s--I and assume for the moment that the minimum polynomial of is irreducible. Then E may be considered as a they may F((p1)-vector space. Since the ...,s) commute with be considered as F(ço1)-linear transformations (cf. Chap. V, § 3). Moreover, Proposition II, sec. 13.24 implies that the considered as linear transformations, are again semisimple. Now let F1 E be any subspace stable under the I, ...,s). Then since F1 is stable under (pt, it is a F(p1 )-subspace of F. Hence. by the induction hypothesis, there exists a F(p1 )-subspace of E, F2, which is stable under and such that

E=

F1

F2.

Since F2 is a F((p1)-subspace, it is also stable under stable subspace complementary to F1. of Let the minimum polynomial simple. we have P1 =

fi

. .

. Jr

and so it is a p

f1 irreducible and relatively prime.

is semi-

§ 6. Nilpotent and semisimple transformations

433

Let

E into the generalized eigenspaces of (pt.

Now assume that F1 cE is a subspace stable under each (1=1, ...,s). According to sec. 13.21 each E1 is stable under each It follows that the subspaces F1 n are also stable under every Moreover the restrictions of the to each are again semisimple (ci. sec. 13.24) and in particular, the restriction of ço1 to has as minimum polynomial the

irreducible polynomial f1. Thus it follows that the restrictions of the form a semisimple set, and hence there exist subspaces which are stable under each co and which satisfy to

= (F1

P

fl

Setting F2

I, ...,

=P

we have that F2 is stable under each

E=

j=

F1

and that F2.

This closes the induction, and completes the proof for the case that the are a finite set. If the set is infinite consider the subalgebra A c: A (E; E) generated by the 'pa. Then A is a commutative algebra and hence every subset of A consists of commuting transformations. In view of the discussion in the beginning of this section it is sufficient to construct a semisimple system of generators for A. But A is finite dimensional and so has a finite system of generators. Hence the theorem is reduced to the case of a finite set. Theorem IV has the following converse: Theorem

V: Suppose Ac:A(E; E) is a commutative semisimple set.

Then for each çoEA, p is a semisimple transformation. Proof Let coEA be arbitrary and consider the decomposition

of its minimum polynomial p. Define a polynomial, g, by

Since the set A is commutative, K(g) is stable under every t/ieA. Hence 28

Greub. Linear Algebra

Chapter XIII. Theory of a linear transformation

434

there exists a subspace E1 c:Ewhich is stable under every /JEA such that

E= Now let

Ii =

(13.63)

1k1—1

Then we have

h(ço)E c: K(g). On the other hand, since E1 is stable under h(p), h((p)E1

E1

whence

K(g)

h((p)E1

n E1

= 0.

It follows that E1 c: K(h). Now consider the polynomial

1,1).

where

Since

k, —

1

it follows that

p(p)x=O

xeE1

(13.64)

xEK(g).

(13.65)

and from 1, 1 we obtain

p(p)x=O

In view of (13.63), (13.64) and (13.65) imply that p(p)=O and so Now it follows that

max(k1—1,l)k,

i=1,...,r

whence

i = 1, ...,r. k, = I Hence ço is semisimple. Corollary: If çü and are two commuting semisimple transformations, then p+i/i and IJco are again semisimple.

Proof. Consider the subalgebra A cA(E;E) generated by p and Then Theorem IV implies that A is a semisimple set. Hence it follows

from Theorem V that ço +

and

p

are semisimple.

Problems I. Let p be nilpotent, and let be the number of subspaces of dimension 2 in a decomposition of E into irreducible subspaces. Prove that dim kerp = EN).

§ 6. Nilpotent

and semisimple transformations

435

2. Let p be nilpotent of degree k in a 6-dimensional vector space E. For each k (1 k 6) determine the possible ranks of p and show that k and r(co) determine the numbers N) (cf. problem 1) explicitly. Conclude that two nilpotent transformations ço and are conjugate if and only if

and 3. Suppose ço is nilpotent and let p*:E*÷_E* be the dual mapping. Assume that E is cyclic with respect to p and that p is of degree k. Let a be a generator of E. Prove that E* is cyclic with respect to and that

is of degree k. Let a*EE* be any vector. Show that

if and only if a* is a generator of E*. 4. Prove that a linear transformation p with minimum polynomial 1u is diagonalizable if and only if I) p is completely reducible ii) p is semisimple Show that i) and ii) are equivalent to

where the

are

distinct scalars.

5. a) Prove that two commuting diagonalizable transformations are simultaneously diagonalizable; i.e., there exists a basis of E with respect to which both matrices are diagonal. b) Use a) to prove that if p and t/i are commuting semisimple transformations of a complex space. then a complex space E. Let a p be the decomposition of £ into generalized eigenspaces, and let

be the corresponding projection operators. Assume that the minimum is Prove polynomial of the induced transformation that the semisimple part of 'p is given by cos

Let £ be a complex vector space and 'p be a linear transformation a), not necessarily distinct. Given an arbitrary polynomialf prove directly that the linear transformationf('p) has the eigenvalues (v= 1 n) (n=dimE). 7.

with eigenvalues

28*

I

Chapter XIII. Theory of a linear transformation

8. Give an example of a semisimple set of linear transformations which contains transformations that are not semisimple.

9. Let A be an algebra of commuting linear transformations in a complex space E. such that for any pEA, a) Construct a decomposition E1 is stable under 'p and the minimum polynomial of the induced transformation is of the form (t

(p) —

b) Show that the mapping A—÷C given by

'peA is a linear function in A. Prove that preserves products and so it is a homomorphism. c) Show that the nilpotent transformations in A form an ideal which is precisely rad A (cf. chap. V, § 2). Consider the subspace T of L(A) Prove that generated by the radA = T1 d) Prove that the semisimple transformations in A form a subalgebra, A5. Consider the linear functions in A5 obtained by restricting to A5. Show that they generate (linearly) the dual space L (A5). Prove that the is a linear isomorphism. T4 L(A5). mapping

10. Assume that E is a complex vector space. Prove that every commutative algebra of semisimple transformations is contained in an ndimensional commutative algebra of semisimple transformations. 11. Calculate the semisimple and nilpotent parts of the linear transformations of problem 7, § 1. § 7. Applications

to inner product spaces

In this concluding paragraph we shall apply our general decomposition theorems to inner product spaces. Decompositions of an inner product space into irreducible subspaces with respect to selfadjoint mappings,

skew mappings and isometries have already been constructed

in

chap. VIII.

Generalizing these results we shall now construct a decomposition for a iiorinal transformation. Since a complex linear space is fully reducible with respect to a normal endomorphism (cf. sec. 11.10) we can to real inner product spaces. restrict

§ 7. Applications

to inner product spaces

437

13.28. Normal transformations. Let E be an inner product space and (p: E—*E be a normal transformation (cf. sec. 8.5). It is clear that every

polynomial in ço is again normal. Moreover since the rank of (p1' (k =2, 3 ...)

equal to the rank of p, it follows that p is nilpotent only if = 0. Now consider the decomposition of the minimum polynomial into its prime factors, (13.66) = ... is

and the corresponding decomposition of E into the generalized eigenspaces,

(13.67)

Since the projection operators m1 associated with the decomposition (13.67) are polynomials in they are normal. On the other hand and so it follows from sec. 8.11 that the and X1EE1 be arbitrary. Then

arc selfadjoint. Now let

=0 i + I; = i.e., the decomposition (13.67) is orthogonal. It follows from Now consider the induced transformations sec. 8.5 that the are again normal and hence so are the transformations On the other hand, is nilpotent. It follows thatf(pj=0 and (xi,

= (xi,

hence all the exponents in (13.66) are equal to I. Now Theorem I of sec. 13.24 implies that a normal transformation is semisimple. Theorem I: Let E be an inner product space. Then a linear transformation ço is normal if and only if i) the generalized eigenspaces are mutually orthogonal. are homothetic (cf. sec. 8.19). ii) The restrictions

Proof: Let p be a normal transformation. It has been shown already that the spaces E1 are mutually orthogonal. Now consider the minimum polynomial. of the induced transformation p.. Since j is irreducible over it follows that either (13.68)

or

ft =

+

;t +

— 4,8k

In the first case we have that consider the case (13.69). Then

+