0% found this document useful (0 votes)

12 views

2207.04377v1

Uploaded by

pranayadaveni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

2207.04377v1

Uploaded by

pranayadaveni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Matrix differentiation with diagrammatic notation

Kenji Nakahira
Quantum Information Science Research Center,
Quantum ICT Research Institute, Tamagawa University
6-1-1 Tamagawa-gakuen, Machida, Tokyo 194-8610 Japan
E-mail: [email protected]

Abstract—We propose a diagrammatic notation for ma- the (i, j)-th component of X. We have
trix differentiation. Our new notation enables us to derive m n
formulas for matrix differentiation more easily than the ∂ XX ∂
f (X) = |ii h j| f (X)
usual matrix (or index) notation. We demonstrate the ∂X i=1 j=1
∂Xi, j
effectiveness of our notation through several examples.
 ∂
 ∂X1,1 f (X) ∂X∂1,2 f (X) ∂ 
··· ∂X1,n f (X) 
 ∂ f (X) ∂
··· ∂ 
 ∂X2,1 ∂X2,2 f (X) ∂X2,n f (X) 
=  .. .. .. ..
 . (1)
I. Introduction  . . . . 
∂ ∂ ∂
 
arXiv:2207.04377v1 [eess.SP] 10 Jul 2022

∂Xm,1 f (X) ∂Xm,2 f (X) ··· ∂Xm,n f (X)

Matrix differentiation (or matrix calculus) is widely In the special case of n = 1, X is a column vector, which
accepted as an essential tool in various fields includ- is denoted by |xi. In this case, we have
ing estimation theory, signal processing, and machine  ∂ 
 ∂x1 f (|xi) 
learning. Matrix differentiation provides a convenient m  ∂ f (|xi) 
∂ X ∂  ∂x2
way to collect the derivative of each component of f (|xi) = |ii f (|xi) =  ..  , (2)

the dependent variable with respect to each component ∂ |xi i=1
∂x i 
 . 
 ∂ 
of the independent variable, where the dependent and ∂xm f (|xi)
independent variables can be a scalar, a vector, or a
where xi B hi|xi.
matrix. However, the usual matrix (or index) notation
A similar notation is used when f is a map from Rm×n
often suffers from cumbersome calculations and difficulty 0 0 ∂
to Rm ×n . For such f , ∂X f (X) is an m × n × m0 × n0 fourth-
in the intuitive interpretation of the final results. It is
known that diagrammatic representations using string order tensor with components { ∂X∂i, j hi0 | f (X)| j0 i}i, j,i0 , j0 . This
diagrams can be successfully applied in linear algebra can be written as the following mm0 × nn0 matrix:
 ∂
(see [1] and references therein). In this paper, we provide  ∂X1,1 f (X) ∂X∂1,2 f (X) · · · ∂X∂1,n f (X) 

a simple diagrammatic approach to derive useful formulas  ∂
 ∂X2,1 f (X) ∂X∂2,2 f (X) · · · ∂X∂2,n f (X) 

∂
for matrix differentiation. f (X) =  .. .. .. ..  , (3)
∂X  . . . . 
Here we mention some related work. In Ref. [2], the
∂ ∂ ∂
 
way of graphically representing the del operator (i.e., ∂Xm,1 f (X) ∂Xm,2 f (X) · · · ∂Xm,n f (X)
∇) is presented, in which calculations are limited to the
where, for each i and j, ∂X∂i, j f (X) is the m0 × n0 matrix
case of three-dimensional Euclidean space. Reference [3]
presents a diagrammatic notation for manipulating tensor whose (i0 , j0 )-th component is ∂X∂i, j hi0 | f (X)| j0 i.
derivatives with respect to one parameter. We adopt a III. Diagrammatic notation
similar notation to those given in these references.
In diagrammatic terms, a matrix is represented as a
box with an input wire at the bottom and an output wire
at the top. Column vectors, row vectors, and scalars are
II. Definition of matrix differentiation regarded as special cases of matrices. For example, A ∈
Rm×n , |xi ∈ Rm B Rm×1 , |yi ∈ Rm∗ B R1×m , and p ∈ R are
diagrammatically depicted as
Let R be the set of all real numbers and Rm×n be the
set of all m × n real matrices. Also, let {|ii}m i=1 denote
the standard basis of Rm . We are concerned only with
finite-dimensional real Hilbert spaces. Given a map f .
(4)
from Rm×n to R and a matrix X ∈ Rm×n of independent
variables, we denote the m × n real matrix with (i, j)-th The Hilbert space Rm is represented by the wire with
component ∂X∂i, j f (X) by ∂X
∂
f (X), where Xi, j B hi|X| ji is label m, while the Hilbert space R is represented by
‘no wire’. For a scalar, the box will be omitted. Matrix The trace of X ∈ Rm×m satisfies Tr X = h∩|X ⊗ 1|∪i, i.e.,
multiplication and tensor products are represented as the
sequential and parallel compositions, respectively. The
identity matrix 1 ∈ Rm×m is depicted as

. (11)
.
(5)
We also use the swap matrix ×n,m , depicted by
We often use a special column vector |∪n i ∈ Rn ⊗ Rn ,
called a cup, and a special row vector h∩n | ∈ Rn∗ ⊗ Rn∗ ,
called a cap. The cup |∪n i is depicted as

, (12)
. (6)
and the matrix called “spider”, depicted by
The cap h∩n | is the transpose of |∪n i, which is depicted
as

. (7)
. (13)
m×n
We have that, for any X ∈ R ,
For details regarding the properties of these matrices, see,
e.g., Ref. [1].
∂ 0 0
We write ∂X f (X) with a map f : Rm×n → Rm ×n as

. (8)

Indeed, the left equality is obtained from . (14)

From Eq. (1), we have

. (15)

IV. Basic formulas

We review some basic formulas that we shall frequently
use later.

A. Derivatives of A and X
, (9)
∂
For any matrix A that is independent of X, ∂X A = 0,
and the same argument works for the right equality. Equa- i.e.,
tion (8) implies that the transpose acts diagrammatically
by rotating boxes 180◦ . Substituting X = 1 with Eq. (8)
yields

(16)

holds. In what follows, we assume that matrices A, B, . . .

. (10) are independent of X, unless otherwise mentioned. Also,
from ∂X∂i, j Xk,l = δi,k δ j,l (where δi,k is the Kronecker delta), C. Chain rules
∂
X = |∪m i h∩n |, i.e.,
0 0
we have ∂X Given a matrix X ∈ Rm×n , a map Y : Rm×n → Rm ×n ,
0 0
and a map f : Rm ×n → Rk×l , the derivative of f [Y(X)]
with respect to Xi, j satisfies

∂ X X ∂ f [Y(X)] ∂Yi0 , j0
k l
f [Y(X)] = , (23)
∂Xi, j i0 =1 j0 =1
∂Yi0 , j0 ∂Xi, j
. (17)
∂
where Yi0 , j0 B hi0 |Y(X)| j0 i. Thus, ∂X f [Y(X)] can be
B. Rules for sums and products diagrammatically represented by
The following sum rule holds:

∂ ∂ ∂
[ f (X) + g(X)] = f (X) + g(X), (18)
∂X ∂X ∂X
. (24)
which is diagrammatically represented as
All the formulas presented in this paper can be obtained
using the above-mentioned equations. It is noteworthy
that this paper is focused on the matrix differentiation,
but our notation can be easily extended to the case of
. high-order tensors.
(19) V. Other basic formulas
As for matrix multiplication and tensor products, we have We derive several basic formulas.
A. Derivatives of matrix multiplication and tensor prod-
∂ ∂ ∂
" # " #
f (X)g(X) = f (X) g(X) + f (X) g(X) , ucts
∂X ∂X ∂X
We immediately obtain
∂ ∂ ∂
" # " #
f (X) ⊗ h(X) = f (X) ⊗ h(X) + f (X) ⊗ h(X) ,
∂X ∂X ∂X
(20) (21)
(22)
which are depicted as (16)

. (25)
B. Derivative of X T
(21) Since X T is represented by

and

, (26)
we have
(26)
(25)

.
(17) (10)
(22)

Note that we assume that the order of wires does not

matter in a diagram. . (27)
∂
C. Derivatives of Hadamard products 2) hx|A|xi = (A + AT ) |xi:
∂ |xi
The Hadamard product of A ∈ Rm×n and B ∈ Rm×n ,
Substituting n = 1 into Eq. (27) gives
denoted by A ◦ B, is the component-wise product, i.e.,
m X
X n
A◦BB hi|A| ji hi|B| ji |ii h j| , (28)
i=1 j=1
, (33)
which is diagrammatically depicted as and thus

(33)
(31)

. (29)

From Eq. (27), we can readily verify

(8)

(34)
holds.
3) Other important examples:

We can easily obtain the following formulas (the proofs

are left to the readers) 1 :
∂
kA |xi − |bi k22 = 2AT (A |xi − |bi),
. ∂ |xi
∂ |xi − |bi
(30) k |xi − |bi k2 = . (36)
∂ |xi k |xi − |bi k2
B. Derivatives with respect to matrices
VI. Examples ∂
1) ha|X|bi = |ai hb|:
We will give some concrete examples that are directly ∂X
derived from the above basic formulas.

(17)
A. Derivatives with respect to column vectors
∂
1) ha|xi = |ai:
∂ |xi
. (37)
Substituting n = 1 into Eq. (17) gives
∂
2) Tr(AX) = AT :
∂X

. (31)
(17) (8)

Thus, we have

(31) . (38)
1 The second line follows from substituting u B k |xi − |bi k22 into
. √
(32) ∂ √ ∂u ∂ u ∂u 1
u= = · √ , (35)
∂ |xi ∂ |xi ∂u ∂ |xi 2 u
Note that ha|T = |ai holds since |ai is a real column vector. which is immediately obtained by the chain rule.
∂ ∂
3) Tr(XX T ) = 2X: 6) Tr[(X + A)−1 ] = −[(X + A)−2 ]T :
∂X ∂X

Letting Y B X + A and using the chain rule, we obtain

(17)
(27)

(8)

(24)
. (39)

∂
4) Tr(AXBX) = AT X T BT + BT X T AT :
∂X

(17)
(41)

(17)

(8)

. (42)

(8)

.
∂
(40) 7) Tr(A ◦ X) = A ◦ 1:
∂X
∂ −1
5) X = −(1 ⊗ X −1 ) |∪i h∩| (1 ⊗ X −1 ):
∂X
∂ −1
Letting Z B ∂X X and differentiating X −1 = X −1 XX −1
∂X −1
with respect to X gives Z = Z + X −1 ∂X X + Z. Thus, we
have

(30)
(17)

(17)

(34)
(32)

(31)
(10)

. (44)
This formula shows that the Hessian matrix of the
quadratic function hx|A|xi + hb|xi + c with A ∈ Rm×m ,
|bi ∈ Rm , and c ∈ R is A + AT .
9) Other important examples:

We can easily obtain the following formulas (the proofs

are left to the readers):
∂
Tr(AXB) = AT BT ,
∂X
∂
Tr(X ⊗ X) = (2 Tr X)1,
∂X
∂
ha|X TCX|bi = CX |bi ha| + C T X |ai hb| ,
∂X
∂
Tr(X k ) = k(X k−1 )T ,
∂X
k−1
∂ X
Tr(AX k ) = (X s AX k−1−s )T ,
∂X s=0
∂
Tr(AX −1 B) = −(X −1 BAX −1 )T , (45)
∂X
where k is a natural number.
VII. Conclusion
We introduced a diagrammatic notation for matrix
differentiation. We demonstrated through some interesting
examples that our notation makes it possible to easily and
intuitively calculate matrix differentiation.
Acknowledgment
I am grateful to O. Hirota for support.
References
[1] B. Coecke, “Quantum picturalism,” Contemporary physics, vol. 51,
no. 1, pp. 59–83, 2010.
[2] J.-H. Kim, M. S. H. Oh, and K.-Y. Kim, “Boosting vector calculus
with the graphical notation,” arXiv preprint arXiv:1911.00892,
2019.

Rollercoaster Key
No ratings yet
Rollercoaster Key
5 pages
Lutkepohl H. Handbook of Matrices
No ratings yet
Lutkepohl H. Handbook of Matrices
309 pages
Matrix Calculus: 1 The Derivative
100% (1)
Matrix Calculus: 1 The Derivative
13 pages
matrixcalc Đạo hàm ma trận PDF
No ratings yet
matrixcalc Đạo hàm ma trận PDF
25 pages
Matrix Calc
No ratings yet
Matrix Calc
23 pages
MPRA Paper 3917
No ratings yet
MPRA Paper 3917
14 pages
Calculus With Vectors and Matrices
No ratings yet
Calculus With Vectors and Matrices
16 pages
Matrixcalc PDF
No ratings yet
Matrixcalc PDF
23 pages
EN530.678 Nonlinear Control and Planning in Robotics Lecture 1: Matrix Algebra Basics January 27, 2020
No ratings yet
EN530.678 Nonlinear Control and Planning in Robotics Lecture 1: Matrix Algebra Basics January 27, 2020
4 pages
Matrix Algebra
No ratings yet
Matrix Algebra
18 pages
Matrix Calculus Tutorial
No ratings yet
Matrix Calculus Tutorial
7 pages
Thomas Minka - Note On Matrix Calculus and Algebra
No ratings yet
Thomas Minka - Note On Matrix Calculus and Algebra
19 pages
F Matrix Calculus
No ratings yet
F Matrix Calculus
9 pages
Matrix Calculus PDF
No ratings yet
Matrix Calculus PDF
9 pages
IB352 Warwick Wk1 - Maths
No ratings yet
IB352 Warwick Wk1 - Maths
15 pages
Matrices.: X Ly 2 2 K K P
No ratings yet
Matrices.: X Ly 2 2 K K P
23 pages
ML_Lec 3- Review of Linear Algebra
No ratings yet
ML_Lec 3- Review of Linear Algebra
16 pages
Chapter Matrix Derivative Common Cases
No ratings yet
Chapter Matrix Derivative Common Cases
6 pages
Matrix Calculus
No ratings yet
Matrix Calculus
9 pages
Linear Algebra Notes: 1 Column Picture of Matrix Vector Multiplication
No ratings yet
Linear Algebra Notes: 1 Column Picture of Matrix Vector Multiplication
10 pages
Matrix Calculus
No ratings yet
Matrix Calculus
8 pages
4025review
No ratings yet
4025review
4 pages
matrices Lecture_Econometrics_Unil
No ratings yet
matrices Lecture_Econometrics_Unil
18 pages
MatricesAndVectors PDF
No ratings yet
MatricesAndVectors PDF
53 pages
ENGMAE 200A: Engineering Analysis I Matrix Eigenvalue Problems Instructor: Dr. Ramin Bostanabad
No ratings yet
ENGMAE 200A: Engineering Analysis I Matrix Eigenvalue Problems Instructor: Dr. Ramin Bostanabad
42 pages
Tensors.(Susanka)
No ratings yet
Tensors.(Susanka)
81 pages
matrix-differential
No ratings yet
matrix-differential
21 pages
Notes On Matrices
No ratings yet
Notes On Matrices
8 pages
Vector and Matrix Calculus: Herman Kamper 30 January 2013
No ratings yet
Vector and Matrix Calculus: Herman Kamper 30 January 2013
5 pages
Maths Primer
No ratings yet
Maths Primer
41 pages
LinearAlgebraPrimer Ver 2010
No ratings yet
LinearAlgebraPrimer Ver 2010
15 pages
TA WEEK 3 Copy
No ratings yet
TA WEEK 3 Copy
27 pages
Applied Linear Algebra
No ratings yet
Applied Linear Algebra
121 pages
Bili Near
No ratings yet
Bili Near
8 pages
Determinants, Areas and Volumes: Theodore Voronov December 4, 2005
No ratings yet
Determinants, Areas and Volumes: Theodore Voronov December 4, 2005
30 pages
The Matrix Exponential (With Exercises) by Dan Klain Corrections and Comments Are Welcome
No ratings yet
The Matrix Exponential (With Exercises) by Dan Klain Corrections and Comments Are Welcome
8 pages
Mathematics of Modern Engineering I Lecture 1
No ratings yet
Mathematics of Modern Engineering I Lecture 1
7 pages
A Second Course in Elementary Differential Equations
No ratings yet
A Second Course in Elementary Differential Equations
210 pages
DA241M Review of Linear Algebra Part 1
No ratings yet
DA241M Review of Linear Algebra Part 1
5 pages
Lecture1 Math Basics
No ratings yet
Lecture1 Math Basics
74 pages
M.A./M.Sc. Mathematics Programme: Ist Semester
No ratings yet
M.A./M.Sc. Mathematics Programme: Ist Semester
110 pages
A Second Course in Elementary Differential Equations PDF
No ratings yet
A Second Course in Elementary Differential Equations PDF
201 pages
Advanced Maths
No ratings yet
Advanced Maths
123 pages
Matrix Calculus
No ratings yet
Matrix Calculus
9 pages
Notice: Estimation Theory Pattern Recognition
No ratings yet
Notice: Estimation Theory Pattern Recognition
5 pages
Basic Matrix Theory
No ratings yet
Basic Matrix Theory
10 pages
Handbook of Matrices
No ratings yet
Handbook of Matrices
309 pages
Quantum Mechanics: A Mathematical Introduction
No ratings yet
Quantum Mechanics: A Mathematical Introduction
144 pages
Chiang - Chapter 4
100% (1)
Chiang - Chapter 4
14 pages
The Exponential Function of Matrices
No ratings yet
The Exponential Function of Matrices
49 pages
Misc Matrix Algebra
No ratings yet
Misc Matrix Algebra
8 pages
Math 5390 Chapter 2
No ratings yet
Math 5390 Chapter 2
5 pages
Linear Algebra Primer: Daniel S. Stutts, PH.D
No ratings yet
Linear Algebra Primer: Daniel S. Stutts, PH.D
14 pages
Ch3a-Systems of Linear Equations
No ratings yet
Ch3a-Systems of Linear Equations
28 pages
MAtrices Review
No ratings yet
MAtrices Review
9 pages
Self Learning LinAlgebra
No ratings yet
Self Learning LinAlgebra
44 pages
Linear Algebra Via Exterior Products
No ratings yet
Linear Algebra Via Exterior Products
285 pages
Matrix Calculus - Notes On The Derivative of A Trace: Johannes Traa
No ratings yet
Matrix Calculus - Notes On The Derivative of A Trace: Johannes Traa
7 pages
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
A First Course in Functional Analysis
From Everand
A First Course in Functional Analysis
Martin Davis
No ratings yet
Surds Harder Questions MME PDF
No ratings yet
Surds Harder Questions MME PDF
8 pages
24 25+1st+Semester+Final+Exam+Student+Guide
No ratings yet
24 25+1st+Semester+Final+Exam+Student+Guide
2 pages
Dividing - Decimals in Dividend and Divisor-All
No ratings yet
Dividing - Decimals in Dividend and Divisor-All
20 pages
Infinite Geometric Series
No ratings yet
Infinite Geometric Series
2 pages
CSIR All Previous Questions PDF
100% (1)
CSIR All Previous Questions PDF
46 pages
Maths Project
No ratings yet
Maths Project
21 pages
Maths Olympiad Question Bank
No ratings yet
Maths Olympiad Question Bank
20 pages
Maths 1st
No ratings yet
Maths 1st
4 pages
TS Board Sets Test. Class 10
100% (1)
TS Board Sets Test. Class 10
2 pages
Calculus 1 - Amber Habib (MAT101)
No ratings yet
Calculus 1 - Amber Habib (MAT101)
143 pages
MTH 9857
No ratings yet
MTH 9857
56 pages
Um 8
No ratings yet
Um 8
40 pages
Chapter 1 Maths Cbse 10 Answer-1
No ratings yet
Chapter 1 Maths Cbse 10 Answer-1
18 pages
Problemset 5_ Number Theory
No ratings yet
Problemset 5_ Number Theory
3 pages
Solucionario de Principios de Analisis Matematico Walter Rudin PDF
No ratings yet
Solucionario de Principios de Analisis Matematico Walter Rudin PDF
89 pages
Composite Transformations: The Heart of Computer Graphics: Lecturer
No ratings yet
Composite Transformations: The Heart of Computer Graphics: Lecturer
18 pages
2023 Lecture Section 1 Matrices
No ratings yet
2023 Lecture Section 1 Matrices
12 pages
CBSE Class 8 Mathematics Practice Worksheet
No ratings yet
CBSE Class 8 Mathematics Practice Worksheet
2 pages
Algebra: "The Established Leader in Ee Review" Legit Multivector Review and Training Center
100% (1)
Algebra: "The Established Leader in Ee Review" Legit Multivector Review and Training Center
9 pages
DLL Math 7 Quarter 1 Week 5
No ratings yet
DLL Math 7 Quarter 1 Week 5
13 pages
X Y X XY X Y X X 4 T(S) K: Column C Linear (Column C) Polynomial (Column C)
No ratings yet
X Y X XY X Y X X 4 T(S) K: Column C Linear (Column C) Polynomial (Column C)
3 pages
Rbiit Eamcet
No ratings yet
Rbiit Eamcet
8 pages
Matrices: I J (D) I J
No ratings yet
Matrices: I J (D) I J
6 pages
T&S Book
No ratings yet
T&S Book
8 pages
Instant Download Yanqi Lake Lectures On Algebra Wen-Wei Li PDF All Chapters
100% (2)
Instant Download Yanqi Lake Lectures On Algebra Wen-Wei Li PDF All Chapters
62 pages
Chapter One Interpolation and Polynomial Approximation
No ratings yet
Chapter One Interpolation and Polynomial Approximation
12 pages
AddEx Ch5
No ratings yet
AddEx Ch5
4 pages
4.3 Homogeneous Linear Equations With Constant Coefficients
0% (1)
4.3 Homogeneous Linear Equations With Constant Coefficients
17 pages
Rational Numbers Notes
No ratings yet
Rational Numbers Notes
5 pages

2207.04377v1

Uploaded by

2207.04377v1

Uploaded by

Matrix differentiation with diagrammatic notation

∂Xm,1 f (X) ∂Xm,2 f (X) ··· ∂Xm,n f (X)

Indeed, the left equality is obtained from . (14)

From Eq. (1), we have

IV. Basic formulas

holds. In what follows, we assume that matrices A, B, . . .

Note that we assume that the order of wires does not

From Eq. (27), we can readily verify

We can easily obtain the following formulas (the proofs

Letting Y B X + A and using the chain rule, we obtain

We can easily obtain the following formulas (the proofs

You might also like