Math - Linear Algebra 數學 - 線性代數

Linear Algebra is the branch of mathematics that studies vector spaces and linear transformations between vector spaces, such as rotating a shape, scaling it up or down, translating it (ie. moving it), etc.
線性代數是數學的一個分支，它研究向量空間和向量空間之間的線性變換，例如旋轉形狀、放大或縮小、平移（即移動）等。

Machine Learning relies heavily on Linear Algebra, so it is essential to understand what vectors and matrices are, what operations you can perform with them, and how they can be useful.
機器學習在很大程度上依賴於線性代數，因此了解什麼是向量和矩陣、您可以使用它們執行哪些操作以及它們有何用處至關重要。

# Vectors

## Definition

**velocity** $= \begin{pmatrix}

10 \\

50 \\

5000 \\

\end{pmatrix}$

Vectors¶ 向量¶

Definition¶ 定義¶

A vector is a quantity defined by a magnitude and a direction. For example, a rocket's velocity is a 3-dimensional vector: its magnitude is the speed of the rocket, and its direction is (hopefully) up. A vector can be represented by an array of numbers called scalars. Each scalar corresponds to the magnitude of the vector with regards to each dimension.
向量是由大小和方向定義的量。例如，火箭的速度是一個 3 維向量：它的大小是火箭的速度，它的方向（希望）是向上的。向量可以由稱為標量的數字數位數組表示。每個標量對應於向量相對於每個維度的大小。

For example, say the rocket is going up at a slight angle: it has a vertical speed of 5,000 m/s, and also a slight speed towards the East at 10 m/s, and a slight speed towards the North at 50 m/s. The rocket's velocity may be represented by the following vector:
例如，假設火箭以一個小角度上升：它的垂直速度為 5,000 m/s，向東的垂直速度為 10 m/s，向北的微速度為 50 m/s。火箭的速度可以用以下向量表示：

velocity $= (\begin{matrix} 10 \\ 50 \\ 5000 \end{matrix})$
速度 $= (\begin{matrix} 10 \\ 50 \\ 5000 \end{matrix})$

Note: by convention vectors are generally presented in the form of columns. Also, vector names are generally lowercase to distinguish them from matrices (which we will discuss below) and in bold (when possible) to distinguish them from simple scalar values such as $m e t e r s_p e r_s e c o n d = 5026$ .
注意：按照約定，向量通常以列的形式表示。此外，向量名稱通常為小寫，以便與矩陣（我們將在下面討論）區分開來，並使用粗體（如果可能）以區別於簡單的標量值，例如 $m e t e r s_p e r_s e c o n d = 5026$ .

A list of N numbers may also represent the coordinates of a point in an N-dimensional space, so it is quite frequent to represent vectors as simple points instead of arrows. A vector with 1 element may be represented as an arrow or a point on an axis, a vector with 2 elements is an arrow or a point on a plane, a vector with 3 elements is an arrow or point in space, and a vector with N elements is an arrow or a point in an N-dimensional space… which most people find hard to imagine.
N 個數位的清單也可以表示 N 維空間中某個點的座標，因此將向量表示為簡單點而不是箭頭是相當常見的。具有 1 個元素的向量可以表示為軸上的箭頭或點，具有 2 個元素的向量是平面上的箭頭或點，具有 3 個元素的向量是空間中的箭頭或點，具有 N 個元素的向量是 N 維空間中的箭頭或點......大多數人覺得很難想像。

Purpose¶ 目的¶

Vectors have many purposes in Machine Learning, most notably to represent observations and predictions. For example, say we built a Machine Learning system to classify videos into 3 categories (good, spam, clickbait) based on what we know about them. For each video, we would have a vector representing what we know about it, such as:
向量在機器學習中有很多用途，最明顯的是表示觀察和預測。例如，假設我們構建了一個機器學習系統，根據我們對視頻的瞭解將視頻分為 3 類（好、垃圾、點擊誘餌）。對於每個視頻，我們將有一個向量來表示我們對它的瞭解，例如：

video $= (\begin{matrix} 10.5 \\ 5.2 \\ 3.25 \\ 7.0 \end{matrix})$ 視頻 $= (\begin{matrix} 10.5 \\ 5.2 \\ 3.25 \\ 7.0 \end{matrix})$

This vector could represent a video that lasts 10.5 minutes, but only 5.2% viewers watch for more than a minute, it gets 3.25 views per day on average, and it was flagged 7 times as spam. As you can see, each axis may have a different meaning.
這個向量可以代表一個持續 10.5 分鐘的視頻，但只有 5.2% 的觀眾觀看時間超過一分鐘，平均每天獲得 3.25 次觀看，並且被標記為垃圾郵件 7 次。如您所見，每個軸可能具有不同的含義。

Based on this vector our Machine Learning system may predict that there is an 80% probability that it is a spam video, 18% that it is clickbait, and 2% that it is a good video. This could be represented as the following vector:
根據這個向量，我們的機器學習系統可能會預測它是垃圾郵件視頻的概率為80%，點擊誘餌的概率為18%，優質視頻的概率為2%。這可以表示為以下向量：

class_probabilities $= (\begin{matrix} 0.80 \\ 0.18 \\ 0.02 \end{matrix})$
$= (\begin{matrix} 0.80 \\ 0.18 \\ 0.02 \end{matrix})$ class_probabilities

Since we plan to do quite a lot of scientific calculations, it is much better to use NumPy's ndarray, which provides a lot of convenient and optimized implementations of essential mathematical operations on vectors (for more details about NumPy, check out the NumPy tutorial). For example:
由於我們計劃進行相當多的科學計算，因此最好使用 NumPy 的 ndarray，它為向量的基本數學運算提供了許多方便和優化的實現（有關 NumPy 的更多詳細資訊，請查看 NumPy 教程）。例如：

Norm¶ 范數¶

The norm of a vector $u$ , noted $‖ u ‖$ , is a measure of the length (a.k.a. the magnitude) of $u$ . There are multiple possible norms, but the most common one (and the only one we will discuss here) is the Euclidian norm, which is defined as:
向量的範數 $u$ ，注意到 $‖ u ‖$ ，是的長度（又名大小）的量度 $u$ 。有多種可能的範數，但最常見的一種（也是我們在這裡討論的唯一一種）是歐幾里得範數，其定義如下：

$‖ u ‖ = \sqrt{\sum_{i} {u_{i}}^{2}}$

We could implement this easily in pure python, recalling that $\sqrt{x} = x^{\frac{1}{2}}$
我們可以很容易地用純 python 實現這一點，回想一下 $\sqrt{x} = x^{\frac{1}{2}}$

t1 = np.array([2, 0.25])

t2 = np.array([2.5, 3.5])

t3 = np.array([1, 2])

x_coords, y_coords = zip(t1, t2, t3, t1)

plt.plot(x_coords, y_coords, "c--", x_coords, y_coords, "co")

plot_vector2d(v, t1, color="r", linestyle=":")

plot_vector2d(v, t2, color="r", linestyle=":")

plot_vector2d(v, t3, color="r", linestyle=":")

t1b = t1 + v

t2b = t2 + v

t3b = t3 + v

x_coords_b, y_coords_b = zip(t1b, t2b, t3b, t1b)

plt.plot(x_coords_b, y_coords_b, "b-", x_coords_b, y_coords_b, "bo")

plt.text(4, 4.2, "v", color="r", fontsize=18)

plt.text(3, 2.3, "v", color="r", fontsize=18)

plt.text(3.5, 0.4, "v", color="r", fontsize=18)

plt.axis([0, 6, 0, 5])

plt.grid()

plt.show()

Zero, unit and normalized vectors¶
零、單位和歸一化向量¶

A **zero-vector ** is a vector full of 0s.
零向量是充滿 0 的向量。
A unit vector is a vector with a norm equal to 1.
單位向量是范數等於 1 的向量。
The normalized vector of a non-null vector $u$ , noted $\hat{u}$ , is the unit vector that points in the same direction as $u$ . It is equal to: $\hat{u} = \frac{u}{‖ u ‖}$
非 null 向量 $u$ 的歸一化向量 ，注意 $\hat{u}$ ，是指向與 $u$ 相同的方向的單位向量。它等於： $\hat{u} = \frac{u}{‖ u ‖}$

## Dot product

### Definition

The dot product (also called *scalar product* or *inner product* in the context of the Euclidian space) of two vectors $\textbf{u}$ and $\textbf{v}$ is a useful operation that comes up fairly often in linear algebra. It is noted $\textbf{u} \cdot \textbf{v}$, or sometimes $⟨\textbf{u}|\textbf{v}⟩$ or $(\textbf{u}|\textbf{v})$, and it is defined as:

$\textbf{u} \cdot \textbf{v} = \left \Vert \textbf{u} \right \| \times \left \Vert \textbf{v} \right \| \times cos(\theta)$

where $\theta$ is the angle between $\textbf{u}$ and $\textbf{v}$.

Another way to calculate the dot product is:

$\textbf{u} \cdot \textbf{v} = \sum_i{\textbf{u}_i \times \textbf{v}_i}$

### In python

The dot product is pretty simple to implement:

Dot product¶ 點積¶

Definition¶ 定義¶

The dot product (also called scalar product or inner product in the context of the Euclidian space) of two vectors $u$ and $v$ is a useful operation that comes up fairly often in linear algebra. It is noted $u \cdot v$ , or sometimes $⟨ u | v ⟩$ or $(u | v)$ , and it is defined as:
兩個向量的點積（在歐幾里得空間的上下文中也稱為標量積或內積） $u$ 是 $v$ 線性代數中經常出現的有用運算。它被注意到，或有時 $⟨ u | v ⟩$ 是 $u \cdot v$ 或 $(u | v)$ ，它被定義為：

$u \cdot v = ‖ u ‖ \times ‖ v ‖ \times c o s (θ)$

where $θ$ is the angle between $u$ and $v$ .
其中 $θ$ 是和 $v$ 之間的 $u$ 角度。

Another way to calculate the dot product is:
計算點積的另一種方法是：

$u \cdot v = \sum_{i} u_{i} \times v_{i}$

In python¶ 在 python 中¶

The dot product is pretty simple to implement:
點積的實現非常簡單：

### Main properties

* The dot product is **commutative**: $\textbf{u} \cdot \textbf{v} = \textbf{v} \cdot \textbf{u}$.

* The dot product is only defined between two vectors, not between a scalar and a vector. This means that we cannot chain dot products: for example, the expression $\textbf{u} \cdot \textbf{v} \cdot \textbf{w}$ is not defined since $\textbf{u} \cdot \textbf{v}$ is a scalar and $\textbf{w}$ is a vector.

* This also means that the dot product is **NOT associative**: $(\textbf{u} \cdot \textbf{v}) \cdot \textbf{w} ≠ \textbf{u} \cdot (\textbf{v} \cdot \textbf{w})$ since neither are defined.

* However, the dot product is **associative with regards to scalar multiplication**: $\lambda \times (\textbf{u} \cdot \textbf{v}) = (\lambda \times \textbf{u}) \cdot \textbf{v} = \textbf{u} \cdot (\lambda \times \textbf{v})$

* Finally, the dot product is **distributive** over addition of vectors: $\textbf{u} \cdot (\textbf{v} + \textbf{w}) = \textbf{u} \cdot \textbf{v} + \textbf{u} \cdot \textbf{w}$.

Main properties¶
主要屬性¶

The dot product is commutative: $u \cdot v = v \cdot u$ .
點積是可交換的： $u \cdot v = v \cdot u$ 。
The dot product is only defined between two vectors, not between a scalar and a vector. This means that we cannot chain dot products: for example, the expression $u \cdot v \cdot w$ is not defined since $u \cdot v$ is a scalar and $w$ is a vector.
點積僅在兩個向量之間定義，而不是在標量和向量之間定義。這意味著我們不能鏈式點積：例如，表達式 $u \cdot v \cdot w$ 沒有定義，因為 $u \cdot v$ 是標量並且 $w$ 是向量。
This also means that the dot product is NOT associative: $(u \cdot v) \cdot w \neq u \cdot (v \cdot w)$ since neither are defined.
這也意味著點積不是結合的： $(u \cdot v) \cdot w \neq u \cdot (v \cdot w)$ 因為兩者都沒有定義。
However, the dot product is associative with regards to scalar multiplication: $λ \times (u \cdot v) = (λ \times u) \cdot v = u \cdot (λ \times v)$
但是，點積對於標量乘法是結合的： $λ \times (u \cdot v) = (λ \times u) \cdot v = u \cdot (λ \times v)$
Finally, the dot product is distributive over addition of vectors: $u \cdot (v + w) = u \cdot v + u \cdot w$ .
最後，點積在向量的添加上是分配的： $u \cdot (v + w) = u \cdot v + u \cdot w$ 。

### Calculating the angle between vectors

One of the many uses of the dot product is to calculate the angle between two non-zero vectors. Looking at the dot product definition, we can deduce the following formula:

$\theta = \arccos{\left ( \dfrac{\textbf{u} \cdot \textbf{v}}{\left \Vert \textbf{u} \right \| \times \left \Vert \textbf{v} \right \|} \right ) }$

Note that if $\textbf{u} \cdot \textbf{v} = 0$, it follows that $\theta = \dfrac{π}{4}$. In other words, if the dot product of two non-null vectors is zero, it means that they are orthogonal.

Let's use this formula to calculate the angle between $\textbf{u}$ and $\textbf{v}$ (in radians):

Calculating the angle between vectors¶
計算 vector 之間的角度¶

One of the many uses of the dot product is to calculate the angle between two non-zero vectors. Looking at the dot product definition, we can deduce the following formula:
點積的眾多用途之一是計算兩個非零向量之間的角度。查看點積定義，我們可以推匯出以下公式：

$θ = \arccos (\frac{u \cdot v}{‖ u ‖ \times ‖ v ‖})$

Note that if $u \cdot v = 0$ , it follows that $θ = \frac{π}{4}$ . In other words, if the dot product of two non-null vectors is zero, it means that they are orthogonal.
請注意，如果，則 $u \cdot v = 0$ 遵循 $θ = \frac{π}{4}$ 。換句話說，如果兩個非 null 向量的點積為零，則表示它們是正交的。

Let's use this formula to calculate the angle between $u$ and $v$ (in radians):
讓我們用這個公式來計算和之間的 $u$ $v$ 角度（以弧度為單位）：

# Matrices

A matrix is a rectangular array of scalars (ie. any number: integer, real or complex) arranged in rows and columns, for example:

\begin{bmatrix} 10 & 20 & 30 \\ 40 & 50 & 60 \end{bmatrix}

You can also think of a matrix as a list of vectors: the previous matrix contains either 2 horizontal 3D vectors or 3 vertical 2D vectors.

Matrices¶ 矩陣¶

A matrix is a rectangular array of scalars (ie. any number: integer, real or complex) arranged in rows and columns, for example:
矩陣是按行和列排列的標量（即任何數位：整數、實數或複數）的矩形陣陣，例如：

$[\begin{matrix} 10 & 20 & 30 \\ 40 & 50 & 60 \end{matrix}]$

You can also think of a matrix as a list of vectors: the previous matrix contains either 2 horizontal 3D vectors or 3 vertical 2D vectors.
您還可以將矩陣視為向量清單：前一個矩陣包含 2 個水準 3D 向量或 3 個垂直 2D 向量。

Matrices are convenient and very efficient to run operations on many vectors at a time. We will also see that they are great at representing and performing linear transformations such rotations, translations and scaling.
矩陣可以方便且非常高效地同時對多個向量運行操作。我們還將看到它們非常擅長表示和執行線性變換，例如旋轉、平移和縮放。

Size¶ 大小¶

The size of a matrix is defined by its number of rows and number of columns. It is noted $r o w s \times c o l u m n s$ . For example, the matrix $A$ above is an example of a $2 \times 3$ matrix: 2 rows, 3 columns. Caution: a $3 \times 2$ matrix would have 3 rows and 2 columns.
矩陣的大小由其行數和列數定義。值得注意的是 $r o w s \times c o l u m n s$ 。例如，上面的矩陣 $A$ 是 $2 \times 3$ 矩陣的一個示例：2 行，3 列。注意： $3 \times 2$ 矩陣將有 3 行和 2 列。

To get a matrix's size in NumPy:
要在 NumPy 中獲取矩陣的大小：

## Element indexing

$X = \begin{bmatrix}

x_{1,1} & x_{1,2} & x_{1,3} & \cdots & x_{1,n}\\

x_{2,1} & x_{2,2} & x_{2,3} & \cdots & x_{2,n}\\

x_{3,1} & x_{3,2} & x_{3,3} & \cdots & x_{3,n}\\

\vdots & \vdots & \vdots & \ddots & \vdots \\

x_{m,1} & x_{m,2} & x_{m,3} & \cdots & x_{m,n}\\

\end{bmatrix}$

Element indexing¶
元素索引¶

The number located in the $i^{t h}$ row, and $j^{t h}$ column of a matrix $X$ is sometimes noted $X_{i, j}$ or $X_{i j}$ , but there is no standard notation, so people often prefer to explicitely name the elements, like this: "let $X = (x_{i, j})_{1 \leq i \leq m, 1 \leq j \leq n}$ ". This means that $X$ is equal to:
位於矩陣的 $i^{t h}$ 和 $j^{t h}$ 列中的數字有時會被記為 $X_{i, j}$ 或 $X_{i j}$ ，但沒有標準表示法，因此人們通常更喜歡明確命名元素，例如：「let $X = (x_{i, j})_{1 \leq i \leq m, 1 \leq j \leq n}$ $X$ 」。這意味著 $X$ 等於：

$X = [\begin{matrix} x_{1, 1} & x_{1, 2} & x_{1, 3} & \dots & x_{1, n} \\ x_{2, 1} & x_{2, 2} & x_{2, 3} & \dots & x_{2, n} \\ x_{3, 1} & x_{3, 2} & x_{3, 3} & \dots & x_{3, n} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ x_{m, 1} & x_{m, 2} & x_{m, 3} & \dots & x_{m, n} \end{matrix}]$

However in this notebook we will use the $X_{i, j}$ notation, as it matches fairly well NumPy's notation. Note that in math indices generally start at 1, but in programming they usually start at 0. So to access $A_{2, 3}$ programmatically, we need to write this:
但是，在這個筆記本中，我們將使用符號， $X_{i, j}$ 因為它與 NumPy 的符號非常匹配。請注意，在數學中，索引通常從 1 開始，但在程式設計中，它們通常從 0 開始。因此，要以程式設計方式訪問 $A_{2, 3}$ ，我們需要編寫以下內容：

The $i^{t h}$ row vector is sometimes noted $M_{i}$ or $M_{i, *}$ , but again there is no standard notation so people often prefer to explicitely define their own names, for example: "let x $_{i}$ be the $i^{t h}$ row vector of matrix $X$ ". We will use the $M_{i, *}$ , for the same reason as above. For example, to access $A_{2, *}$ (ie. $A$ 's 2nd row vector):
$i^{t h}$ 行向量有時會被記為 $M_{i}$ 或 $M_{i, *}$ ，但同樣沒有標準表示法，因此人們通常更喜歡明確定義自己的名稱，例如：“設 x $_{i}$ 為 matrix $X$ 的 $i^{t h}$ 行向量 ”。我們將使用 $M_{i, *}$ ，原因與上述相同。例如，訪問 $A_{2, *}$ （即. $A$ 的第 2 行向量）：

Note that the result is actually a one-dimensional NumPy array: there is no such thing as a vertical or horizontal one-dimensional array. If you need to actually represent a row vector as a one-row matrix (ie. a 2D NumPy array), or a column vector as a one-column matrix, then you need to use a slice instead of an integer when accessing the row or column, for example:
請注意，結果實際上是一個一維 NumPy 陣列：沒有垂直或水準一維數位之類的東西。如果你需要實際將行向量表示為單行矩陣（即 2D NumPy 陣列），或將列向量表示為單列矩陣，那麼在訪問行或列時需要使用切片而不是整數，例如：

## Adding matrices

If two matrices $Q$ and $R$ have the same size $m \times n$, they can be added together. Addition is performed *elementwise*: the result is also a $m \times n$ matrix $S$ where each element is the sum of the elements at the corresponding position: $S_{i,j} = Q_{i,j} + R_{i,j}$

$S =

\begin{bmatrix}

Q_{11} + R_{11} & Q_{12} + R_{12} & Q_{13} + R_{13} & \cdots & Q_{1n} + R_{1n} \\

Q_{21} + R_{21} & Q_{22} + R_{22} & Q_{23} + R_{23} & \cdots & Q_{2n} + R_{2n} \\

Q_{31} + R_{31} & Q_{32} + R_{32} & Q_{33} + R_{33} & \cdots & Q_{3n} + R_{3n} \\

\vdots & \vdots & \vdots & \ddots & \vdots \\

Q_{m1} + R_{m1} & Q_{m2} + R_{m2} & Q_{m3} + R_{m3} & \cdots & Q_{mn} + R_{mn} \\

\end{bmatrix}$

For example, let's create a $2 \times 3$ matric $B$ and compute $A + B$:

Adding matrices¶
添加矩陣¶

If two matrices $Q$ and $R$ have the same size $m \times n$ , they can be added together. Addition is performed elementwise: the result is also a $m \times n$ matrix $S$ where each element is the sum of the elements at the corresponding position: $S_{i, j} = Q_{i, j} + R_{i, j}$
如果兩個矩陣 $Q$ 和 $R$ 具有相同的大小 $m \times n$ ，則可以將它們相加。加法是按元素執行的：結果也是一個 $m \times n$ 矩陣 $S$ ，其中每個元素是相應位置的元素之和： $S_{i, j} = Q_{i, j} + R_{i, j}$

$S = [\begin{matrix} Q_{11} + R_{11} & Q_{12} + R_{12} & Q_{13} + R_{13} & \dots & Q_{1 n} + R_{1 n} \\ Q_{21} + R_{21} & Q_{22} + R_{22} & Q_{23} + R_{23} & \dots & Q_{2 n} + R_{2 n} \\ Q_{31} + R_{31} & Q_{32} + R_{32} & Q_{33} + R_{33} & \dots & Q_{3 n} + R_{3 n} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ Q_{m 1} + R_{m 1} & Q_{m 2} + R_{m 2} & Q_{m 3} + R_{m 3} & \dots & Q_{m n} + R_{m n} \end{matrix}]$

For example, let's create a $2 \times 3$ matric $B$ and compute $A + B$ :
例如，讓我們建立一個 $2 \times 3$ matric $B$ 並計算 $A + B$ ：

## Scalar multiplication

A matrix $M$ can be multiplied by a scalar $\lambda$. The result is noted $\lambda M$, and it is a matrix of the same size as $M$ with all elements multiplied by $\lambda$:

$\lambda M =

\begin{bmatrix}

\lambda \times M_{11} & \lambda \times M_{12} & \lambda \times M_{13} & \cdots & \lambda \times M_{1n} \\

\lambda \times M_{21} & \lambda \times M_{22} & \lambda \times M_{23} & \cdots & \lambda \times M_{2n} \\

\lambda \times M_{31} & \lambda \times M_{32} & \lambda \times M_{33} & \cdots & \lambda \times M_{3n} \\

\vdots & \vdots & \vdots & \ddots & \vdots \\

\lambda \times M_{m1} & \lambda \times M_{m2} & \lambda \times M_{m3} & \cdots & \lambda \times M_{mn} \\

\end{bmatrix}$

A more concise way of writing this is:

$(\lambda M)_{i,j} = \lambda (M)_{i,j}$

In NumPy, simply use the `*` operator to multiply a matrix by a scalar. For example:

Scalar multiplication¶
標量乘法¶

A matrix $M$ can be multiplied by a scalar $λ$ . The result is noted $λ M$ , and it is a matrix of the same size as $M$ with all elements multiplied by $λ$ :
矩陣 $M$ 可以乘以標量 $λ$ 。結果被記錄 $λ M$ 下來，它是一個大小相同的矩陣，與 $M$ 所有元素乘以 $λ$ 時的大小相同。

$λ M = [\begin{matrix} λ \times M_{11} & λ \times M_{12} & λ \times M_{13} & \dots & λ \times M_{1 n} \\ λ \times M_{21} & λ \times M_{22} & λ \times M_{23} & \dots & λ \times M_{2 n} \\ λ \times M_{31} & λ \times M_{32} & λ \times M_{33} & \dots & λ \times M_{3 n} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ λ \times M_{m 1} & λ \times M_{m 2} & λ \times M_{m 3} & \dots & λ \times M_{m n} \end{matrix}]$

A more concise way of writing this is:
更簡潔的寫法是：

$(λ M)_{i, j} = λ (M)_{i, j}$

In NumPy, simply use the * operator to multiply a matrix by a scalar. For example:
在 NumPy 中，只需使用 * 運算子將矩陣乘以標量即可。例如：

## Matrix multiplication

So far, matrix operations have been rather intuitive. But multiplying matrices is a bit more involved.

$P_{i,j} = \sum_{k=1}^n{Q_{i,k} \times R_{k,j}}$

The element at position $i,j$ in the resulting matrix is the sum of the products of elements in row $i$ of matrix $Q$ by the elements in column $j$ of matrix $R$.

$P =

\begin{bmatrix}

Q_{11} R_{11} + Q_{12} R_{21} + \cdots + Q_{1n} R_{n1} &

Q_{11} R_{12} + Q_{12} R_{22} + \cdots + Q_{1n} R_{n2} &

\cdots &

Q_{11} R_{1q} + Q_{12} R_{2q} + \cdots + Q_{1n} R_{nq} \\

Matrix multiplication¶
矩陣乘法¶

So far, matrix operations have been rather intuitive. But multiplying matrices is a bit more involved.
到目前為止，矩陣運算一直相當直觀。但是乘以矩陣要複雜一些。

A matrix $Q$ of size $m \times n$ can be multiplied by a matrix $R$ of size $n \times q$ . It is noted simply $Q R$ without multiplication sign or dot. The result $P$ is an $m \times q$ matrix where each element is computed as a sum of products:
size 的矩陣 $Q$ 可以乘以 size $n \times q$ 的矩陣 $R$ 。 $m \times n$ 它只是 $Q R$ 簡單地記錄下來，沒有乘號或點。結果 $P$ 是一個 $m \times q$ 矩陣，其中每個元素都計算為乘積之和：

$P_{i, j} = \sum_{k = 1}^{n} Q_{i, k} \times R_{k, j}$

The element at position $i, j$ in the resulting matrix is the sum of the products of elements in row $i$ of matrix $Q$ by the elements in column $j$ of matrix $R$ .
結果矩陣中位置的 $i, j$ 元素是矩陣 $Q$ 行 $i$ 中的元素與矩陣 $R$ 列中 $j$ 的元素的乘積之和。

$P = [\begin{matrix} Q_{11} R_{11} + Q_{12} R_{21} + \dots + Q_{1 n} R_{n 1} & Q_{11} R_{12} + Q_{12} R_{22} + \dots + Q_{1 n} R_{n 2} & \dots & Q_{11} R_{1 q} + Q_{12} R_{2 q} + \dots + Q_{1 n} R_{n q} \\ Q_{21} R_{11} + Q_{22} R_{21} + \dots + Q_{2 n} R_{n 1} & Q_{21} R_{12} + Q_{22} R_{22} + \dots + Q_{2 n} R_{n 2} & \dots & Q_{21} R_{1 q} + Q_{22} R_{2 q} + \dots + Q_{2 n} R_{n q} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ Q_{m 1} R_{11} + Q_{m 2} R_{21} + \dots + Q_{m n} R_{n 1} & Q_{m 1} R_{12} + Q_{m 2} R_{22} + \dots + Q_{m n} R_{n 2} & \dots & Q_{m 1} R_{1 q} + Q_{m 2} R_{2 q} + \dots + Q_{m n} R_{n q} \end{matrix}]$

You may notice that each element $P_{i, j}$ is the dot product of the row vector $Q_{i, *}$ and the column vector $R_{*, j}$ :
您可能會注意到，每個元素 $P_{i, j}$ 都是 row vector $Q_{i, *}$ 和 column vector $R_{*, j}$ 的點積：

$P_{i, j} = Q_{i, *} \cdot R_{*, j}$

So we can rewrite $P$ more concisely as:
所以我們可以更簡潔地改 $P$ 寫為：

$P = [\begin{matrix} Q_{1, *} \cdot R_{*, 1} & Q_{1, *} \cdot R_{*, 2} & \dots & Q_{1, *} \cdot R_{*, q} \\ Q_{2, *} \cdot R_{*, 1} & Q_{2, *} \cdot R_{*, 2} & \dots & Q_{2, *} \cdot R_{*, q} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ Q_{m, *} \cdot R_{*, 1} & Q_{m, *} \cdot R_{*, 2} & \dots & Q_{m, *} \cdot R_{*, q} \end{matrix}]$

Looks good! You can check the other elements until you get used to the algorithm.
看起來不錯！您可以檢查其他元素，直到您習慣了該演算法。

We multiplied a $2 \times 3$ matrix by a $3 \times 4$ matrix, so the result is a $2 \times 4$ matrix. The first matrix's number of columns has to be equal to the second matrix's number of rows. If we try to multiple $D$ by $A$ , we get an error because D has 4 columns while A has 2 rows:
我們將一個 $2 \times 3$ 矩陣乘以一個 $3 \times 4$ 矩陣，所以結果是一個 $2 \times 4$ 矩陣。第一個矩陣的列數必須等於第二個矩陣的行數。如果我們嘗試乘 $D$ 以， $A$ 我們會得到一個錯誤，因為 D 有 4 列，而 A 有 2 行：

This illustrates the fact that matrix multiplication is NOT commutative: in general $Q R \neq R Q$
這說明了矩陣乘法不是可交換的：一般來說 $Q R \neq R Q$

In fact, $Q R$ and $R Q$ are only both defined if $Q$ has size $m \times n$ and $R$ has size $n \times m$ . Let's look at an example where both are defined and show that they are (in general) NOT equal:
實際上， $Q R$ 和 $R Q$ 僅在 has size $m \times n$ 和 $R$ has size $n \times m$ 時 $Q$ 都定義。讓我們看一個示例，其中兩者都被定義並表明它們（通常）不相等：

The product of a matrix $M$ by the identity matrix (of matching size) results in the same matrix $M$ . More formally, if $M$ is an $m \times n$ matrix, then:
矩陣 $M$ 乘以單位矩陣（大小匹配）的乘積得到相同的矩陣 $M$ 。更正式地說，如果 $M$ 是一個 $m \times n$ 矩陣，則：

$M I_{n} = I_{m} M = M$

This is generally written more concisely (since the size of the identity matrices is unambiguous given the context):
這通常寫得更簡潔（因為在上下文中，單位矩陣的大小是明確的）：

$M I = I M = M$

For example: 例如：

The @ infix operator @ 中綴運算符

Python 3.5 introduced the @ infix operator for matrix multiplication, and NumPy 1.10 added support for it. If you are using Python 3.5+ and NumPy 1.10+, you can simply write A @ D instead of A.dot(D), making your code much more readable (but less portable). This operator also works for vector dot products.
Python 3.5 引入了用於矩陣乘法的 @ 中綴運算符，而 NumPy 1.10 增加了對它的支援。如果您使用的是 Python 3.5+ 和 NumPy 1.10+，您可以簡單地編寫 A @ D 而不是 A.dot（D），從而使您的代碼更具可讀性（但可移植性較差）。此運算子也適用於向量點積。

Note: Q @ R is actually equivalent to Q.__matmul__(R) which is implemented by NumPy as np.matmul(Q, R), not as Q.dot(R). The main difference is that matmul does not support scalar multiplication, while dot does, so you can write Q.dot(3), which is equivalent to Q * 3, but you cannot write Q @ 3 (more details).
注意：Q @ R 實際上等同於 Q.__matmul__（R），它由 NumPy 實現為 np.matmul（Q， R），而不是 Q.dot（R）。主要區別在於 matmul 不支援標量乘法，而 dot 支援，所以你可以寫 Q.dot（3），相當於 Q * 3，但你不能寫 Q @ 3（更多細節）。

## Matrix transpose

The transpose of a matrix $M$ is a matrix noted $M^T$ such that the $i^{th}$ row in $M^T$ is equal to the $i^{th}$ column in $M$:

$ A^T =

\begin{bmatrix}

10 & 20 & 30 \\

40 & 50 & 60

\end{bmatrix}^T =

\begin{bmatrix}

10 & 40 \\

20 & 50 \\

30 & 60

\end{bmatrix}$

In other words, ($A^T)_{i,j}$ = $A_{j,i}$

Obviously, if $M$ is an $m \times n$ matrix, then $M^T$ is an $n \times m$ matrix.

Note: there are a few other notations, such as $M^t$, $M′$, or ${^t}M$.

In NumPy, a matrix's transpose can be obtained simply using the `T` attribute:

Matrix transpose¶
矩陣轉置¶

The transpose of a matrix $M$ is a matrix noted $M^{T}$ such that the $i^{t h}$ row in $M^{T}$ is equal to the $i^{t h}$ column in $M$ :
矩陣 $M$ 的轉置是一個矩陣，記錄 $M^{T}$ 到中的 $i^{t h}$ 行等於中的 $M^{T}$ $i^{t h}$ 欄 $M$ ：

$A^{T} = {[\begin{matrix} 10 & 20 & 30 \\ 40 & 50 & 60 \end{matrix}]}^{T} = [\begin{matrix} 10 & 40 \\ 20 & 50 \\ 30 & 60 \end{matrix}]$

In other words, ( $A^{T})_{i, j}$ = $A_{j, i}$
換句話說，（ $A^{T})_{i, j}$ = $A_{j, i}$

Obviously, if $M$ is an $m \times n$ matrix, then $M^{T}$ is an $n \times m$ matrix.
顯然，如果 $M$ 是一個 $m \times n$ 矩陣，那麼 $M^{T}$ 是一個 $n \times m$ 矩陣。

Note: there are a few other notations, such as $M^{t}$ , $M'$ , or $^{t} M$ .
注意：還有一些其他符號，例如 $M^{t}$ 、或 $M'$ $^{t} M$ 。

In NumPy, a matrix's transpose can be obtained simply using the T attribute:
在 NumPy 中，只需使用 T 屬性即可獲得矩陣的轉置：

A symmetric matrix $M$ is defined as a matrix that is equal to its transpose: $M^{T} = M$ . This definition implies that it must be a square matrix whose elements are symmetric relative to the main diagonal, for example:
對稱矩陣 $M$ 定義為等於其轉置的矩陣： $M^{T} = M$ 。此定義意味著它必須是一個方陣，其元素相對於主對角線對稱，例如：

$[\begin{matrix} 17 & 22 & 27 & 49 \\ 22 & 29 & 36 & 0 \\ 27 & 36 & 45 & 2 \\ 49 & 0 & 2 & 99 \end{matrix}]$

The product of a matrix by its transpose is always a symmetric matrix, for example:
矩陣通過其轉置的乘積始終是對稱矩陣，例如：

Converting 1D arrays to 2D arrays in NumPy¶
在 NumPy 中將 1D 陣列轉換為 2D 陣列¶

As we mentionned earlier, in NumPy (as opposed to Matlab, for example), 1D really means 1D: there is no such thing as a vertical 1D-array or a horizontal 1D-array. So you should not be surprised to see that transposing a 1D array does not do anything:
正如我們之前提到的，在 NumPy 中（例如，與 Matlab 相反），一維實際上意味著一維：沒有垂直的一維數位或水準的一維數位列。因此，你不應該驚訝地發現轉置 1D 陣列不會做任何事情：

Plotting a matrix¶
繪製矩陣¶

We have already seen that vectors can been represented as points or arrows in N-dimensional space. Is there a good graphical representation of matrices? Well you can simply see a matrix as a list of vectors, so plotting a matrix results in many points or arrows. For example, let's create a $2 \times 4$ matrix P and plot it as points:
我們已經看到，向量可以在 N 維空間中表示為點或箭頭。矩陣有沒有很好的圖形表示？好吧，你可以簡單地將矩陣視為向量清單，因此繪製矩陣會產生許多點或箭頭。例如，讓我們創建一個 $2 \times 4$ 矩陣 P 並將其繪製為點：

Of course we could also have stored the same 4 vectors as row vectors instead of column vectors, resulting in a $4 \times 2$ matrix (the transpose of $P$ , in fact). It is really an arbitrary choice.
當然，我們也可以存儲與行向量相同的 4 個向量，而不是列向量，從而產生一個 $4 \times 2$ 矩陣（實際上是的 $P$ 轉置）。這真的是一個武斷的選擇。

Since the vectors are ordered, you can see the matrix as a path and represent it with connected dots:
由於向量是有序的，因此您可以將矩陣視為路徑，並用連接的點表示它：

Geometric applications of matrix operations¶
矩陣運算的幾何應用¶

We saw earlier that vector addition results in a geometric translation, vector multiplication by a scalar results in rescaling (zooming in or out, centered on the origin), and vector dot product results in projecting a vector onto another vector, rescaling and measuring the resulting coordinate.
我們之前看到，向量添加會導致幾何平移，向量乘以標量會導致重新縮放（放大或縮小，以原點為中心），而向量點積導致將一個向量投影到另一個向量上，重新縮放並測量結果座標。

Similarly, matrix operations have very useful geometric applications.
同樣，矩陣運算也有非常有用的幾何應用。

H = np.array([

[ 0.5, -0.2, 0.2, -0.1],

[ 0.4, 0.4, 1.5, 0.6]

])

P_moved = P + H

plt.gca().add_artist(Polygon(P.T, alpha=0.2))

plt.gca().add_artist(Polygon(P_moved.T, alpha=0.3, color="r"))

for vector, origin in zip(H.T, P.T):

plot_vector2d(vector, origin=origin)

plt.text(2.2, 1.8, "$P$", color="b", fontsize=18)

plt.text(2.0, 3.2, "$P+H$", color="r", fontsize=18)

plt.text(2.5, 0.5, "$H_{*,1}$", color="k", fontsize=18)

plt.text(4.1, 3.5, "$H_{*,2}$", color="k", fontsize=18)

plt.text(0.4, 2.6, "$H_{*,3}$", color="k", fontsize=18)

plt.text(4.4, 0.2, "$H_{*,4}$", color="k", fontsize=18)

plt.axis([0, 5, 0, 4])

plt.grid()

plt.show()

Although matrices can only be added together if they have the same size, NumPy allows adding a row vector or a column vector to a matrix: this is called broadcasting and is explained in further details in the NumPy tutorial. We could have obtained the same result as above with:
雖然矩陣只有在大小相同的情況下才能相加，但 NumPy 允許向矩陣添加行向量或列向量：這稱為廣播，在 NumPy 教程中有更詳細的解釋。我們本可以得到與上述相同的結果：

Scalar multiplication¶
標量乘法¶

Multiplying a matrix by a scalar results in all its vectors being multiplied by that scalar, so unsurprisingly, the geometric result is a rescaling of the entire figure. For example, let's rescale our polygon by a factor of 60% (zooming out, centered on the origin):
將矩陣乘以標量會導致其所有向量乘以該標量，因此不出所料，幾何結果是整個圖形的重新縮放。例如，讓我們將多邊形重新縮放 60% 的係數（縮小，以原點為中心）：

def plot_transformation(P_before, P_after, text_before, text_after, axis = [0, 5, 0, 4], arrows=False):

if arrows:

for vector_before, vector_after in zip(P_before.T, P_after.T):

plot_vector2d(vector_before, color="blue", linestyle="--")

plot_vector2d(vector_after, color="red", linestyle="-")

plt.gca().add_artist(Polygon(P_before.T, alpha=0.2))

plt.gca().add_artist(Polygon(P_after.T, alpha=0.3, color="r"))

plt.text(P_before[0].mean(), P_before[1].mean(), text_before, fontsize=18, color="blue")

plt.text(P_after[0].mean(), P_after[1].mean(), text_after, fontsize=18, color="red")

plt.axis(axis)

plt.grid()

P_rescaled = 0.60 * P

plot_transformation(P, P_rescaled, "$P$", "$0.6 P$", arrows=True)

plt.show()

The first row is equal to $V_{1, *} P$ , which is the coordinates of the projection of $P$ onto the 30° axis, as we have seen above. The second row is $V_{2, *} P$ , which is the coordinates of the projection of $P$ onto the 120° axis. So basically we obtained the coordinates of $P$ after rotating the horizontal and vertical axes by 30° (or equivalently after rotating the polygon by -30° around the origin)! Let's plot $V P$ to see this:
第一行等於 $V_{1, *} P$ ，它是投影 $P$ 到 30° 軸上的座標，正如我們上面看到的。第二行是 $V_{2, *} P$ ，它是 $P$ 在120°軸上的投影座標。所以基本上我們得到了將水平軸和垂直軸旋轉 30° $P$ 后的座標（或者相當於將多邊形繞原點旋轉 -30° 之後）！讓我們來圖 $V P$ 一下：

### Matrix multiplication – Other linear transformations

More generally, any linear transformation $f$ that maps n-dimensional vectors to m-dimensional vectors can be represented as an $m \times n$ matrix. For example, say $\textbf{u}$ is a 3-dimensional vector:

$\textbf{u} = \begin{pmatrix} x \\ y \\ z \end{pmatrix}$

and $f$ is defined as:

$f(\textbf{u}) = \begin{pmatrix}

ax + by + cz \\

dx + ey + fz

\end{pmatrix}$

$F = \begin{bmatrix}

a & b & c \\

d & e & f

\end{bmatrix}$

Now, to compute $f(\textbf{u})$ we can simply do a matrix multiplication:

$f(\textbf{u}) = F \textbf{u}$

If we have a matric $G = \begin{bmatrix}\textbf{u}_1 & \textbf{u}_2 & \cdots & \textbf{u}_q \end{bmatrix}$, where each $\textbf{u}_i$ is a 3-dimensional column vector, then $FG$ results in the linear transformation of all vectors $\textbf{u}_i$ as defined by the matrix $F$:

Matrix multiplication – Other linear transformations¶
矩陣乘法 – 其他線性變換¶

More generally, any linear transformation $f$ that maps n-dimensional vectors to m-dimensional vectors can be represented as an $m \times n$ matrix. For example, say $u$ is a 3-dimensional vector:
更一般地說，任何將 n 維向量映射到 m 維向量的線性變換 $f$ 都可以表示為 $m \times n$ 矩陣。例如，say $u$ 是一個 3 維向量：

$u = (\begin{matrix} x \\ y \\ z \end{matrix})$

and $f$ is defined as:
，定義為 $f$ ：

$f (u) = (\begin{matrix} a x + b y + c z \\ d x + e y + f z \end{matrix})$

This transormation $f$ maps 3-dimensional vectors to 2-dimensional vectors in a linear way (ie. the resulting coordinates only involve sums of multiples of the original coordinates). We can represent this transformation as matrix $F$ :
這種轉換以線性方式 $f$ 將 3 維向量映射到 2 維向量（即，生成的座標僅涉及原始座標的倍數之和）。我們可以將此轉換表示為矩陣 $F$ ：

$F = [\begin{matrix} a & b & c \\ d & e & f \end{matrix}]$

Now, to compute $f (u)$ we can simply do a matrix multiplication:
現在，為了計算 $f (u)$ ，我們可以簡單地進行矩陣乘法：

$f (u) = F u$

If we have a matric $G = [\begin{matrix} u_{1} & u_{2} & \dots & u_{q} \end{matrix}]$ , where each $u_{i}$ is a 3-dimensional column vector, then $F G$ results in the linear transformation of all vectors $u_{i}$ as defined by the matrix $F$ :
如果我們有一個 matric $G = [\begin{matrix} u_{1} & u_{2} & \dots & u_{q} \end{matrix}]$ ，其中每個 $u_{i}$ 向量都是一個 3 維列向量，那麼 $F G$ 結果所有向量的線性變換 $u_{i}$ 由矩陣 $F$ 定義：

$F G = [\begin{matrix} f (u_{1}) & f (u_{2}) & \dots & f (u_{q}) \end{matrix}]$

To summarize, the matrix on the left hand side of a dot product specifies what linear transormation to apply to the right hand side vectors. We have already shown that this can be used to perform projections and rotations, but any other linear transformation is possible. For example, here is a transformation known as a shear mapping:

Also, the inverse of scaling by a factor of $\lambda$ is of course scaling by a factor or $\frac{1}{\lambda}$:

$ (\lambda \times M)^{-1} = \frac{1}{\lambda} \times M^{-1}$

Once you understand the geometric interpretation of matrices as linear transformations, most of these properties seem fairly intuitive.

A matrix that is its own inverse is called an **involution**. The simplest examples are reflection matrices, or a rotation by 180°, but there are also more complex involutions, for example imagine a transformation that squeezes horizontally, then reflects over the vertical axis and finally rotates by 90° clockwise. Pick up a napkin and try doing that twice: you will end up in the original position. Here is the corresponding involutory matrix:

Also, the inverse of scaling by a factor of $λ$ is of course scaling by a factor or $\frac{1}{λ}$ :

$(λ \times M)^{- 1} = \frac{1}{λ} \times M^{- 1}$

Once you understand the geometric interpretation of matrices as linear transformations, most of these properties seem fairly intuitive.

A matrix that is its own inverse is called an involution. The simplest examples are reflection matrices, or a rotation by 180°, but there are also more complex involutions, for example imagine a transformation that squeezes horizontally, then reflects over the vertical axis and finally rotates by 90° clockwise. Pick up a napkin and try doing that twice: you will end up in the original position. Here is the corresponding involutory matrix:

## Determinant

The determinant of a square matrix $M$, noted $\det(M)$ or $\det M$ or $|M|$ is a value that can be calculated from its elements $(M_{i,j})$ using various equivalent methods. One of the simplest methods is this recursive approach:

$|M| = M_{1,1}\times|M^{(1,1)}| - M_{2,1}\times|M^{(2,1)}| + M_{3,1}\times|M^{(3,1)}| - M_{4,1}\times|M^{(4,1)}| + \cdots ± M_{n,1}\times|M^{(n,1)}|$

* Where $M^{(i,j)}$ is the matrix $M$ without row $i$ and column $j$.

For example, let's calculate the determinant of the following $3 \times 3$ matrix:

$M = \begin{bmatrix}

1 & 2 & 3 \\

4 & 5 & 6 \\

7 & 8 & 0

\end{bmatrix}$

Using the method above, we get:

$|M| = 1 \times \left | \begin{bmatrix} 5 & 6 \\ 8 & 0 \end{bmatrix} \right |

- 2 \times \left | \begin{bmatrix} 4 & 6 \\ 7 & 0 \end{bmatrix} \right |

Determinant¶

The determinant of a square matrix $M$ , noted $det (M)$ or $det M$ or $| M |$ is a value that can be calculated from its elements $(M_{i, j})$ using various equivalent methods. One of the simplest methods is this recursive approach:

$| M | = M_{1, 1} \times | M^{(1, 1)} | - M_{2, 1} \times | M^{(2, 1)} | + M_{3, 1} \times | M^{(3, 1)} | - M_{4, 1} \times | M^{(4, 1)} | + \dots \pm M_{n, 1} \times | M^{(n, 1)} |$

Where $M^{(i, j)}$ is the matrix $M$ without row $i$ and column $j$ .

For example, let's calculate the determinant of the following $3 \times 3$ matrix:

$M = [\begin{matrix} 1 & 2 & 3 \\ 4 & 5 & 6 \\ 7 & 8 & 0 \end{matrix}]$

Using the method above, we get:

$| M | = 1 \times | [\begin{matrix} 5 & 6 \\ 8 & 0 \end{matrix}] | - 2 \times | [\begin{matrix} 4 & 6 \\ 7 & 0 \end{matrix}] | + 3 \times | [\begin{matrix} 4 & 5 \\ 7 & 8 \end{matrix}] |$

Now we need to compute the determinant of each of these $2 \times 2$ matrices (these determinants are called minors):

$| [\begin{matrix} 5 & 6 \\ 8 & 0 \end{matrix}] | = 5 \times 0 - 6 \times 8 = - 48$

$| [\begin{matrix} 4 & 6 \\ 7 & 0 \end{matrix}] | = 4 \times 0 - 6 \times 7 = - 42$

$| [\begin{matrix} 4 & 5 \\ 7 & 8 \end{matrix}] | = 4 \times 8 - 5 \times 7 = - 3$

Now we can calculate the final result:

$| M | = 1 \times (- 48) - 2 \times (- 42) + 3 \times (- 3) = 27$

The determinant can also be used to measure how much a linear transformation affects surface areas: for example, the projection matrices $F_{project}$ and $F_{project\_30}$ completely flatten the polygon $P$, until its area is zero. This is why the determinant of these matrices is 0. The shear mapping modified the shape of the polygon, but it did not affect its surface area, which is why the determinant is 1. You can try computing the determinant of a rotation matrix, and you should also find 1. What about a scaling matrix? Let's see:

The determinant can also be used to measure how much a linear transformation affects surface areas: for example, the projection matrices $F_{p r o j e c t}$ and $F_{p r o j e c t_30}$ completely flatten the polygon $P$ , until its area is zero. This is why the determinant of these matrices is 0. The shear mapping modified the shape of the polygon, but it did not affect its surface area, which is why the determinant is 1. You can try computing the determinant of a rotation matrix, and you should also find 1. What about a scaling matrix? Let's see:

## Eigenvectors and eigenvalues

An **eigenvector** of a square matrix $M$ (also called a **characteristic vector**) is a non-zero vector that remains on the same line after transformation by the linear transformation associated with $M$. A more formal definition is any vector $v$ such that:

$M \cdot v = \lambda \times v$

Where $\lambda$ is a scalar value called the **eigenvalue** associated to the vector $v$.

For example, any horizontal vector remains horizontal after applying the shear mapping (as you can see on the image above), so it is an eigenvector of $M$. A vertical vector ends up tilted to the right, so vertical vectors are *NOT* eigenvectors of $M$.

If we look at the squeeze mapping, we find that any horizontal or vertical vector keeps its direction (although its length changes), so all horizontal and vertical vectors are eigenvectors of $F_{squeeze}$.

However, rotation matrices have no eigenvectors at all (except if the rotation angle is 0° or 180°, in which case all non-zero vectors are eigenvectors).

Eigenvectors and eigenvalues¶

An eigenvector of a square matrix $M$ (also called a characteristic vector) is a non-zero vector that remains on the same line after transformation by the linear transformation associated with $M$ . A more formal definition is any vector $v$ such that:

$M \cdot v = λ \times v$

Where $λ$ is a scalar value called the eigenvalue associated to the vector $v$ .

For example, any horizontal vector remains horizontal after applying the shear mapping (as you can see on the image above), so it is an eigenvector of $M$ . A vertical vector ends up tilted to the right, so vertical vectors are NOT eigenvectors of $M$ .

However, rotation matrices have no eigenvectors at all (except if the rotation angle is 0° or 180°, in which case all non-zero vectors are eigenvectors).

NumPy's eig function returns the list of unit eigenvectors and their corresponding eigenvalues for any square matrix. Let's look at the eigenvectors and eigenvalues of the squeeze mapping matrix $F_{s q u e e z e}$ :

The trace does not have a simple geometric interpretation (in general), but it has a number of properties that make it useful in many areas:

* $tr(A + B) = tr(A) + tr(B)$

* $tr(A \cdot B) = tr(B \cdot A)$

* $tr(A \cdot B \cdot \cdots \cdot Y \cdot Z) = tr(Z \cdot A \cdot B \cdot \cdots \cdot Y)$

* $tr(A^T \cdot B) = tr(A \cdot B^T) = tr(B^T \cdot A) = tr(B \cdot A^T) = \sum_{i,j}X_{i,j} \times Y_{i,j}$

* …

It does, however, have a useful geometric interpretation in the case of projection matrices (such as $F_{project}$ that we discussed earlier): it corresponds to the number of dimensions after projection. For example:

The trace does not have a simple geometric interpretation (in general), but it has a number of properties that make it useful in many areas:

$t r (A + B) = t r (A) + t r (B)$
$t r (A \cdot B) = t r (B \cdot A)$
$t r (A \cdot B \cdot \dots \cdot Y \cdot Z) = t r (Z \cdot A \cdot B \cdot \dots \cdot Y)$
$t r (A^{T} \cdot B) = t r (A \cdot B^{T}) = t r (B^{T} \cdot A) = t r (B \cdot A^{T}) = \sum_{i, j} X_{i, j} \times Y_{i, j}$
…

It does, however, have a useful geometric interpretation in the case of projection matrices (such as $F_{p r o j e c t}$ that we discussed earlier): it corresponds to the number of dimensions after projection. For example:

2.0 LineaAlgebra.ipynb

Vectors¶ 向量¶

Definition¶ 定義¶

Purpose¶ 目的¶

Vectors in python¶python 中的 vectors¶

Plotting vectors¶繪製向量¶

2D vectors¶ 2D 向量¶

3D vectors¶ 3D 向量¶

Norm¶ 范數¶

Addition¶ 添加¶

Multiplication by a scalar¶乘以標量¶

Zero, unit and normalized vectors¶零、單位和歸一化向量¶

Dot product¶ 點積¶

Definition¶ 定義¶

In python¶ 在 python 中¶

Main properties¶主要屬性¶

Calculating the angle between vectors¶計算 vector 之間的角度¶

Projecting a point onto an axis¶將點投影到軸上¶

Matrices¶ 矩陣¶

Matrices in python¶python 中的矩陣¶

Size¶ 大小¶

Element indexing¶元素索引¶

Square, triangular, diagonal and identity matrices¶正方形、三角形、對角線和標識矩陣¶

Adding matrices¶添加矩陣¶

Scalar multiplication¶標量乘法¶

Matrix multiplication¶矩陣乘法¶

Matrix transpose¶矩陣轉置¶

Converting 1D arrays to 2D arrays in NumPy¶在 NumPy 中將 1D 陣列轉換為 2D 陣列¶

Plotting a matrix¶繪製矩陣¶

Geometric applications of matrix operations¶矩陣運算的幾何應用¶

Addition = multiple geometric translations¶加法 = 多個幾何平移¶

Scalar multiplication¶標量乘法¶

Matrix multiplication – Projection onto an axis¶矩陣乘法 – 投影到軸上¶

Matrix multiplication – Rotation¶矩陣乘法 – 旋轉¶

Matrix multiplication – Other linear transformations¶矩陣乘法 – 其他線性變換¶

Matrix inverse¶

Determinant¶

Composing linear transformations¶

Singular Value Decomposition¶

Eigenvectors and eigenvalues¶

Trace¶

What next?¶ 下一步是什麼？¶

Vectors in python¶
python 中的 vectors¶

Plotting vectors¶
繪製向量¶

Multiplication by a scalar¶
乘以標量¶

Zero, unit and normalized vectors¶
零、單位和歸一化向量¶

Main properties¶
主要屬性¶

Calculating the angle between vectors¶
計算 vector 之間的角度¶

Projecting a point onto an axis¶
將點投影到軸上¶

Matrices in python¶
python 中的矩陣¶

Element indexing¶
元素索引¶

Square, triangular, diagonal and identity matrices¶
正方形、三角形、對角線和標識矩陣¶

Adding matrices¶
添加矩陣¶

Scalar multiplication¶
標量乘法¶

Matrix multiplication¶
矩陣乘法¶

Matrix transpose¶
矩陣轉置¶

Converting 1D arrays to 2D arrays in NumPy¶
在 NumPy 中將 1D 陣列轉換為 2D 陣列¶

Plotting a matrix¶
繪製矩陣¶

Geometric applications of matrix operations¶
矩陣運算的幾何應用¶

Addition = multiple geometric translations¶
加法 = 多個幾何平移¶

Scalar multiplication¶
標量乘法¶

Matrix multiplication – Projection onto an axis¶
矩陣乘法 – 投影到軸上¶

Matrix multiplication – Rotation¶
矩陣乘法 – 旋轉¶

Matrix multiplication – Other linear transformations¶
矩陣乘法 – 其他線性變換¶