.. _sec_linear-algebra: Linear Algebra ============== Now that you can store and manipulate data, let us briefly review the subset of basic linear algebra that you will need to understand and implement most of models covered in this book. Below, we introduce the basic mathematical objects, arithmetic, and operations in linear algebra, expressing each of them through mathematical notation and the corresponding implementation in code. Scalars ------- If you never studied linear algebra or machine learning, then your past experience with math probably consisted of thinking about one number at a time. And, if you ever balanced a checkbook or even paid for dinner at a restaurant then you already know how to do basic things like adding and multiplying pairs of numbers. For example, the temperature in Palo Alto is :math:`52` degrees Fahrenheit. Formally, we call values consisting of just one numerical quantity *scalars*. If you wanted to convert this value to Celsius (the metric system's more sensible temperature scale), you would evaluate the expression :math:`c = \frac{5}{9}(f - 32)`, setting :math:`f` to :math:`52`. In this equation, each of the terms---\ :math:`5`, :math:`9`, and :math:`32`---are scalar values. The placeholders :math:`c` and :math:`f` are called *variables* and they represent unknown scalar values. In this book, we adopt the mathematical notation where scalar variables are denoted by ordinary lower-cased letters (e.g., :math:`x`, :math:`y`, and :math:`z`). We denote the space of all (continuous) *real-valued* scalars by :math:`\mathbb{R}`. For expedience, we will punt on rigorous definitions of what precisely *space* is, but just remember for now that the expression :math:`x \in \mathbb{R}` is a formal way to say that :math:`x` is a real-valued scalar. The symbol :math:`\in` can be pronounced "in" and simply denotes membership in a set. Analogously, we could write :math:`x, y \in \{0, 1\}` to state that :math:`x` and :math:`y` are numbers whose value can only be :math:`0` or :math:`1`. A scalar is represented by a tensor with just one element. In the next snippet, we instantiate two scalars and perform some familiar arithmetic operations with them, namely addition, multiplication, division, and exponentiation. .. raw:: html

.. raw:: html

.. raw:: latex \diilbookstyleinputcell .. code:: python from mxnet import np, npx npx.set_np() x = np.array(3.0) y = np.array(2.0) x + y, x * y, x / y, x ** y .. raw:: latex \diilbookstyleoutputcell .. parsed-literal:: :class: output (array(5.), array(6.), array(1.5), array(9.)) .. raw:: html

.. raw:: html

.. raw:: latex \diilbookstyleinputcell .. code:: python import torch x = torch.tensor(3.0) y = torch.tensor(2.0) x + y, x * y, x / y, x**y .. raw:: latex \diilbookstyleoutputcell .. parsed-literal:: :class: output (tensor(5.), tensor(6.), tensor(1.5000), tensor(9.)) .. raw:: html

.. raw:: html

.. raw:: latex \diilbookstyleinputcell .. code:: python import tensorflow as tf x = tf.constant(3.0) y = tf.constant(2.0) x + y, x * y, x / y, x**y .. raw:: latex \diilbookstyleoutputcell .. parsed-literal:: :class: output (, , , ) .. raw:: html

.. raw:: html

Vectors ------- You can think of a vector as simply a list of scalar values. We call these values the *elements* (*entries* or *components*) of the vector. When our vectors represent examples from our dataset, their values hold some real-world significance. For example, if we were training a model to predict the risk that a loan defaults, we might associate each applicant with a vector whose components correspond to their income, length of employment, number of previous defaults, and other factors. If we were studying the risk of heart attacks hospital patients potentially face, we might represent each patient by a vector whose components capture their most recent vital signs, cholesterol levels, minutes of exercise per day, etc. In math notation, we will usually denote vectors as bold-faced, lower-cased letters (e.g., :math:`\mathbf{x}`, :math:`\mathbf{y}`, and :math:`\mathbf{z})`. We work with vectors via one-dimensional tensors. In general tensors can have arbitrary lengths, subject to the memory limits of your machine. .. raw:: html