C1_W2: Regression with Multiple Input Variables

This week, you’ll extend linear regression to handle multiple input features. You’ll also learn some methods for improving your model’s training and performance, such as vectorization, feature scaling, feature engineering and polynomial regression. At the end of the week, you’ll get to practice implementing linear regression in code.

C1_W2_M1 Multiple Linear Regression

C1_W2_M1_1 Multiple features

-$\vec{x}^{(i)}$= vector of 4 parameters for$i^{th}$row =$[1416 3 2 40] $

In this example, house price increase by (multiply 1k)
- 0.1 per square foot
- 4 per bedroom
- 10 per floor
- -2 per year old
- add 80 base price

We can simplify the model
From linear algebra, this is a row vector as opposed to column vector
this is multiple linear regression
- Not multivariate regression

Quiz

In the training set below (see slide: C1W2_M1_1 Multiple features), what is$x{1}^{(4)} $?

Ans

852

C1_W2_M1_2 Vectorization part 1

Learning to write vectorized code allows you to take advantage of modern numberical linear algebra libraries, as well as maybe GPU hardware.

Vector can be represented in Python as np.array([1.0, 2.5, -3.3])
if n is large, this code (on left) is inefficient
for loop is more concise, but still not efficient
np.dot(w,x) + b is most efficient using vectorization
Vectorization has 2 benefits: concise and efficient
np.dot can use parallel hardware

C1_W2_M1_3 Vectorization part 2

How does vectorized algorithm works…

Without vectorization, we run calculations linearly
np.dot works in multiple steps:
- get values of the vectors w, x
- In parallel run w[i] * x[i]

C1_W2_Lab01: Python Numpy Vectorization

Coursera
Local -$a \cdot b$returns a scalar
- e.g.$[1, 2, 3, 4] \cdot [-1, 4, 3, 2] = 24 $

C1_W2_M1_4 Gradient descent for multiple linear regression

w & x are now vectors
have to update all the parameters simultaneously for$w_{1} .. w_{n}$as well as$b $

Normal Equation

C1_W2_Lab02: Muliple linear regression

[Optional Lab: Multiple linear regression

Coursera](https://www.coursera.org/learn/machine-learning/ungradedLab/7GEJh/optional-lab-multiple-linear-regression/lab)

Local

Quiz: Multiple linear regression

In the training set below, what is$x_4^{(3)} $?

Size	Rooms	Floors	Age	Price
2104	5	1	45	460
1416	3	2	40	232
1534	3	2	30	315
852	2	1	36	178

Which of the following are potential benefits of vectorization?
- It makes your code run faster
- It makes your code shorter
- It allows your code to run more easily on parallel compute hardware
- All of the above
To make a gradient descent converge about twice as fast, a technique that almost always works is to double the learning rate$alpha $
- True
- False

Ans

30, 4, F

C1_W2_M2 Gradient Descent in Practice

C1_W2_M2_01 Feature scaling part 1

Use Feature Scaling to enable gradient descent to run faster

when we scatterplot size vs bedrooms, we see x has a much larger range than y
when we contour plot we see an oval
ie small w(size) has a large change & large w(bedrooms has a small change

since contour is tall & skinny, gradeient descent may end up bouncing back and forth for a long time
a technique is to scale the data to get a more circular contour plot

We can speed up gradient descent by scaling our features