Coursera: Machine Learning (Week 3) [Assignment Solution] - Andrew NG

byAkshay Daga (APDaga) -June 08, 2018

61

Coursera: Machine Learning (Week 3) [Assignment Solution] - Andrew NG

▸ Logistic regression and apply it to two different datasets.

I have recently completed the Machine Learning course from Coursera by Andrew NG.

While doing the course we have to go through various quiz and assignments.

Here, I am sharing my solutions for the weekly assignments throughout the course.

These solutions are for reference only.

> It is recommended that you should solve the assignments by yourself honestly then only it makes sense to complete the course.

> But, In case you stuck in between, feel free to refer the solutions provided by me.

NOTE:

Don't just copy paste the code for the sake of completion.

Even if you copy the code, make sure you understand the code first.

Click here to check out week-2 assignment solutions, Scroll down for the solutions for week-3 assignment.

In this exercise, you will implement logistic regression and apply it to two different datasets. Before starting on the programming exercise, we strongly recommend watching the video lectures and completing the review questions for the associated topics.

Recommended Machine Learning Courses:

Coursera: Machine Learning

Coursera: Deep Learning Specialization

Coursera: Machine Learning with Python

Coursera: Advanced Machine Learning Specialization

Udemy: Machine Learning

LinkedIn: Machine Learning

Eduonix: Machine Learning

edX: Machine Learning

Fast.ai: Introduction to Machine Learning for Coders

It consist of the following files:

ex2.m - Octave/MATLAB script that steps you through the exercise
ex2 reg.m - Octave/MATLAB script for the later parts of the exercise
ex2data1.txt - Training set for the first half of the exercise
ex2data2.txt - Training set for the second half of the exercise
submit.m - Submission script that sends your solutions to our servers
mapFeature.m - Function to generate polynomial features
plotDecisionBoundary.m - Function to plot classifier's decision boundary
[*] plotData.m - Function to plot 2D classification data
[*] sigmoid.m - Sigmoid Function
[*] costFunction.m - Logistic Regression Cost Function
[*] predict.m - Logistic Regression Prediction Function
[*] costFunctionReg.m - Regularized Logistic Regression Cost
Video - YouTube videos featuring Free IOT/ML tutorials

* indicates files you will need to complete

plotData.m :

function plotData(X, y)
  %PLOTDATA Plots the data points X and y into a new figure 
  %   PLOTDATA(x,y) plots the data points with + for the positive examples
  %   and o for the negative examples. X is assumed to be a Mx2 matrix.
  
  % ====================== YOUR CODE HERE ======================
  % Instructions: Plot the positive and negative examples on a
  %               2D plot, using the option 'k+' for the positive
  %               examples and 'ko' for the negative examples.
  %
  
  %Seperating positive and negative results
  pos = find(y==1); %index of positive results
  neg = find(y==0); %index of negative results
  
  % Create New Figure
  figure;
  
  %Plotting Positive Results on 
  %    X_axis: Exam1 Score =  X(pos,1)
  %    Y_axis: Exam2 Score =  X(pos,2)
  plot(X(pos,1),X(pos,2),'g+');
  
  %To keep above plotted graph as it is.
  hold on;  
  
  %Plotting Negative Results on 
  %    X_axis: Exam1 Score =  X(neg,1)
  %    Y_axis: Exam2 Score =  X(neg,2)
  plot(X(neg,1),X(neg,2),'ro');
  
  % =========================================================================
  
  hold off;
end

sigmoid.m :

function g = sigmoid(z)
  %SIGMOID Compute sigmoid function
  %   g = SIGMOID(z) computes the sigmoid of z.
  
  % You need to return the following variables correctly 
  g = zeros(size(z));
  
  % ====================== YOUR CODE HERE ======================
  % Instructions: Compute the sigmoid of each value of z (z can be a matrix,
  %               vector or scalar).
  g = 1./(1+exp(-z));
  
  % =============================================================
end

costFunction.m :

function [J, grad] = costFunction(theta, X, y)
  %COSTFUNCTION Compute cost and gradient for logistic regression
  %   J = COSTFUNCTION(theta, X, y) computes the cost of using theta as the
  %   parameter for logistic regression and the gradient of the cost
  %   w.r.t. to the parameters.
  
  % Initialize some useful values
  m = length(y); % number of training examples
  
  % You need to return the following variables correctly 
  J = 0;
  grad = zeros(size(theta));
  
  % ====================== YOUR CODE HERE ======================
  % Instructions: Compute the cost of a particular choice of theta.
  %               You should set J to the cost.
  %               Compute the partial derivatives and set grad to the partial
  %               derivatives of the cost w.r.t. each parameter in theta
  %
  % Note: grad should have the same dimensions as theta
  %
  %DIMENSIONS: 
  %   theta = (n+1) x 1
  %   X     = m x (n+1)
  %   y     = m x 1
  %   grad  = (n+1) x 1
  %   J     = Scalar
  
  z = X * theta;      % m x 1
  h_x = sigmoid(z);   % m x 1 
  
  J = (1/m)*sum((-y.*log(h_x))-((1-y).*log(1-h_x))); % scalar
  
  grad = (1/m)* (X'*(h_x-y));     % (n+1) x 1
  
  % =============================================================
  
end

Check-out our free tutorials on IOT (Internet of Things):

predict.m :

function p = predict(theta, X)
  %PREDICT Predict whether the label is 0 or 1 using learned logistic 
  %regression parameters theta
  %   p = PREDICT(theta, X) computes the predictions for X using a 
  %   threshold at 0.5 (i.e., if sigmoid(theta'*x) >= 0.5, predict 1)
  
  m = size(X, 1); % Number of training examples
  
  % You need to return the following variables correctly
  p = zeros(m, 1);
  
  % ====================== YOUR CODE HERE ======================
  % Instructions: Complete the following code to make predictions using
  %               your learned logistic regression parameters. 
  %               You should set p to a vector of 0's and 1's
  %
  % Dimentions:
  % X     =  m x (n+1)
  % theta = (n+1) x 1
  
  h_x = sigmoid(X*theta);
  p=(h_x>=0.5);
  
  %p = double(sigmoid(X * theta)>=0.5);
  % =========================================================================
end

costFunctionReg.m :

function [J, grad] = costFunctionReg(theta, X, y, lambda)
  %COSTFUNCTIONREG Compute cost and gradient for logistic regression with regularization
  %   J = COSTFUNCTIONREG(theta, X, y, lambda) computes the cost of using
  %   theta as the parameter for regularized logistic regression and the
  %   gradient of the cost w.r.t. to the parameters. 
  
  % Initialize some useful values
  m = length(y); % number of training examples
  
  % You need to return the following variables correctly 
  J = 0;
  grad = zeros(size(theta));
  
  % ====================== YOUR CODE HERE ======================
  % Instructions: Compute the cost of a particular choice of theta.
  %               You should set J to the cost.
  %               Compute the partial derivatives and set grad to the partial
  %               derivatives of the cost w.r.t. each parameter in theta
  
  %DIMENSIONS: 
  %   theta = (n+1) x 1
  %   X     = m x (n+1)
  %   y     = m x 1
  %   grad  = (n+1) x 1
  %   J     = Scalar
  
  z = X * theta;      % m x 1
  h_x = sigmoid(z);  % m x 1 
  
  reg_term = (lambda/(2*m)) * sum(theta(2:end).^2);
  
  J = (1/m)*sum((-y.*log(h_x))-((1-y).*log(1-h_x))) + reg_term; % scalar
  
  grad(1) = (1/m)* (X(:,1)'*(h_x-y));                                  % 1 x 1
  grad(2:end) = (1/m)* (X(:,2:end)'*(h_x-y))+(lambda/m)*theta(2:end);  % n x 1
  
  % =============================================================
end

I tried to provide optimized solutions like vectorized implementation for each assignment. If you think that more optimization can be done, then put suggest the corrections / improvements.

--------------------------------------------------------------------------------

Click here to see solutions for all Machine Learning Coursera Assignments.

&

Click here to see more codes for Raspberry Pi 3 and similar Family.

&

Click here to see more codes for NodeMCU ESP8266 and similar Family.

&

Click here to see more codes for Arduino Mega (ATMega 2560) and similar Family.

Feel free to ask doubts in the comment section. I will try my best to solve it.

If you find this helpful by any mean like, comment and share the post.

This is the simplest way to encourage me to keep doing such work.

Thanks and Regards,

-Akshay P. Daga

61 Comments

Ranjith17 May 2019 at 10:01
how could you do this please explain me...
ReplyDelete
Replies
Anonymous20 May 2019 at 14:49
How can i download these files?
ReplyDelete
Replies
Anonymous27 May 2019 at 16:16
Hi Akshay, Please may I have theses files as well:

ex2.m
ex2 reg.m
ex2data1.txt
ex2data2.txt
submit.m
mapFeature.m
plotDecisionBoundary.m

ReplyDelete
Replies
Unknown30 May 2019 at 00:25
can you please tell me what you did by this

grad = (1/m)* (X'*(h_x-y));
ReplyDelete
Replies
Anonymous7 June 2019 at 12:22
this means:- take the transpose of feature matrix X(i.e X') and multiply it with the difference of matrices h_x and y i.e the matrix with sigmoid outputs and the result matrix(y). Finally multiply the end product with 1/m , where m is the number of training examples.

This is the vectorized implementation of the code that's actually way more lengthier to implement using loops.
ReplyDelete
Replies
as18 June 2019 at 12:57
Hi, can you please explain the predict function?
ReplyDelete
Replies
as18 June 2019 at 13:18
In this gradient decent the number of iteration are not specified so how is the gradient decent working? can someone please explain?
ReplyDelete
Replies
Unknown6 July 2019 at 14:20
I used the exact code at the end but I'm still getting 65/100 not able to figure out the reason
ReplyDelete
Replies
Shashank17 July 2019 at 22:20
Hi !! why didn't you use sum() function for grad even why formula contains that ?
ReplyDelete
Replies
Unknown21 July 2019 at 13:30
we have learned that Z= theta transpose X then why are using Z=X multiplied by theta in the above codes ?
ReplyDelete
Replies
Sarthak26 July 2019 at 13:34
Hii, thanks for your help mr. Akshay. I had this one doubt about predict.m function:
I tried coding for predict function in the following way:

h_x = sigmoid(X*theta);
if (0<=h_x<0.5)
p=0;
elseif (0.5<=h_x<=1)
p=1;
endif

I know I did it in a long way but the accuracy that I am getting 60.00. Your code gave me the accuracy 89.00. Can you please help me understand what's wrong with this and what's the exact difference between your code and mines'?
ReplyDelete
Replies
GowdA27 July 2019 at 15:36
hey bro it says z not defined why???
ReplyDelete
Replies
Md Inamul Haroon20 October 2019 at 18:01
I have copy the exact code for plotData.m , and all the others program worked very well but I am still getting 70/100. Can you tel what's the problem ?
ReplyDelete
Replies
Md Inamul Haroon20 October 2019 at 18:04
Can you tell me , how can I run "ex2" script in console ?
ReplyDelete
Replies
puneet2 November 2019 at 10:53
hi I want to clarify few things from you,
I have read in regression, these are few important points which have not been covered in andrew ng regression topic, how to find how significant your variable is, significance of p value and R^2 (R-square) values. I would like to know more about them. kindly share some sources.
ReplyDelete
Replies
Aziz5 November 2019 at 13:39
HI, The line code reg_term = (lambda/(2*m)) * sum(theta(2:end).^2); in costFunctionReg function,

can you explain more about this part theta(2:end) , what does it mean and how did you deduce it,
ReplyDelete
Replies
Unknown1 April 2020 at 15:38
sir,please explain me predict.m function
I used
for i=1:size(X,1)
if sigmoid(X*theta)>=0.5
p=sigmoid(X*theta);

end
as well as,
h_x = sigmoid(X*theta);
for i=1:size(X,1)

if (0<=h_x<0.5)
p=0;
elseif (0.5<=h_x<=1)
p=1;
end
but i am getting 40 accuracy it is working only with your code.why sir?
ReplyDelete
Replies
Anonymous5 April 2020 at 20:11
Hi there,
I am trying the the same code as yours of sigmoid() function but each time it is getting an error saying that

'z' undefined near line 6 column 18
error: called from
sigmoid at line 6 column 5

what to do please help me out..
ReplyDelete
Replies
ankit16 May 2020 at 14:23
Hello Akshay,
It'd be great if you kindly share the code for "fminunc" in this week's files(wherever needed), coz i don't understand that particular function well, neither did i get its solution anywhere else on internet.
ReplyDelete
Replies
Unknown16 May 2020 at 18:33
grad(2:end) = (1/m)* (X(:,2:end)'*(h_x-y))+(lambda/m)*theta(2:end); can u please explain this..
ReplyDelete
Replies
Qwert12318 May 2020 at 12:42
Hey it says my plot is empty can someone help?
ReplyDelete
Replies
Unknown31 May 2020 at 10:28
I am facing this type of problem in matlab , what can i do ? how to fix that n where ??

'fminunc' requires Optimization Toolbox.

Error in ex2 (line 99)
fminunc(@(t)(costFunction(t, X, y)), initial_theta, options);
ReplyDelete
Replies
Unknown15 June 2020 at 12:09
In sigmoid
error in line 6 (the preallocated value assigned to variable 'g' might be unused)

what should i do
ReplyDelete
Replies
HarshD4 July 2020 at 22:15
Hi Akshay can you please explain why we use this X(:,2:end) and theta(2:end) instead of plain X and theta??
ReplyDelete
Replies
HarshD5 July 2020 at 19:15
And also why use two gradents?
ReplyDelete
Replies
Unknown12 July 2020 at 17:33
Good day sir,
im new in this course...i could not fully understand the assignment in week 3...as i enter my code...i think still in error..
ReplyDelete
Replies
Unknown24 July 2020 at 18:52
please explain the predict function
ReplyDelete
Replies
irfan yaqub2 September 2020 at 10:12
costfuntion is not returning the scalar value, it is returning the 1*100 matrix.
ReplyDelete
Replies
Akshay Daga (APDaga)7 September 2020 at 12:30
Opening and closing brackets are not matching you code.

NOTE: check the brackets are "2*m"

YOUR CODE: reg_term = (lambda/2*m)) * sum(theta(2:end).^2);
WORKING CODE: reg_term = (lambda/(2*m)) * sum(theta(2:end).^2);
ReplyDelete
Replies
abhinav6 October 2020 at 19:52
Hello Akshay,
While computing cost function I am getting so many outputs
ReplyDelete
Replies
vdv9 October 2020 at 23:26
Error - theta may not be defined , predict function
ReplyDelete
Replies
trixie12 October 2020 at 16:35
hi i have a doubt i took theta as [zeros(n+1),1] it is giving me 0 and i cant submit the assignment can you specify initial value of theta and theta and values of X. i am totally confused
ReplyDelete
Replies
aditya15 November 2020 at 23:26
nothing is working here
every time it is showing
>> plotData

error: 'y' undefined near line 14 column 12
error: called from
plotData at line 14 column 5
>>
ReplyDelete
Replies
Bymoloz13 December 2020 at 06:20
J = (1 / m) * sum ((- y. * Log (h_x)) - ((1-y). * Log (1-h_x))) the log representation in this equation means ln isn't it? So, shouldn't we write it as log (1-h_x) / log (10).
ReplyDelete
Replies
Sorash4 February 2021 at 22:48
I made it this way:

function [J, grad] = costFunctionReg(theta, X, y, lambda)
%COSTFUNCTIONREG Compute cost and gradient for logistic regression with regularization
% J = COSTFUNCTIONREG(theta, X, y, lambda) computes the cost of using
% theta as the parameter for regularized logistic regression and the
% gradient of the cost w.r.t. to the parameters.

% Initialize some useful values
m = length(y); % number of training examples

% You need to return the following variables correctly
J = 0;
grad = zeros(size(theta));

% ====================== YOUR CODE HERE ======================
% Instructions: Compute the cost of a particular choice of theta.
% You should set J to the cost.
% Compute the partial derivatives and set grad to the partial
% derivatives of the cost w.r.t. each parameter in theta

[J, grad] = costFunction(theta, X, y);
feats = theta(2:end);
J = J + lambda / (2 * m) * (feats' * feats);
grad(2:end) = grad(2:end) + lambda / m * feats;

% =============================================================

end
ReplyDelete
Replies
Anshuman3 March 2021 at 12:05
My question is about the solved subroutine 'plotDecisionBoundary.m'
Line 20 : plot_y
I didn't understand the definition of this
Infact how this particular code helped to plot the decision boundary! Please explain..
ReplyDelete
Replies
Unknown15 May 2021 at 19:58
so in cost function grad is basically you doing gradient descent right? but what is the use of 1/m? i'm really confused sorry
ReplyDelete
Replies
Unknown16 May 2021 at 14:21
Muje 55 marks hi aa rahe he mane code bhi sahi likha he phir bhi...logistic regression cost and regularised logistic regression gradient dono me 0 marks he..
ReplyDelete
Replies
Vincent12 June 2021 at 17:26
i really confused in assignment, i enjoyed all the stuff that prof.Ng doing buat why it turns out to become nightmare when im face the programming assignment?
In the cosfunctionreg.m why you put

grad(1) = (1/m)* (X(:,1)'*(h_x-y)); whats this mean?
grad(2:end) = (1/m)* (X(:,2:end)'*(h_x-y))+(lambda/m)*theta(2:end); what grad(2:end) mean?
ReplyDelete
Replies
Unknown20 August 2021 at 23:49
Hi dear Akshay. I'm trying to submit week 3 assignment but I keep seeing the error:
!! Submission failed: unexpected error: Error: File: costFunctionReg.m Line: 22 Column: 3
Invalid expression. Check for missing or extra characters.
Can you help me out?
ReplyDelete
Replies
sk30 January 2022 at 12:14
I am getting a syntax error in exercise "CostfunctionReg.m" at
grad(1) = (1/m)* (X(:,1)'*(h_x-y));
please tell me why am i getting this error.
yes i am running it in octave but please don't tell me to go through the another link . please just tell me the issue.
ReplyDelete
Replies
Unknown6 February 2022 at 15:50
!! Submission failed: Index exceeds array bounds.

Function: getResponse

LineNumber: 132of submitWithConfiguration
ReplyDelete
Replies
Unknown14 February 2022 at 01:48
Here in the cost function though y and log(h_x) both have the same dimensions (mx1), how the dot product is possible between them?
ReplyDelete
Replies