Coursera: Machine Learning (Week 4) Quiz - Neural Networks: Representation| Andrew NG

byAkshay Daga (APDaga) -November 13, 2019

14

▸ Neural Networks - Representation :

Recommended Machine Learning Courses:

Coursera: Machine Learning

Coursera: Deep Learning Specialization

Coursera: Machine Learning with Python

Coursera: Advanced Machine Learning Specialization

Udemy: Machine Learning

LinkedIn: Machine Learning

Eduonix: Machine Learning

edX: Machine Learning

Fast.ai: Introduction to Machine Learning for Coders

Which of the following statements are true? Check all that apply.
- Any logical function over binary-valued (0 or 1) inputs x1 and x2 can be (approximately) represented using some neural network.
- Suppose you have a multi-class classification problem with three classes, trained with a 3 layer network. Let $a^{(3)}_1 = (h_\theta(x))_1$ be the activation of the first output unit, and similarly $a^{(3)}_2 = (h_\theta(x))_2$ and $a^{(3)}_3 = (h_\theta(x))_3$ . Then for any input x, it must be the case that $a^{(3)}_1 + a^{(3)}_2 + a^{(3)}_3 = 1$ .
- A two layer (one input layer, one output layer; no hidden layer) neural network can represent the XOR function.
- The activation values of the hidden units in a neural network, with the sigmoid activation function applied at every layer, are always in the range (0, 1).

Consider the following neural network which takes two binary-valued inputs
$x_1,x_2 \ \epsilon \ \{0,1\}$ and outputs $h_\theta(x)$ . Which of the following logical functions does it (approximately) compute?
- AND
  This network outputs approximately 1 only when both inputs are 1.
- NAND (meaning “NOT AND”)
- OR
- XOR (exclusive OR)

Consider the following neural network which takes two binary-valued inputs
$x_1,x_2 \ \epsilon \ \{0,1\}$ and outputs $h_\theta(x)$ . Which of the following logical functions does it (approximately) compute?
- AND
- NAND (meaning “NOT AND”)
- OR
  This network outputs approximately 1 when atleast one input is 1.
- XOR (exclusive OR)

Consider the neural network given below. Which of the following equations correctly computes the activation $a_1^{(3)}$ ? Note: $g(z)$ is the sigmoid activation
function.
- $a_1^{(3)} = g(\theta_{1,0}^{(2)}a_0^{(2)}+\theta_{1,1}^{(2)}a_1^{(2)}+\theta_{1,2}^{(2)}a_2^{(2)})$
  Thiscorrectly uses the first row of $\theta^{(2)}$ and includes the “+1” term of $a_0^{(2)}$ .
- $a_1^{(3)} = g(\theta_{1,0}^{(2)}a_0^{(1)}+\theta_{1,1}^{(2)}a_1^{(1)}+\theta_{1,2}^{(2)}a_2^{(1)})$
- $a_1^{(3)} = g(\theta_{1,0}^{(1)}a_0^{(2)}+\theta_{1,1}^{(1)}a_1^{(2)}+\theta_{1,2}^{(1)}a_2^{(2)})$
- $a_1^{(3)} = g(\theta_{2,0}^{(2)}a_0^{(2)}+\theta_{2,1}^{(2)}a_1^{(2)}+\theta_{2,2}^{(2)}a_2^{(2)})$

You have the following neural network:

You’d like to compute the activations of the hidden layer $a^{(2)} \ \epsilon \ R^3$ . One way to do
so is the following Octave code:

You want to have a vectorized implementation of this (i.e., one that does not use for loops). Which of the following implementations correctly compute ? Check all
that apply.
- z = Theta1 * x; a2 = sigmoid (z);
  This version computes $a^{(2)} = g(\theta^{(1)}x)$ correctly in two steps , first the multiplication and then the sigmoid activation.
- a2 = sigmoid (x * Theta1);
- a2 = sigmoid (Theta2 * x);
- z = sigmoid(x); a2 = sigmoid (Theta1 * z);

Check-out our free tutorials on IOT (Internet of Things):

You are using the neural network pictured below and have learned the parameters $\theta^{(1)} = \begin{bmatrix} 1 & 1 & 2.4\\ 1 & 1.7 & 3.2 \end{bmatrix}$ (used to compute $a^{(2)}$ ) and $\theta^{(2)} = \begin{bmatrix} 1 & 0.3 & -1.2 \end{bmatrix}$ (used to compute $a^{(3)}$ as a function of $a^{(2)}$ ). Suppose you swap the parameters for the first hidden layer between its two units so $\theta^{(1)} = \begin{bmatrix} 1 & 1.7 & 3.2 \\ 1 & 1 & 2.4 \end{bmatrix}$ and also swap the output layer so $\theta^{(2)} = \begin{bmatrix} 1 & -1.2 & 0.3 \end{bmatrix}$ . How will this change the value of the output $h_\theta(x)$ ?
- It will stay the same.
  Swapping $\theta^{(1)}$ swaps the hidden layers output $a^{(2)}$ . But the swap of $\theta^{(2)}$ cancels out the change, so the output will remain unchanged.
- It will increase.
- It will decrease
- Insufficient information to tell: it may increase or decrease.

Click here to see solutions for all Machine Learning Coursera Assignments.
&
Click here to see more codes for Raspberry Pi 3 and similar Family.
&
Click here to see more codes for NodeMCU ESP8266 and similar Family.
&
Click here to see more codes for Arduino Mega (ATMega 2560) and similar Family.

Feel free to ask doubts in the comment section. I will try my best to answer it.
If you find this helpful by any mean like, comment and share the post.
This is the simplest way to encourage me to keep doing such work.

Thanks & Regards,
- APDaga DumpBox

14 Comments

Juan Bomfim30 July 2020 at 20:52
Why I can't represent XOR function without hidden layers? If I have a case like question 2 but with the weights: -10, 20, -20 I would get:

x1 | x2 | xor
0 | 0 | 0
0 | 1 | 1
1 | 0 | 1
1 | 1 | 0

wouldn't I?
ReplyDelete
Replies
sun light2 August 2020 at 09:58
please explain 2 one some clearly'i did not undertand what is the target of output.
ReplyDelete
Replies
sun light3 August 2020 at 19:29
in 2nd question they ask Which of the following logical functions does it (approximately) compute?
out put answer what shoud come to satisfy the truth table. In 2 nd one first bit you answered This network outputs approximately 1 only when both inputs are 1.In bit 2 also same.out put howmuch should come to satisfy the truth table.(that means -30,20,10)
ReplyDelete
Replies
Bye12 September 2020 at 01:26
Why question 1 option 2 is incorrect?
ReplyDelete
Replies
Unknown23 September 2020 at 11:24
I didn't get the question numbers 4 and 5 can you please explain in detail?
I mean how did writing the vectorized implementation here work in place of for loop? and similarly in question 5 how do I calculate whether output changes or remains same?
ReplyDelete
Replies
Unknown6 December 2020 at 12:14
consider to the mnn with sigmoidal functions and the training data set x1:0.6,0.2 x2:0.1,0.3 t1:1,0 t2:0,1
ReplyDelete
Replies

Add comment

Coursera: Machine Learning (Week 4) Quiz - Neural Networks: Representation| Andrew NG

▸ Neural Networks - Representation :

Check-out our free tutorials on IOT (Internet of Things):

14 Comments

Contact form