Show the optimal step to minimize $ax^tQx + b^tx + c$ has $lambda = -fracd^tnabla q(x)d^tnabla^2q(x)d$
Clash Royale CLAN TAG#URR8PPP
up vote
1
down vote
favorite
If a method of descent direction with exact linear search is used to
minimize a quadratic function $q:mathbbR^ntomathbbR$, show that
the optimal step is given by $$lambda = -fracd^tnabla
q(x)d^tnabla^2q(x)d$$ where $d$ is the direction used on point $x$
So I need to minimize a function of the form $$ax^tQx + b^tx + c$$
So I need to minimize $$f(x -lambda d) = a(x -lambda d)^tQ(x -lambda d) + b^t(x -lambda d) + c$$
for that I take the derivative with respect to $lambda$ and set it equal to $0$ to get the $lambda$ that minimizes the expression.
First let's open the expression so we can take the derivative
$$f(x -lambda d) = a(x -lambda d)^t(Qx -Qlambda d) + b^tx -b^tlambda d + c = \ ax^tQx-alambda xQd-alambda d^tQx+alambda^2d^tQd + b^tx -b^tlambda d + c $$
so $$f' = -axQd-ad^tQx + 2alambda d^tQd-b^td = 0implies\ 2alambda d^tQd = b^td+axQd + aQx$$
but this expression doesn't even have the gradient, neither the hessian.
UPDATE:
I found another book asking a similar exercise:
This one does not have $nabla^2$ so I guess the first is wrong? This one can be more trusted. Anyways, I still can't arrive at the answer.
calculus linear-algebra multivariable-calculus optimization maxima-minima
add a comment |Â
up vote
1
down vote
favorite
If a method of descent direction with exact linear search is used to
minimize a quadratic function $q:mathbbR^ntomathbbR$, show that
the optimal step is given by $$lambda = -fracd^tnabla
q(x)d^tnabla^2q(x)d$$ where $d$ is the direction used on point $x$
So I need to minimize a function of the form $$ax^tQx + b^tx + c$$
So I need to minimize $$f(x -lambda d) = a(x -lambda d)^tQ(x -lambda d) + b^t(x -lambda d) + c$$
for that I take the derivative with respect to $lambda$ and set it equal to $0$ to get the $lambda$ that minimizes the expression.
First let's open the expression so we can take the derivative
$$f(x -lambda d) = a(x -lambda d)^t(Qx -Qlambda d) + b^tx -b^tlambda d + c = \ ax^tQx-alambda xQd-alambda d^tQx+alambda^2d^tQd + b^tx -b^tlambda d + c $$
so $$f' = -axQd-ad^tQx + 2alambda d^tQd-b^td = 0implies\ 2alambda d^tQd = b^td+axQd + aQx$$
but this expression doesn't even have the gradient, neither the hessian.
UPDATE:
I found another book asking a similar exercise:
This one does not have $nabla^2$ so I guess the first is wrong? This one can be more trusted. Anyways, I still can't arrive at the answer.
calculus linear-algebra multivariable-calculus optimization maxima-minima
add a comment |Â
up vote
1
down vote
favorite
up vote
1
down vote
favorite
If a method of descent direction with exact linear search is used to
minimize a quadratic function $q:mathbbR^ntomathbbR$, show that
the optimal step is given by $$lambda = -fracd^tnabla
q(x)d^tnabla^2q(x)d$$ where $d$ is the direction used on point $x$
So I need to minimize a function of the form $$ax^tQx + b^tx + c$$
So I need to minimize $$f(x -lambda d) = a(x -lambda d)^tQ(x -lambda d) + b^t(x -lambda d) + c$$
for that I take the derivative with respect to $lambda$ and set it equal to $0$ to get the $lambda$ that minimizes the expression.
First let's open the expression so we can take the derivative
$$f(x -lambda d) = a(x -lambda d)^t(Qx -Qlambda d) + b^tx -b^tlambda d + c = \ ax^tQx-alambda xQd-alambda d^tQx+alambda^2d^tQd + b^tx -b^tlambda d + c $$
so $$f' = -axQd-ad^tQx + 2alambda d^tQd-b^td = 0implies\ 2alambda d^tQd = b^td+axQd + aQx$$
but this expression doesn't even have the gradient, neither the hessian.
UPDATE:
I found another book asking a similar exercise:
This one does not have $nabla^2$ so I guess the first is wrong? This one can be more trusted. Anyways, I still can't arrive at the answer.
calculus linear-algebra multivariable-calculus optimization maxima-minima
If a method of descent direction with exact linear search is used to
minimize a quadratic function $q:mathbbR^ntomathbbR$, show that
the optimal step is given by $$lambda = -fracd^tnabla
q(x)d^tnabla^2q(x)d$$ where $d$ is the direction used on point $x$
So I need to minimize a function of the form $$ax^tQx + b^tx + c$$
So I need to minimize $$f(x -lambda d) = a(x -lambda d)^tQ(x -lambda d) + b^t(x -lambda d) + c$$
for that I take the derivative with respect to $lambda$ and set it equal to $0$ to get the $lambda$ that minimizes the expression.
First let's open the expression so we can take the derivative
$$f(x -lambda d) = a(x -lambda d)^t(Qx -Qlambda d) + b^tx -b^tlambda d + c = \ ax^tQx-alambda xQd-alambda d^tQx+alambda^2d^tQd + b^tx -b^tlambda d + c $$
so $$f' = -axQd-ad^tQx + 2alambda d^tQd-b^td = 0implies\ 2alambda d^tQd = b^td+axQd + aQx$$
but this expression doesn't even have the gradient, neither the hessian.
UPDATE:
I found another book asking a similar exercise:
This one does not have $nabla^2$ so I guess the first is wrong? This one can be more trusted. Anyways, I still can't arrive at the answer.
calculus linear-algebra multivariable-calculus optimization maxima-minima
edited Aug 26 at 2:13
asked Aug 22 at 21:34
Guerlando OCs
29321244
29321244
add a comment |Â
add a comment |Â
1 Answer
1
active
oldest
votes
up vote
1
down vote
accepted
Assuming $Q$ to be symmetric positive definite, $a>0$.
$$q(x)=ax^TQx+b^Tx+c$$
then we have
$$nabla q(x) = 2aQx + b$$
$$nabla^2 q(x) = 2aQ$$
Given $x$ and $d$, we want to minimizie $q(x+lambda d)$.
beginalign
fracddlambda q(x+lambda d) &= nabla q(x+lambda d)^T d \
&=[2a(x+lambda d)^TQ + b^T]d
endalign
We equate it to zero,
$$[2a(x+lambda d)^TQ + b^T]d = 0$$
$$2ax^TQd+2alambda d^TQd + b^Td = 0$$
$$2alambda d^TQd+2ax^TQd + b^Td = 0$$
$$lambda d^Tnabla^2q(x)d + (2aQx+b)^Td=0 $$
$$lambda d^Tnabla^2q(x)d + nabla q(x)^Td=0 $$
$$lambda =-frac nabla q(x)^Td d^Tnabla^2q(x)d$$
The expression should be positive since $d$ should be chosen such that $nabla q(x)^Td <0$ to be a descent direction, and the negative should make the whole expression positive. The denominator is positive by the assumption that $Q$ is positive definite.
Remark:
- The hessian does appear in the second source that you found, note that is it equal to $Q$.
- For the second source, $d = -nabla f(x)$.
add a comment |Â
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
up vote
1
down vote
accepted
Assuming $Q$ to be symmetric positive definite, $a>0$.
$$q(x)=ax^TQx+b^Tx+c$$
then we have
$$nabla q(x) = 2aQx + b$$
$$nabla^2 q(x) = 2aQ$$
Given $x$ and $d$, we want to minimizie $q(x+lambda d)$.
beginalign
fracddlambda q(x+lambda d) &= nabla q(x+lambda d)^T d \
&=[2a(x+lambda d)^TQ + b^T]d
endalign
We equate it to zero,
$$[2a(x+lambda d)^TQ + b^T]d = 0$$
$$2ax^TQd+2alambda d^TQd + b^Td = 0$$
$$2alambda d^TQd+2ax^TQd + b^Td = 0$$
$$lambda d^Tnabla^2q(x)d + (2aQx+b)^Td=0 $$
$$lambda d^Tnabla^2q(x)d + nabla q(x)^Td=0 $$
$$lambda =-frac nabla q(x)^Td d^Tnabla^2q(x)d$$
The expression should be positive since $d$ should be chosen such that $nabla q(x)^Td <0$ to be a descent direction, and the negative should make the whole expression positive. The denominator is positive by the assumption that $Q$ is positive definite.
Remark:
- The hessian does appear in the second source that you found, note that is it equal to $Q$.
- For the second source, $d = -nabla f(x)$.
add a comment |Â
up vote
1
down vote
accepted
Assuming $Q$ to be symmetric positive definite, $a>0$.
$$q(x)=ax^TQx+b^Tx+c$$
then we have
$$nabla q(x) = 2aQx + b$$
$$nabla^2 q(x) = 2aQ$$
Given $x$ and $d$, we want to minimizie $q(x+lambda d)$.
beginalign
fracddlambda q(x+lambda d) &= nabla q(x+lambda d)^T d \
&=[2a(x+lambda d)^TQ + b^T]d
endalign
We equate it to zero,
$$[2a(x+lambda d)^TQ + b^T]d = 0$$
$$2ax^TQd+2alambda d^TQd + b^Td = 0$$
$$2alambda d^TQd+2ax^TQd + b^Td = 0$$
$$lambda d^Tnabla^2q(x)d + (2aQx+b)^Td=0 $$
$$lambda d^Tnabla^2q(x)d + nabla q(x)^Td=0 $$
$$lambda =-frac nabla q(x)^Td d^Tnabla^2q(x)d$$
The expression should be positive since $d$ should be chosen such that $nabla q(x)^Td <0$ to be a descent direction, and the negative should make the whole expression positive. The denominator is positive by the assumption that $Q$ is positive definite.
Remark:
- The hessian does appear in the second source that you found, note that is it equal to $Q$.
- For the second source, $d = -nabla f(x)$.
add a comment |Â
up vote
1
down vote
accepted
up vote
1
down vote
accepted
Assuming $Q$ to be symmetric positive definite, $a>0$.
$$q(x)=ax^TQx+b^Tx+c$$
then we have
$$nabla q(x) = 2aQx + b$$
$$nabla^2 q(x) = 2aQ$$
Given $x$ and $d$, we want to minimizie $q(x+lambda d)$.
beginalign
fracddlambda q(x+lambda d) &= nabla q(x+lambda d)^T d \
&=[2a(x+lambda d)^TQ + b^T]d
endalign
We equate it to zero,
$$[2a(x+lambda d)^TQ + b^T]d = 0$$
$$2ax^TQd+2alambda d^TQd + b^Td = 0$$
$$2alambda d^TQd+2ax^TQd + b^Td = 0$$
$$lambda d^Tnabla^2q(x)d + (2aQx+b)^Td=0 $$
$$lambda d^Tnabla^2q(x)d + nabla q(x)^Td=0 $$
$$lambda =-frac nabla q(x)^Td d^Tnabla^2q(x)d$$
The expression should be positive since $d$ should be chosen such that $nabla q(x)^Td <0$ to be a descent direction, and the negative should make the whole expression positive. The denominator is positive by the assumption that $Q$ is positive definite.
Remark:
- The hessian does appear in the second source that you found, note that is it equal to $Q$.
- For the second source, $d = -nabla f(x)$.
Assuming $Q$ to be symmetric positive definite, $a>0$.
$$q(x)=ax^TQx+b^Tx+c$$
then we have
$$nabla q(x) = 2aQx + b$$
$$nabla^2 q(x) = 2aQ$$
Given $x$ and $d$, we want to minimizie $q(x+lambda d)$.
beginalign
fracddlambda q(x+lambda d) &= nabla q(x+lambda d)^T d \
&=[2a(x+lambda d)^TQ + b^T]d
endalign
We equate it to zero,
$$[2a(x+lambda d)^TQ + b^T]d = 0$$
$$2ax^TQd+2alambda d^TQd + b^Td = 0$$
$$2alambda d^TQd+2ax^TQd + b^Td = 0$$
$$lambda d^Tnabla^2q(x)d + (2aQx+b)^Td=0 $$
$$lambda d^Tnabla^2q(x)d + nabla q(x)^Td=0 $$
$$lambda =-frac nabla q(x)^Td d^Tnabla^2q(x)d$$
The expression should be positive since $d$ should be chosen such that $nabla q(x)^Td <0$ to be a descent direction, and the negative should make the whole expression positive. The denominator is positive by the assumption that $Q$ is positive definite.
Remark:
- The hessian does appear in the second source that you found, note that is it equal to $Q$.
- For the second source, $d = -nabla f(x)$.
edited Aug 26 at 3:12
answered Aug 26 at 3:06
Siong Thye Goh
80.6k1453102
80.6k1453102
add a comment |Â
add a comment |Â
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f2891469%2fshow-the-optimal-step-to-minimize-axtqx-btx-c-has-lambda-fracdt%23new-answer', 'question_page');
);
Post as a guest
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password