Show the optimal step to minimize $ax^tQx + b^tx + c$ has $lambda = -fracd^tnabla q(x)d^tnabla^2q(x)d$

up vote
1
down vote

favorite

If a method of descent direction with exact linear search is used to
minimize a quadratic function $q:mathbbR^ntomathbbR$, show that
the optimal step is given by $$lambda = -fracd^tnabla
q(x)d^tnabla^2q(x)d$$ where $d$ is the direction used on point $x$

So I need to minimize a function of the form $$ax^tQx + b^tx + c$$

So I need to minimize $$f(x -lambda d) = a(x -lambda d)^tQ(x -lambda d) + b^t(x -lambda d) + c$$

for that I take the derivative with respect to $lambda$ and set it equal to $0$ to get the $lambda$ that minimizes the expression.

First let's open the expression so we can take the derivative

$$f(x -lambda d) = a(x -lambda d)^t(Qx -Qlambda d) + b^tx -b^tlambda d + c = \ ax^tQx-alambda xQd-alambda d^tQx+alambda^2d^tQd + b^tx -b^tlambda d + c $$

so $$f' = -axQd-ad^tQx + 2alambda d^tQd-b^td = 0implies\ 2alambda d^tQd = b^td+axQd + aQx$$

but this expression doesn't even have the gradient, neither the hessian.

UPDATE:

I found another book asking a similar exercise:

enter image description here

This one does not have $nabla^2$ so I guess the first is wrong? This one can be more trusted. Anyways, I still can't arrive at the answer.

edited Aug 26 at 2:13

asked Aug 22 at 21:34

Guerlando OCs

29321244

add a commentÂ |Â

up vote
1
down vote

favorite

If a method of descent direction with exact linear search is used to
minimize a quadratic function $q:mathbbR^ntomathbbR$, show that
the optimal step is given by $$lambda = -fracd^tnabla
q(x)d^tnabla^2q(x)d$$ where $d$ is the direction used on point $x$

So I need to minimize a function of the form $$ax^tQx + b^tx + c$$

So I need to minimize $$f(x -lambda d) = a(x -lambda d)^tQ(x -lambda d) + b^t(x -lambda d) + c$$

for that I take the derivative with respect to $lambda$ and set it equal to $0$ to get the $lambda$ that minimizes the expression.

First let's open the expression so we can take the derivative

$$f(x -lambda d) = a(x -lambda d)^t(Qx -Qlambda d) + b^tx -b^tlambda d + c = \ ax^tQx-alambda xQd-alambda d^tQx+alambda^2d^tQd + b^tx -b^tlambda d + c $$

so $$f' = -axQd-ad^tQx + 2alambda d^tQd-b^td = 0implies\ 2alambda d^tQd = b^td+axQd + aQx$$

but this expression doesn't even have the gradient, neither the hessian.

UPDATE:

I found another book asking a similar exercise:

enter image description here

This one does not have $nabla^2$ so I guess the first is wrong? This one can be more trusted. Anyways, I still can't arrive at the answer.

edited Aug 26 at 2:13

asked Aug 22 at 21:34

Guerlando OCs

29321244

add a commentÂ |Â

up vote
1
down vote

favorite

If a method of descent direction with exact linear search is used to
minimize a quadratic function $q:mathbbR^ntomathbbR$, show that
the optimal step is given by $$lambda = -fracd^tnabla
q(x)d^tnabla^2q(x)d$$ where $d$ is the direction used on point $x$

So I need to minimize a function of the form $$ax^tQx + b^tx + c$$

So I need to minimize $$f(x -lambda d) = a(x -lambda d)^tQ(x -lambda d) + b^t(x -lambda d) + c$$

for that I take the derivative with respect to $lambda$ and set it equal to $0$ to get the $lambda$ that minimizes the expression.

First let's open the expression so we can take the derivative

$$f(x -lambda d) = a(x -lambda d)^t(Qx -Qlambda d) + b^tx -b^tlambda d + c = \ ax^tQx-alambda xQd-alambda d^tQx+alambda^2d^tQd + b^tx -b^tlambda d + c $$

so $$f' = -axQd-ad^tQx + 2alambda d^tQd-b^td = 0implies\ 2alambda d^tQd = b^td+axQd + aQx$$

but this expression doesn't even have the gradient, neither the hessian.

UPDATE:

I found another book asking a similar exercise:

enter image description here

This one does not have $nabla^2$ so I guess the first is wrong? This one can be more trusted. Anyways, I still can't arrive at the answer.

edited Aug 26 at 2:13

asked Aug 22 at 21:34

Guerlando OCs

29321244

If a method of descent direction with exact linear search is used to
minimize a quadratic function $q:mathbbR^ntomathbbR$, show that
the optimal step is given by $$lambda = -fracd^tnabla
q(x)d^tnabla^2q(x)d$$ where $d$ is the direction used on point $x$

So I need to minimize a function of the form $$ax^tQx + b^tx + c$$

So I need to minimize $$f(x -lambda d) = a(x -lambda d)^tQ(x -lambda d) + b^t(x -lambda d) + c$$

for that I take the derivative with respect to $lambda$ and set it equal to $0$ to get the $lambda$ that minimizes the expression.

First let's open the expression so we can take the derivative

$$f(x -lambda d) = a(x -lambda d)^t(Qx -Qlambda d) + b^tx -b^tlambda d + c = \ ax^tQx-alambda xQd-alambda d^tQx+alambda^2d^tQd + b^tx -b^tlambda d + c $$

so $$f' = -axQd-ad^tQx + 2alambda d^tQd-b^td = 0implies\ 2alambda d^tQd = b^td+axQd + aQx$$

but this expression doesn't even have the gradient, neither the hessian.

UPDATE:

I found another book asking a similar exercise:

enter image description here

This one does not have $nabla^2$ so I guess the first is wrong? This one can be more trusted. Anyways, I still can't arrive at the answer.

edited Aug 26 at 2:13

asked Aug 22 at 21:34

Guerlando OCs

29321244

edited Aug 26 at 2:13

asked Aug 22 at 21:34

Guerlando OCs

29321244

asked Aug 22 at 21:34

Guerlando OCs

29321244

asked Aug 22 at 21:34

Guerlando OCs

29321244

add a commentÂ |Â

1 Answer
1

active

oldest

votes

up vote
1
down vote

accepted

+50

Assuming $Q$ to be symmetric positive definite, $a>0$.

$$q(x)=ax^TQx+b^Tx+c$$

then we have

$$nabla q(x) = 2aQx + b$$

$$nabla^2 q(x) = 2aQ$$

Given $x$ and $d$, we want to minimizie $q(x+lambda d)$.

beginalign
fracddlambda q(x+lambda d) &= nabla q(x+lambda d)^T d \
&=[2a(x+lambda d)^TQ + b^T]d
endalign

We equate it to zero,

$$[2a(x+lambda d)^TQ + b^T]d = 0$$
$$2ax^TQd+2alambda d^TQd + b^Td = 0$$

$$2alambda d^TQd+2ax^TQd + b^Td = 0$$

$$lambda d^Tnabla^2q(x)d + (2aQx+b)^Td=0 $$

$$lambda d^Tnabla^2q(x)d + nabla q(x)^Td=0 $$

$$lambda =-frac nabla q(x)^Td d^Tnabla^2q(x)d$$

The expression should be positive since $d$ should be chosen such that $nabla q(x)^Td <0$ to be a descent direction, and the negative should make the whole expression positive. The denominator is positive by the assumption that $Q$ is positive definite.

Remark:

The hessian does appear in the second source that you found, note that is it equal to $Q$.

For the second source, $d = -nabla f(x)$.

edited Aug 26 at 3:12

answered Aug 26 at 3:06

Siong Thye Goh

80.6k1453102

add a commentÂ |Â

Your Answer

StackExchange.ifUsing("editor", function ()
return StackExchange.using("mathjaxEditing", function ()
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\$","\$"]]);
);
);
, "mathjax-editing");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "69"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
convertImagesToLinks: true,
noModals: false,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
noCode: true, onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);

);

draft saved

draft discarded

StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f2891469%2fshow-the-optimal-step-to-minimize-axtqx-btx-c-has-lambda-fracdt%23new-answer', 'question_page');

);

Post as a guest

Name

1 Answer
1

active

oldest

votes

1 Answer
1

active

oldest

votes

up vote
1
down vote

accepted

+50

Assuming $Q$ to be symmetric positive definite, $a>0$.

$$q(x)=ax^TQx+b^Tx+c$$

then we have

$$nabla q(x) = 2aQx + b$$

$$nabla^2 q(x) = 2aQ$$

Given $x$ and $d$, we want to minimizie $q(x+lambda d)$.

beginalign
fracddlambda q(x+lambda d) &= nabla q(x+lambda d)^T d \
&=[2a(x+lambda d)^TQ + b^T]d
endalign

We equate it to zero,

$$[2a(x+lambda d)^TQ + b^T]d = 0$$
$$2ax^TQd+2alambda d^TQd + b^Td = 0$$

$$2alambda d^TQd+2ax^TQd + b^Td = 0$$

$$lambda d^Tnabla^2q(x)d + (2aQx+b)^Td=0 $$

$$lambda d^Tnabla^2q(x)d + nabla q(x)^Td=0 $$

$$lambda =-frac nabla q(x)^Td d^Tnabla^2q(x)d$$

Remark:

The hessian does appear in the second source that you found, note that is it equal to $Q$.

For the second source, $d = -nabla f(x)$.

edited Aug 26 at 3:12

answered Aug 26 at 3:06

Siong Thye Goh

80.6k1453102

add a commentÂ |Â

up vote
1
down vote

accepted

+50

Assuming $Q$ to be symmetric positive definite, $a>0$.

$$q(x)=ax^TQx+b^Tx+c$$

then we have

$$nabla q(x) = 2aQx + b$$

$$nabla^2 q(x) = 2aQ$$

Given $x$ and $d$, we want to minimizie $q(x+lambda d)$.

beginalign
fracddlambda q(x+lambda d) &= nabla q(x+lambda d)^T d \
&=[2a(x+lambda d)^TQ + b^T]d
endalign

We equate it to zero,

$$[2a(x+lambda d)^TQ + b^T]d = 0$$
$$2ax^TQd+2alambda d^TQd + b^Td = 0$$

$$2alambda d^TQd+2ax^TQd + b^Td = 0$$

$$lambda d^Tnabla^2q(x)d + (2aQx+b)^Td=0 $$

$$lambda d^Tnabla^2q(x)d + nabla q(x)^Td=0 $$

$$lambda =-frac nabla q(x)^Td d^Tnabla^2q(x)d$$

Remark:

The hessian does appear in the second source that you found, note that is it equal to $Q$.

For the second source, $d = -nabla f(x)$.

edited Aug 26 at 3:12

answered Aug 26 at 3:06

Siong Thye Goh

80.6k1453102

add a commentÂ |Â

up vote
1
down vote

accepted

+50

up vote
1
down vote

accepted

+50

Assuming $Q$ to be symmetric positive definite, $a>0$.

$$q(x)=ax^TQx+b^Tx+c$$

then we have

$$nabla q(x) = 2aQx + b$$

$$nabla^2 q(x) = 2aQ$$

Given $x$ and $d$, we want to minimizie $q(x+lambda d)$.

beginalign
fracddlambda q(x+lambda d) &= nabla q(x+lambda d)^T d \
&=[2a(x+lambda d)^TQ + b^T]d
endalign

We equate it to zero,

$$[2a(x+lambda d)^TQ + b^T]d = 0$$
$$2ax^TQd+2alambda d^TQd + b^Td = 0$$

$$2alambda d^TQd+2ax^TQd + b^Td = 0$$

$$lambda d^Tnabla^2q(x)d + (2aQx+b)^Td=0 $$

$$lambda d^Tnabla^2q(x)d + nabla q(x)^Td=0 $$

$$lambda =-frac nabla q(x)^Td d^Tnabla^2q(x)d$$

Remark:

The hessian does appear in the second source that you found, note that is it equal to $Q$.

For the second source, $d = -nabla f(x)$.

edited Aug 26 at 3:12

answered Aug 26 at 3:06

Siong Thye Goh

80.6k1453102

Assuming $Q$ to be symmetric positive definite, $a>0$.

$$q(x)=ax^TQx+b^Tx+c$$

then we have

$$nabla q(x) = 2aQx + b$$

$$nabla^2 q(x) = 2aQ$$

Given $x$ and $d$, we want to minimizie $q(x+lambda d)$.

beginalign
fracddlambda q(x+lambda d) &= nabla q(x+lambda d)^T d \
&=[2a(x+lambda d)^TQ + b^T]d
endalign

We equate it to zero,

$$[2a(x+lambda d)^TQ + b^T]d = 0$$
$$2ax^TQd+2alambda d^TQd + b^Td = 0$$

$$2alambda d^TQd+2ax^TQd + b^Td = 0$$

$$lambda d^Tnabla^2q(x)d + (2aQx+b)^Td=0 $$

$$lambda d^Tnabla^2q(x)d + nabla q(x)^Td=0 $$

$$lambda =-frac nabla q(x)^Td d^Tnabla^2q(x)d$$

Remark:

The hessian does appear in the second source that you found, note that is it equal to $Q$.

For the second source, $d = -nabla f(x)$.

edited Aug 26 at 3:12

answered Aug 26 at 3:06

Siong Thye Goh

80.6k1453102

edited Aug 26 at 3:12

answered Aug 26 at 3:06

Siong Thye Goh

80.6k1453102

answered Aug 26 at 3:06

Siong Thye Goh

80.6k1453102

answered Aug 26 at 3:06

Siong Thye Goh

80.6k1453102

add a commentÂ |Â

draft saved

draft discarded

draft saved

draft discarded

Post as a guest

Name

搜尋此網誌

Vtyjkyuk