Show the optimal step to minimize $ax^tQx + b^tx + c$ has $lambda = -fracd^tnabla q(x)d^tnabla^2q(x)d$

The name of the pictureThe name of the pictureThe name of the pictureClash Royale CLAN TAG#URR8PPP











up vote
1
down vote

favorite













If a method of descent direction with exact linear search is used to
minimize a quadratic function $q:mathbbR^ntomathbbR$, show that
the optimal step is given by $$lambda = -fracd^tnabla
q(x)d^tnabla^2q(x)d$$ where $d$ is the direction used on point $x$




So I need to minimize a function of the form $$ax^tQx + b^tx + c$$



So I need to minimize $$f(x -lambda d) = a(x -lambda d)^tQ(x -lambda d) + b^t(x -lambda d) + c$$



for that I take the derivative with respect to $lambda$ and set it equal to $0$ to get the $lambda$ that minimizes the expression.



First let's open the expression so we can take the derivative



$$f(x -lambda d) = a(x -lambda d)^t(Qx -Qlambda d) + b^tx -b^tlambda d + c = \ ax^tQx-alambda xQd-alambda d^tQx+alambda^2d^tQd + b^tx -b^tlambda d + c $$



so $$f' = -axQd-ad^tQx + 2alambda d^tQd-b^td = 0implies\ 2alambda d^tQd = b^td+axQd + aQx$$



but this expression doesn't even have the gradient, neither the hessian.



UPDATE:



I found another book asking a similar exercise:



enter image description here



This one does not have $nabla^2$ so I guess the first is wrong? This one can be more trusted. Anyways, I still can't arrive at the answer.







share|cite|improve this question


























    up vote
    1
    down vote

    favorite













    If a method of descent direction with exact linear search is used to
    minimize a quadratic function $q:mathbbR^ntomathbbR$, show that
    the optimal step is given by $$lambda = -fracd^tnabla
    q(x)d^tnabla^2q(x)d$$ where $d$ is the direction used on point $x$




    So I need to minimize a function of the form $$ax^tQx + b^tx + c$$



    So I need to minimize $$f(x -lambda d) = a(x -lambda d)^tQ(x -lambda d) + b^t(x -lambda d) + c$$



    for that I take the derivative with respect to $lambda$ and set it equal to $0$ to get the $lambda$ that minimizes the expression.



    First let's open the expression so we can take the derivative



    $$f(x -lambda d) = a(x -lambda d)^t(Qx -Qlambda d) + b^tx -b^tlambda d + c = \ ax^tQx-alambda xQd-alambda d^tQx+alambda^2d^tQd + b^tx -b^tlambda d + c $$



    so $$f' = -axQd-ad^tQx + 2alambda d^tQd-b^td = 0implies\ 2alambda d^tQd = b^td+axQd + aQx$$



    but this expression doesn't even have the gradient, neither the hessian.



    UPDATE:



    I found another book asking a similar exercise:



    enter image description here



    This one does not have $nabla^2$ so I guess the first is wrong? This one can be more trusted. Anyways, I still can't arrive at the answer.







    share|cite|improve this question
























      up vote
      1
      down vote

      favorite









      up vote
      1
      down vote

      favorite












      If a method of descent direction with exact linear search is used to
      minimize a quadratic function $q:mathbbR^ntomathbbR$, show that
      the optimal step is given by $$lambda = -fracd^tnabla
      q(x)d^tnabla^2q(x)d$$ where $d$ is the direction used on point $x$




      So I need to minimize a function of the form $$ax^tQx + b^tx + c$$



      So I need to minimize $$f(x -lambda d) = a(x -lambda d)^tQ(x -lambda d) + b^t(x -lambda d) + c$$



      for that I take the derivative with respect to $lambda$ and set it equal to $0$ to get the $lambda$ that minimizes the expression.



      First let's open the expression so we can take the derivative



      $$f(x -lambda d) = a(x -lambda d)^t(Qx -Qlambda d) + b^tx -b^tlambda d + c = \ ax^tQx-alambda xQd-alambda d^tQx+alambda^2d^tQd + b^tx -b^tlambda d + c $$



      so $$f' = -axQd-ad^tQx + 2alambda d^tQd-b^td = 0implies\ 2alambda d^tQd = b^td+axQd + aQx$$



      but this expression doesn't even have the gradient, neither the hessian.



      UPDATE:



      I found another book asking a similar exercise:



      enter image description here



      This one does not have $nabla^2$ so I guess the first is wrong? This one can be more trusted. Anyways, I still can't arrive at the answer.







      share|cite|improve this question















      If a method of descent direction with exact linear search is used to
      minimize a quadratic function $q:mathbbR^ntomathbbR$, show that
      the optimal step is given by $$lambda = -fracd^tnabla
      q(x)d^tnabla^2q(x)d$$ where $d$ is the direction used on point $x$




      So I need to minimize a function of the form $$ax^tQx + b^tx + c$$



      So I need to minimize $$f(x -lambda d) = a(x -lambda d)^tQ(x -lambda d) + b^t(x -lambda d) + c$$



      for that I take the derivative with respect to $lambda$ and set it equal to $0$ to get the $lambda$ that minimizes the expression.



      First let's open the expression so we can take the derivative



      $$f(x -lambda d) = a(x -lambda d)^t(Qx -Qlambda d) + b^tx -b^tlambda d + c = \ ax^tQx-alambda xQd-alambda d^tQx+alambda^2d^tQd + b^tx -b^tlambda d + c $$



      so $$f' = -axQd-ad^tQx + 2alambda d^tQd-b^td = 0implies\ 2alambda d^tQd = b^td+axQd + aQx$$



      but this expression doesn't even have the gradient, neither the hessian.



      UPDATE:



      I found another book asking a similar exercise:



      enter image description here



      This one does not have $nabla^2$ so I guess the first is wrong? This one can be more trusted. Anyways, I still can't arrive at the answer.









      share|cite|improve this question













      share|cite|improve this question




      share|cite|improve this question








      edited Aug 26 at 2:13

























      asked Aug 22 at 21:34









      Guerlando OCs

      29321244




      29321244




















          1 Answer
          1






          active

          oldest

          votes

















          up vote
          1
          down vote



          accepted
          +50










          Assuming $Q$ to be symmetric positive definite, $a>0$.



          $$q(x)=ax^TQx+b^Tx+c$$



          then we have



          $$nabla q(x) = 2aQx + b$$



          $$nabla^2 q(x) = 2aQ$$



          Given $x$ and $d$, we want to minimizie $q(x+lambda d)$.



          beginalign
          fracddlambda q(x+lambda d) &= nabla q(x+lambda d)^T d \
          &=[2a(x+lambda d)^TQ + b^T]d
          endalign



          We equate it to zero,



          $$[2a(x+lambda d)^TQ + b^T]d = 0$$
          $$2ax^TQd+2alambda d^TQd + b^Td = 0$$



          $$2alambda d^TQd+2ax^TQd + b^Td = 0$$



          $$lambda d^Tnabla^2q(x)d + (2aQx+b)^Td=0 $$



          $$lambda d^Tnabla^2q(x)d + nabla q(x)^Td=0 $$



          $$lambda =-frac nabla q(x)^Td d^Tnabla^2q(x)d$$



          The expression should be positive since $d$ should be chosen such that $nabla q(x)^Td <0$ to be a descent direction, and the negative should make the whole expression positive. The denominator is positive by the assumption that $Q$ is positive definite.



          Remark:



          • The hessian does appear in the second source that you found, note that is it equal to $Q$.

          • For the second source, $d = -nabla f(x)$.





          share|cite|improve this answer






















            Your Answer




            StackExchange.ifUsing("editor", function ()
            return StackExchange.using("mathjaxEditing", function ()
            StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
            StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
            );
            );
            , "mathjax-editing");

            StackExchange.ready(function()
            var channelOptions =
            tags: "".split(" "),
            id: "69"
            ;
            initTagRenderer("".split(" "), "".split(" "), channelOptions);

            StackExchange.using("externalEditor", function()
            // Have to fire editor after snippets, if snippets enabled
            if (StackExchange.settings.snippets.snippetsEnabled)
            StackExchange.using("snippets", function()
            createEditor();
            );

            else
            createEditor();

            );

            function createEditor()
            StackExchange.prepareEditor(
            heartbeatType: 'answer',
            convertImagesToLinks: true,
            noModals: false,
            showLowRepImageUploadWarning: true,
            reputationToPostImages: 10,
            bindNavPrevention: true,
            postfix: "",
            noCode: true, onDemand: true,
            discardSelector: ".discard-answer"
            ,immediatelyShowMarkdownHelp:true
            );



            );













             

            draft saved


            draft discarded


















            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f2891469%2fshow-the-optimal-step-to-minimize-axtqx-btx-c-has-lambda-fracdt%23new-answer', 'question_page');

            );

            Post as a guest






























            1 Answer
            1






            active

            oldest

            votes








            1 Answer
            1






            active

            oldest

            votes









            active

            oldest

            votes






            active

            oldest

            votes








            up vote
            1
            down vote



            accepted
            +50










            Assuming $Q$ to be symmetric positive definite, $a>0$.



            $$q(x)=ax^TQx+b^Tx+c$$



            then we have



            $$nabla q(x) = 2aQx + b$$



            $$nabla^2 q(x) = 2aQ$$



            Given $x$ and $d$, we want to minimizie $q(x+lambda d)$.



            beginalign
            fracddlambda q(x+lambda d) &= nabla q(x+lambda d)^T d \
            &=[2a(x+lambda d)^TQ + b^T]d
            endalign



            We equate it to zero,



            $$[2a(x+lambda d)^TQ + b^T]d = 0$$
            $$2ax^TQd+2alambda d^TQd + b^Td = 0$$



            $$2alambda d^TQd+2ax^TQd + b^Td = 0$$



            $$lambda d^Tnabla^2q(x)d + (2aQx+b)^Td=0 $$



            $$lambda d^Tnabla^2q(x)d + nabla q(x)^Td=0 $$



            $$lambda =-frac nabla q(x)^Td d^Tnabla^2q(x)d$$



            The expression should be positive since $d$ should be chosen such that $nabla q(x)^Td <0$ to be a descent direction, and the negative should make the whole expression positive. The denominator is positive by the assumption that $Q$ is positive definite.



            Remark:



            • The hessian does appear in the second source that you found, note that is it equal to $Q$.

            • For the second source, $d = -nabla f(x)$.





            share|cite|improve this answer


























              up vote
              1
              down vote



              accepted
              +50










              Assuming $Q$ to be symmetric positive definite, $a>0$.



              $$q(x)=ax^TQx+b^Tx+c$$



              then we have



              $$nabla q(x) = 2aQx + b$$



              $$nabla^2 q(x) = 2aQ$$



              Given $x$ and $d$, we want to minimizie $q(x+lambda d)$.



              beginalign
              fracddlambda q(x+lambda d) &= nabla q(x+lambda d)^T d \
              &=[2a(x+lambda d)^TQ + b^T]d
              endalign



              We equate it to zero,



              $$[2a(x+lambda d)^TQ + b^T]d = 0$$
              $$2ax^TQd+2alambda d^TQd + b^Td = 0$$



              $$2alambda d^TQd+2ax^TQd + b^Td = 0$$



              $$lambda d^Tnabla^2q(x)d + (2aQx+b)^Td=0 $$



              $$lambda d^Tnabla^2q(x)d + nabla q(x)^Td=0 $$



              $$lambda =-frac nabla q(x)^Td d^Tnabla^2q(x)d$$



              The expression should be positive since $d$ should be chosen such that $nabla q(x)^Td <0$ to be a descent direction, and the negative should make the whole expression positive. The denominator is positive by the assumption that $Q$ is positive definite.



              Remark:



              • The hessian does appear in the second source that you found, note that is it equal to $Q$.

              • For the second source, $d = -nabla f(x)$.





              share|cite|improve this answer
























                up vote
                1
                down vote



                accepted
                +50







                up vote
                1
                down vote



                accepted
                +50




                +50




                Assuming $Q$ to be symmetric positive definite, $a>0$.



                $$q(x)=ax^TQx+b^Tx+c$$



                then we have



                $$nabla q(x) = 2aQx + b$$



                $$nabla^2 q(x) = 2aQ$$



                Given $x$ and $d$, we want to minimizie $q(x+lambda d)$.



                beginalign
                fracddlambda q(x+lambda d) &= nabla q(x+lambda d)^T d \
                &=[2a(x+lambda d)^TQ + b^T]d
                endalign



                We equate it to zero,



                $$[2a(x+lambda d)^TQ + b^T]d = 0$$
                $$2ax^TQd+2alambda d^TQd + b^Td = 0$$



                $$2alambda d^TQd+2ax^TQd + b^Td = 0$$



                $$lambda d^Tnabla^2q(x)d + (2aQx+b)^Td=0 $$



                $$lambda d^Tnabla^2q(x)d + nabla q(x)^Td=0 $$



                $$lambda =-frac nabla q(x)^Td d^Tnabla^2q(x)d$$



                The expression should be positive since $d$ should be chosen such that $nabla q(x)^Td <0$ to be a descent direction, and the negative should make the whole expression positive. The denominator is positive by the assumption that $Q$ is positive definite.



                Remark:



                • The hessian does appear in the second source that you found, note that is it equal to $Q$.

                • For the second source, $d = -nabla f(x)$.





                share|cite|improve this answer














                Assuming $Q$ to be symmetric positive definite, $a>0$.



                $$q(x)=ax^TQx+b^Tx+c$$



                then we have



                $$nabla q(x) = 2aQx + b$$



                $$nabla^2 q(x) = 2aQ$$



                Given $x$ and $d$, we want to minimizie $q(x+lambda d)$.



                beginalign
                fracddlambda q(x+lambda d) &= nabla q(x+lambda d)^T d \
                &=[2a(x+lambda d)^TQ + b^T]d
                endalign



                We equate it to zero,



                $$[2a(x+lambda d)^TQ + b^T]d = 0$$
                $$2ax^TQd+2alambda d^TQd + b^Td = 0$$



                $$2alambda d^TQd+2ax^TQd + b^Td = 0$$



                $$lambda d^Tnabla^2q(x)d + (2aQx+b)^Td=0 $$



                $$lambda d^Tnabla^2q(x)d + nabla q(x)^Td=0 $$



                $$lambda =-frac nabla q(x)^Td d^Tnabla^2q(x)d$$



                The expression should be positive since $d$ should be chosen such that $nabla q(x)^Td <0$ to be a descent direction, and the negative should make the whole expression positive. The denominator is positive by the assumption that $Q$ is positive definite.



                Remark:



                • The hessian does appear in the second source that you found, note that is it equal to $Q$.

                • For the second source, $d = -nabla f(x)$.






                share|cite|improve this answer














                share|cite|improve this answer



                share|cite|improve this answer








                edited Aug 26 at 3:12

























                answered Aug 26 at 3:06









                Siong Thye Goh

                80.6k1453102




                80.6k1453102



























                     

                    draft saved


                    draft discarded















































                     


                    draft saved


                    draft discarded














                    StackExchange.ready(
                    function ()
                    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f2891469%2fshow-the-optimal-step-to-minimize-axtqx-btx-c-has-lambda-fracdt%23new-answer', 'question_page');

                    );

                    Post as a guest













































































                    這個網誌中的熱門文章

                    How to combine Bézier curves to a surface?

                    Mutual Information Always Non-negative

                    Why am i infinitely getting the same tweet with the Twitter Search API?