Perturbation Theory: Derivative of a trace.

The name of the pictureThe name of the pictureThe name of the pictureClash Royale CLAN TAG#URR8PPP











up vote
2
down vote

favorite












The problem that I am looking at is the following perturbation problem from the notes on Trace Inequalities and Quantum Entropy on page 12, the following result is said to follow "by the Spectral Theorem and first order perturbation theory":



Let $f$ be a continuously differentiable function on $(0,infty)$ and $B,C$ self-adjoint (complex) $n times n$ matrices. Then
$$fracddtleft|_t=0right.operatornameTr[f(B + tC)] = operatornameTr[f'(B)C].$$



If someone knows a reference where I can learn why this is true (preferably an article, I could find online or through a University library), then that would be great.



Note: I already received a solution to this using some more powerful perturbation theory $C^1$ unitarily diagonalizing, but I was wondering about less complicated things.







share|cite|improve this question






















  • Considering how a trace is simply a sum of terms, the total time derivative evaluated at $t=0$ would just slide into the trace operation. From here, it's simply an extension on the chain rule
    – Captain Morgan
    Aug 22 at 5:45










  • Sure... just why does there exist a chain rule for functions defined via the spectral theorem. Maybe, I would need to prove a version of the convergence lemma in Baby Rudin, where I approximate $f'$ by polynomials that have some antiderivatives that converge to $f$ and then I can try to interchange the derivative limit and the limit of polymonials for which the chain rule obviously applies.
    – 4-ier
    Aug 22 at 6:02










  • For instance, if $f(x) = x^2$, then we know that $f(B + tC) = B^2 + t(BC + CB) + t^2C^2$ so its derivative in $t$ at zero is $BC + CB$ which does not equal $f'(B)C = 2BC$ in general, but does after applying the trace.
    – 4-ier
    Aug 22 at 6:10











  • Yes, but since $rm tr(CB)=rm tr(BC)$ you can combine the terms to obtain $rm tr(2BC)$.
    – lynn
    Aug 23 at 19:12











  • Yes, I agree. What I am saying is that for polynomials $f$, I can see that it works because under the trace, everything commutes. However, for a general continuously differentiable function I am not sure if one can make it work.
    – 4-ier
    Aug 23 at 22:34














up vote
2
down vote

favorite












The problem that I am looking at is the following perturbation problem from the notes on Trace Inequalities and Quantum Entropy on page 12, the following result is said to follow "by the Spectral Theorem and first order perturbation theory":



Let $f$ be a continuously differentiable function on $(0,infty)$ and $B,C$ self-adjoint (complex) $n times n$ matrices. Then
$$fracddtleft|_t=0right.operatornameTr[f(B + tC)] = operatornameTr[f'(B)C].$$



If someone knows a reference where I can learn why this is true (preferably an article, I could find online or through a University library), then that would be great.



Note: I already received a solution to this using some more powerful perturbation theory $C^1$ unitarily diagonalizing, but I was wondering about less complicated things.







share|cite|improve this question






















  • Considering how a trace is simply a sum of terms, the total time derivative evaluated at $t=0$ would just slide into the trace operation. From here, it's simply an extension on the chain rule
    – Captain Morgan
    Aug 22 at 5:45










  • Sure... just why does there exist a chain rule for functions defined via the spectral theorem. Maybe, I would need to prove a version of the convergence lemma in Baby Rudin, where I approximate $f'$ by polynomials that have some antiderivatives that converge to $f$ and then I can try to interchange the derivative limit and the limit of polymonials for which the chain rule obviously applies.
    – 4-ier
    Aug 22 at 6:02










  • For instance, if $f(x) = x^2$, then we know that $f(B + tC) = B^2 + t(BC + CB) + t^2C^2$ so its derivative in $t$ at zero is $BC + CB$ which does not equal $f'(B)C = 2BC$ in general, but does after applying the trace.
    – 4-ier
    Aug 22 at 6:10











  • Yes, but since $rm tr(CB)=rm tr(BC)$ you can combine the terms to obtain $rm tr(2BC)$.
    – lynn
    Aug 23 at 19:12











  • Yes, I agree. What I am saying is that for polynomials $f$, I can see that it works because under the trace, everything commutes. However, for a general continuously differentiable function I am not sure if one can make it work.
    – 4-ier
    Aug 23 at 22:34












up vote
2
down vote

favorite









up vote
2
down vote

favorite











The problem that I am looking at is the following perturbation problem from the notes on Trace Inequalities and Quantum Entropy on page 12, the following result is said to follow "by the Spectral Theorem and first order perturbation theory":



Let $f$ be a continuously differentiable function on $(0,infty)$ and $B,C$ self-adjoint (complex) $n times n$ matrices. Then
$$fracddtleft|_t=0right.operatornameTr[f(B + tC)] = operatornameTr[f'(B)C].$$



If someone knows a reference where I can learn why this is true (preferably an article, I could find online or through a University library), then that would be great.



Note: I already received a solution to this using some more powerful perturbation theory $C^1$ unitarily diagonalizing, but I was wondering about less complicated things.







share|cite|improve this question














The problem that I am looking at is the following perturbation problem from the notes on Trace Inequalities and Quantum Entropy on page 12, the following result is said to follow "by the Spectral Theorem and first order perturbation theory":



Let $f$ be a continuously differentiable function on $(0,infty)$ and $B,C$ self-adjoint (complex) $n times n$ matrices. Then
$$fracddtleft|_t=0right.operatornameTr[f(B + tC)] = operatornameTr[f'(B)C].$$



If someone knows a reference where I can learn why this is true (preferably an article, I could find online or through a University library), then that would be great.



Note: I already received a solution to this using some more powerful perturbation theory $C^1$ unitarily diagonalizing, but I was wondering about less complicated things.









share|cite|improve this question













share|cite|improve this question




share|cite|improve this question








edited Aug 23 at 22:37

























asked Aug 22 at 2:59









4-ier

5989




5989











  • Considering how a trace is simply a sum of terms, the total time derivative evaluated at $t=0$ would just slide into the trace operation. From here, it's simply an extension on the chain rule
    – Captain Morgan
    Aug 22 at 5:45










  • Sure... just why does there exist a chain rule for functions defined via the spectral theorem. Maybe, I would need to prove a version of the convergence lemma in Baby Rudin, where I approximate $f'$ by polynomials that have some antiderivatives that converge to $f$ and then I can try to interchange the derivative limit and the limit of polymonials for which the chain rule obviously applies.
    – 4-ier
    Aug 22 at 6:02










  • For instance, if $f(x) = x^2$, then we know that $f(B + tC) = B^2 + t(BC + CB) + t^2C^2$ so its derivative in $t$ at zero is $BC + CB$ which does not equal $f'(B)C = 2BC$ in general, but does after applying the trace.
    – 4-ier
    Aug 22 at 6:10











  • Yes, but since $rm tr(CB)=rm tr(BC)$ you can combine the terms to obtain $rm tr(2BC)$.
    – lynn
    Aug 23 at 19:12











  • Yes, I agree. What I am saying is that for polynomials $f$, I can see that it works because under the trace, everything commutes. However, for a general continuously differentiable function I am not sure if one can make it work.
    – 4-ier
    Aug 23 at 22:34
















  • Considering how a trace is simply a sum of terms, the total time derivative evaluated at $t=0$ would just slide into the trace operation. From here, it's simply an extension on the chain rule
    – Captain Morgan
    Aug 22 at 5:45










  • Sure... just why does there exist a chain rule for functions defined via the spectral theorem. Maybe, I would need to prove a version of the convergence lemma in Baby Rudin, where I approximate $f'$ by polynomials that have some antiderivatives that converge to $f$ and then I can try to interchange the derivative limit and the limit of polymonials for which the chain rule obviously applies.
    – 4-ier
    Aug 22 at 6:02










  • For instance, if $f(x) = x^2$, then we know that $f(B + tC) = B^2 + t(BC + CB) + t^2C^2$ so its derivative in $t$ at zero is $BC + CB$ which does not equal $f'(B)C = 2BC$ in general, but does after applying the trace.
    – 4-ier
    Aug 22 at 6:10











  • Yes, but since $rm tr(CB)=rm tr(BC)$ you can combine the terms to obtain $rm tr(2BC)$.
    – lynn
    Aug 23 at 19:12











  • Yes, I agree. What I am saying is that for polynomials $f$, I can see that it works because under the trace, everything commutes. However, for a general continuously differentiable function I am not sure if one can make it work.
    – 4-ier
    Aug 23 at 22:34















Considering how a trace is simply a sum of terms, the total time derivative evaluated at $t=0$ would just slide into the trace operation. From here, it's simply an extension on the chain rule
– Captain Morgan
Aug 22 at 5:45




Considering how a trace is simply a sum of terms, the total time derivative evaluated at $t=0$ would just slide into the trace operation. From here, it's simply an extension on the chain rule
– Captain Morgan
Aug 22 at 5:45












Sure... just why does there exist a chain rule for functions defined via the spectral theorem. Maybe, I would need to prove a version of the convergence lemma in Baby Rudin, where I approximate $f'$ by polynomials that have some antiderivatives that converge to $f$ and then I can try to interchange the derivative limit and the limit of polymonials for which the chain rule obviously applies.
– 4-ier
Aug 22 at 6:02




Sure... just why does there exist a chain rule for functions defined via the spectral theorem. Maybe, I would need to prove a version of the convergence lemma in Baby Rudin, where I approximate $f'$ by polynomials that have some antiderivatives that converge to $f$ and then I can try to interchange the derivative limit and the limit of polymonials for which the chain rule obviously applies.
– 4-ier
Aug 22 at 6:02












For instance, if $f(x) = x^2$, then we know that $f(B + tC) = B^2 + t(BC + CB) + t^2C^2$ so its derivative in $t$ at zero is $BC + CB$ which does not equal $f'(B)C = 2BC$ in general, but does after applying the trace.
– 4-ier
Aug 22 at 6:10





For instance, if $f(x) = x^2$, then we know that $f(B + tC) = B^2 + t(BC + CB) + t^2C^2$ so its derivative in $t$ at zero is $BC + CB$ which does not equal $f'(B)C = 2BC$ in general, but does after applying the trace.
– 4-ier
Aug 22 at 6:10













Yes, but since $rm tr(CB)=rm tr(BC)$ you can combine the terms to obtain $rm tr(2BC)$.
– lynn
Aug 23 at 19:12





Yes, but since $rm tr(CB)=rm tr(BC)$ you can combine the terms to obtain $rm tr(2BC)$.
– lynn
Aug 23 at 19:12













Yes, I agree. What I am saying is that for polynomials $f$, I can see that it works because under the trace, everything commutes. However, for a general continuously differentiable function I am not sure if one can make it work.
– 4-ier
Aug 23 at 22:34




Yes, I agree. What I am saying is that for polynomials $f$, I can see that it works because under the trace, everything commutes. However, for a general continuously differentiable function I am not sure if one can make it work.
– 4-ier
Aug 23 at 22:34










1 Answer
1






active

oldest

votes

















up vote
0
down vote













The result is easy to check if $f(x)$ was a polynomial. For a general continuously differentiable function $f(x)$ and a given $epsilon>0$, choose a polynomial $p(x)$ such that $|f(x)-p(x)|+|f'(x)-p'(x)|<epsilon$ on an interval $I$ containing all of the eigenvalues of $B$ and $C$. One has
$$dfracddtTr(p(B+tC))(0)=Tr(p'(B)C),$$
and so for
$$g(t)=|Tr(p(B+tC))-Tr(p(B))-Tr(p'(B)C)t|$$
we have $g(t)/t rightarrow 0$ as $trightarrow 0$.
First note that $$|Tr(p'(B)C)-Tr(f'(B)C)|=|Tr((p'(B)-f'(B))C)| $$ $$leq |p'(B)-f'(B)| |C|<n epsilon |C|.$$
Moreover,
$$|Tr(p(B+tC))-Tr(f(B+tC))|=|Tr(p(B+tC)-f(B+tC))|<n epsilon,$$
and similarly
$$|Tr(p(B))-Tr(f(B))|<n epsilon.$$
Putting all these inequalities together, we have



$$h(t)=|Tr(f(B+tC))-Tr(f(B))-Tr(f'(B)C)t|leq g(t)+(2+|C|)nepsilon,$$
By letting $epsilon rightarrow 0$, we have $0leq h(t)leq g(t)$ which implies the formula.






share|cite|improve this answer




















  • I'm fine up until getting $h(t) leq g(t)$, because this inequality depended on $|p - f| < epsilon$ and $g = g_p$. Once $epsilon < |p - f|$ the inequality is false. You would need to control $g_p(t)$ as $epsilon to 0$.
    – 4-ier
    Aug 24 at 3:03











Your Answer




StackExchange.ifUsing("editor", function ()
return StackExchange.using("mathjaxEditing", function ()
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
);
);
, "mathjax-editing");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "69"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
convertImagesToLinks: true,
noModals: false,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
noCode: true, onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);



);








 

draft saved


draft discarded


















StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f2890587%2fperturbation-theory-derivative-of-a-trace%23new-answer', 'question_page');

);

Post as a guest






























1 Answer
1






active

oldest

votes








1 Answer
1






active

oldest

votes









active

oldest

votes






active

oldest

votes








up vote
0
down vote













The result is easy to check if $f(x)$ was a polynomial. For a general continuously differentiable function $f(x)$ and a given $epsilon>0$, choose a polynomial $p(x)$ such that $|f(x)-p(x)|+|f'(x)-p'(x)|<epsilon$ on an interval $I$ containing all of the eigenvalues of $B$ and $C$. One has
$$dfracddtTr(p(B+tC))(0)=Tr(p'(B)C),$$
and so for
$$g(t)=|Tr(p(B+tC))-Tr(p(B))-Tr(p'(B)C)t|$$
we have $g(t)/t rightarrow 0$ as $trightarrow 0$.
First note that $$|Tr(p'(B)C)-Tr(f'(B)C)|=|Tr((p'(B)-f'(B))C)| $$ $$leq |p'(B)-f'(B)| |C|<n epsilon |C|.$$
Moreover,
$$|Tr(p(B+tC))-Tr(f(B+tC))|=|Tr(p(B+tC)-f(B+tC))|<n epsilon,$$
and similarly
$$|Tr(p(B))-Tr(f(B))|<n epsilon.$$
Putting all these inequalities together, we have



$$h(t)=|Tr(f(B+tC))-Tr(f(B))-Tr(f'(B)C)t|leq g(t)+(2+|C|)nepsilon,$$
By letting $epsilon rightarrow 0$, we have $0leq h(t)leq g(t)$ which implies the formula.






share|cite|improve this answer




















  • I'm fine up until getting $h(t) leq g(t)$, because this inequality depended on $|p - f| < epsilon$ and $g = g_p$. Once $epsilon < |p - f|$ the inequality is false. You would need to control $g_p(t)$ as $epsilon to 0$.
    – 4-ier
    Aug 24 at 3:03















up vote
0
down vote













The result is easy to check if $f(x)$ was a polynomial. For a general continuously differentiable function $f(x)$ and a given $epsilon>0$, choose a polynomial $p(x)$ such that $|f(x)-p(x)|+|f'(x)-p'(x)|<epsilon$ on an interval $I$ containing all of the eigenvalues of $B$ and $C$. One has
$$dfracddtTr(p(B+tC))(0)=Tr(p'(B)C),$$
and so for
$$g(t)=|Tr(p(B+tC))-Tr(p(B))-Tr(p'(B)C)t|$$
we have $g(t)/t rightarrow 0$ as $trightarrow 0$.
First note that $$|Tr(p'(B)C)-Tr(f'(B)C)|=|Tr((p'(B)-f'(B))C)| $$ $$leq |p'(B)-f'(B)| |C|<n epsilon |C|.$$
Moreover,
$$|Tr(p(B+tC))-Tr(f(B+tC))|=|Tr(p(B+tC)-f(B+tC))|<n epsilon,$$
and similarly
$$|Tr(p(B))-Tr(f(B))|<n epsilon.$$
Putting all these inequalities together, we have



$$h(t)=|Tr(f(B+tC))-Tr(f(B))-Tr(f'(B)C)t|leq g(t)+(2+|C|)nepsilon,$$
By letting $epsilon rightarrow 0$, we have $0leq h(t)leq g(t)$ which implies the formula.






share|cite|improve this answer




















  • I'm fine up until getting $h(t) leq g(t)$, because this inequality depended on $|p - f| < epsilon$ and $g = g_p$. Once $epsilon < |p - f|$ the inequality is false. You would need to control $g_p(t)$ as $epsilon to 0$.
    – 4-ier
    Aug 24 at 3:03













up vote
0
down vote










up vote
0
down vote









The result is easy to check if $f(x)$ was a polynomial. For a general continuously differentiable function $f(x)$ and a given $epsilon>0$, choose a polynomial $p(x)$ such that $|f(x)-p(x)|+|f'(x)-p'(x)|<epsilon$ on an interval $I$ containing all of the eigenvalues of $B$ and $C$. One has
$$dfracddtTr(p(B+tC))(0)=Tr(p'(B)C),$$
and so for
$$g(t)=|Tr(p(B+tC))-Tr(p(B))-Tr(p'(B)C)t|$$
we have $g(t)/t rightarrow 0$ as $trightarrow 0$.
First note that $$|Tr(p'(B)C)-Tr(f'(B)C)|=|Tr((p'(B)-f'(B))C)| $$ $$leq |p'(B)-f'(B)| |C|<n epsilon |C|.$$
Moreover,
$$|Tr(p(B+tC))-Tr(f(B+tC))|=|Tr(p(B+tC)-f(B+tC))|<n epsilon,$$
and similarly
$$|Tr(p(B))-Tr(f(B))|<n epsilon.$$
Putting all these inequalities together, we have



$$h(t)=|Tr(f(B+tC))-Tr(f(B))-Tr(f'(B)C)t|leq g(t)+(2+|C|)nepsilon,$$
By letting $epsilon rightarrow 0$, we have $0leq h(t)leq g(t)$ which implies the formula.






share|cite|improve this answer












The result is easy to check if $f(x)$ was a polynomial. For a general continuously differentiable function $f(x)$ and a given $epsilon>0$, choose a polynomial $p(x)$ such that $|f(x)-p(x)|+|f'(x)-p'(x)|<epsilon$ on an interval $I$ containing all of the eigenvalues of $B$ and $C$. One has
$$dfracddtTr(p(B+tC))(0)=Tr(p'(B)C),$$
and so for
$$g(t)=|Tr(p(B+tC))-Tr(p(B))-Tr(p'(B)C)t|$$
we have $g(t)/t rightarrow 0$ as $trightarrow 0$.
First note that $$|Tr(p'(B)C)-Tr(f'(B)C)|=|Tr((p'(B)-f'(B))C)| $$ $$leq |p'(B)-f'(B)| |C|<n epsilon |C|.$$
Moreover,
$$|Tr(p(B+tC))-Tr(f(B+tC))|=|Tr(p(B+tC)-f(B+tC))|<n epsilon,$$
and similarly
$$|Tr(p(B))-Tr(f(B))|<n epsilon.$$
Putting all these inequalities together, we have



$$h(t)=|Tr(f(B+tC))-Tr(f(B))-Tr(f'(B)C)t|leq g(t)+(2+|C|)nepsilon,$$
By letting $epsilon rightarrow 0$, we have $0leq h(t)leq g(t)$ which implies the formula.







share|cite|improve this answer












share|cite|improve this answer



share|cite|improve this answer










answered Aug 24 at 0:13









Marco

1,3287




1,3287











  • I'm fine up until getting $h(t) leq g(t)$, because this inequality depended on $|p - f| < epsilon$ and $g = g_p$. Once $epsilon < |p - f|$ the inequality is false. You would need to control $g_p(t)$ as $epsilon to 0$.
    – 4-ier
    Aug 24 at 3:03

















  • I'm fine up until getting $h(t) leq g(t)$, because this inequality depended on $|p - f| < epsilon$ and $g = g_p$. Once $epsilon < |p - f|$ the inequality is false. You would need to control $g_p(t)$ as $epsilon to 0$.
    – 4-ier
    Aug 24 at 3:03
















I'm fine up until getting $h(t) leq g(t)$, because this inequality depended on $|p - f| < epsilon$ and $g = g_p$. Once $epsilon < |p - f|$ the inequality is false. You would need to control $g_p(t)$ as $epsilon to 0$.
– 4-ier
Aug 24 at 3:03





I'm fine up until getting $h(t) leq g(t)$, because this inequality depended on $|p - f| < epsilon$ and $g = g_p$. Once $epsilon < |p - f|$ the inequality is false. You would need to control $g_p(t)$ as $epsilon to 0$.
– 4-ier
Aug 24 at 3:03













 

draft saved


draft discarded


























 


draft saved


draft discarded














StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f2890587%2fperturbation-theory-derivative-of-a-trace%23new-answer', 'question_page');

);

Post as a guest













































































這個網誌中的熱門文章

tkz-euclide: tkzDrawCircle[R] not working

How to combine Bézier curves to a surface?

1st Magritte Awards