Understanding the differential $dx$ when doing $u$-substitution

The name of the pictureThe name of the pictureThe name of the pictureClash Royale CLAN TAG#URR8PPP











up vote
13
down vote

favorite
9












I just finished taking my first year of calculus in college and I passed with an A. I don't think, however, that I ever really understood the entire $fracdydx$ notation (so I just focused on using $u'$), and now that I'm going to be starting calculus 2 and learning newer integration techniques, I feel that understanding the differential in an integral is important.



Take this problem for an example:



$$int 2x (x^2+4)^100dx$$



So solving this...
$$u = x^2 implies du = 2x dx longleftarrow textwhy $dx$ here?$$



And so now I'd have: $displaystyle int (u)^100du$ which is $displaystyle fracu^101101 + C$ and then I'd just substitute back in my $u$.



I'm so confused by all of this. I know how to do it from practice, but I don't understand what is really happening here. What happens to the $du$ between rewriting the integral and taking the anti-derivative? Why is writing the $dx$ so important? How should I be viewing this in when seeing an integral?







share|cite|improve this question


















  • 2




    You are smart to think about this. Understanding this notation becomes more important the deeper you get into calculus.
    – Ataraxia
    May 25 '13 at 19:35






  • 1




    @nkon $u=x^2 implies u'=2x implies dfracdudx=2x "implies "du=2xdx$, you get the last implication "multiplying by $dx$". This is something which I find absolutely disgusting. It doesn't make sense as it was presented to you, though it can be formalized in such a way that it makes sense. See this.
    – Git Gud
    May 25 '13 at 19:37







  • 3




    @GitGud's comment is right, though I wouldn't go as far as to call it "absolutely disgusting". It's just a convenient piece of notation that helps you remember the substitution rule.
    – Javier
    May 25 '13 at 19:39










  • @JavierBadia Maybe it works for some people. To me, it just confused me and since it's wrong, I classify it as disgusting.
    – Git Gud
    May 25 '13 at 19:41










  • @GitGud so it's just notation to remind me of what I took the derivative of?
    – hax0r_n_code
    May 25 '13 at 19:42














up vote
13
down vote

favorite
9












I just finished taking my first year of calculus in college and I passed with an A. I don't think, however, that I ever really understood the entire $fracdydx$ notation (so I just focused on using $u'$), and now that I'm going to be starting calculus 2 and learning newer integration techniques, I feel that understanding the differential in an integral is important.



Take this problem for an example:



$$int 2x (x^2+4)^100dx$$



So solving this...
$$u = x^2 implies du = 2x dx longleftarrow textwhy $dx$ here?$$



And so now I'd have: $displaystyle int (u)^100du$ which is $displaystyle fracu^101101 + C$ and then I'd just substitute back in my $u$.



I'm so confused by all of this. I know how to do it from practice, but I don't understand what is really happening here. What happens to the $du$ between rewriting the integral and taking the anti-derivative? Why is writing the $dx$ so important? How should I be viewing this in when seeing an integral?







share|cite|improve this question


















  • 2




    You are smart to think about this. Understanding this notation becomes more important the deeper you get into calculus.
    – Ataraxia
    May 25 '13 at 19:35






  • 1




    @nkon $u=x^2 implies u'=2x implies dfracdudx=2x "implies "du=2xdx$, you get the last implication "multiplying by $dx$". This is something which I find absolutely disgusting. It doesn't make sense as it was presented to you, though it can be formalized in such a way that it makes sense. See this.
    – Git Gud
    May 25 '13 at 19:37







  • 3




    @GitGud's comment is right, though I wouldn't go as far as to call it "absolutely disgusting". It's just a convenient piece of notation that helps you remember the substitution rule.
    – Javier
    May 25 '13 at 19:39










  • @JavierBadia Maybe it works for some people. To me, it just confused me and since it's wrong, I classify it as disgusting.
    – Git Gud
    May 25 '13 at 19:41










  • @GitGud so it's just notation to remind me of what I took the derivative of?
    – hax0r_n_code
    May 25 '13 at 19:42












up vote
13
down vote

favorite
9









up vote
13
down vote

favorite
9






9





I just finished taking my first year of calculus in college and I passed with an A. I don't think, however, that I ever really understood the entire $fracdydx$ notation (so I just focused on using $u'$), and now that I'm going to be starting calculus 2 and learning newer integration techniques, I feel that understanding the differential in an integral is important.



Take this problem for an example:



$$int 2x (x^2+4)^100dx$$



So solving this...
$$u = x^2 implies du = 2x dx longleftarrow textwhy $dx$ here?$$



And so now I'd have: $displaystyle int (u)^100du$ which is $displaystyle fracu^101101 + C$ and then I'd just substitute back in my $u$.



I'm so confused by all of this. I know how to do it from practice, but I don't understand what is really happening here. What happens to the $du$ between rewriting the integral and taking the anti-derivative? Why is writing the $dx$ so important? How should I be viewing this in when seeing an integral?







share|cite|improve this question














I just finished taking my first year of calculus in college and I passed with an A. I don't think, however, that I ever really understood the entire $fracdydx$ notation (so I just focused on using $u'$), and now that I'm going to be starting calculus 2 and learning newer integration techniques, I feel that understanding the differential in an integral is important.



Take this problem for an example:



$$int 2x (x^2+4)^100dx$$



So solving this...
$$u = x^2 implies du = 2x dx longleftarrow textwhy $dx$ here?$$



And so now I'd have: $displaystyle int (u)^100du$ which is $displaystyle fracu^101101 + C$ and then I'd just substitute back in my $u$.



I'm so confused by all of this. I know how to do it from practice, but I don't understand what is really happening here. What happens to the $du$ between rewriting the integral and taking the anti-derivative? Why is writing the $dx$ so important? How should I be viewing this in when seeing an integral?









share|cite|improve this question













share|cite|improve this question




share|cite|improve this question








edited May 25 '13 at 20:42









Sharkos

13k22148




13k22148










asked May 25 '13 at 19:31









hax0r_n_code

1,87353571




1,87353571







  • 2




    You are smart to think about this. Understanding this notation becomes more important the deeper you get into calculus.
    – Ataraxia
    May 25 '13 at 19:35






  • 1




    @nkon $u=x^2 implies u'=2x implies dfracdudx=2x "implies "du=2xdx$, you get the last implication "multiplying by $dx$". This is something which I find absolutely disgusting. It doesn't make sense as it was presented to you, though it can be formalized in such a way that it makes sense. See this.
    – Git Gud
    May 25 '13 at 19:37







  • 3




    @GitGud's comment is right, though I wouldn't go as far as to call it "absolutely disgusting". It's just a convenient piece of notation that helps you remember the substitution rule.
    – Javier
    May 25 '13 at 19:39










  • @JavierBadia Maybe it works for some people. To me, it just confused me and since it's wrong, I classify it as disgusting.
    – Git Gud
    May 25 '13 at 19:41










  • @GitGud so it's just notation to remind me of what I took the derivative of?
    – hax0r_n_code
    May 25 '13 at 19:42












  • 2




    You are smart to think about this. Understanding this notation becomes more important the deeper you get into calculus.
    – Ataraxia
    May 25 '13 at 19:35






  • 1




    @nkon $u=x^2 implies u'=2x implies dfracdudx=2x "implies "du=2xdx$, you get the last implication "multiplying by $dx$". This is something which I find absolutely disgusting. It doesn't make sense as it was presented to you, though it can be formalized in such a way that it makes sense. See this.
    – Git Gud
    May 25 '13 at 19:37







  • 3




    @GitGud's comment is right, though I wouldn't go as far as to call it "absolutely disgusting". It's just a convenient piece of notation that helps you remember the substitution rule.
    – Javier
    May 25 '13 at 19:39










  • @JavierBadia Maybe it works for some people. To me, it just confused me and since it's wrong, I classify it as disgusting.
    – Git Gud
    May 25 '13 at 19:41










  • @GitGud so it's just notation to remind me of what I took the derivative of?
    – hax0r_n_code
    May 25 '13 at 19:42







2




2




You are smart to think about this. Understanding this notation becomes more important the deeper you get into calculus.
– Ataraxia
May 25 '13 at 19:35




You are smart to think about this. Understanding this notation becomes more important the deeper you get into calculus.
– Ataraxia
May 25 '13 at 19:35




1




1




@nkon $u=x^2 implies u'=2x implies dfracdudx=2x "implies "du=2xdx$, you get the last implication "multiplying by $dx$". This is something which I find absolutely disgusting. It doesn't make sense as it was presented to you, though it can be formalized in such a way that it makes sense. See this.
– Git Gud
May 25 '13 at 19:37





@nkon $u=x^2 implies u'=2x implies dfracdudx=2x "implies "du=2xdx$, you get the last implication "multiplying by $dx$". This is something which I find absolutely disgusting. It doesn't make sense as it was presented to you, though it can be formalized in such a way that it makes sense. See this.
– Git Gud
May 25 '13 at 19:37





3




3




@GitGud's comment is right, though I wouldn't go as far as to call it "absolutely disgusting". It's just a convenient piece of notation that helps you remember the substitution rule.
– Javier
May 25 '13 at 19:39




@GitGud's comment is right, though I wouldn't go as far as to call it "absolutely disgusting". It's just a convenient piece of notation that helps you remember the substitution rule.
– Javier
May 25 '13 at 19:39












@JavierBadia Maybe it works for some people. To me, it just confused me and since it's wrong, I classify it as disgusting.
– Git Gud
May 25 '13 at 19:41




@JavierBadia Maybe it works for some people. To me, it just confused me and since it's wrong, I classify it as disgusting.
– Git Gud
May 25 '13 at 19:41












@GitGud so it's just notation to remind me of what I took the derivative of?
– hax0r_n_code
May 25 '13 at 19:42




@GitGud so it's just notation to remind me of what I took the derivative of?
– hax0r_n_code
May 25 '13 at 19:42










4 Answers
4






active

oldest

votes

















up vote
11
down vote



accepted










Okay, so this is a slightly tricksy problem, because at first people think "Oh, it's fine, I just multiply through by this thing and it's fine" but then they think "Hang on, we've never actually defined what we meant by this... it's just some shorthand trickery" - but then if they eventually do differential geometry you think "Aha! So it did make sense all along!"



The main message I want you to take away is that things like $mathrm d x$ are actually well defined things called differential forms which you don't really need to understand in any detail at all to get how they work.



The way they end up working in integration and changes of variable is roughly the following: they come together to make a volume form which just tells you how much volume a small range of your parameters corresponds to. (I say "come together" because if you are doing many integrations like in $int int f(x,y) ;mathrm d x mathrm d y$ then you get a bunch of the $mathrm d [...]$ things all together.) More precisely, remember how you can roughly define integration as a limit of a sum like
$$int_a^b f(x) mathrm d x equiv lim_Ntoinftysum_n=1^N f(x_n) left(x_n-x_n-1right)$$
where $x_0=a,x_N=b$ and the other points $x_i$ are chosen in between, such that all gaps $delta_n = x_n-x_n-1 to 0$ (say, uniformly) as $Nto infty$.
Here, $delta_n = x_n-x_n-1$ is providing some measure of how important the bit of space between $x_n-1,x_n$ is in computing the integral. The $mathrm d x$ is what keeps track of that information.



Suppose you then try $u=x^2$ or $x=sqrt u$. Then in general
$$int_a^2^b^2 f(sqrt u) mathrm d u equiv lim_Ntoinftysum_N f(sqrtu_n) left(u_n-u_n-1right) = lim_Ntoinftysum_N f(x_n) left(x_n^2-x_n-1^2right)neq int f mathrm d x$$
because the weight is different!



But notice that $x_n^2-x_n-1^2 = (x_n-x_n-1)(x_n+x_n-1) approx (x_n-x_n-1)(2x_n)$ in the limit of fine spacing, so
$$int_a^2^b^2 f(sqrt u) fracmathrm d u2x = int_a^b f(x) ; mathrm d x$$



We're really analyzing the difference between the volumes of the little patches of space when we play with the differentials. The trick is to realize that in general, just like here, $mathrm d u = u'(x) mathrm dx$. In higher dimensional integrals, you will discover that the generalization to e.g.
$$int f(x,y) ;mathrm d xmathrm d y = int f(u,v) ; J ; mathrm d umathrm d v$$
where $u=u(x,y),v=v(x,y)$ involves a quantity $J$ called the Jacobian (determinant) which uses all the possible derivatives of $u,v$ with respect to $x,y$ in a particular way.



The notation
$$fracmathrm d umathrm d x = lim_textfine spacingfracdelta udelta x = lim_x_n-x_n-1to 0fracu_n-u_n-1x_n-x_n-1 = u'(x)$$
is now seen to be just a suggestive notation which works for the case of only one variable changing. It's used because it makes it clear how the volume form should be replaced.




When there are many variables, this notation breaks down because the factors are all mixed up together and people write partial derivatives, which you'll see soon if you haven't already, instead. It turns out that it makes sense to use a generalization of the
$$mathrm d u = u'(x) mathrm d x$$
law called the chain rule in which, for $u=u(x,y)$ for example
$$mathrm d u = u_x mathrm d x + u_y mathrm d y$$
where $u_x(x,y)$ is the derivative of $u$ with respect to $x$ when we just think of $y$ as a constant.



You'll have to wait until differential geometry courses to see how to use this to get the Jacobian factor; it turns out that rather than just writing the forms together, you should technically define something called a wedge product such that $awedge b = -bwedge a$ for one-forms like $mathrm d x$; then you get
$$mathrm d u wedge mathrm d v = (u_x mathrm d x + u_y mathrm d y)wedge(v_x mathrm d x + v_y mathrm d y) = (u_x v_y-u_y v_x) mathrm d x wedge mathrm d y$$
so that the Jacobian is (one over) $(u_x v_y-u_y v_x)=det pmatrixu_x & u_y \ v_x & v_y$.



You can get this result directly from thinking about little patches of volume, however, so you'll see this far earlier than any differential form stuff. I just thought that, since you were curious, you should have had the full story mentioned to you along the way.






share|cite|improve this answer






















  • I was reading your answer on that other question which you referred this one to, and I was wondering ; where could I read about the construction of such a wedge product? I know about exterior algebras in the general setting (i.e. the quotient of the tensor algebra blablabla...) but I've never seen how it applies to differential geometry and I'd like to read about it. What do you suggest?
    – Patrick Da Silva
    Jun 11 '13 at 7:21










  • Absolutely any proper course on differential geometry or indeed general relativity (which is the same thing). It's a surprisingly simple formalism. Personally I learned it in a (fairly geometric) GR course - see damtp.cam.ac.uk/user/hsr1000 - for which you can skip the physics.
    – Sharkos
    Jun 11 '13 at 10:00










  • I did have a differential geometry course, but I guess it wasn't that much advanced... thanks though
    – Patrick Da Silva
    Jun 12 '13 at 1:14










  • @Sharkos You wrote $mathrm d u = u_x mathrm d x + u_y mathrm d x$. Is this supposed to be $mathrm d u = u_x mathrm d x + u_y mathrm d y$?
    – FreshAir
    Aug 14 '15 at 1:42







  • 1




    @FreshAir Yup, thanks, have fixed!
    – Sharkos
    Aug 17 '15 at 10:58

















up vote
6
down vote













The use of infinitesimals can only be formalized an analysed with care when working with non-standard analysis. When people first study calculus and the most elementary books define $dx$ as $Delta x$ approaching zero, usually people get confused think "but this should be zero", and in standard analysis it'll really be.



Things are much easier than this, however, we just have to throw away those $dx$ and $dy$. Why? Simply because modern mathematics adhered to methods that are more sophisticated, simpler, and because if you are going to proceed in mathematics you'll really need those modern methods when studying analysis or differential geometry (where $dx$ receives a true definition and gains a fundamental role).



Now you might ask the exact same question I've asked when my I first encountered the rigorous framework: "this guy is mad! My book talks about infinitesimals, my teacher told me it's all right, he must be mad", but it's not like that. I'll show you two examples: the first one is meant to show you that when working with integrals those things appear just as mnemonic rules that allows you to remember the true formula easier when you're starting. The second is to show you when things become confusing when using infinitesimals without care.



First, consider your function:



$$int2x(x^2+4)^100dx$$



The idea is that we see that this can be rewritten in some convenient form. Note that if we set $f(x) = x^2+4$, then $f'(x)=2x$. Then we are integrating:



$$int f'(x) (f(x))^100dx$$



Now, if we set $g(x)= x^100$ note that by composition we are integrating:



$$int g(f(x))f'(x)dx$$



Now, if $G(x)$ is a primitive of $g$ recall the chain rule, the integrand is just $(G(f(x)))'=G'(f(x))f'(x)$, so since the indefinite integral is a primitive, and the integrand is a derivative the result is simply:



$$int (G(f(x)))'dx = G(f(x))$$



Ande since $g(x) = x^100$ the obvious primitive is $G(x) = x^101/101$ and hence:



$$int 2x(x^2+4)^100dx = frac(x^2+4)^101101$$



We can "remember" that by saying that we set $u = x^2+4$ and $du=2xdx$, so is just a rule to remember how to find the formula that's just an application of the chain rule.



The second example is the chain rule itself. Usually people write:



$$fracdfdx = fracdfdu fracdudx$$



But look, on the lhs you are differentiating not $f$, but rather $fcirc u$. So, on the left $f$ means one thing, on the right it means another thing! So using this language of infinitesimals the wrong way arround may lead to confusions and may hide from you the true nature of what you are studying. The book I've used when I moved from the infinitesimals treatment to the rigorous one was Spivak's Calculus. Try it! He develop everything formally, without appeal to those "undefined" creatures and shows you where they appear just as ways to remember formulas.



I hope this helps you. Good luck!






share|cite|improve this answer






















  • Thank you so much for the fantastic post! It has helped me to understand this concept much more clearly.
    – hax0r_n_code
    May 25 '13 at 22:00

















up vote
2
down vote













To understand integration by substitution, you can just use the chain rule in reverse:
beginequation
int f(g (x)) g'(x) dx = F (g (x)) + C,
endequation
where $ F $ is an anti derivative of $ f $. To check this, just take the derivative of the right hand side using the chain rule.






share|cite|improve this answer



























    up vote
    0
    down vote













    $dx$ is what is known as a differential. It is an infinitesimally small interval of $x$:



    $$dx=lim_xtox_0x-x_0$$



    Using this definition, it is clear from the definition of the derivative why $fracdydx$ is the derivative of $y$ with respect to $x$:



    $$y'=f'(x)=fracdydx=lim_xtox_0fracf(x-x_0)-f(x_0)x-x_0$$



    When doing u-substitution, you are defining $u$ to be an expression dependent on $x$. $fracdudx$ is a fraction like any other, so to get $du$ you must multiply both sides of the equation by $dx$:



    $$u=x^2+4$$



    $$fracdudx=2x$$



    $$dxfracdudx=2xspacedx$$



    $$du=2xspacedx$$






    share|cite|improve this answer


















    • 2




      @nkon I get your problem now, I think. You see, another disgusting thing these people do is writing $y$ when they actually mean $y(x)$.
      – Git Gud
      May 25 '13 at 19:48






    • 1




      @nkon it means you're treating $y$ as if it were a function of $x$. It could just as easily be, for example, $fracdydn$ if y were a function of $n$.
      – Ataraxia
      May 25 '13 at 19:54






    • 1




      To think about $dx$ as infinitesimal you must use nonstandard analysis and define the hyperreals. Indeed, with your definition $dx = x$ because using standard analysis $lim_x_0 to 0 x-x_0 = x$.
      – user1620696
      May 25 '13 at 20:16






    • 1




      Even with this definition of $dx$ in standard analysis this is zero. The notion of "a nonzero number less than any other still" can only be made precise in nonstandard analysis.
      – user1620696
      May 25 '13 at 21:05






    • 4




      I would like to emphasize what @user1620696 said too - $dx=lim_xtox_0x-x_0$ is very much worse notation than just using the differential because it is defined, and it doesn't mean this. The expression on the right is identically 0 in any formulation.
      – Sharkos
      May 25 '13 at 23:13










    Your Answer




    StackExchange.ifUsing("editor", function ()
    return StackExchange.using("mathjaxEditing", function ()
    StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
    StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
    );
    );
    , "mathjax-editing");

    StackExchange.ready(function()
    var channelOptions =
    tags: "".split(" "),
    id: "69"
    ;
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function()
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled)
    StackExchange.using("snippets", function()
    createEditor();
    );

    else
    createEditor();

    );

    function createEditor()
    StackExchange.prepareEditor(
    heartbeatType: 'answer',
    convertImagesToLinks: true,
    noModals: false,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: 10,
    bindNavPrevention: true,
    postfix: "",
    noCode: true, onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    );



    );








     

    draft saved


    draft discarded


















    StackExchange.ready(
    function ()
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f402303%2funderstanding-the-differential-dx-when-doing-u-substitution%23new-answer', 'question_page');

    );

    Post as a guest






























    4 Answers
    4






    active

    oldest

    votes








    4 Answers
    4






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes








    up vote
    11
    down vote



    accepted










    Okay, so this is a slightly tricksy problem, because at first people think "Oh, it's fine, I just multiply through by this thing and it's fine" but then they think "Hang on, we've never actually defined what we meant by this... it's just some shorthand trickery" - but then if they eventually do differential geometry you think "Aha! So it did make sense all along!"



    The main message I want you to take away is that things like $mathrm d x$ are actually well defined things called differential forms which you don't really need to understand in any detail at all to get how they work.



    The way they end up working in integration and changes of variable is roughly the following: they come together to make a volume form which just tells you how much volume a small range of your parameters corresponds to. (I say "come together" because if you are doing many integrations like in $int int f(x,y) ;mathrm d x mathrm d y$ then you get a bunch of the $mathrm d [...]$ things all together.) More precisely, remember how you can roughly define integration as a limit of a sum like
    $$int_a^b f(x) mathrm d x equiv lim_Ntoinftysum_n=1^N f(x_n) left(x_n-x_n-1right)$$
    where $x_0=a,x_N=b$ and the other points $x_i$ are chosen in between, such that all gaps $delta_n = x_n-x_n-1 to 0$ (say, uniformly) as $Nto infty$.
    Here, $delta_n = x_n-x_n-1$ is providing some measure of how important the bit of space between $x_n-1,x_n$ is in computing the integral. The $mathrm d x$ is what keeps track of that information.



    Suppose you then try $u=x^2$ or $x=sqrt u$. Then in general
    $$int_a^2^b^2 f(sqrt u) mathrm d u equiv lim_Ntoinftysum_N f(sqrtu_n) left(u_n-u_n-1right) = lim_Ntoinftysum_N f(x_n) left(x_n^2-x_n-1^2right)neq int f mathrm d x$$
    because the weight is different!



    But notice that $x_n^2-x_n-1^2 = (x_n-x_n-1)(x_n+x_n-1) approx (x_n-x_n-1)(2x_n)$ in the limit of fine spacing, so
    $$int_a^2^b^2 f(sqrt u) fracmathrm d u2x = int_a^b f(x) ; mathrm d x$$



    We're really analyzing the difference between the volumes of the little patches of space when we play with the differentials. The trick is to realize that in general, just like here, $mathrm d u = u'(x) mathrm dx$. In higher dimensional integrals, you will discover that the generalization to e.g.
    $$int f(x,y) ;mathrm d xmathrm d y = int f(u,v) ; J ; mathrm d umathrm d v$$
    where $u=u(x,y),v=v(x,y)$ involves a quantity $J$ called the Jacobian (determinant) which uses all the possible derivatives of $u,v$ with respect to $x,y$ in a particular way.



    The notation
    $$fracmathrm d umathrm d x = lim_textfine spacingfracdelta udelta x = lim_x_n-x_n-1to 0fracu_n-u_n-1x_n-x_n-1 = u'(x)$$
    is now seen to be just a suggestive notation which works for the case of only one variable changing. It's used because it makes it clear how the volume form should be replaced.




    When there are many variables, this notation breaks down because the factors are all mixed up together and people write partial derivatives, which you'll see soon if you haven't already, instead. It turns out that it makes sense to use a generalization of the
    $$mathrm d u = u'(x) mathrm d x$$
    law called the chain rule in which, for $u=u(x,y)$ for example
    $$mathrm d u = u_x mathrm d x + u_y mathrm d y$$
    where $u_x(x,y)$ is the derivative of $u$ with respect to $x$ when we just think of $y$ as a constant.



    You'll have to wait until differential geometry courses to see how to use this to get the Jacobian factor; it turns out that rather than just writing the forms together, you should technically define something called a wedge product such that $awedge b = -bwedge a$ for one-forms like $mathrm d x$; then you get
    $$mathrm d u wedge mathrm d v = (u_x mathrm d x + u_y mathrm d y)wedge(v_x mathrm d x + v_y mathrm d y) = (u_x v_y-u_y v_x) mathrm d x wedge mathrm d y$$
    so that the Jacobian is (one over) $(u_x v_y-u_y v_x)=det pmatrixu_x & u_y \ v_x & v_y$.



    You can get this result directly from thinking about little patches of volume, however, so you'll see this far earlier than any differential form stuff. I just thought that, since you were curious, you should have had the full story mentioned to you along the way.






    share|cite|improve this answer






















    • I was reading your answer on that other question which you referred this one to, and I was wondering ; where could I read about the construction of such a wedge product? I know about exterior algebras in the general setting (i.e. the quotient of the tensor algebra blablabla...) but I've never seen how it applies to differential geometry and I'd like to read about it. What do you suggest?
      – Patrick Da Silva
      Jun 11 '13 at 7:21










    • Absolutely any proper course on differential geometry or indeed general relativity (which is the same thing). It's a surprisingly simple formalism. Personally I learned it in a (fairly geometric) GR course - see damtp.cam.ac.uk/user/hsr1000 - for which you can skip the physics.
      – Sharkos
      Jun 11 '13 at 10:00










    • I did have a differential geometry course, but I guess it wasn't that much advanced... thanks though
      – Patrick Da Silva
      Jun 12 '13 at 1:14










    • @Sharkos You wrote $mathrm d u = u_x mathrm d x + u_y mathrm d x$. Is this supposed to be $mathrm d u = u_x mathrm d x + u_y mathrm d y$?
      – FreshAir
      Aug 14 '15 at 1:42







    • 1




      @FreshAir Yup, thanks, have fixed!
      – Sharkos
      Aug 17 '15 at 10:58














    up vote
    11
    down vote



    accepted










    Okay, so this is a slightly tricksy problem, because at first people think "Oh, it's fine, I just multiply through by this thing and it's fine" but then they think "Hang on, we've never actually defined what we meant by this... it's just some shorthand trickery" - but then if they eventually do differential geometry you think "Aha! So it did make sense all along!"



    The main message I want you to take away is that things like $mathrm d x$ are actually well defined things called differential forms which you don't really need to understand in any detail at all to get how they work.



    The way they end up working in integration and changes of variable is roughly the following: they come together to make a volume form which just tells you how much volume a small range of your parameters corresponds to. (I say "come together" because if you are doing many integrations like in $int int f(x,y) ;mathrm d x mathrm d y$ then you get a bunch of the $mathrm d [...]$ things all together.) More precisely, remember how you can roughly define integration as a limit of a sum like
    $$int_a^b f(x) mathrm d x equiv lim_Ntoinftysum_n=1^N f(x_n) left(x_n-x_n-1right)$$
    where $x_0=a,x_N=b$ and the other points $x_i$ are chosen in between, such that all gaps $delta_n = x_n-x_n-1 to 0$ (say, uniformly) as $Nto infty$.
    Here, $delta_n = x_n-x_n-1$ is providing some measure of how important the bit of space between $x_n-1,x_n$ is in computing the integral. The $mathrm d x$ is what keeps track of that information.



    Suppose you then try $u=x^2$ or $x=sqrt u$. Then in general
    $$int_a^2^b^2 f(sqrt u) mathrm d u equiv lim_Ntoinftysum_N f(sqrtu_n) left(u_n-u_n-1right) = lim_Ntoinftysum_N f(x_n) left(x_n^2-x_n-1^2right)neq int f mathrm d x$$
    because the weight is different!



    But notice that $x_n^2-x_n-1^2 = (x_n-x_n-1)(x_n+x_n-1) approx (x_n-x_n-1)(2x_n)$ in the limit of fine spacing, so
    $$int_a^2^b^2 f(sqrt u) fracmathrm d u2x = int_a^b f(x) ; mathrm d x$$



    We're really analyzing the difference between the volumes of the little patches of space when we play with the differentials. The trick is to realize that in general, just like here, $mathrm d u = u'(x) mathrm dx$. In higher dimensional integrals, you will discover that the generalization to e.g.
    $$int f(x,y) ;mathrm d xmathrm d y = int f(u,v) ; J ; mathrm d umathrm d v$$
    where $u=u(x,y),v=v(x,y)$ involves a quantity $J$ called the Jacobian (determinant) which uses all the possible derivatives of $u,v$ with respect to $x,y$ in a particular way.



    The notation
    $$fracmathrm d umathrm d x = lim_textfine spacingfracdelta udelta x = lim_x_n-x_n-1to 0fracu_n-u_n-1x_n-x_n-1 = u'(x)$$
    is now seen to be just a suggestive notation which works for the case of only one variable changing. It's used because it makes it clear how the volume form should be replaced.




    When there are many variables, this notation breaks down because the factors are all mixed up together and people write partial derivatives, which you'll see soon if you haven't already, instead. It turns out that it makes sense to use a generalization of the
    $$mathrm d u = u'(x) mathrm d x$$
    law called the chain rule in which, for $u=u(x,y)$ for example
    $$mathrm d u = u_x mathrm d x + u_y mathrm d y$$
    where $u_x(x,y)$ is the derivative of $u$ with respect to $x$ when we just think of $y$ as a constant.



    You'll have to wait until differential geometry courses to see how to use this to get the Jacobian factor; it turns out that rather than just writing the forms together, you should technically define something called a wedge product such that $awedge b = -bwedge a$ for one-forms like $mathrm d x$; then you get
    $$mathrm d u wedge mathrm d v = (u_x mathrm d x + u_y mathrm d y)wedge(v_x mathrm d x + v_y mathrm d y) = (u_x v_y-u_y v_x) mathrm d x wedge mathrm d y$$
    so that the Jacobian is (one over) $(u_x v_y-u_y v_x)=det pmatrixu_x & u_y \ v_x & v_y$.



    You can get this result directly from thinking about little patches of volume, however, so you'll see this far earlier than any differential form stuff. I just thought that, since you were curious, you should have had the full story mentioned to you along the way.






    share|cite|improve this answer






















    • I was reading your answer on that other question which you referred this one to, and I was wondering ; where could I read about the construction of such a wedge product? I know about exterior algebras in the general setting (i.e. the quotient of the tensor algebra blablabla...) but I've never seen how it applies to differential geometry and I'd like to read about it. What do you suggest?
      – Patrick Da Silva
      Jun 11 '13 at 7:21










    • Absolutely any proper course on differential geometry or indeed general relativity (which is the same thing). It's a surprisingly simple formalism. Personally I learned it in a (fairly geometric) GR course - see damtp.cam.ac.uk/user/hsr1000 - for which you can skip the physics.
      – Sharkos
      Jun 11 '13 at 10:00










    • I did have a differential geometry course, but I guess it wasn't that much advanced... thanks though
      – Patrick Da Silva
      Jun 12 '13 at 1:14










    • @Sharkos You wrote $mathrm d u = u_x mathrm d x + u_y mathrm d x$. Is this supposed to be $mathrm d u = u_x mathrm d x + u_y mathrm d y$?
      – FreshAir
      Aug 14 '15 at 1:42







    • 1




      @FreshAir Yup, thanks, have fixed!
      – Sharkos
      Aug 17 '15 at 10:58












    up vote
    11
    down vote



    accepted







    up vote
    11
    down vote



    accepted






    Okay, so this is a slightly tricksy problem, because at first people think "Oh, it's fine, I just multiply through by this thing and it's fine" but then they think "Hang on, we've never actually defined what we meant by this... it's just some shorthand trickery" - but then if they eventually do differential geometry you think "Aha! So it did make sense all along!"



    The main message I want you to take away is that things like $mathrm d x$ are actually well defined things called differential forms which you don't really need to understand in any detail at all to get how they work.



    The way they end up working in integration and changes of variable is roughly the following: they come together to make a volume form which just tells you how much volume a small range of your parameters corresponds to. (I say "come together" because if you are doing many integrations like in $int int f(x,y) ;mathrm d x mathrm d y$ then you get a bunch of the $mathrm d [...]$ things all together.) More precisely, remember how you can roughly define integration as a limit of a sum like
    $$int_a^b f(x) mathrm d x equiv lim_Ntoinftysum_n=1^N f(x_n) left(x_n-x_n-1right)$$
    where $x_0=a,x_N=b$ and the other points $x_i$ are chosen in between, such that all gaps $delta_n = x_n-x_n-1 to 0$ (say, uniformly) as $Nto infty$.
    Here, $delta_n = x_n-x_n-1$ is providing some measure of how important the bit of space between $x_n-1,x_n$ is in computing the integral. The $mathrm d x$ is what keeps track of that information.



    Suppose you then try $u=x^2$ or $x=sqrt u$. Then in general
    $$int_a^2^b^2 f(sqrt u) mathrm d u equiv lim_Ntoinftysum_N f(sqrtu_n) left(u_n-u_n-1right) = lim_Ntoinftysum_N f(x_n) left(x_n^2-x_n-1^2right)neq int f mathrm d x$$
    because the weight is different!



    But notice that $x_n^2-x_n-1^2 = (x_n-x_n-1)(x_n+x_n-1) approx (x_n-x_n-1)(2x_n)$ in the limit of fine spacing, so
    $$int_a^2^b^2 f(sqrt u) fracmathrm d u2x = int_a^b f(x) ; mathrm d x$$



    We're really analyzing the difference between the volumes of the little patches of space when we play with the differentials. The trick is to realize that in general, just like here, $mathrm d u = u'(x) mathrm dx$. In higher dimensional integrals, you will discover that the generalization to e.g.
    $$int f(x,y) ;mathrm d xmathrm d y = int f(u,v) ; J ; mathrm d umathrm d v$$
    where $u=u(x,y),v=v(x,y)$ involves a quantity $J$ called the Jacobian (determinant) which uses all the possible derivatives of $u,v$ with respect to $x,y$ in a particular way.



    The notation
    $$fracmathrm d umathrm d x = lim_textfine spacingfracdelta udelta x = lim_x_n-x_n-1to 0fracu_n-u_n-1x_n-x_n-1 = u'(x)$$
    is now seen to be just a suggestive notation which works for the case of only one variable changing. It's used because it makes it clear how the volume form should be replaced.




    When there are many variables, this notation breaks down because the factors are all mixed up together and people write partial derivatives, which you'll see soon if you haven't already, instead. It turns out that it makes sense to use a generalization of the
    $$mathrm d u = u'(x) mathrm d x$$
    law called the chain rule in which, for $u=u(x,y)$ for example
    $$mathrm d u = u_x mathrm d x + u_y mathrm d y$$
    where $u_x(x,y)$ is the derivative of $u$ with respect to $x$ when we just think of $y$ as a constant.



    You'll have to wait until differential geometry courses to see how to use this to get the Jacobian factor; it turns out that rather than just writing the forms together, you should technically define something called a wedge product such that $awedge b = -bwedge a$ for one-forms like $mathrm d x$; then you get
    $$mathrm d u wedge mathrm d v = (u_x mathrm d x + u_y mathrm d y)wedge(v_x mathrm d x + v_y mathrm d y) = (u_x v_y-u_y v_x) mathrm d x wedge mathrm d y$$
    so that the Jacobian is (one over) $(u_x v_y-u_y v_x)=det pmatrixu_x & u_y \ v_x & v_y$.



    You can get this result directly from thinking about little patches of volume, however, so you'll see this far earlier than any differential form stuff. I just thought that, since you were curious, you should have had the full story mentioned to you along the way.






    share|cite|improve this answer














    Okay, so this is a slightly tricksy problem, because at first people think "Oh, it's fine, I just multiply through by this thing and it's fine" but then they think "Hang on, we've never actually defined what we meant by this... it's just some shorthand trickery" - but then if they eventually do differential geometry you think "Aha! So it did make sense all along!"



    The main message I want you to take away is that things like $mathrm d x$ are actually well defined things called differential forms which you don't really need to understand in any detail at all to get how they work.



    The way they end up working in integration and changes of variable is roughly the following: they come together to make a volume form which just tells you how much volume a small range of your parameters corresponds to. (I say "come together" because if you are doing many integrations like in $int int f(x,y) ;mathrm d x mathrm d y$ then you get a bunch of the $mathrm d [...]$ things all together.) More precisely, remember how you can roughly define integration as a limit of a sum like
    $$int_a^b f(x) mathrm d x equiv lim_Ntoinftysum_n=1^N f(x_n) left(x_n-x_n-1right)$$
    where $x_0=a,x_N=b$ and the other points $x_i$ are chosen in between, such that all gaps $delta_n = x_n-x_n-1 to 0$ (say, uniformly) as $Nto infty$.
    Here, $delta_n = x_n-x_n-1$ is providing some measure of how important the bit of space between $x_n-1,x_n$ is in computing the integral. The $mathrm d x$ is what keeps track of that information.



    Suppose you then try $u=x^2$ or $x=sqrt u$. Then in general
    $$int_a^2^b^2 f(sqrt u) mathrm d u equiv lim_Ntoinftysum_N f(sqrtu_n) left(u_n-u_n-1right) = lim_Ntoinftysum_N f(x_n) left(x_n^2-x_n-1^2right)neq int f mathrm d x$$
    because the weight is different!



    But notice that $x_n^2-x_n-1^2 = (x_n-x_n-1)(x_n+x_n-1) approx (x_n-x_n-1)(2x_n)$ in the limit of fine spacing, so
    $$int_a^2^b^2 f(sqrt u) fracmathrm d u2x = int_a^b f(x) ; mathrm d x$$



    We're really analyzing the difference between the volumes of the little patches of space when we play with the differentials. The trick is to realize that in general, just like here, $mathrm d u = u'(x) mathrm dx$. In higher dimensional integrals, you will discover that the generalization to e.g.
    $$int f(x,y) ;mathrm d xmathrm d y = int f(u,v) ; J ; mathrm d umathrm d v$$
    where $u=u(x,y),v=v(x,y)$ involves a quantity $J$ called the Jacobian (determinant) which uses all the possible derivatives of $u,v$ with respect to $x,y$ in a particular way.



    The notation
    $$fracmathrm d umathrm d x = lim_textfine spacingfracdelta udelta x = lim_x_n-x_n-1to 0fracu_n-u_n-1x_n-x_n-1 = u'(x)$$
    is now seen to be just a suggestive notation which works for the case of only one variable changing. It's used because it makes it clear how the volume form should be replaced.




    When there are many variables, this notation breaks down because the factors are all mixed up together and people write partial derivatives, which you'll see soon if you haven't already, instead. It turns out that it makes sense to use a generalization of the
    $$mathrm d u = u'(x) mathrm d x$$
    law called the chain rule in which, for $u=u(x,y)$ for example
    $$mathrm d u = u_x mathrm d x + u_y mathrm d y$$
    where $u_x(x,y)$ is the derivative of $u$ with respect to $x$ when we just think of $y$ as a constant.



    You'll have to wait until differential geometry courses to see how to use this to get the Jacobian factor; it turns out that rather than just writing the forms together, you should technically define something called a wedge product such that $awedge b = -bwedge a$ for one-forms like $mathrm d x$; then you get
    $$mathrm d u wedge mathrm d v = (u_x mathrm d x + u_y mathrm d y)wedge(v_x mathrm d x + v_y mathrm d y) = (u_x v_y-u_y v_x) mathrm d x wedge mathrm d y$$
    so that the Jacobian is (one over) $(u_x v_y-u_y v_x)=det pmatrixu_x & u_y \ v_x & v_y$.



    You can get this result directly from thinking about little patches of volume, however, so you'll see this far earlier than any differential form stuff. I just thought that, since you were curious, you should have had the full story mentioned to you along the way.







    share|cite|improve this answer














    share|cite|improve this answer



    share|cite|improve this answer








    edited Aug 8 at 13:43

























    answered May 25 '13 at 20:30









    Sharkos

    13k22148




    13k22148











    • I was reading your answer on that other question which you referred this one to, and I was wondering ; where could I read about the construction of such a wedge product? I know about exterior algebras in the general setting (i.e. the quotient of the tensor algebra blablabla...) but I've never seen how it applies to differential geometry and I'd like to read about it. What do you suggest?
      – Patrick Da Silva
      Jun 11 '13 at 7:21










    • Absolutely any proper course on differential geometry or indeed general relativity (which is the same thing). It's a surprisingly simple formalism. Personally I learned it in a (fairly geometric) GR course - see damtp.cam.ac.uk/user/hsr1000 - for which you can skip the physics.
      – Sharkos
      Jun 11 '13 at 10:00










    • I did have a differential geometry course, but I guess it wasn't that much advanced... thanks though
      – Patrick Da Silva
      Jun 12 '13 at 1:14










    • @Sharkos You wrote $mathrm d u = u_x mathrm d x + u_y mathrm d x$. Is this supposed to be $mathrm d u = u_x mathrm d x + u_y mathrm d y$?
      – FreshAir
      Aug 14 '15 at 1:42







    • 1




      @FreshAir Yup, thanks, have fixed!
      – Sharkos
      Aug 17 '15 at 10:58
















    • I was reading your answer on that other question which you referred this one to, and I was wondering ; where could I read about the construction of such a wedge product? I know about exterior algebras in the general setting (i.e. the quotient of the tensor algebra blablabla...) but I've never seen how it applies to differential geometry and I'd like to read about it. What do you suggest?
      – Patrick Da Silva
      Jun 11 '13 at 7:21










    • Absolutely any proper course on differential geometry or indeed general relativity (which is the same thing). It's a surprisingly simple formalism. Personally I learned it in a (fairly geometric) GR course - see damtp.cam.ac.uk/user/hsr1000 - for which you can skip the physics.
      – Sharkos
      Jun 11 '13 at 10:00










    • I did have a differential geometry course, but I guess it wasn't that much advanced... thanks though
      – Patrick Da Silva
      Jun 12 '13 at 1:14










    • @Sharkos You wrote $mathrm d u = u_x mathrm d x + u_y mathrm d x$. Is this supposed to be $mathrm d u = u_x mathrm d x + u_y mathrm d y$?
      – FreshAir
      Aug 14 '15 at 1:42







    • 1




      @FreshAir Yup, thanks, have fixed!
      – Sharkos
      Aug 17 '15 at 10:58















    I was reading your answer on that other question which you referred this one to, and I was wondering ; where could I read about the construction of such a wedge product? I know about exterior algebras in the general setting (i.e. the quotient of the tensor algebra blablabla...) but I've never seen how it applies to differential geometry and I'd like to read about it. What do you suggest?
    – Patrick Da Silva
    Jun 11 '13 at 7:21




    I was reading your answer on that other question which you referred this one to, and I was wondering ; where could I read about the construction of such a wedge product? I know about exterior algebras in the general setting (i.e. the quotient of the tensor algebra blablabla...) but I've never seen how it applies to differential geometry and I'd like to read about it. What do you suggest?
    – Patrick Da Silva
    Jun 11 '13 at 7:21












    Absolutely any proper course on differential geometry or indeed general relativity (which is the same thing). It's a surprisingly simple formalism. Personally I learned it in a (fairly geometric) GR course - see damtp.cam.ac.uk/user/hsr1000 - for which you can skip the physics.
    – Sharkos
    Jun 11 '13 at 10:00




    Absolutely any proper course on differential geometry or indeed general relativity (which is the same thing). It's a surprisingly simple formalism. Personally I learned it in a (fairly geometric) GR course - see damtp.cam.ac.uk/user/hsr1000 - for which you can skip the physics.
    – Sharkos
    Jun 11 '13 at 10:00












    I did have a differential geometry course, but I guess it wasn't that much advanced... thanks though
    – Patrick Da Silva
    Jun 12 '13 at 1:14




    I did have a differential geometry course, but I guess it wasn't that much advanced... thanks though
    – Patrick Da Silva
    Jun 12 '13 at 1:14












    @Sharkos You wrote $mathrm d u = u_x mathrm d x + u_y mathrm d x$. Is this supposed to be $mathrm d u = u_x mathrm d x + u_y mathrm d y$?
    – FreshAir
    Aug 14 '15 at 1:42





    @Sharkos You wrote $mathrm d u = u_x mathrm d x + u_y mathrm d x$. Is this supposed to be $mathrm d u = u_x mathrm d x + u_y mathrm d y$?
    – FreshAir
    Aug 14 '15 at 1:42





    1




    1




    @FreshAir Yup, thanks, have fixed!
    – Sharkos
    Aug 17 '15 at 10:58




    @FreshAir Yup, thanks, have fixed!
    – Sharkos
    Aug 17 '15 at 10:58










    up vote
    6
    down vote













    The use of infinitesimals can only be formalized an analysed with care when working with non-standard analysis. When people first study calculus and the most elementary books define $dx$ as $Delta x$ approaching zero, usually people get confused think "but this should be zero", and in standard analysis it'll really be.



    Things are much easier than this, however, we just have to throw away those $dx$ and $dy$. Why? Simply because modern mathematics adhered to methods that are more sophisticated, simpler, and because if you are going to proceed in mathematics you'll really need those modern methods when studying analysis or differential geometry (where $dx$ receives a true definition and gains a fundamental role).



    Now you might ask the exact same question I've asked when my I first encountered the rigorous framework: "this guy is mad! My book talks about infinitesimals, my teacher told me it's all right, he must be mad", but it's not like that. I'll show you two examples: the first one is meant to show you that when working with integrals those things appear just as mnemonic rules that allows you to remember the true formula easier when you're starting. The second is to show you when things become confusing when using infinitesimals without care.



    First, consider your function:



    $$int2x(x^2+4)^100dx$$



    The idea is that we see that this can be rewritten in some convenient form. Note that if we set $f(x) = x^2+4$, then $f'(x)=2x$. Then we are integrating:



    $$int f'(x) (f(x))^100dx$$



    Now, if we set $g(x)= x^100$ note that by composition we are integrating:



    $$int g(f(x))f'(x)dx$$



    Now, if $G(x)$ is a primitive of $g$ recall the chain rule, the integrand is just $(G(f(x)))'=G'(f(x))f'(x)$, so since the indefinite integral is a primitive, and the integrand is a derivative the result is simply:



    $$int (G(f(x)))'dx = G(f(x))$$



    Ande since $g(x) = x^100$ the obvious primitive is $G(x) = x^101/101$ and hence:



    $$int 2x(x^2+4)^100dx = frac(x^2+4)^101101$$



    We can "remember" that by saying that we set $u = x^2+4$ and $du=2xdx$, so is just a rule to remember how to find the formula that's just an application of the chain rule.



    The second example is the chain rule itself. Usually people write:



    $$fracdfdx = fracdfdu fracdudx$$



    But look, on the lhs you are differentiating not $f$, but rather $fcirc u$. So, on the left $f$ means one thing, on the right it means another thing! So using this language of infinitesimals the wrong way arround may lead to confusions and may hide from you the true nature of what you are studying. The book I've used when I moved from the infinitesimals treatment to the rigorous one was Spivak's Calculus. Try it! He develop everything formally, without appeal to those "undefined" creatures and shows you where they appear just as ways to remember formulas.



    I hope this helps you. Good luck!






    share|cite|improve this answer






















    • Thank you so much for the fantastic post! It has helped me to understand this concept much more clearly.
      – hax0r_n_code
      May 25 '13 at 22:00














    up vote
    6
    down vote













    The use of infinitesimals can only be formalized an analysed with care when working with non-standard analysis. When people first study calculus and the most elementary books define $dx$ as $Delta x$ approaching zero, usually people get confused think "but this should be zero", and in standard analysis it'll really be.



    Things are much easier than this, however, we just have to throw away those $dx$ and $dy$. Why? Simply because modern mathematics adhered to methods that are more sophisticated, simpler, and because if you are going to proceed in mathematics you'll really need those modern methods when studying analysis or differential geometry (where $dx$ receives a true definition and gains a fundamental role).



    Now you might ask the exact same question I've asked when my I first encountered the rigorous framework: "this guy is mad! My book talks about infinitesimals, my teacher told me it's all right, he must be mad", but it's not like that. I'll show you two examples: the first one is meant to show you that when working with integrals those things appear just as mnemonic rules that allows you to remember the true formula easier when you're starting. The second is to show you when things become confusing when using infinitesimals without care.



    First, consider your function:



    $$int2x(x^2+4)^100dx$$



    The idea is that we see that this can be rewritten in some convenient form. Note that if we set $f(x) = x^2+4$, then $f'(x)=2x$. Then we are integrating:



    $$int f'(x) (f(x))^100dx$$



    Now, if we set $g(x)= x^100$ note that by composition we are integrating:



    $$int g(f(x))f'(x)dx$$



    Now, if $G(x)$ is a primitive of $g$ recall the chain rule, the integrand is just $(G(f(x)))'=G'(f(x))f'(x)$, so since the indefinite integral is a primitive, and the integrand is a derivative the result is simply:



    $$int (G(f(x)))'dx = G(f(x))$$



    Ande since $g(x) = x^100$ the obvious primitive is $G(x) = x^101/101$ and hence:



    $$int 2x(x^2+4)^100dx = frac(x^2+4)^101101$$



    We can "remember" that by saying that we set $u = x^2+4$ and $du=2xdx$, so is just a rule to remember how to find the formula that's just an application of the chain rule.



    The second example is the chain rule itself. Usually people write:



    $$fracdfdx = fracdfdu fracdudx$$



    But look, on the lhs you are differentiating not $f$, but rather $fcirc u$. So, on the left $f$ means one thing, on the right it means another thing! So using this language of infinitesimals the wrong way arround may lead to confusions and may hide from you the true nature of what you are studying. The book I've used when I moved from the infinitesimals treatment to the rigorous one was Spivak's Calculus. Try it! He develop everything formally, without appeal to those "undefined" creatures and shows you where they appear just as ways to remember formulas.



    I hope this helps you. Good luck!






    share|cite|improve this answer






















    • Thank you so much for the fantastic post! It has helped me to understand this concept much more clearly.
      – hax0r_n_code
      May 25 '13 at 22:00












    up vote
    6
    down vote










    up vote
    6
    down vote









    The use of infinitesimals can only be formalized an analysed with care when working with non-standard analysis. When people first study calculus and the most elementary books define $dx$ as $Delta x$ approaching zero, usually people get confused think "but this should be zero", and in standard analysis it'll really be.



    Things are much easier than this, however, we just have to throw away those $dx$ and $dy$. Why? Simply because modern mathematics adhered to methods that are more sophisticated, simpler, and because if you are going to proceed in mathematics you'll really need those modern methods when studying analysis or differential geometry (where $dx$ receives a true definition and gains a fundamental role).



    Now you might ask the exact same question I've asked when my I first encountered the rigorous framework: "this guy is mad! My book talks about infinitesimals, my teacher told me it's all right, he must be mad", but it's not like that. I'll show you two examples: the first one is meant to show you that when working with integrals those things appear just as mnemonic rules that allows you to remember the true formula easier when you're starting. The second is to show you when things become confusing when using infinitesimals without care.



    First, consider your function:



    $$int2x(x^2+4)^100dx$$



    The idea is that we see that this can be rewritten in some convenient form. Note that if we set $f(x) = x^2+4$, then $f'(x)=2x$. Then we are integrating:



    $$int f'(x) (f(x))^100dx$$



    Now, if we set $g(x)= x^100$ note that by composition we are integrating:



    $$int g(f(x))f'(x)dx$$



    Now, if $G(x)$ is a primitive of $g$ recall the chain rule, the integrand is just $(G(f(x)))'=G'(f(x))f'(x)$, so since the indefinite integral is a primitive, and the integrand is a derivative the result is simply:



    $$int (G(f(x)))'dx = G(f(x))$$



    Ande since $g(x) = x^100$ the obvious primitive is $G(x) = x^101/101$ and hence:



    $$int 2x(x^2+4)^100dx = frac(x^2+4)^101101$$



    We can "remember" that by saying that we set $u = x^2+4$ and $du=2xdx$, so is just a rule to remember how to find the formula that's just an application of the chain rule.



    The second example is the chain rule itself. Usually people write:



    $$fracdfdx = fracdfdu fracdudx$$



    But look, on the lhs you are differentiating not $f$, but rather $fcirc u$. So, on the left $f$ means one thing, on the right it means another thing! So using this language of infinitesimals the wrong way arround may lead to confusions and may hide from you the true nature of what you are studying. The book I've used when I moved from the infinitesimals treatment to the rigorous one was Spivak's Calculus. Try it! He develop everything formally, without appeal to those "undefined" creatures and shows you where they appear just as ways to remember formulas.



    I hope this helps you. Good luck!






    share|cite|improve this answer














    The use of infinitesimals can only be formalized an analysed with care when working with non-standard analysis. When people first study calculus and the most elementary books define $dx$ as $Delta x$ approaching zero, usually people get confused think "but this should be zero", and in standard analysis it'll really be.



    Things are much easier than this, however, we just have to throw away those $dx$ and $dy$. Why? Simply because modern mathematics adhered to methods that are more sophisticated, simpler, and because if you are going to proceed in mathematics you'll really need those modern methods when studying analysis or differential geometry (where $dx$ receives a true definition and gains a fundamental role).



    Now you might ask the exact same question I've asked when my I first encountered the rigorous framework: "this guy is mad! My book talks about infinitesimals, my teacher told me it's all right, he must be mad", but it's not like that. I'll show you two examples: the first one is meant to show you that when working with integrals those things appear just as mnemonic rules that allows you to remember the true formula easier when you're starting. The second is to show you when things become confusing when using infinitesimals without care.



    First, consider your function:



    $$int2x(x^2+4)^100dx$$



    The idea is that we see that this can be rewritten in some convenient form. Note that if we set $f(x) = x^2+4$, then $f'(x)=2x$. Then we are integrating:



    $$int f'(x) (f(x))^100dx$$



    Now, if we set $g(x)= x^100$ note that by composition we are integrating:



    $$int g(f(x))f'(x)dx$$



    Now, if $G(x)$ is a primitive of $g$ recall the chain rule, the integrand is just $(G(f(x)))'=G'(f(x))f'(x)$, so since the indefinite integral is a primitive, and the integrand is a derivative the result is simply:



    $$int (G(f(x)))'dx = G(f(x))$$



    Ande since $g(x) = x^100$ the obvious primitive is $G(x) = x^101/101$ and hence:



    $$int 2x(x^2+4)^100dx = frac(x^2+4)^101101$$



    We can "remember" that by saying that we set $u = x^2+4$ and $du=2xdx$, so is just a rule to remember how to find the formula that's just an application of the chain rule.



    The second example is the chain rule itself. Usually people write:



    $$fracdfdx = fracdfdu fracdudx$$



    But look, on the lhs you are differentiating not $f$, but rather $fcirc u$. So, on the left $f$ means one thing, on the right it means another thing! So using this language of infinitesimals the wrong way arround may lead to confusions and may hide from you the true nature of what you are studying. The book I've used when I moved from the infinitesimals treatment to the rigorous one was Spivak's Calculus. Try it! He develop everything formally, without appeal to those "undefined" creatures and shows you where they appear just as ways to remember formulas.



    I hope this helps you. Good luck!







    share|cite|improve this answer














    share|cite|improve this answer



    share|cite|improve this answer








    edited May 25 '13 at 22:05

























    answered May 25 '13 at 20:38









    user1620696

    11k337103




    11k337103











    • Thank you so much for the fantastic post! It has helped me to understand this concept much more clearly.
      – hax0r_n_code
      May 25 '13 at 22:00
















    • Thank you so much for the fantastic post! It has helped me to understand this concept much more clearly.
      – hax0r_n_code
      May 25 '13 at 22:00















    Thank you so much for the fantastic post! It has helped me to understand this concept much more clearly.
    – hax0r_n_code
    May 25 '13 at 22:00




    Thank you so much for the fantastic post! It has helped me to understand this concept much more clearly.
    – hax0r_n_code
    May 25 '13 at 22:00










    up vote
    2
    down vote













    To understand integration by substitution, you can just use the chain rule in reverse:
    beginequation
    int f(g (x)) g'(x) dx = F (g (x)) + C,
    endequation
    where $ F $ is an anti derivative of $ f $. To check this, just take the derivative of the right hand side using the chain rule.






    share|cite|improve this answer
























      up vote
      2
      down vote













      To understand integration by substitution, you can just use the chain rule in reverse:
      beginequation
      int f(g (x)) g'(x) dx = F (g (x)) + C,
      endequation
      where $ F $ is an anti derivative of $ f $. To check this, just take the derivative of the right hand side using the chain rule.






      share|cite|improve this answer






















        up vote
        2
        down vote










        up vote
        2
        down vote









        To understand integration by substitution, you can just use the chain rule in reverse:
        beginequation
        int f(g (x)) g'(x) dx = F (g (x)) + C,
        endequation
        where $ F $ is an anti derivative of $ f $. To check this, just take the derivative of the right hand side using the chain rule.






        share|cite|improve this answer












        To understand integration by substitution, you can just use the chain rule in reverse:
        beginequation
        int f(g (x)) g'(x) dx = F (g (x)) + C,
        endequation
        where $ F $ is an anti derivative of $ f $. To check this, just take the derivative of the right hand side using the chain rule.







        share|cite|improve this answer












        share|cite|improve this answer



        share|cite|improve this answer










        answered Aug 17 '15 at 11:33









        littleO

        26.2k540102




        26.2k540102




















            up vote
            0
            down vote













            $dx$ is what is known as a differential. It is an infinitesimally small interval of $x$:



            $$dx=lim_xtox_0x-x_0$$



            Using this definition, it is clear from the definition of the derivative why $fracdydx$ is the derivative of $y$ with respect to $x$:



            $$y'=f'(x)=fracdydx=lim_xtox_0fracf(x-x_0)-f(x_0)x-x_0$$



            When doing u-substitution, you are defining $u$ to be an expression dependent on $x$. $fracdudx$ is a fraction like any other, so to get $du$ you must multiply both sides of the equation by $dx$:



            $$u=x^2+4$$



            $$fracdudx=2x$$



            $$dxfracdudx=2xspacedx$$



            $$du=2xspacedx$$






            share|cite|improve this answer


















            • 2




              @nkon I get your problem now, I think. You see, another disgusting thing these people do is writing $y$ when they actually mean $y(x)$.
              – Git Gud
              May 25 '13 at 19:48






            • 1




              @nkon it means you're treating $y$ as if it were a function of $x$. It could just as easily be, for example, $fracdydn$ if y were a function of $n$.
              – Ataraxia
              May 25 '13 at 19:54






            • 1




              To think about $dx$ as infinitesimal you must use nonstandard analysis and define the hyperreals. Indeed, with your definition $dx = x$ because using standard analysis $lim_x_0 to 0 x-x_0 = x$.
              – user1620696
              May 25 '13 at 20:16






            • 1




              Even with this definition of $dx$ in standard analysis this is zero. The notion of "a nonzero number less than any other still" can only be made precise in nonstandard analysis.
              – user1620696
              May 25 '13 at 21:05






            • 4




              I would like to emphasize what @user1620696 said too - $dx=lim_xtox_0x-x_0$ is very much worse notation than just using the differential because it is defined, and it doesn't mean this. The expression on the right is identically 0 in any formulation.
              – Sharkos
              May 25 '13 at 23:13














            up vote
            0
            down vote













            $dx$ is what is known as a differential. It is an infinitesimally small interval of $x$:



            $$dx=lim_xtox_0x-x_0$$



            Using this definition, it is clear from the definition of the derivative why $fracdydx$ is the derivative of $y$ with respect to $x$:



            $$y'=f'(x)=fracdydx=lim_xtox_0fracf(x-x_0)-f(x_0)x-x_0$$



            When doing u-substitution, you are defining $u$ to be an expression dependent on $x$. $fracdudx$ is a fraction like any other, so to get $du$ you must multiply both sides of the equation by $dx$:



            $$u=x^2+4$$



            $$fracdudx=2x$$



            $$dxfracdudx=2xspacedx$$



            $$du=2xspacedx$$






            share|cite|improve this answer


















            • 2




              @nkon I get your problem now, I think. You see, another disgusting thing these people do is writing $y$ when they actually mean $y(x)$.
              – Git Gud
              May 25 '13 at 19:48






            • 1




              @nkon it means you're treating $y$ as if it were a function of $x$. It could just as easily be, for example, $fracdydn$ if y were a function of $n$.
              – Ataraxia
              May 25 '13 at 19:54






            • 1




              To think about $dx$ as infinitesimal you must use nonstandard analysis and define the hyperreals. Indeed, with your definition $dx = x$ because using standard analysis $lim_x_0 to 0 x-x_0 = x$.
              – user1620696
              May 25 '13 at 20:16






            • 1




              Even with this definition of $dx$ in standard analysis this is zero. The notion of "a nonzero number less than any other still" can only be made precise in nonstandard analysis.
              – user1620696
              May 25 '13 at 21:05






            • 4




              I would like to emphasize what @user1620696 said too - $dx=lim_xtox_0x-x_0$ is very much worse notation than just using the differential because it is defined, and it doesn't mean this. The expression on the right is identically 0 in any formulation.
              – Sharkos
              May 25 '13 at 23:13












            up vote
            0
            down vote










            up vote
            0
            down vote









            $dx$ is what is known as a differential. It is an infinitesimally small interval of $x$:



            $$dx=lim_xtox_0x-x_0$$



            Using this definition, it is clear from the definition of the derivative why $fracdydx$ is the derivative of $y$ with respect to $x$:



            $$y'=f'(x)=fracdydx=lim_xtox_0fracf(x-x_0)-f(x_0)x-x_0$$



            When doing u-substitution, you are defining $u$ to be an expression dependent on $x$. $fracdudx$ is a fraction like any other, so to get $du$ you must multiply both sides of the equation by $dx$:



            $$u=x^2+4$$



            $$fracdudx=2x$$



            $$dxfracdudx=2xspacedx$$



            $$du=2xspacedx$$






            share|cite|improve this answer














            $dx$ is what is known as a differential. It is an infinitesimally small interval of $x$:



            $$dx=lim_xtox_0x-x_0$$



            Using this definition, it is clear from the definition of the derivative why $fracdydx$ is the derivative of $y$ with respect to $x$:



            $$y'=f'(x)=fracdydx=lim_xtox_0fracf(x-x_0)-f(x_0)x-x_0$$



            When doing u-substitution, you are defining $u$ to be an expression dependent on $x$. $fracdudx$ is a fraction like any other, so to get $du$ you must multiply both sides of the equation by $dx$:



            $$u=x^2+4$$



            $$fracdudx=2x$$



            $$dxfracdudx=2xspacedx$$



            $$du=2xspacedx$$







            share|cite|improve this answer














            share|cite|improve this answer



            share|cite|improve this answer








            edited May 25 '13 at 20:51

























            answered May 25 '13 at 19:42









            Ataraxia

            4,77321647




            4,77321647







            • 2




              @nkon I get your problem now, I think. You see, another disgusting thing these people do is writing $y$ when they actually mean $y(x)$.
              – Git Gud
              May 25 '13 at 19:48






            • 1




              @nkon it means you're treating $y$ as if it were a function of $x$. It could just as easily be, for example, $fracdydn$ if y were a function of $n$.
              – Ataraxia
              May 25 '13 at 19:54






            • 1




              To think about $dx$ as infinitesimal you must use nonstandard analysis and define the hyperreals. Indeed, with your definition $dx = x$ because using standard analysis $lim_x_0 to 0 x-x_0 = x$.
              – user1620696
              May 25 '13 at 20:16






            • 1




              Even with this definition of $dx$ in standard analysis this is zero. The notion of "a nonzero number less than any other still" can only be made precise in nonstandard analysis.
              – user1620696
              May 25 '13 at 21:05






            • 4




              I would like to emphasize what @user1620696 said too - $dx=lim_xtox_0x-x_0$ is very much worse notation than just using the differential because it is defined, and it doesn't mean this. The expression on the right is identically 0 in any formulation.
              – Sharkos
              May 25 '13 at 23:13












            • 2




              @nkon I get your problem now, I think. You see, another disgusting thing these people do is writing $y$ when they actually mean $y(x)$.
              – Git Gud
              May 25 '13 at 19:48






            • 1




              @nkon it means you're treating $y$ as if it were a function of $x$. It could just as easily be, for example, $fracdydn$ if y were a function of $n$.
              – Ataraxia
              May 25 '13 at 19:54






            • 1




              To think about $dx$ as infinitesimal you must use nonstandard analysis and define the hyperreals. Indeed, with your definition $dx = x$ because using standard analysis $lim_x_0 to 0 x-x_0 = x$.
              – user1620696
              May 25 '13 at 20:16






            • 1




              Even with this definition of $dx$ in standard analysis this is zero. The notion of "a nonzero number less than any other still" can only be made precise in nonstandard analysis.
              – user1620696
              May 25 '13 at 21:05






            • 4




              I would like to emphasize what @user1620696 said too - $dx=lim_xtox_0x-x_0$ is very much worse notation than just using the differential because it is defined, and it doesn't mean this. The expression on the right is identically 0 in any formulation.
              – Sharkos
              May 25 '13 at 23:13







            2




            2




            @nkon I get your problem now, I think. You see, another disgusting thing these people do is writing $y$ when they actually mean $y(x)$.
            – Git Gud
            May 25 '13 at 19:48




            @nkon I get your problem now, I think. You see, another disgusting thing these people do is writing $y$ when they actually mean $y(x)$.
            – Git Gud
            May 25 '13 at 19:48




            1




            1




            @nkon it means you're treating $y$ as if it were a function of $x$. It could just as easily be, for example, $fracdydn$ if y were a function of $n$.
            – Ataraxia
            May 25 '13 at 19:54




            @nkon it means you're treating $y$ as if it were a function of $x$. It could just as easily be, for example, $fracdydn$ if y were a function of $n$.
            – Ataraxia
            May 25 '13 at 19:54




            1




            1




            To think about $dx$ as infinitesimal you must use nonstandard analysis and define the hyperreals. Indeed, with your definition $dx = x$ because using standard analysis $lim_x_0 to 0 x-x_0 = x$.
            – user1620696
            May 25 '13 at 20:16




            To think about $dx$ as infinitesimal you must use nonstandard analysis and define the hyperreals. Indeed, with your definition $dx = x$ because using standard analysis $lim_x_0 to 0 x-x_0 = x$.
            – user1620696
            May 25 '13 at 20:16




            1




            1




            Even with this definition of $dx$ in standard analysis this is zero. The notion of "a nonzero number less than any other still" can only be made precise in nonstandard analysis.
            – user1620696
            May 25 '13 at 21:05




            Even with this definition of $dx$ in standard analysis this is zero. The notion of "a nonzero number less than any other still" can only be made precise in nonstandard analysis.
            – user1620696
            May 25 '13 at 21:05




            4




            4




            I would like to emphasize what @user1620696 said too - $dx=lim_xtox_0x-x_0$ is very much worse notation than just using the differential because it is defined, and it doesn't mean this. The expression on the right is identically 0 in any formulation.
            – Sharkos
            May 25 '13 at 23:13




            I would like to emphasize what @user1620696 said too - $dx=lim_xtox_0x-x_0$ is very much worse notation than just using the differential because it is defined, and it doesn't mean this. The expression on the right is identically 0 in any formulation.
            – Sharkos
            May 25 '13 at 23:13












             

            draft saved


            draft discarded


























             


            draft saved


            draft discarded














            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f402303%2funderstanding-the-differential-dx-when-doing-u-substitution%23new-answer', 'question_page');

            );

            Post as a guest













































































            這個網誌中的熱門文章

            tkz-euclide: tkzDrawCircle[R] not working

            How to combine Bézier curves to a surface?

            1st Magritte Awards