Professional Documents
Culture Documents
Section 1.1 - Why Study Calculus Section 1.2 - Why Not to Study Calculus
However, the code of ethics and morals we live by turn out to be nothing more than a set of ironic contradictions. We find ourselves surrounded by a sea of chaos. Life then carries us through this sea of confusion, never stopping anywhere to drop us at a destination. Life appears futile with no meaning in sight. Is man confined to this helpless state of existence? To be honest, no one knows the answer. We can, however, look for hope in the mind. The power of thought, reason and logic are what allow man to seize control of life and ultimately serve as a guide to enlightenment and wisdom. Aristotle felt that happiness could only be achieved by cultivating the mind. He wrote: Now the peculiar excellence of man is his power of thought; it is by this faculty that he surpasses and rules other forms of life; and as the growth of this faculty has given him supremacy, so, we may presume, its development will give him fulfillment and happiness. This leads to the question, How do we define reason?. This is synonymous with asking how do thoughts define behavior? As a child you had little control over your actions. There did not seem to be any purpose behind what you did or what happened to you. Life carried you along, giving you the freedom to enjoy the ride. The idea of free-will offers a more plausible explanation of human actions. It says that man has the freedom of choice to decide on a certain course of action. Actions are the results of choices; choices which we are free to make. The branch of philosophy known as determinism takes free-will a step further to explain why man chooses to act in one way over another. When man is confronted with a situation, the line of action he decided on taking is based on experience, personal interests and preferences. The decisions we make are thus entirely influenced by our past experiences. The will is not free to behave on its own. For example my decision to write this book was not based on some impulsive instinct but on a culmination of circumstances I was exposed to and reacted against. Any action from the most random to the most perverse can be explained by the set of situations, experiences, thoughts and feelings that preceded that action. Essentially determinism says that our lives are predetermined to the extent that regardless of how we live we can never change our course of life. As a French thinker once said , " We change yet we remain the same ". The logic behind this statement is that if our past can never be changed then our future will always remain the same. Each action is dictated by the previous one. It is at this point where science and art diverge. For the scientist, determinism is an accurate enough explanation of human life, as it says that everything that occurs, occurs for a set of reasons. Observing and understanding these set of reasons becomes the work of the scientist. The artist, however, interprets determinism as saying, 'Since everything in life is predetermined then life is meaningless'. Life may have no meaning but it is the goal of the artist to question that statement by exploring the mysterious depths of human nature and the heart. Perhaps it is the utter randomness of life that causes us to ignore our unchanging destiny. No human being has any control over actions that he or she will be exposed to. Fate begins to lose meaning as one never knows what will happen to oneself. It takes the risks of the artist to create chaos from which wisdom evolves.
On the other hand, the study of nature is more precise and is less likely to be influenced by a wide variety of unrelated factors. Within natures all actions, occurrences, or changes are dependent on a few factors that can be carefully isolated and studied individually. Science is specifically about analyzing these interacting systems and then forming hypothesis that can accurately explain them. What makes observing phenomena in nature so interesting is that they always occur in a closed setting where external factors can easily be removed to leave behind just a few interacting objects. It is these objects along with their properties, that become the focus of study. Any attempt to logically explain their unique interaction must come from the objects themselves and not from imaginary external factors. Through reasoning and observation, nature can be understood, such that the future can be determined from the present. As Sherlock Holmes would often warn Watson, " You see but you do not observe!" . Often enough, human beings fail to grasp this simple rule of nature by ignorantly attributing any naturally occurring phenomena to the Gods, heavens, or some mysterious substance with superpowers. Understanding and accepting the truth requires an open and critical mind. Charlotte Bronte humorously wrote about this ironic flaw of human nature in her popular novel, Shirley. Note well! Whenever you present the actual, simple, truth, it is somehow always denounced as a lie: they disown it, cast it off, throw it on the parish; whereas the product of your own imagination, the mere figment, the sheer fiction, is adapted, termed pretty, proper, sweetly natural: the little spurious wretch gets all the comfits - the honest, lawful bantling all the cuffs. Such is the way of the world...". Not only is science stymied by ignorance and deception, but it is also muddled by the work of the pseudo-scientist. The pseudo-scientist is described by the Spanish philosopher, Jose Ortega, in his stunning book on modern western civilization, The Revolt of the Masses: "... By 1890 a third generation takes command in the intellectual world, and we find a type of scientist without precedent in history. He is a person who knows, of all that a routinely dutiful man must know, only something of one specific science; even of this science, he is well informed only within that limited area in which he is an active researcher. He may even go so far as to claim he has an advantage in not cultivating what lies outside his own narrow field, and he may declare that curiosity about general knowledge is the sign of the amateur, the dilettante. Immured within his small area, he succeeds in discovering new facts, advances the science which he scarcely knows, and increases perforce the encyclopedia of knowledge of which he is conscientiously ignorant..." Physics Nobel laureate Richard Feynman wrote the following observations about the different types of pseudo-scientists he met at a conference. Most of the scientists' work fell into one of these categories: 1. Completely ununderstandable 2. Vague and indefinite 3. Something correct that is obvious and self-evident, worked out by a long and difficult analysis and presented as an important discovery 4. A claim based on the stupidity of the author that some obvious and correct thing accepted and checked for years is in fact false
5. An attempt to do something probably impossible but of certainly no utility which it is finally revealed at the end fails 6. Just plain wrong One of the remarkable masterpieces of the mind is the science of mathematics, often called the science of deductive reasoning. While science is a logical system of thought used to study the natural world; mathematics is the precise language of science. It is the form of communication for scientific analysis. Number and symbols are nothing more than vague abstractions unless they refer to something specific. Before mathematics can exist there must be a situation to give it meaning. It is scientific analysis that determines the structure of mathematics. Through mathematics we are able to define the present. The present is only dependent on the conditions that exist within the short frame of time that it occupies. Quickly it vanishes before our eyes, becoming a memory. The goal of science is to define the objective world in terms of existing quantifiable conditions expressed by mathematics. Our dimensions or properties remain fixed and do not change. It is when our dimensions change that our study becomes a bit more complicated and calculus arises. But first, what is meant by change? To understand change we need to explain the concept of time. By definition, time is a passage of events, such that for time to pass, something must change with respect to itself. For example a moving object implies a changing distance covered from a reference point. This comprises an event that defines time. Or a rising temperature implies that the temperature is changing , thus occupying time. Changes are the results of actions that comprise a situation. While calculus is the study of mathematically defined change, it is not necessarily the study of time alone. In science other dimensions can be changing with respect to each other. For example velocity can change with height, temperature change with energy, density change with depth, force change with mass etc. When a dimension is changing with respect to itself, we say it is changing with respect to time. When factors change with respect to each other, we disregard the effect time has on the factors and proceed to only analyze the interacting dimensions. We assume our factors are constant that change with respect to each other, not with time. Calculus is thus the branch of mathematics used to study any phenomena involving change . Change is a relative concept that can involve any pair of dimensions, time, force, mass, length, temperature etc. This may sound a bit abstract but it will become much clearer as we follow through the course. This concludes the answer to what calculus is about. You may not feel I answered the original posted question, " Why study calculus?". The purpose of studying calculus is simply to introduce your mind to the scientific method of analysis. Through science, practical problems can be identified, explanations generated and logical solutions selected. The aim is for you to understand how to apply your mind in a systematic manner toward understanding the world around you. Engineering relies more on this fundamental logical approach toward problem solving than it does on sheer number crunching and formula manipulation. For this reason many engineers have forgotten all the theorems of calculus, but what remains is the important conceptual framework of method and application. Engineers often jokingly admit " We don't know all the math but we do know the end-result ! "
Good engineering sense is defined as the ability to quickly identify a problem, come up with practical solutions and then select the most efficient option. Many real-world problems are independent of any complex mathematics, but the same systematic scientific approach is required to solve them. The application of sound logic is all that is required to reduce complexities to simplicities. Early in the introduction I stated that it was an over-statement to claim that one needs calculus to become a scientist. Those engineers and scientists who got by with a cursory knowledge of the subject fall into the realm of the pseudo-scientist and charlatan. As Ortega wrote, the pseudo-scientist, "advances the science which he scarcely knows, and increases perforce the encyclopedia of knowledge of which he is conscientiously ignorant...". It is the thrill and enlightenment of understanding nature that drives the scientist. The engineer takes science a step further to control nature to suit man's needs. Both the engineer and scientist share a deep appreciation for the workings of nature; an appreciation that develops into wisdom. Therefore, the purpose of studying calculus is two-fold. First is to introduce you to the basic concepts of mathematics used to study almost any type of changing phenomena within a controlled setting. Second, studying calculus will develop invaluable scientific sense and practical engineering problem solving skills in you. You will understand how to think logically to reduce even the most complex systems to a few interacting components. As you study the main concepts, theories and examples in this book, your mind will develop into a powerful systematic instrument. To conclude on a humorous note, read the following satirical essay from The Onion in the chapter on 1925 in their book, Our Dumb Century. Should we teach our children facts? No, I say, a thousand times no! As they grow into tomorrow's farmers, housewives, mill workers, and coal miners, facts are the last things they will need. Manners! Subservience! Above all, obedience! To speak only when spoken to, and not to cause trouble! These are the principles upon which our educational system was grounded. Why in the name of God should we replace it with a system that actually encourages an ignorant man to ask questions? A good citizen does nothing of the sort. He is content with the reasons he is given by his betters.
Questions
1. Has God created man or man created God? Some say man and nature are too well made to happen at random. The many physical laws of this universe must be made by a creator just as traffic laws are made by man. While evolution may appear random, it is actually an accumulation of infinite changes over time. What we see is just the current state, not the infinite layers that have been set down over eternity. For this reason, it is easy to conclude the current state is too perfect to understand. But when you study how each layer of evolution happens, it becomes quite easy to understand how complexity can be achieved from the simplest of changes accumulated over time. Comment on this question with regards to your views on religion and science. 2. The paradox of life is that one must suffer to enjoy it. Or consider the quote by the father of the modern Olympic movement, " The most important thing in the Olympic Games is not to win but to take part, just as the most important thing in life is not the triumph but the struggle. The essential thing is not to have conquered but to
have fought well. " What do you think? Is life about achievements or the journey to those achievements? 3. The world contains thousands of cultures. Each culture has its own set of morals and standards which the society tries to live by . What one culture considers good, another culture may consider it taboo. While most people are fundamentally good, there is little consistency among socially accepted behavior. Are morals absolute or do they exist only relative to each other? In other words do you think certain actions can be judged as pure good or evil, regardless of personal upbringing? What aspects of your culture could be perceived as strange or negative to someone of a different culture?
The submarine's sixteen cylinder diesel and electric engines are a marvel of wires and pistons all intricately connected in tight confines. Furthurmore, pumps, gauges and hydraulics are plastered all over within it. One can only imagine the tragic consequences should any one of these many components fail.
The complexity of this 60 year old submarine makes one wonder how this was built with any degree of reliability? Was it the creation of an isolated team of engineers who were just lucky to design it right?
The reality is the design evolved over decades from the field history of many prior designs. Each design took the collective effort of teams of engineers, technicians and skilled laborers. Specialized teams only worked on their niche. For example, the torpedos have complex mechanical gyroscopes, acoustic honing devices, electric motors and magnetic detonators. Each alone took dedicated engineering to design successfully.
Cumulatively, millions of man hours went into making one successful design. It then astounds your mind to grasp the massive effort invested into making not just one, but all the hundreds of submarines in use during the war! Consider another example, an aircraft engine which puts many engineering disciplines into conflict: aerodynamics, mechanical stress, noise and vibration, heat transfer, material properties, reliability, fatigue life prediction and more. Furthuremore, aircraft manufacturers demand efficient operation, long life and short delivery cycles. It used to take ten years to develop a new aircraft engine but the industry today aims for an average of only two. The picture below shows the incredible amount of cutting-edge analysis that goes into the design of just one component of the engine, the compressor blade.
At the same time, aircraft engine makers are targeting low costs for the design, manufacturing and maintenance. So it is no surprise the industry of making all this possible is competitive, demanding and expensive. In other words, it is highly evolved and larger than any single person could do themselves. After considering all this, one realizes that the war industry takes an entire nation to support. People can not put blame on a "single crazed leader". It is everyday people toiling away that creates these complex weapons of destruction. There is a saying in Haiti, "Don't play with the hungry dog". People under desperate circumstances will abandon all sense of humanity in order to preserve their way of life. Herd mentality takes over, wherein viscous behavior is acceptable so long as everyone else thinks nothing of it. A famous quote from the Vietnam-US conflict was, " You will kill 10 of our men, and we will kill 1 of yours, and in the end it will be you who tire of it. " On the other hand one can not be judgemental and blame things on "pure evil". Another saying that shows how things need to be viewed relative to their circumstances is, "One man's genocide is another man's freedom fight". Great cruelty occurs when the ignorant populace will use any justification to preserve their threatened way of life. It is a kill or be killed world where self-survival is all that matters. " You will never see the source of a genocide. It is buried too deep in grudges, under an accumulation of misunderstandings that we were last to inherit. We came of age at the worst moment in Rwanda's history: we were taught to obey absolutely, raised in hatred, stuffed with slogans. We are an unfortunate generation." Joseph-Desire, killer - from Rwandan Stories
Unfortunately, there is no overnight solutions to complex situations like these. They evolve over decades of accumulated prejudices and injusticies until they reach a tipping point. Simple education, understanding and empathy for our fellow human beings is usually enough to disperse these feelings. Unfortunately, even this is asking too much as people prefer a life of complacency to one of education. Furthuremore, human beings are inherently selfish and there is little that can be done to change the way we are "hardwired". Whatever its scale, war is made up of individual acts, and individual choices to perform them. So human individuals need to make the commitment, as early in life as possible, that they will have no part with it. We must remember we are all human beings and everyone in this planet is our family. We all experience pain and joy equally and labels to divide us are meaningless. The Universal Declaration of Human Rights which was adopted by the UN General Assembly in 1948 states: All human beings are born free and equal in dignity and rights.They are endowed with reason and conscience and should act towards one another in a spirit of brotherhood. No one shall be subjected to torture or to cruel, inhuman or degrading treatment or punishment
Last, consider what Dwight David Eisenhower, said: "Every gun that is made, every warship launched, every rocket fired signifies, in the final sense, a theft from those who hunger and are not fed, those who are cold and are not clothed. This world in arms is not spending money alone. It is spending the sweat of its laborers, the genius of its scientists, the hopes of its children. " As this article has shown, engineering can be used to create incredible designs meant to do terribly cruel and inhuman things. Broadly educate yourself so you can make decisions of morality based on your experiences, not based on what someone tells you to believe. After all, it is Adolf Hitler who said, " What luck for the rulers that men do not think ".
A number therefore does not depend on any specific quality of the object. 10 balls and 10 eggs are the same number of things. Now that we have defined the number as something created by the human mind to count objects, then let us proceed to examine the Laws of Arithmetic. The laws of arithmetic are based on the addition and multiplication of numbers. But what does it mean to add two or three numbers together? To begin with addition assumes that all your numbers refer to something and that thing is the same throughout. So 3 pens added with 3 pens is six pens. We are bringing together all our things and then counting the sum. So:
In multiplication we are doing something a bit different, we are creating multiples of a number. When we multiply two numbers x and y : we are creating y multiples of x which is the same as adding x with itself y times. This can also be thought of as a grid of x rows and y columns where the product of the numbers is the sum of all the objects inside the grid. So 4 * 6 is:
In this example we only added 4 with itself 6 times and summed up the number we got. This is all that multiplication is about, creating multiples of a number or just repetitive addition. Multiplication of two numbers x and y can therefore be expressed as:
or 3*7 is: 7 + 7 + 7
You are probably asking what is the use of such an operation or why is multiplying numbers an important thing in mathematics. It is fairly obvious how it is used when a baker wants to find the total cost of 42 cupcakes at $2/cupcake. The answer is 42*2. But is not so easy to understand for example why in Newton's second Law of Motion:
The reason multiplication dominates these equations and so many others depends entirely on the context of the equation and where it is derived from. As we study many equations in this book the difference between multiplication and addition will become clearer. We have studied integers which refer to full existences such as 2 books, 4 cats and 110 bananas. But what about half a candy bar or a third of a day? When the existence or object is broken up into equal parts we say we are dividing and we use fractions to represent the division. Division is essentially the opposite or inverse operation of multiplication. Whereas multiplication creates equal sets of a number, division creates equal sub-sets within the number. The fractions that are used to represent division are often written as: where the bottom number refers to how many pieces the object has been broken into. For
example the following rod has been broken into 8 partitions where each small block is: of the original object:
th
We can now define x divided by y to be where y indicates the number of equal parts an object is subdivided into and x indicates how many parts you have. The relationship between multiplication and division is best expressed by the equation, x on top of the y it tells us that there are x times or x parts of If there is an
Whereas division creates sub-sets within an object, multiplication creates sets outside the object. Dividing an object by another number x, creates x sub-sets within the object such that multiplying one sub-set by x give the original object again. For this reason multiplication and division are called inverse operation of each other. We can now state the five laws of arithmetic: 1) A + B = B + A
2) A + ( B + C)= ( A + B) + C 3) AB = BA
4) A(BC) = (AB)C
The fourth property is saying that multiplication of more than two numbers is independent of the order they are multiplied in. 5) A( B + C)= AB + AC
To this point our discussion of numbers has been limited to positive numbers but what about negative numbers? Before discussing what it means to have -3 books let us look at the number 0. Just as numbers define an amount of something, so we must have a way of defining nothing. Zero, therefore represents nothing or no-existence. It is just as much a number as 6 or 7 is. For example:
As we shall study, zero is an important number as it helps in solving equations and defining infinity. Negative numbers, in the simplest sense, are used to define those numbers that are the result of a larger number being subtracted from a smaller one. Imagine a man who has six pens. A young man comes to buy eight pens. How many are left? The answer is not 0 but 2. The shopkeeper sells all six, but he still needs two to give to the man. Negative numbers can therefore be thought of as numbers that do not exist but should exist. They become positive when they take on a physical existence otherwise they are missing pieces in a puzzle. Our discussion of numbers has been limited to objects. This need not be the case. Numbers can be used to represent abstract existences such as 2 days, 3 years, 4 classes, etc. Recall how we defined the number to be meaningless without referring to something. Three books meant one book + one book + one book. The number can also be thought of as a repetition of an occurrence. A day is defined as the rising of the sun, setting, then rising again. This action is cyclical and continues indefinitely . We can therefore refer to each cycle of the sun rising as one day and four days would then represent four cycles of the sun rising. Here we have extended the definition of the number to not only include objects but also actions. The difference between an action and an object is a difficult one to explain. While an object represents something we can see, touch, hold and feel, an action is only that which happens. Actions refer to change or movement. Therefore, an action's existence is defined by what takes place during that time frame of a beginning and an end. Numbers are not limited to objects and actions alone, they can also be used to count ideas and other abstract concepts that exist only in the mind. Simply by existing in the mind means it can be counted. When somebody says I have two things to say to you, each thing
is made up of a different choice of words, phrases, not to mention the action or object it may refer to. To conclude our discussion of the number here is a short definition. Number - A symbol used to express an object's, action's or abstraction's repetitive nature.
The picture above on the right is called the 'Optimal load path' . The way to interpret it is areas in red indicate areas of highest stress, so need material there. Areas in green are also under stress, while areas in blue have almost no stress. Actually most of the sphere is blue, however, all the blue parts of the model were taken out of the sphere so you could see the internal green and red parts of the model.
Before continuing, it will be useful to understand what stress means. Stress is an engineering concept and is simply the load or force divided by the area over which it is applied. Materials fail when the internal stress from external loads exceeds their allowable yield stress. Materials like steel have a high yield stress while those like rubber have a low yield stress. Think of this as a pin pricking you. A pin with a sharp point will be a lot more painful than a pin with a blunt point. The reason is the same load is going through a small area, thereby increasing the stress you feel. Now suppose the same problem of designing the stool were given to another engineer. This engineer will use a different software and methodology to build his virtual model. In areas where the sphere is taking no load, the results from both engineer's model should be blue or the stress should theoretically be close to zero. However one engineer says the stress in the blue area is 1 MPa, while the other says it is 2 MPa. Does this mean the second engineer is producing results with 50% error ? No , it means both engineer's analysis are outputting very small numbers which are close to zero and therefore the SAME number. Now consider you are looking at load near one of the three support points at the base. Theoretical results say the stress from load should be infinity as you have a high load going through an infinitely small point. Now one engineer says it is 6000 MPa, while the other engineer's analysis shows 3000 MPa. Again does this mean the results have 50% error? No, it means both engineer's analysis are outputting very large numbers which correlate to the trend of infinite stress and are therefore the SAME number. Too many engineers only focus on the absolute numbers when comparing analysis results. Good engineering judgment involves looking at relative differences and also seeing how large or small the numbers are when making comparisons. Another way of thinking of this is in terms of order of magnitude. Small or very large numbers within the same order of magnitude are essentially the same number for all practical purposes. This is not only true in science but all life in general. It is important to understand the concept of interpreting numbers as even many engineers with PhD's do not get it. Often those in the scientific community are too focused on the absolute details and fail to see the bigger picture around them. Another problem scientists have is reporting results with too many significant digits. For example an atmospheric scientist who says the temperature is 97.6756 Farenheit or 22.22323 Celsius is only confusing you by adding so many significant digits. They may be more 'accurate', however by increasing confusion they defeat the purpose behind reporting the number! Only report just enough digits that are truly significant. So one should report temperature to within 1 whole degree as few people can perceive differences of a tenth of a degree. Of course there are many situations where modeling the absolute response is of critical importance. It takes good engineering judgment to know when one needs to look at absolute numbers and when one needs to only look at relative differences in numbers. You are now in a much better position to interpret numbers now that you understand how relative comparisons work for large and small numbers.
unavoidable crash, especially in fog. To heed the 3 second rule at 70 Mph ( 110 Kmh ), one must back off over 100 yards. Most people follow at only a few car lengths while ignoring the laws of physics. Instead they rely on their perception of supernatural braking power or angels overhead. Neither are respected by Murphy and his law. Those who apply rational and predictive driving skills behave the opposite of the speed-atany-cost mentality. Once a person truly comprehends risk, a calmness takes over, speeds drop and following distances increase. It is a rare thing to see: someone valuing staying alive over the perception of "lost time." Once you die, there is no time to contemplate how much time 90 MPH saved you on Monday! The truth is that most motorists are essentially ignorant people in a big mob, hurrying all the time. They do not understand basic risk assessment. There is no rhyme or reason to it, just impatience and self-indulgence. But the excuses flow like water! Successfully applying risk assessment involves making minor changes to your behavior with the understanding that your overall risk of a tragedy will be greatly reduced. Most people are unable to perform risk assessment because they are unable to look at long-term benefits. Instead they focus on short-term gains, i.e driving fast gets me there x minutes faster so it must be better. Unfortunately, there are no hard and fast rules in life. As this article and the prior article attempt to prove completely opposite points using the same facts. That is why it is so important to understand what you learn and not memorize blanket statements and generalizations. Everything has meaning only in the context to which it applies. If you continuously seek to educate yourself on everything around you then you will be in a much stronger position to make independent decisions based on reason and experience.
Engineering and science both follow the same systematic analysis to solve problems and understand the world. Science deals more with objective and idealized situations while engineering is often concerned with subjective situations that require sound judgment. Situations are not always defined by fixed quantifiable conditions. The importance of the scientific method is its emphasis on clear, logical, and focused thinking that can analyze a specific situation in relation to the larger system that it is part of. The method is the quickest cure for frustration and desperation. Mathematics is a reflection of the orderly analysis and conclusions of the scientific method. It is the language of science used to communicate ideas, theories and observations in a concise manner. From the scientific method, notice how the word, condition, appeared often. In mathematics condition are referred to as dimensions. The following section will take a closer look at what a dimension represents.
From "Piled Higher and Deeper" by Jorge Cham - www.phdcomics.com In a perfect world all scientific research could be trusted at face value. Unfortunately, the world is complex and science is not black or white. There is a lot of grey area where it is difficult to differentiate fact from opinion. Read the words of the Dalai Lama commenting on suicide bombings: " You know, science or knowledge is just a method. The thing is how to use that. Science itself is wonderful. Sometimes we use that knowledge for destruction. It's not science's mistake, it's our mistake. Similarly, some people manipulate religion in the wrong way. It's not the fault of religion, it is the fault of people and politics. People talk about dirty science. There is also dirty religion and dirty politics. " The Dalai Lama is saying that science, just like anything else, can be manipulated to prove any point. So how does one distinguish 'dirty science' from 'good science'? Unfortunately, the answer is not a simple one. There are shades of good and bad science in all research. It takes a lot of experience to know how to filter out the nonsense. This article will summarize a few categories most bad science typically falls under. If you see any research that shares elements of these categories then you should be skeptical of the results.
So when you actually get around to trying to test it, you end up with this morass of confusing and conflicting data out of which people can pick just the elements they want to support their preconceived opinions. And that sometimes make the particular researcher look very sure that he knows the answer or she knows the answer, but unfortunately that is not how you do good science. "
Lack of complexity implies unattractive conclusion which lack glamor and perceived importance
Consider the excerpt from an article in The New Yorker by eminent surgeon Dr. Atul Gawande. In this excerpt, Dr. Gawande describes how Dr. Pronovost developed simple medical checklists that have proven to save countless lives . Gawande writes, " If a new drug were as effective at saving lives as Peter Pronovost's checklist, there would be a nationwide marketing campaign urging doctors to use it. " As you will read below, his checklists have fallen on deaf ears due to their lack of perceived complexity! We have the means to make some of the most complex and dangerous work we do in surgery, emergency care, and I.C.U. medicine more effective than we ever thought possible. But the prospect pushes against the traditional culture of medicine, with its central belief that in situations of high risk and complexity what you want is a kind of expert audacity.the right stuff, again. Checklists and standard operating procedures feel like exactly the opposite, and that's what rankles many people. The still limited response to Pronovost's work may be easy to explain, but it is hard to justify. If someone found a new drug that could wipe out infections with anything remotely like the effectiveness of Pronovost's lists, there would be television ads with Robert Jarvik extolling its virtues, detail men offering free lunches to get doctors to make it part of their practice, government programs to research it, and competitors jumping in to make a newer, better version. That's what happened when manufacturers marketed central-line catheters coated with silver or other antimicrobials; they cost a third more, and reduced infections only slightly.and hospitals have spent tens of millions of dollars on them. But, with the checklist, what we have is Peter Pronovost trying to see if maybe, in the next year or two, hospitals in Rhode Island and New Jersey will give his idea a try. Pronovost remains, in a way, an odd bird in medical research. He does not have the multimillion-dollar grants that his colleagues in bench science have. He has no swarm of doctoral students and lab animals. He's focused on work that is not normally considered a significant contribution in academic medicine. As a result, few other researchers are venturing to extend his achievements. Yet his work has already saved more lives than that of any laboratory scientist in the past decade. Furthermore, often times new discoveries are based on very simple incremental improvements to existing ideas. For this reason established scientists find it very easy to ridicule them. They claim how can something so simple replace decades of their research and experience? Society agrees with them since they are conditioned into believing all science is beyond their understanding. Since it is society that ultimately funds scientist work, then without the support of society, the new ideas will never take ground and flourish.
have almost no meaning. So you have to always question the variability of the conditions and ask the scientist what cumulative effect they have on the uncertainty of their results. A good scientific experiment must be repeatable. The experiment has to be able to be performed many times and always produce the same result. If repeatability has not been established then you must scrutinize the variability of each condition.
It is interesting to note the difference in perspective in the clinical scientist and the engineer. Scientists are interested in 'exactness'; in the minutiae of detail. Engineers are more interested in systems and the dynamics of how things actually work. Clinicians will always argue they are more accurate while failing to understand the problems at hand in the real world. In conclusion, the world is not only made of one color. Stay open-minded and appreciate all the interacting diversity surrounding you. The best way to avoid the trap of believing bad science is to read from multiple sources to identify common elements and conclusions. When numerous people you respect and trust say the same thing, then you can be more sure you are reading mostly good science. Keep in mind the real world is not black and white at all. Between pure science and the incorrect interpretation of data, there is a huge gray area, and this is where most research is done. Unfortunately, as Thomas Edison said, " Opportunity is missed by most people because it is dressed in overalls and looks like work ". This leads to another famous saying, " The less one understands something, the greater one's faith in its capabilities ". It is unfortunate that human nature prefers ignorance and blind faith over hard-work and skepticism. On the other hand, if you have something worthwhile, people will come. But you better be prepared to show proof. If you have an idea for a machine that flies around with no physical prototype, nobody will care, you can bet on it. Talk is cheap. Demonstrate your machine and the world will beat a path to your door, regardless of what the scientific community has to say. To end on a humorous note, the following table translates commonly used expressions found in research papers:
Published
Translation
" It has long been known...." " Of great theoretical and practical importance...." " A definite trend is evident... " Three of the examples were chosen for detailed study.." " Typical results are shown.. " It is believed that..
I didn't look up the original reference. Interesting to me. The data seem practically meaningless The others made no sense. The best results are shown. I think
" It is generally believed that.... " It is clear that much additional work will be required before a complete understanding of these phenomena is possible.. " Correct within an order of magnitude.." " Statistically oriented projection of the findings.. " Highly significant area for exploratory study.."
A couple of other guys think so too. I don't understand it. Wrong Wild guess. A totally useless topic suggested by my committee.
Source: Unknown
themselves, dimensions are meaningless. They must refer to certain conditions specific to the situation being studied. One cannot refer to just mass or length. One has to specify mass and length of what part of the system? The units of the various dimensions, meter, seconds, Celsius, kg, serves as standards to measure the dimension relative to. Understanding which dimensions to include in your scientific analysis all depends on the type of situation being studied. The more familiar you are with the fundamental and derived dimensions, the easier it will be for you to understand which dimensions are significant and how they define the situation.
This is read as, a situation is a function of certain conditions. The conditions are called dimensions. For example, consider the mathematical function for calculating the force acting on a body of mass, m. Force = Mass * Acceleration The force on a body is a situation defined by two conditions, mass and acceleration. In a closed system , the force acting on an accelerating mass, is the product of the mass and its acceleration. Thus, force is a function of two dimensions, mass and acceleration while force is the third dimension dependent on m and a of the system. Therefore force is called the dependent dimension while mass and acceleration are the independent dimensions that together define the dependent dimension. The dependent dimension, F, is placed outside the brackets, while the independent dimension are inside the brackets. Remember . the language of mathematics is as concise as possible. The function is more commonly written as:
For example, the force required to accelerate a 5 kg object at 3 meters per second per second ( acceleration is by definition the change in speed per unit time ) is 15 N, where N represents Newtons, the unit of measurement representing force:
One is free to use any value of m and a to calculate the corresponding force. For this reason, m and a are called independent dimensions, while F is the dependent dimension defined by them. Note that m and a must come from the same system, and the calculated force refers to that situation only. What if I were given the acceleration of a body and the force acting on it and wanted to calculate its mass? In other words, given F and a, what is m? This is done by solving the force function for m or:
Therefore, mass as a function of its acceleration and the force acting on it, is the force divided by the acceleration. Here mass is the dependent dimension dependent on force and acceleration. Once again, the three dimensions, mass, force, and acceleration must all refer to the same situation. It is not uncommon in engineering to encounter twelve to fifteen dimensional functions. For example, consider the five dimensional function for calculating the elongation of a cable being stretched with some tensional force.
The total elongation of the cable is dependent on four conditions. The tension acting on it, its length, the cross-sectional area, and a material property called elasticity. The point to understand is that the n'th condition is entirely dependent on the other ( n - 1 ) independent conditions. The ( n - 1 ) independent dimensions represent a set of fixed or constant conditions that together define the n'th dimension .
Questions
The distance covered by a falling object near the surface of the earth is given by:
Simplify this function using d for distance, t for time etc. The acceleration can assumed to be a constant 9.8 m/s/s. Substitute this constant into the function and write the expression for distance as a function of time. To find approximately how tall you are, drop an object from the top of your head and using a stopwatch, find the time taken for it to hit the ground. Substitute this time into the distance function just derived. The answer should be close to your height. Try this at least ten times to get a close approximation. Calculate your height in feet using the following function:
You might think your answer will be different if you had dropped a metal object , paper or a balloon. You are correct, however, it is due to air resistance which is a function of amount of surface area exposed to the direction of travel. So you should get the same answer no matter how heavy your object is as long as it is relatively immune to air resistance.
IF we were given the masses of two bodies and the distance separating them, then the force of attraction would be a simple matter of plug and chug. Suppose we were only given the mass of one object as 1000 kg, and the distance between it and object 2 to be equal to 1,000,000 meters. The function would reduce to:
The fixed independent dimensions give rise to a constant that can be separated from the function such that:
was replaced by a constant c. The gravitation force is now only a function of the first object mass, since all other conditions are given. Thus:
The force depends on whatever the mass we choose to enter multiplied by some constant that is dependent on the situation already known. Thus we have reduced a four dimensional function to two dimensions by entering conditions that are already known. Therefore, any ndimensional function can be reduced to two dimensions, provided ( n-2 ) dimensions are already given/
Questions
1) Play with the gas savings calculator and try and derive the equations used 2) The amount of interest earned by money in a savings account is given by the function:
Or simply:
Given a deposit of $1000 and an interest rate of 6 %, write a two dimensional function of interest earned as a function of time Find the interest earned after 2, 3 and 10 years
Using a constant mass of 4 kg, the function reduces to the following two-dimensional function.
The total energy of the body is now entirely dependent only on the square of its velocity, whatever it may be. If the velocity were 1 m/s, the energy would be 2 Joules. If it were 2 m/s, the energy would be 8 Joules. As the velocity of the body increases, its kinetic energy also increases. This makes sense, since the faster a body is moving, the more energy it has. The way to visualize the changing relationship between the independent and dependent dimension is by graphing the function on a set of perpendicular axes. The horizontal axis is labeled after the independent dimension (condition), while the vertical axis is labeled after the dependent dimension (situation). The graph of a function is a continuous line that reflect a horizontal change in the independent dimension with a vertical change in the dependent dimension. Each point on the graph represent a unique situation . The graph of the function would be:
It is, however, not obvious as to how the graph of a function is a well-defined continuous line through all the points at which the function exists. To better understand the important relationship between a function and its graph, consider a hypothetical function . In this function, f represents an imaginary situation dependent on some condition x. Therefore x, is our independent dimension, while f is our dependent dimension. For the next few chapters we will only be studying abstract fs and xs. This will allow our study of the mathematical function to be more specialized by avoiding the confusion that arises from dealing with so many, Fs, ms, as, ls, Es etc. It is difficult for the mind to keep track of so many abbreviations, much less understand where they were derived from.
With intervals of 1 the difference between each evaluated point of our independent dimension x is 1. This gives a fairly generalized idea of how the dependent dimension, f, changes. From x=0 to x=1, observe that f is zero at both these points. The conclusion is that the functions value is approximately zero from x=0 to x=1. After x=1 notice that f gets substantially larger, particularly between x=2 and x=3. This leads to the observation that as x increases f gets larger and larger. Also notice that over each sub-interval the net change in y is different. From x=0 to x=1, f goes from 0 to 0 or the net change in f is 0 From x=1 to x=2, f goes from 0 to 2 or the net change in f is 2 From x=2 to x=3, f goes from 2 to 6 or the net change in f is 6 - 2 = 4 In each case the change in x is 1 but the corresponding change in f increases as x increases. Until now our discussion of the function has been limited to just four points, x= 0,1,2, and 3. The concept to understand is that a function is defined at every point and the continuous line connecting the points corresponds to the graph of the function. In order to gain a better understanding of the function, it needs to be analyzed at more points. Though before continuing, an important symbol will be introduced, the Greek letter Delta, . is a symbol for a quantifiable change, not metaphorically speaking! It represents a definite difference or change between two values. For example if x changed from 8 to 21, instead of saying the change in x is 21-8 or 13, it is more accurate to say x is 13 or x = 13 . This symbol will be used throughout the book to reflect the amount a dimension changes by.
Returning to our example, , we observed how as x increased, f increased at a greater rate. However, between x=0 and x=1 there appeared to be some inconsistency in the function since its value was 0 at both endpoints. Let us now analyze the function at intervals of x differing by or x = .333. By reducing be analyzed at ten points instead of only four. x from 1 to .33, the function can
The change in our independent dimension x will be .333 and we will be evaluating points at this intervals. Plotting these points gives us:
The pattern from x=1 to x=3 remains the same with f increasing as x increases. However, from, x=0 to x=1 something new appears. Notice how the function seems to decrease until x = 0.5, and then increases afterward. In analyzing these different graphs (the first graph consisting of only four points) of the same function we can state an important observation. The behavior of a function as reflected by its graphs is only as accurate as the number of points plotted over the interval of interest. That should mean if x were infinitely small, then the graph of the function would be the line connecting the infinite amount of points. However, even this does not tell much. Through an infinite number of points we get a more and more accurate look at the functions graph or behavior, but between any two points x and (x + x) where x is the infinitely small distance separating the two points, there exists a further infinite amount of points such that we cannot be sure that our graph takes them into consideration? In other words how do we know if the graph is continuous over the small interval separating the points? Our graph could suddenly jump up or down as it did between x=0 and x=1. To prove that the graph of a function is continuous over an infinitely small interval consider a simpler function, nearly zero. Therefore if a point located a distance An infinitely small interval implies a change in x, is defined at any value for x, x from x. x, equal to
If the distance, x between the two points x and (x + x) is allowed to go to zero, what we are doing is called taking the limit as x goes to a number that gets closer and closer to zero. Mathematically this is expressed as:
This proves that points on the graph that are in between two infinitely close points are all close to the values of the endpoints and do not jump up and down in a random fashion. Each successive point in this interval differs from the previous point by an amount that is related to the change in the independent dimension. The graph is uniquely defined as a function of the independent dimension such that for a small x the f is proportionally small. Intuitively it is not difficult to understand why the graph of a function is continuous over an infinitely small interval. By definition the mathematical function expresses a relationship between dimensions, such that one dimension is said to be dependent on the value of the other dimension. The graph of a function reflects this relationship by relating a horizontal change of the independent dimension with a vertical change in the dependent dimension. Now if an imaginary function were to change by an infinitely small amount then the f value would also change a similar minimal amount, because the value of f is dependent on the value of x. The graph of a function is then the continuous line drawn through an infinite set of points. For the graph to be continuous it only needs to be defined at all points where it is being studied.
The graph of can now be drawn as a continuous line that corresponds exactly to the behavior of the function, f, as the independent dimension, x, changes:
Questions
Whenever an object is thrown in the air towards some direction, the path it follows is called a trajectory and the study of its motion is the focus of projectile mechanics. The motion of a projectile is dependent on several conditions such as its initial velocity when set in motion, the angle of inclination, the force of gravity, air-resistance etc. What makes projectile motion so interesting is that gravity only restricts the vertical rise of the body, but has no effect on its horizontal motion parallel to the earths surface. This situation gives rise to the curved trajectory or path of the projectile. The goal of the following problem is to find the equation for the trajectory (graph) of a ball struck in motion by a bat at an angle of 45 degrees with an initial velocity of 50 meters per second. The height of the ball in motion is dependent on the horizontal distance covered and is given by the equation:
Substitute the known values for the given conditions (use g=9.8) to reduce the function to one of two dimensions, involving vertical height as a function of horizontal distance covered. Get a rough idea of the graph of this projectile motion by plotting points at intervals of = 1 meter. As soon as the height is negative stop plotting. Use a d=.5 m, and determine the maximum height reached and the distance from the firing point when it hits the ground. Use an angle of 60 degrees and see how the graph changes. Double the initial velocity while using the same 45 degree initial angle of inclination. d
is:
refers to an interval over which we are analyzing the change in the dependent dimension, f.. The second point plus the change in the variable, can be written in terms of the first point
Therefore the change in the dependent variable over an interval from , which can also be written as:
to
is
is 5 - 3 = 2
Questions
1. Hooke's law states that the force required to stretch a spring is directly proportional to the amount stretched or
F = Force s = distance stretched k = spring stiffness constant i.e Force required to stretch spring a unit distance Graph this function for k=1000 N/m. Calculate the change in force required to stretch the spring from 1m to 4 m. 2. The distance a free-falling object covers from an initial dropping point is given by the function:
d=distance
Given that acceleration is equal to gravity or , reduce the three dimensional function to one of two-dimensions for time as a function of distance, d. Graph the function at intervals of , from d = 0 to d = 100m , over each interval.
What can you conclude about the change in time covered between d = 10m and d = 20m versus the change in time covered between d = 90 and d = 100m? o Solution:
The change from 10m to 20m is 0.59 seconds and from 90m to 100m is 0.23 seconds. So the time is much less from 90m to 100m since the object is travelling alot faster.
The question we now must ask ourselves is how can we measure the relative change of the dependent variable with respect to the independent variable? In other words, how do we calculate how much more or less To calculate how much more changed compared to , we simply divide the by the interval over we
change in f over the change in x for the interval. Thus we divide, which are evaluating it, which is equal to respect to x over an interval is defined as:
While this expression may seem rather simple, it does require some explanation. By dividing the change in f by the change in x what we are doing is calculating how much more f changed for a given change in x. For example in the function, , when x changed from 3 to 5, f changed from 81 to 375. Over this interval of from x=3 to x=5, the was 294. Thus the relative change in f with respect to a change in the independent variable x is:
The value of 147 tells us that f changes 147 times more than x over that interval of from x=3 to x=5 only. Thus for each unit change in x, , the corresponding change in f is 147. We can therefore define the rate of change of a function with respect to its independent variable to be:
The value, called the rate of change of the function, refers to how much more or less changes for a unit change in x. It is only valid over the interval under consideration,
Another way of understanding what rate of change of a function means is to look at the steepness of the line connecting the two endpoints of the interval under consideration.
As increases, the steepness of the line connecting the two endpoints will increase. Thus, the greater the rate of change of the function, the greater its slope or steepness over the interval under consideration. Since slope and rate of change are synonymous, then how is rate of change defined for functions whose graphs do not have constant slopes? For example, from x=9 to x=12 of , the change in f is 2997. Thus the rate of change of the function over the interval is:
This value is significantly higher than the rate of change calculated for the previous interval from x = 3 to x = 5. We can only conclude that the rate of change or slope of the graph must be increasing and is not constant over an interval the function to understand how this might be so: . Look at the graph of
Since the rate of change of a function can change, then we have to come up with a more refined definition of rate of change. We can define the average rate of change of a function over an interval , to be equal to
In the next section we will take a closer look at how we can define the exact or instantaneous rate of change of a function.
From x = 1 to x = 3,
x = 2, and
x = 2,
equals 6. The average rate at which f changed with respect to x is by definition, . For each unit change in x, the change in f is 3. This tells us that f is changing three times faster that x is changing over the interval from x = 1 to x = 3. We can now look at the interval from x = 2 to x = 4 where x equals 2.
Thus
Over this interval, the rate of change is the same constant, 3. This leads us to conclude that the rate of change of the function over any interval is a constant, 3. This can be proven by the definition of rate of change:
x.
slope or steepness equal to c. For this reason functions of the form are also called linear functions since their graphs are straight lines with a constant rate of change. On the other hand, the graph of the function is not a straight line. Unlike a line, the rate of change of is not the same constant over any interval x. To define the rate of change for the function we will have to derive a more precise way of defining rate of change of function. To do this we will analyze the function over small intervals of x
The first point to notice is that the rate of change of the function varies with x. When x is small, f does not change that much as compared to how much it changes when x is large. We can conclude that the rate of change of f with respect to x, is not constant over any interval, x, but varies with x. To begin our analysis let us divide up the graph into intervals of happening in each of these intervals separately. = 2 and study what is
From x = 0 to x = 2,
From x = 6 to x = 8,
Clearly, as x increases the rate of change of the function is increasing and is not a constant as in our study of the line where the function changed at the same rate as x increased. Returning back to the graph of ; we calculated that from x = 0 to x = 2 the change in f was 4. We can then conclude that from x = 0 to x = 2 the average rate of change of the function over that particular interval is 2 or:
Remember it is called average because this rate of change is only valid from x = 0 to x = 2. Now let us consider the interval from x = 2 to x = 4 where once again the change in x, 2. The change in f of the graph is equal to: is
Thus the average rate of change of the function over this interval is equal to
This value is greater than the value we observed from x = 0 to x = 2. This implies that from x = 2 to x = 4, f(x) is increasing at a greater rate than from x = 0 to x = 2. This is despite the fact that in both cases the = 2. Therefore, the rate of change is not constant over the interval, but is increasing with x. When we say the rate of change of f(x) from x = 2 to x = 4 is 6, it is only the average value for the given interval since it assumes rate of change is constant over that interval. Let us move on to the next interval of x = 4 to x = 6. Once again the change in x or equals 2 but the corresponding change in f(x) is not the same as before.
Note that this corresponds to the same value we would get by:
This is still larger than the rate of change for the previous interval which was 6. Remember rate of change , by definition, refers to how much the function changes with respect to a change in the independent variable. The steepness or slope of the line over the interval provides a geometric understanding for this concept. The average rate of changes calculated over each interval can be used to approximate the graph of f(x) from x = 0 to x = 8.
This roughly corresponds to the original graph, but as we can see the rate of change or slope is not constant through out the interval from x = 0 to x = 8, but increases as xincreases. In order to get more accurate answers we need to reduce our interval of =2 to a much smaller one. The idea being we need to analyze our graph over a small interval, , to see what exactly is going on at each instant the function is changing.
Here is where we begin our study of Calculus. We break down and freeze a changing situation into an infinite series of actions and analyzing what is going on in each individual actions.
Remember the
in the f(x +
) expression.
Similarly the rate of change from x = 7 to x = 7.5 and x = 7.5 to x = 8 is 14.5 and 15.5 respectively. Notice how the rate of change of the function slightly increases as x increases. This implies that there must be a way to instantaneously define the rate of change of function through one point on the graph. Before continuing, take a look at how the average rate of changes or slopes just calculated, approximate the graph from x = 6 to x = 8.
As you can see from the above graph, the smaller interval of = 0.5 gives a more accurate approximation of the graph of f(x). Our results are only averages because the rate of change of f(x) is increasing with x and is not constant over an interval To define the instantaneous rate of change, we need to define as an infinitely small distance separating two point x and (x+ ) on the graph of f(x). The rate of change calculated through these two points will then give us the exact rate of change of f(x) at that point, x. Since through any one point there exists an infinite many lines, we are in actuality giving the slope of the line through the two points x and (x+ ) where goes to zero. The two points are assumed to be infinitely close to each other such that the rate of change f(x) over that interval is constant. As we let these two points (x) and (x+ ) come closer and closer together, the slope of the line through these two points corresponds to the actual graph itself. For example with = 2, the line through x = 6 and x = 8 only roughly represented the actual graph. However, by breaking up the interval from x = 6 to x = 8 into four sub-intervals of = 1/2, we got a better approximation to the actual graph itself. The instantaneous rate of change can now be defined as the rate of change of the line through a point x and (x+ ) where is near to zero. This is called taking the limit as goes to zero. In the graph this looks like.
By letting the difference or between the two points go to zero we are able to accurately describe the behavior of the function at any large we can only approximate the functions behavior over the interval. This infinitely small allows us to define the rate of change of f(x) through any one point on the graph, where point is by definition an infinitely small interval between x and x + over which the rate of change is assumed to be constant or does not change. Thus, the instantaneous rate of change can also be thought of as the slope of the tangent to the graph at any point. The point is actually two simultaneous points on the graph, separated by an infinitely small distance. For example, the slope of the tangent to the graph at x = 6 is given by:
We can now go on to define the rate of change at any point x of the function
as:
This function is called the derivative and gives us the rate of change or tangent to graph of at any point x. It is extremely important to realize that the functions derivative is also a function of x. In other words its value changes with x to reflect a change in the rate of change of the original function. This may sound a bit confusing but I encourage you to spend a few minutes and think how it is so. You are probably asking yourself how can we assume that the derivative is constant over an infinitely small interval ? Since the functions rate of change is changing then it can only be constant over an infinitely small interval or point. It is through this point that we define the derivative by taking the limit as goes to zero. In effect we define the graph of the function through a series of connected lines with changing slopes. In the following paragraph we will mathematically prove that the derivatives value converges to the exact value as all errors go to zero as goes to zero. Since the graph of the function is continuous over an infinitely small interval then so will its derivative be defined for that interval or point. Note that and no longer refer to discrete values. When we let becomes infinitely small such that: go to zero, its value
Well now see how to find the derivative of any simple function, derivative is also a function , we shall call it f(x), f prime of x where,
. Since the
From the Binomial expansion theorem we get: ( The theorem is just algebra and you can find a proof of it by clicking here ).
This simplifies to
, with
Take note of how x goes from x to nth power to the 0 power while x goes from the 0 power to the nth power in the last term. If we divide through by x we get:
As we take the limit as x goes to zero, every term after the first one goes to zero and hence cancels out; This leaves us with:
where n is a positive integer. The derivative reflects the instantaneous rate of change of the function at any value x. The derivative is also a function of x whose value is dependent on x. Take a look at the left side of the function, dependent variable, f, is whose derivative is When differentiating a function of the form variable is, , the derivative of the dependent . Thus . The derivative is By definition the derivative of a
, which is the instantaneous rate of change of f with respect to , represents the independent variable
differentiating a function results in a new function of x, where called respect to the independent variable, x.. If , then:
gives the instantaneous rate of change of f(x) as a function of any value, x. Remember that the rate of change of a function other than a line is not constant. Its value changes as x changes. If f(x) were equal to a constant multiplied by a function of x such as:
Thus the derivative of f(x) with respect to x, is the constant multiplied by the derivative of the function of x, A(x).
Combing the A(x) and B(x) terms together simplifies the above expression to:
Therefore if f(x) is a sum of two functions of x, then its derivative with respect to x is the sum of the derivatives of the functions with respect to x. Thus:
Similarly if f(x) is defined in terms of a difference among some functions of x, then the sum of the difference among the derivatives of the functions.
is
We can differentiate products of functions by using the definition of the derivative. A small change in f can be written as: Next, divide by to calculate the rate of change of f with respect to x:
Taking the limit as goes to zero gives us the instantaneous rate of change of f with respect to x, or the derivative of f(x).
Since both A(x) and B(x) are functions of x, then substituted with substitution only holds true for
can be
reduces it to:
Thus the derivative of a function f(x) that is a product of two functions of x, is simply the product of the first function and the derivative of the second function plus the product of the second function and the derivative of the first function.
of any function of x of the form is always where n is any real number, positive, negative, or fraction. This is no coincidence but is because of the way exponents are defined as a continuous operation for any n. Let us consider the first case of , where n is any positive integer and 1/n is a fraction.
This concludes the proof that if then , for any positive n, integer or fraction. We will prove it also holds true for n as a negative number. We can use the product rule to prove that the derivative of n, negative real numbers also. Consider the function: for all
yields:
The left side of the equation represents a product of two functions, f and side is a constant function,
derivatives must be equal. Using the product rule to differentiate the left side with respect to x results in:
Setting the derivatives of the left and right side equal to each other:
Remember that:
Therefore if
for positive real numbers and negative integers. What remains is to prove it true for negative fractional powers as well. To do this let,
gives us:
Thus if for any n, positive or negative real number. This is true because of the way exponents are defined as a uniform operation for any n. The purpose of going through all the proofs for the different cases of n was to give you a better understanding of how to differentiate functions of x with respect to x.
because we also need to take into consideration . We can replace with g(x) and get:
To find the derivative of with respect to x, we first need to find the derivative of f with respect to g. From the definition of the derivative:
Next we find the derivative of g(x), the inside function, with respect to x.
to get df / dx :
goes to zero,
or:
Similarly,
As dx goes to zero,
becomes
in the
denominator of
in the numerator of
Thus the derivative of a function of a function of x with respect to x, is the derivative of the outer function with respect to the inner function, multiplied by the derivative of the inner function with respect to x. Some examples will show how this is done.
The graph is a linearly increasing straight line, where the distance covered increases directly with time. The steepness or rate of change of this graph is by definition the change in distance over the change in time:
Since motion is characterized by a change in position relative to time, then velocity is only defined over that definite time interval. If an object moves from a to b, the velocity of the object over this interval is this interval distance covered, to cover this distance, respect to time: divided by the time taken . Thus, velocity is the rate of change of distance covered with
For example, the car above takes one hour to go from a station 400 kilometers down the highway to another station 500 kilometers down the same highway. The velocity of the car is then Since the graph of the cars distance covered with respect to time is a straight line, it has a constant rate of change, such that = v, where v is a constant velocity. This tells us that . The graph of this velocity
the velocity over the entire journey is a constant or function is just a straight line, or is the same at any time t.
This confirms logic since throughout the journey the speedometer reads a constant 100 km./hr. Therefore, distance covered is directly related to the elapsed time, in each hour it will cover a 100 more kilometers. Now let us define acceleration. Acceleration is the rate of change of velocity with respect to time. Acceleration is to velocity as what velocity is to distance. The concept of acceleration refers to a changing velocity per unit time. The units of acceleration are
For example if the acceleration of a Ferrari is 4 km/hr per second, all this means is that each second the velocity increases or changes by 4 km/hr. If the velocity at t=3 seconds is 48 km/hr, then the velocity at t=6 seconds will be:
Lets see how this applies to a car moving with constant acceleration. Since the acceleration is constant, the graph is a horizontal line, where the acceleration of the car is at any time t is the same.
Acceleration is therefore the rate of change of velocity with respect to time or . An acceleration of 4 m/s/s means that every second the velocity increases by 4 m/s. The concept of acceleration and velocity are fairly obvious to understand when dealing with constant accelerations and velocities. We now need to define a more precise way of explaining velocities as the derivative of the position or distance function with respect to time and acceleration as the derivative of the velocity function with respect to time.
If we freeze the moment when 4 seconds have past then its speedometer will read a velocity of exactly 60 km/hr at that instant only. However at t= 4.1 seconds the velocity will be slightly higher since the car is accelerating. This is why we use Calculus to analyze how these accelerations give rise to a changing velocity that results in a changing distance that is covered. The graph of its distance from the starting point as a function of time is:
From the graph we can see that at t = 4 seconds the car has covered 33 meters. Since the speedometer at that moment reads 60 km/hr, then we can say that at t= 4 seconds, its velocity is a constant 60 km/hr. At t = 4.1 seconds, the velocity has changed due to the car's acceleration. So the speedometer now reads 61 km/hr. We can assume that from t= 4.0 seconds to t=4.1 seconds the velocity of the car is a constant 60 km/hr. By definition velocity is the distance covered divide by the time taken to cover the distance or Thus, the distance covered by the car in this small time interval, divided by the time, .1 seconds, will give us 60 km/hr. Since the velocity of the car is increasing, due to its constant rate of acceleration, the velocity of the Porsche at any instant, t, will be whatever the speedometer reads at that moment.
If we were given the relationship for distance covered as a function of time t, then velocity of the car at any time t can be found by calculating the distance covered a time interval, : over
Since the car is accelerating, its velocity is not constant over the interval assume the velocity is constant over an infinitely small time interval,
. We can
Therefore we have to take the limit as goes to zero to find the instantaneous rate of change of distance with respect to time. The instantaneous rate of change of distance will correspond exactly to what the speedometer reads at time t.
The velocity at any time t is the instantaneous rate of change of the distance function at a time t. By definition the derivative is the instantaneous rate of change of a function over an infinitely small interval. thus the derivative of the distance function, time is the velocity function for the object We now need to derive an expression for acceleration as function of time. In the same way that velocity is the rate of change of distance with respect to time, acceleration is the rate of change of velocity with respect to time. with respect to
To find the instantaneous acceleration at any time t, we need to take the limit as goes to zero. Without taking the limit, is a discrete value such that the calculated acceleration is the average acceleration is for that interval. By taking the limit as , we are assuming the acceleration is constant over that time interval.
This proves that acceleration is the derivative of the velocity function with respect to time. Since velocity is the first derivative of the distance function with respect to time, the acceleration function is the second derivative of the distance function. In other words the acceleration function is obtained by differentiating the distance function twice. Our results can be summarized as follows:
We can define a Newton as the force equipped accelerate a body of unit mass, 1 kg, Therefore to accelerate a body of mass, m, the required force would be m times a.
This is read as, the force required to accelerate a body is directly related to its mass and the magnitude of the acceleration of the mass. The important concept to understand is that forces are defined as accelerations or changes in velocity. It requires no force to keep a body in motion Once in motion it will remain in motion. A force is required only to change its velocity or accelerate it. Thus force is a quantifiable measurement of a masss resistance to a change in motion. If an object had no resistance to a change in motion then there would be no such thing as force! This might seem to contradict reason. One can better understand this by considering an airplane flying in space, where space is some imaginary place that contains no matter or force fields inside it. If its four engines produce an acceleration and the mass of the
plane is kg, then the thrust or force acting on the plane is . If we assume an inexhaustible and weightless fuel source then theoretically the engines will push the plane forward with a constant force of 4 million newtons. Now since the plane is flying in an imaginary space under a constant force, it is free to accelerate forever. Remember forces are defined as accelerations and not velocities. The plane will accelerate at a constant acceleration of velocity would increases and increase at the rate of function of time would be a linearly increasing function: . This means the planes The graph of its velocity as a
The fundamental concept to understand here is that a force is required only to change an objects velocity. A change in velocity is by definition an acceleration. Therefore forces are required only to accelerate an object. A constant force acting on a body will accelerate the body with a constant acceleration, which means the bodys velocity will increase and increase forever, all due to a constant force. Furthermore the greater the mass, the greater its resistance to a change in velocity. Thus, the force required to accelerate a mass is directly proportional to its mass.
To derive this, think of a unit mass of 1 kg separated a distance from a larger mass, M. The gravitational force is the force of attraction the larger mass expresses for the unit mass and vice-versa. Observation confirms that the gravitational force is proportional to mass the larger body and decreases with the square of the distance separating them. The reason it is distance squared and not just directly related to the distance is because masses are 3dimensional. In 3-dimensional space, properties are related to the projected areas as opposed to 2-dimensional geometry that are dependent on the length only. Therefore, the gravitational force between the two masses is:
If the unit mass was replaced by a mass m, the force of attraction would be m times the amount it was with the unit mass.
G replaces c as a gravitational constant determined from experiments. The gravitational force of earth acting on a body of mass located near the surface of the earth is then:
Since most of our free falling bodies occur near the surface of the earth, we can take d, to be the radius of the earth. Substituting the known values in along with the value for G, reduces the equation to:
This important result tells us that the acceleration of a body of any mass is 9.8 meters per second per second near the surface of the earth. The gravitational force acting on a body near the surface of the earth would be its mass, m, times the constant acceleration, 9.8. For those who have a ground to hold them up this does not mean much but for a free-falling body in air its acceleration as it falls toward the earth will be a constant , regardless of its mass. A body of twice the mass will be pulled in by twice the force, but the acceleration due to the force of gravity remains the same.
This may sound a bit confusing but just remember any body will fall to the earth with a constant acceleration, independent of its mass. In terms of particle motion, mass means nothing for a falling body! While the gravitational force increases with mass, the acceleration remains the same. We can now write the acceleration function for a falling body near the earths surface as:
Since acceleration as a function of time is by definition the derivative of the velocity function with respect to time then what we have is the same as:
The derivative of the velocity function (acceleration) is 9.8. Since we know that the derivative of any function is we know what its derivative is. then we can easily find the velocity function since
The process of finding a function, given its derivative is known as anti-differentiation. In this case 9.8 can also be written as . We see that:
Therefore n equals 1. Consequently the anti-derivative of hopefully be obvious since the derivative of that velocity as a function of time is: is just
is
The graph of the velocity function is a linearly increasing function with constant rate of change or slope
This graph of the velocity function gives us the objects velocity as any time t, assuming that air-resistance is negligible. For example at t= 10 seconds, the objects velocity is:
Or almost 30,000 km/ hr. Due to air-resistance no object reaches such high velocities. Remember the greater the height it is dropped from the more time it has to increase its velocity or accelerate before it slams into the earth. Now how do we find the distance function or the distance covered from the point of dropping the object. From the definition of velocity we know that:
Since is
We can graph the distance function . The graph gives us the vertical distance traveled from where it was dropped at any time t.
Clearly as time, t, increases, the rate at which distance is being covered is very great . For example between t= 0 s to t =5s, the object has covered totally.
Or the object has covered 112.5 meters in the first five seconds of its free-fall. However from t = 20 s to t = 25 s, the object has covered:
The object has covered more than a kilometer during this five second interval!! This should make sense because initially the bodys velocity is small and thus does not cover much distance over a time interval ;. However, after some time its velocity has increased ( look at the graph of the velocity function graph), such that over a same interval , the object covers a greater distance. Remember constant acceleration means that the velocity is increasing linearly with time and distance increases with the half square of time.
From the definition of the derivative, the derivative of a constant function of the form is zero because:
Looking back at our free-falling body, we know that its acceleration was now be written as:
. This can
Remember that the anti-derivative of zero is a constant. At t=0 we have the initial condition:
We call , or the velocity of the object at t= 0. It represents a situation that may exist when the initial condition is zero. When anti-differentiating we need to remember to add a constant along to reflect the initial conditions that may exist in the situation. For example if a car is traveling down the highway at 82 mph , suddenly sees a cop, and then steps on the pedal, accelerating the cart at 3 mph/s, then its velocity at any time t, were t is measured as soon as he steps on the gas is:
The anti-derivative of this function gives us the distance covered as a function of time:
is the objects initial position at t=0. Our result can be generalized for initial acceleration or distance as follows:
Questions
1. Consider the following three cases for a free-falling body: 1 - A ball is dropped from rest from the top of a building. 2 - A similar ball is dropped from the same spot with an initial velocity of 10 m/s Find the distance function (distance covered as a function of time) for each case. 2. Consider three cars that pass a certain starting point. Car 1 - starts from rest with a constant acceleration of 6 m/s/s Car 2 - Has an initial velocity of 50 km/hr and maintains this constant velocity with no acceleration. Car 3 - Has an initial velocity of 20 km/hr and an acceleration of 4 km/hr/ sec Car 4 - starts 5 km in from of all the other cars with no initial velocity but an acceleration of 7 km/hr/sec. Derive the distance function for each car with reference to the distance covered from the staring point. Graph the distance function for each car. 3. Last, determine at which distance each car will pass each other. You can do this by either looking at where the graphs of the paths intersect or by setting the distance functions equal to each other and solving for time.
Solutions
d(t) = 4.9t^2
d(t) = 4.9t^2 + 10t car1: d(t)=10.8t^2 car2: d(t)=50t car3: d(t)=2t^2 + 20t car4: d(t)=3.5t^2 + 5 car1 and car2 intersect at about 235km at a time of about 4.8hours car1 and car3 intersect at about 55km at a time of about 2.4hours car1 and car4 intersect at about 10km at a time of about 1hr car3 and car4 intersect at about 6km at a time of about .5 hr car2 and car3 seem to never intersect, unless the intersection is at time much greater than 6 hours
Horizontal tangent may also exist, but not necessarily, when the concavity of a graph changes.
A change in concavity occurs at points on the graph called inflection points. As we shall soon study, the derivatives value at an inflection does not have to be zero. Therefore we will restrict our definition of equilibrium points to reflect either maximum or minimum values on the graph. For example to find equilibrium points for the function, differentiate it to get, . The derivative, . we first need to
, at any point x. Since we want to find points where the rate of change of equal to zero to find those values of x which the satisfy results in: . Doing this for
This tells us that at both x=0 and x=2/3 there exists an equilibrium point, which is confirmed by the graph of the function,
Having found the critical points, how do we classify them as either maximums or minimums? Obviously this can be done by just looking at the graph of purpose of this chapter is to understand how to only use the derivative, , but the to
approximate the behavior of the function . To determine whether our equilibrium points are either maximum or minimums we need to evaluate points left and right of the equilibrium points to determine where the function, , is increasing or decreasing.
A minimum is defined as the bottom of a U-shaped or concave up graph. If the graph is concave up then the slope or rate of change is positive to the right of an equilibrium point and the function is increasing to the right of that point. To the left of the equilibrium point,
the slope is negative which means the function is decreasing till it reaches the equilibrium point. To better understand this, look at the following graph of a concave up portion of a graph. Notice how the function decreases till it reaches the equilibrium point then rises after passing it.
Similarly if the rate of change of were negative on the right side and positive on the left side of the equilibrium point, then we get an inverted U shaped or concave down graph. A concave down graph thus reflects a maximum value at the equilibrium point. If the slope is both positive or negative on either side of the equilibrium point then we get an inflection point that represents where the concavity changes from a U shape to an upside down U shape or vice-versa. The following graph summarizes the above conclusions.
The first derivative allows us to define equilibrium points on the graph of a function, . By evaluating points to the left and right of the equilibrium point we can classify these points as either maximums or minimums and thus determine the concavity of the graph. Without having found the equilibrium points it is extremely difficult to determine the behavior of a function over an interval. The sign of the first derivative only tells us if a function is increasing or decreasing; however, a function can increase or decrease in two way. For example consider the graphs of the following two different functions.
In both cases the function is increasing and the first derivative is always positive; however each function increases in a different way i.e. one increases concave up and the other increases concave down. Using the first derivative only, we would have to know not only where its positive or negative but also how the first derivative is changing i.e. positive and increasing, negative and increasing etc. For the first graph of is concave down. The process of looking only at the graph of the first derivative to understand how behaves is an extremely abstract and difficult one. To quicken and simplify our work we can use the functions second derivative to conclude where the graph is concave up or down. This information along with the fact that the derivative is either positive or negative over an interval will be enough to accurately determine a functions behavior. Recall that a property of a concave up part of a graph is that its slope or rate of change is always increasing. is positive and increasing thus the graph of f(x) is increasing and is also positive but is decreasing. Thus the graph
On the left side the slope is negative; however, as x increases the slope gets less and less, 5, -3, -2, till it reaches 0, from where on it increases to 1, 5, 6, etc. We can then conclude that the rate at which the slope is changing must be positive or the graph of the derivative is increasing. Since the derivatives value is constantly increasing, then the rate of change of the derivative, given by will be positive. Remember that positive rate of change implies that the function is increasing over that interval, while a negative rate of change implies the function decreases as x increases. In a concave up graph the derivative is increasing, such that the second derivative over this interval will be positive. Working in reverse we arrive at an important conclusion. If the second derivative is positive over an interval, then the first derivative is increasing, implying that the graph of the original function, is concave up. This is true because the rate of change of a concave up graph is always increasing. The reverse is true for concave down graphs. If the second derivative is negative then the first derivative is decreasing, implying that the original functions graph is concave down over the interval. The following graph summarizes the conclusions:
Though all the information concerning the behavior of f(x) can be obtained from studying its derivative, we can quicken and confirm our sketches by looking at the functions second derivative. Without having found any equilibrium points we can accurately determine the behavior of over an interval by using the signs of both the functions first and second derivative simultaneously. Four possibilities may exist for the signs of the derivatives. Both up. 2. and are positive over an interval. Therefore is increasing and concave
is positive but
is negative. Thus
3. 4.
and
is negative but
have to exist at equilibrium points. For example in the graph of derivatives value at the inflection point (x=0.45) is not equal to zero.
In some rare cases, the value of the derivatives can be of the same sign before and after an equilibrium point. In such cases the graph of the function, has to be determined by carefully looking at both the functions first and second derivative. To make this analysis simpler, let us go through a step by step process for predicting the behavior of only and , using
To begin drawing f(x), first find maximum and minimum values by setting f(x) equal to zero, and solving for x. Next find inflection points by setting f(x) = 0 and solving for x. Then plot two number lines for both f(x) and f(x) with plus and minus signs to indicate where they are positive and negative. First look at f(x) to understand where the graph is increasing or decreasing. Next look at f(x) to find out how the graph is increasing or decreasing, concave up or down. The number line for the example is:
Having drawn this important number line, how do we make interpret it. Remember positive rate of change implies that is increasing while the reverse is true for negative rate of change. Similarly a positive second derivative implies that the graph is concave up while negative values represent concave down. Then use equilibrium points to draw the graph through them.
This gives us the exact change in over the infinitely small interval dx. Remember how the instantaneous rate of change was defined as taking the limit as a discrete value for a change in x, goes to zero.
The derivative gives us the instantaneous rate of change of a function over an infinitely small interval, . If we multiply both sides by dx, we get:
and
allows us to approximate the change in a function, . Since the derivative is only defined over an infinitely small interval, we can not use its value to give us the exact change in a function over an interval This is because the derivative of a function changes after each interval, dx and is constant over the interval Therefore the following equation only represents an approximation to the net change in as it assumes that the rate of change is constant. over a discrete interval
Look at the following graph and notice how the approximation gets less and less accurate as x increases.
Replacing the infinitesimal values with discrete changes, both sides by , yields:
and
, and multiplying
Using the derivative we got an error of 4 or 20 %. As x get smaller, the error gets less such that this method becomes more accurate as We can state our observations in a short theorem:
As long as we know a functions value at some initial point we can calculate its value at a nearby point x, by evaluating the derivative at x = a, then multiplying it by x=(x-a) and adding the f to f(a). This only gives an approximate answer because it assumes the rate of change of the function is constant over the interval x. For example to approximate we can use for f(a). The derivative at this point is
All that we have studied about the derivative until now suggests that the entire graph of a function can be drawn exactly by using the derivative only. This should be of no surprise since the derivative of a function is derived from the function itself, it only remains to undo what is done in the derivation to get the original f(x) back, given only f'(x) The accuracy of our results depends on how small our interval x is. This is because, with the exception of linear graphs, the derivative or rate of change of a function is different at each and every point on f(x).
The change in the function is only valid for the derivative evaluated at a point multiplied by an infinitely small dx The derivative is only constant over an infinitely small interval, . The only precise way of defining f(x) in terms of f'(x) is by evaluating f'(x) x over infinitely small intervals, keeping in mind that f.(x) is different over each interval. Therefore if we multiply the derivative by x we get:
The change in f is exactly equal to the derivative evaluated at a point x, multiplied by an infinitesimal dx. To get accurate approximations of f(x) we need to divide the interval into smaller sub-intervals and evaluate the rate of change at each point. For example if we were to divide the interval from x=4 to x=6 of into 3 intervals to calculate the change in the function.s value from x=4 to x=6 we would have:
Direct calculations tell us that from x=4 to x=6 , the change in f(x) is:
We now want to find this answer using the derivative only. As opposed to our last example we have divided the interval into three sub-intervals where the rate of change is different at each of the three points. In each subinterval x = 2/3 =0 .67. From x=4 to x=4.67 the approximate change in f over this interval is:
From x=4.67 to x=5.33 the approximate change in f over this interval is:
Therefore the net change in f(x) over the interval from x=4 to x=6 is the sum of the three approximate change in f.s that we calculated. Before stating the results let me first explain some of the notation about to be used. The first sign is:
This tells us to evaluate the change in a function, = b. The second sign is a summation sum:
The summation signs tells us to add up all the individual values of x from Returning back to original example we can state our results as, the change in the x=4 to x=6 is approximately equal to the sum of the Mathematically this becomes:
. from
The result corresponds to an error of just 6.6 %, far better than the 20% error we got when we used only one interval of x = 2. The reason we still have an error is due to the fact that the derivative is constantly changing over the discrete interval, We only calculated the derivative at three point from x=4 to x=6. The accurate way to find the precise change in the function would be to let x go to zero and then calculate the infinite sum of the along the interval from x=4 to x=6. This is true because according to the definition of the derivative, the rate of change of a function at any point is based on going to zero:
Thus the derivative is defined to be constant only over an infinitely small interval, dx. We shall study this more in depth in the next chapter. Until now we can state the following conclusion:
As n gets larger,
What this definition does is analyze a changing function over an infinitely small interval to calculate the rate of change of the function at that instant. By letting go to zero, the terms interval and point become synonymous such that the derivative gives is the rate of change at any point x. If we multiply both sides of the equation by dx ,where we get:
The answer is only an approximation because the equation assumes that the rate of change or derivative is constant over the interval . For example consider the following linear function:
or is a constant. At any point on the graph of , the rate of change is reflected by the constant steepness of the graph. Now let me pose a simple question. How much does the function, at 0 and 4 we get: , change when x goes from 0 to 4? By evaluating
The new symbol [ means to evaluate the function at the two values specified on the symbol and subtract the difference. When x changes from 0 to 4, the dependent variable, f, changes by 8. How else could we arrive at the same answer? From the definition of the derivative we know that:
Over an interval x, the change in f(x), can be approximated by evaluating for the interval. Let us see how this applies to the function we were studying. We have shown that as x changes from 0 to 4, f changed from 0 to 8. Now let us find the same answer using the above equation:
The answer obtained from the derivative is exact because the rate of change of the function is constant over the discrete interval,
But what about if the derivative is not a constant but varies with x? As we shall soon prove, the total change of a function f(x) from x = a to x = b is found by taking the infinite sum of from a to b where each over the infinitely small interval, dx. gives an infinitely small change in the function, df,
As x changes the rate of change of f(x) increases. We can begin by asking ourselves what is the change in f(x), when x changes from 0 to 8? In the graph this translates to:
How else could we calculate the change in f? From the derivative we know that:
Since the derivative of the function changes with x, then we need to evaluate the above equation over small intervals of x to get an accurate answer. Remember the rate of change of f(x) is changing and is not constant over a discrete interval, x.
from x = 0 to x = 8 by using the derivative only. A small f of the over each interval,
The net change in f(x) is found by summing up the individual approximations of each interval.
, for
Adding up the , we get 28. This corresponds to an error of 4 since we calculated the actual f of f(x) to be 32 as:
To approximate this
to calculate the net change in the function, change to the slope of the tangent is:
By evaluating the derivative over each interval x we got the approximate f of f(x) over that interval only. How could we get a more accurate result? The solution is to further divide the interval into a smaller , such that the derivative is assumed to be constant over the infinitely small interval. If we let equal 0.5, we will have sixteen intervals over which we can calculate the
You can clearly see that by further reducing the interval, the error is difference between the approximate and actual is very small. By using sixteen intervals, we reduced the error from 4 to 2. We can go on to conclude that as the interval, , goes to zero, then the net change in f(x), will correspond exactly to the infinite sum of the derivatives evaluated over each interval,
Thus the net change in a function from a to b is exactly equal to the infinite sum of its derivative evaluated over an infinitely small interval, One argument against such a conclusion is that as decreases, the error gets smaller for each individual interval however the total error remains the same. This is because as , we are adding up an infinite sum of small errors as opposed to a discrete sum of 8 or 10 for example. We have shown intuitively that this does not happen, but how can we prove it?
Returning to the definition of rate of change between two points on a graph, separated by a distance
Notice that we are not taking any limit as two points on the graph of
If we let
Therefore
is:
is:
Remember that
, therefore:
This important result confirms the fact that the sum of the derivatives evaluated over infinite times over infinitely small intervals of dx from x = a to x = b corresponds exactly to the net change in the anti-derivative, f(x). We have proven this for the case of , but how do we prove it for the general case for any f(x) and its derivative? More specifically we want to prove the following fundamental theorem of Calculus.
Here, the integral sign, , replaces the summation sign, , and represents the infinite sum of the derivative evaluated over an infinitely small interval, dx. To understand this let us take a more conceptual look at how the rate of change of a function is defined. The following formula give us the average rate of change of f(x), through any two points on the graph of f(x).
As we let , the two points, come closer together. The average rate of change converges to the instantaneous rate of change of the function at that point, x.
the rate of change converges to a constant value . Since the instantaneous rate of change,
given by is constant over the interval, dx, then an infinitely small change in f(x), df, is found by multiplying the definition of the derivative by dx, where
converges to
represents
the net change in the function, from x = a to x = b, where . Finally the last term is the infinite sum of the derivative evaluate over each interval, dx. This infinite sum is represented by the integral sign,
This the fundamental theorem of Calculus. For example if f'(x)= 2x and we sum up 2xdx as delta x goes to zero over an interval, we get the CHANGE in the anti-derivative function F(x) over that same interval, where F(x) = x^2 .
Assuming f and x represent two different conditions, then the situation F is dependent on their product. In other words f and x are two constants whose product defines the dependent dimension, F.
F is therefore, a function of two independent conditions, f and x. If we let x be a variable and f be a constant; we have:
As x changes, F also changes, since its value is only dependent on x and the constant f. Thus:
This should be rather obvious to you. Now what if the condition, f, is also a function of x? In other words, the value of f changes as x changes and is therefore, not a constant over an interval from x = a to x = b. This modifies the relationship to:
Since F is a function of f and x and f is a function of x, then F is also some function of x only, such that;
As the condition, f, is a function of x, then its value changes as x goes from a to b. Therefore can be assumed to be constant only over some infinitely small interval, dx. multiplied by the infinitely
This extremely important result defines the instantaneous rate change of respect to x to be equal to . The relationship of to is the same as
with is to
i.e. a function and its derivative. The only difference is the change in notation from . Multiplying both sides by dx again:
The result is still a bit abstract and may not seem to be anything new to you. The important point to understand is that F is a function of two independent conditions, f and x. If f is also a function of x, then as x changes, f also changes such that its value is not constant over an
interval from x = a to x = b. Therefore an instantaneous change in F, df, needs to be defined over an infinitely small interval over which f(x) is assumed to be constant.
Integrating
from some value x = a to x = b gives us the net change in is defined as the anti-derivative of
The result is the fundamental theory behind integral Calculus. Take some time to think deeply and abstractly about what is going on in this derivation. Mathematically it represents the same theory derived in the previous chapter, but conceptually it explains integration in a new way. It allows us to analyze a changing situation in terms of infinitesimal changes which we can sum up to find the net change in the system as one variable changes.
What if the height of a particular shape were changing with respect to its length, expressed by the function, ? The graph of the function and shape would look like this:
What is peculiar about the shaded area? The only apparent characteristic is that its height is changing with the length such that as the length increases the height increases. Therefore the area of the object is not a simple matter of:
In the graph, height is a function of length. The area equation for the shape now becomes:
Over an infinitely small length, dl, the area of one rectangle has constant height, and length, dl . Over an infinitely small interval, the area under the curve changes by the area of one rectangle. This can be expressed as:
The infinite sum of the inscribed rectangles over an interval of l=a to l=b corresponds exactly to the net change in the area of the shape, or the area under the curve from l=a to l=b. Thus the area of the shape from l=1 to l=3 is:
then in this unique case will give us the area the shape for any length, l. However keep in mind that we are calculating how much a situation changes, as its defining conditions change.
Now let us look at volumes. Let us say we take the same shape with length, l, and height, and revolve it around the x-axis. How do we find the volume of the conical cylinder generated?
All we have is a cylinder of varying or changing radius. The radius is give as a function of length by;
As the length goes from some value, a to b the radius changes with respect to the length. The equation can only hold true over an infinitely small interval, dl. The change in the volume of one infinitely small cylinder with length, dl, is
The net change in the volume of the cylinder as the length changes from 0 to 6 is found by integrating the differential,
Step 2- Identify which dimension is changing with respect to another dimension and determine the independent variable. If f were given as a function of x, then its value would change as x changed. Thus x is our independent variable.
Step 3- Write the differential dF, as a product of f(x) and an infinitely small change in the independent variable x, dx.
Step 4- Integrate both sides of the function from some value x=a to x=b to calculate the net change in the dependent dimension F.
To understand how to apply the systematic approach, Consider the problem of finding the work done by a gravitational field on a space shuttle. Jules Verne, the master of science fiction, wrote in his popular book Round the Moon about a space shuttle that was launched into orbit by literally firing it from a large cannon with a long muzzle. The ship veered of course and instead of landing on the moon, it flew towards the side of the moon, around the dark-side or back of the moon then came out the other side, directed towards earth. Luckily it did find its way back to earth safely, by landing in the sea. Enough said of that, the problem we want to consider is how much TNT would be needed to send the shuttle to the moon.
There are essentially two forces, both due to gravity, acting on the shuttle. That of the earth, which attracts the shuttle back towards the earth and that of the moon which attracts it forward. The equation for the gravitational force between two bodies is given by Newton's Law of gravitation is where G is a constant, M the mass, and d the distance between the two bodies. The derivation of this formula is not difficult but what exactly is gravity and what causes it, nobody knows. The fact is that there is an observed force determined by this equation. We can simplify this problem by finding the location of the point where the gravitational force of the earth is equal and opposite to the moons gravitational force. We do this by setting the two equation equal to each other:
Solving for d gives us 200,000 km from the center from the earth, remember gravitational forces are measured from the centers of the body. Once the shuttle reaches this point, the moons gravity will be stronger than the earths and will pull the shuttle towards it. No extra energy will be required. We can now state the problem as find the total amount of energy required to send the shuttle to this midway point. Then use this answer to determine how much TNT to use. In our systematic approach. step one is to always write the equation first. In this case we are looking for total energy and energy is given by the simple equation:
Step two is to identify which variable is changing, or which variable changes with respect to another constant. From the given equation for gravitational force between two bodies we had force given as function of distance:
There are two forces acting in opposite directions on the shuttle, that of the earth and that of the moons, where the earth's force will always be greater till the midpoint is reached.
This tells us that as the force is changing with respect to the distance and is not constant over a distance d. In step three we determine our independent variable and then write our differential over an infinitely small interval. From the above equation it is clear that Energy is entirely dependent on distance which tells us that d is our independent variable, since everything else, including force, is dependent on it. We can now turn the d after the f(d) into an infinitely small d. When doing this we must then convert the Energy also into an infinitely small E. This tells us that the force at a particular distance, multiplied by small distance d gives the energy done by the force over that interval or:
In the fourth and final step we determine which values, the dependent variable goes from and ends at. These values become our limits of integration so we can now take the infinite sum of F(d) d over an interval to find the total energy done. In our example the distance goes from the radius of the earth, 6400 km to the midpoint, 200,000 km away:
Therefore the shuttle must have at least 100000000 Joules of energy before leaving the earth's atmosphere, by the time it reaches the mid-way point it will have almost no energy left as all was used up by the opposing gravitational pull of the earth and the moon. Given that 1 kg of TNT will give 10 Joules of energy, 10000000 kg of TNT will be needed to blast the shuttle out of the canon.
If a load/force is applied at the end of the beam, the beam will bend downwards. Try this with a ruler in your hand to see how it bends. When a load is applied at the end the beam will experience the highest stress at the end where it is fixed. The stresses it experiences are proportional to how high the load is and how far the load is from the fixed end. In engineering, the term 'Bending Moment' is calculated from the product of the load multiplied by the distance . The greater the bending moment , the greater the chance it will break . Therefore: Bending Moment = Load X Distance In the example below of a single load at the end of the beam , the bending moment at the fixed end would be Load times the distance, d.
How would we find the bending moment for the case shown below?
Here a distributed load is increasing along the span of the beam with a triangular distribution. Triangular distributed loads are found commonly when a liquid is exerting pressure on a wall with the pressure increasing with the depth . An example would be the walls of your swimming pool. As the water goes deeper, they exert a linearly increasing pressure on the walls of the pool. Since our distributed load is changing with the span of the beam, we need to apply our systematic approach to integration to solve the problem. Step 1 - Determine the form of the functional relationship between the interacting conditions. This is synonymous with writing the equation, where every dimension is assumed to be a constant:
In Civil Engineering, a distributed load is expressed as a constant in units of load per unit distance. For the case where the loading is a uniform rectangular distributed load over the span as shown below:
We can write: Load = some constant * distance The constant is in units of load/distance and its value depends on the magnitude of the distributed load. Therefore the constant multiplied by distance equals the total load acting over that distance.
Step 2 - Identify which dimension is changing with respect to another dimension and determine the independent variable. In our triangular distributed loading case the constant changes linearly with distance or:
Step 3 - Write the differential dF, as a product of f(x) and an infinitely small change in the independent variable x, dx. Substituting back into our equation for bending moment:
Step 4 - Integrate both sides of the function from some value x=a to x=b to calculate the net change in the dependent dimension F. In the triangular loading case, c(d) just equals some constant, c multiplied by d. We could certainly have parabolic and even exponential distributed loading functions. But for triangular loading, we just need to replace c(d) with constant times d or:
Questions
Determine the bending moment for this parabolic type of loading:
From Funny exam questions Your gut feel for a situation is essentially the integral of your experiences from your birth till the present moment. The ability to trust your gut is the most fascinating trait of human intelligence. It reduces years of experience and knowledge into a single quantifiable dimension. Understanding the gut can be quite complex as it involves a knowledge of many interacting disciplines, e.g. neurology, psychology, sociology etc. I will not bog you down in complex medical terminology but rather give you a layman's perspective on, 'What does this all mean to me ?'. While we like to believe the world is black or white , the reality is people are grey. We can distinguish the infinite shades of grey by assessing what our gut feel is for a situation. Why rely on a conclusion based on one piece of information when you can rely on your gut which is based on countless pieces of information?
The gut has many limitations as well, so understanding them will help train your gut better. A lot of current authors have written extensively about the gut's critical role in decision making, as in the following excellent books: Blink: The Power of Thinking Without Thinking by Malcolm Gladwell How Doctors Think by Jerome Groopman The Wisdom of Crowds by James Surowieck
The fundamental principle of the gut is its ability to accumulate information and store it in a distinct location in the brain. We all know it is much easier to recall something rather than remember. For example you will find it impossible to remember all your steps the previous day. However, if you watched a video of yourself moving about yesterday , you would instantly recall each situation with ease. The same goes for places you visit, details of people you interact with, knowledge you read in books etc. While it is impossible to remember the details, you will always be able to recall the details when given a little hint. So how does this translate to the gut? The brain stores information in containers in your brain. Now the fascinating part is the brain stores both the information and intensity of the information. For example let's say someone told you Hitler was a bad man. Somewhere in your brain that would get filed away in the Hitler department with a score of -1 . Now each time someone told you Hitler was a bad man, that same part of the brain will increment the counter by one . So if I asked you today what you think of Hitler, your brain will immeadiately fetch the intensity of the information on Hitler and report it to you as a gut feel. That is the remarkable aspect of the gut. While it is fairly obvious that each of us has a Hitler file somewhere, it is not obvious that our brain also stores the intensity of the information as well. Therefore, what is most amazing is this intensity can be recalled on a moment's notice when needed. Another example that demonstrates how the brain stores increments of intensity of information is through the 'nuisance accumulator'. The phrase explains how the brain collectively logs each recurring nuisance or annoyance. Humans are forced to change when the nuisance accumulator becomes full and needs to be emptied. . For example if you buy a new car, chances are the first year everything seems great. But with time, each minor annoyance starts to grow on you until it becomes easier to purchase a new car rather than live with weight of all the accumulated nuisances in your brain! Another powerful feature of the brain is chunkification. Essentially it is the ability for the brain to conglomerate details of knowledge into a single conclusion. When this happens the details are forgotten, but the conclusion remains in long-term memory. This is important because it lets your brain just store single pieces of information instead of infinite details. Also chunks of knowledge can be further chunkified into new conclusions. As we grow older we are basically taking existing knowledge and simplifying them into concise analogies that summarize all the information into a single saying. The gut is then able to probe these chunks. However, the gut is not able to probe the details that contributed to the making of these chunks since that information is forgotten. A good example that demonstrates this is to pick up an interesting article you read a few years ago. You will notice how all the details are forgotten, but the overall message of the article will seem very familiar and obvious to you. Chunkification explains why most older
generations of humans appear to be consciencously ignorant and inferior to later generations. All generations and societies do the best they can with the tools and knowledge they have available. However, the knowledge they develop becomes the building blocks for future generations to build larger chunks of conclusions from. The same feature that makes the gut feel so powerful also makes it prone to distorted perceptions on life. As your gut feel for a certain piece of information strengthens, your brain goes into a mode where it quickly rejects any information that is in opposition to your gut feel. For example, if someone started preaching to you Hitler is a good man, your brain would not start decreasing your negative gut feel for the man. Instead it would outright reject the information and not change the intensity of the existing information. This has nothing to do with psychology but is an evolutionary survival mechanism. As we grow older our brains need to be able to act quickly on gut information in order to survive. Furthermore, our brains have to learn to carry out complex tasks with ease. If you have ever watched a professional athlete, you will quickly observe how 'easy' they make it look. The reality is they have conditioned their brains and body through years of training to perform the most complex movements without a second thought. It is impossible to become proficient at a task without building a firm base of fundamental knowledge . Ask any professional and they will all agree. If our brain was always like clay as a child's brain, then our behavior would constantly be changing . Habits and methods would not develop. Consequentially it would be nearly impossible for us to progress by not being able to build upon established layers of complex systems. Cognitive dissonance refers to the brain's desire to resist information that would require the individual to act in ways that depart from their established habits. The social animal by Elliot Aronson is an excellent book that uses many interesting examples to explain the concepts of social psychology, including cognitive dissonance. This is why a child's early development essentially remains the foundations for their personality for the rest of their life. It does not matter what happens to them after maturity, their core self will hardly be unchanged. If you ask any 80 year old how they are different then when they are 17, they will respond " I am essentially the same person, except the things I believed when I was 17, I now believe with far more certainty". To summarize the following aspects of the brain and gut: The brain stores information and intensity of that information The intensity automatically increases/decreases depending on new knowledge learned until an adolescent's maturity The gut can retrieve the intensity of this information instantly when needed Chunkification lets the brain condense many bits of information into one conclusion to free up space for new information to be digested As you get older the brain autommatically rejects information that goes against the gut feel for that information in the brain
You are probably asking , 'So what does this all mean for me?'. First, since your brain so easily absorbs information like a sponge, spend time reading and learning as much as you can about the world. It does not matter if you can not remember most of what you learn. Rest assured that the information has been processed and filed away in the brain for easy
recollection when needed. In other words you are feeding your gut by being a vociferous consumer of knowledge. By understanding how your gut works you should then learn to trust your gut above any one-dimensional argument. You will gain confidence in your gut's ability to make the correct decision in complex multi-dimensional situations. Last, you need to be self-aware to not fall into the evolutionary survival trap of rejecting information that goes against your gut feel. Most humans spin in a spiral of a self fulfilling prophecy, whereby they only seek out information that strengthens what they already feel to be true. Therefore, you need to spend considerable time training your mind to be open and tolerant. Obviously you need to strike a balance between nurturing your habits and absorbing totally contrary ideas that can change your established habits.
Chapter 12 - Differentials
We have at last arrived at the final chapter of this book. The goal of this chapter is to extend the systematic approach of integration to include other types of mathematical functions involving relative change of variables. The process of identifying which dimension is changing with respect to another one is the first step in the systematic approach. The second step involves defining an instantaneous change for the function by writing what is called a differential. Differentials are of the form:
Dividing both sides by the infinitely small, dx, defines the derivative or instantaneous rate of change of with respect to x. What is important to understand is that is assumed to be constant over the interval, dx. Thus the infinite sum of these dx. s is given by integrating the function. The net change in the function, F(x) , over an interval form x = a to x = b is thus:
Let us now look at various combinations of f(x) and x, and see how we differentials are defined for each case. Consider case 1, where some situation, F is a function of two independent conditions f and x,
If f is a function of x, f(x), then its value changes as x changes. The differential is defined by converting the independent x into an infinitely small dx.
The differential is then integrated to determine how much the situation, F, changes as x changes from x = a to x = b.
In this case,
to get:
If f is given as a function of x, then we will first have to re-write f(x) as a function of A. We know that:
If
, then
. Once we have
This approach to writing and integrating the differential for case 2 functions is confusing and abstract. An easier and more logical approach is to integrate with respect to x only: For example if,
First let
, then
or:
The result is the same function we started with. If f were a function of x, then its value would change as x goes from a to b. The differential can be written as:
This results in the same numerical answer as the previously defined method of writing the differential. To summarize, if ; and f is a function of x, then:
Here
, where
Once again,
, then:
Our results can be generalized as follows. If F is a function of two variables, f and x such as:
If A is some function of x, and f is also a function of x then both these values change as x changes. It is important to understand that is a constant value, dependent on x, that defines F. Therefore, the differential is defined as:
Substituting this back into the differential gives us the general expression:
to find the net change in F over the interval. and , where and
Which is Newton.s second law, or the force acting on a body of mass, m, is a function of the acceleration given to it. We are free to input any a and what we get out is the force. The inverse of this Force function, according to the definition, will give us the acceleration as a function of Force. This is done by simply solving for the independent variable, a:
Now I can let F be anything and then find the acceleration as a function of it. The inverse of a function, f(x), is commonly written as, . Now we will look at the more general case of graphing a function and its inverse in the same co-ordinate planes. Given the function , to calculate its inverse we only have to solve this for x to get
. Notice that we have not really changed the function at all, we have only solved for the independent variable. The graph of these two functions would be exactly the same. Our definition of the inverse function therefore has to be slightly modified. After finding the inverse of a function we just interchange x and y to get: What does this do to the inverse function? This essential flips the graph of f(x) about the line y=x such that for every point (x,y) there is a corresponding point (y,x) on the graph of the inverse function. Now both the functions can be graphed in the same x-y plane. Remember that if we just solve for the dependent, we are not changing the equation but merely re-writing it. For this reason its graph is the same. By flipping the x and y, we get another function of x, whose relation to f(x) is that it has been graphed as though the x-axis were the y-axis and vice-versa. It is best we look at the two graphs:
Notice how every point (x, y) has a corresponding point (y,x) on the inverse function. The graph of the inverse function is therefore exactly the same as the original function except that the x and y-axis have been switched:
Since every point (x, y) has a corresponding point (y,x) then any point y from the inverse function when inputted in the original function should yield x:
Remember the a function and its inverse are both function.s of x. The way they are related is that the inverse function represents the original function by just having its dependent and independent variable switched around. As you can see from the first graph, when the two function.s are graphed together, the inverse function contains all the point (x, y), of the first
function, plotted as (y, x) with the exception that y is given as function of x. For this reason
What is important to understand about the inverse function is that it is obtained by solving for the independent variable, then replacing it with y, to create a function that is also a function of x and can be graphed along with the original function. Now that we know how a function and its inverse function are closely related, it brings us to the question, how are the derivatives related? Logic would tell us that instead of should just find we
Or the derivative of
Let us examine the graph of f(x) and its inverse function to see what exactly is going on.
Note that at x=2, the slopes are not reciprocals but are reciprocals only at y values of on the inverse function or through (x, f(x)) and (f-1(f(x)), x). Or the point (3,9) will have a reciprocal slope at (9,3) since at this point x and y are reversed hence the slope becomes the reciprocal or This is the important point to understand about the function and its inverse, they only behave as opposites at point (a,b) and (b,a). This means that at point a something different is going on. The question is then how can we find the derivative of the inverse function with respect to the x-axis? Looking again at:
What we have just done is calculated the derivative of the inverse function only by looking at the original function and its derivative. The reason the derivative was not just the reciprocal of y=2x was because we forgot to do the following two steps: 1) Replace x with its equivalent expression in terms of y.
By replacing x with we can find the derivative with respect to the same x-axis but instead with a y-value.
derivative of the inverse function with respect to the axis by taking the reciprocal of the expression and then replacing every y with x and vice-versa.
This last expression is the derivative for the function.s inverse with respect to the x-axis. To summarize we can state the following theorem: To find the derivative of the inverse function, 1) Remember, an inverse function is related to the main function in that if you reflect it over the line y=x, you will land on the main function. 2) First find the derivative of f(x) 3) Replace any x in the derivative with its y-equivalent, so as to be able to find the derivative with any given y-value.
5) Since the inverse function is graphed with respect to x, replace every y with x and x with y to find the derivative of the inverse function. To summarize further:
Where a is called the base, and x is the exponent or power. Therefore a raised to the x power is defined to be:
For example:
Since multiplication can be called repetitive addition then exponent or raising something to a power can be though of as repetitive multiplication. This is really all that we mean by exponents. They can be reduced to multiplication and multiplication can be further reduced to exponents. Now let us take a look at some properties of exponents. The first being the rule for multiplying two exponents of the same base.
Notice how multiplication of exponents has been reduced to addition of exponents. For example:
It is really this simple. The second important property of exponents states that a to the x power raised to the y power is just a to the product of x and y power or:
All this is saying is that an exponent raised to another exponent can be reduced to multiplying the two exponents out or:
Take note of how this rule reduces exponents of exponents to just multiplication of the two exponents whereas rule one reduces multiplication of exponents to addition of exponents, ( assuming a common base a). These two important properties of exponents are the fundamental ways of defining what exponents are and how they relate to repetitive multiplication, where multiplication is just repetitive addition. It now remains for us to define what we mean by raising an exponent to a fractional power. This is actually much simpler than it sounds. Since we defined whole number exponents to denote repetitive multiplication, we would want fractional powers to represent repetitive division. Fractional exponents therefore are called roots and tell us into how many equal multiples a number has been divided into ,such that the product of these roots gives us the original number a back.
This definition of fractional exponents remains consistent with the first and second rule of adding and multiplying exponents. Last but not least we need to define what we mean by raising exponents to negative numbers. This is done by remembering that positive exponents refer to multiplication, therefore negative exponents would refer to division. In order for our definition to remain in accordance with the two rules of exponents we need to define it this way:
This is rather a long way of deriving this as you can clearly see from the following example how negative exponents are defined.
Increasing the power means multiplying it more times by itself, while decreasing the power means dividing it more times by itself. Therefore multiplication and division reduce to addition and subtraction in exponents. Since a raised to the first power is a multiplied by itself once, or just a, by definition then what is a raised to the zero power. Once again this definition must remain consistent with the rules of exponents. Instead of raising a to zero power, we can raise it to 1/ or one over infinity which is close to zero. Therefore a raised to the negative 1 over infinity is:
From the definition of fractional exponents, we are asking ourselves what number multiplied by itself an infinite amount of times gives a? If this number were slightly less than 1, then as you multiply infinite times, you get a smaller and smaller number or zero. If this number were slightly greater than 1 then you would get a larger and larger number each time you multiply it by itself, or eventually infinity. Therefore only the number one satisfies both limits as you approach it from either 1/ or -1/. Hence:
This definition of zero exponent power tells us an important property of exponential.s. In multiplication you are adding, where adding nothing is zero. In exponents you are multiplying, where multiplying nothing is one, not zero. Therefore all exponents are expressed as repetitive multiplication of numbers greater than or equal to one. We can say that the base of exponents is therefore 1.
This is defined for all values of x, as defined in the previous chapter. Let us now look at the inverse of this function or what we shall refer to as the Logarithmic of this function. Remember the inverse function is found by interchanging y and x so we get:
This tells us that we must do something to an inputted x value to find y. Remember x refers to the number we get by raising a to some power y. Therefore if we were given x then in order to get y, we must ask ourselves to what power has, a, been raised to so as to get y? This question can be replaced with the word . Log., where the Log, base a, of a number is the power a or the base has been raised to so as to get x. Stated mathematically:
The question marks refers to our output or y values. Remember we are inputting values of x, making x our independent variable, then calculating what power the base must be raised to get x. This answer is our dependent variable as its value depends on what number x we have inputted. Hence:
This is really not that difficult, just remember that the Logarithmic function is the inverse or
opposite of the exponential function. Now let us move on to examine some unique properties of the Logarithm.
Or the Log of a product is simply the sum of the Log.s of each number. To understand this, recall the property of exponential multiplication, where the product of two exponential numbers of the same base is simply the sum of the exponents with the same base.
we begin by assuming that c and d can be expressed as some exponent of the base. Remember that let: . Therefore we can
and
respectively.
The proof is based on property 1, excepts that we must keep in mind that Thus is expressed as difference of two Logarithms and not a sum, since we are dividing not multiplying. The final and most important property of Logarithms is that:
as
where . When we raise x to the n'th power we must raise the left side of the equation also to the n'th power:
This is based on the third property of exponential functions that tells us that an exponents raised to an exponent can be reduced to the product of the exponents. We can now write as which is the same as
Having proved the unique propertied of Logarithms, all of which related to the property of exponents, we shall now calculate the derivative of the logarithmic function. We begin with the function:
From the second property of Logarithms the numerator: Can be simplified to:
Consequently:
Clearly as goes to zero, / x also goes to zero regardless of what value x is. Outside the brackets, 1/ tends to infinity as goes to zero. This means we can rewrite:
x/ is therefore the reciprocal of / x and it also tends to infinity as approaches zero, regardless of what value x takes on. Thus from the third property of Logarithms:
We shall now make a small change in variables to help evaluate the limit inside the Log brackets. We replace / x with 1/n and x / with n. The derivative can now be rewritten as:
The sum of the terms after the first two will always be less than the sum of the
corresponding infinite series whose sum can be proven to be one. Just think that it will take an infinite number of steps to cover one meter if in each step you can only cover half the remaining distance.
This is true because: . The same reasoning can be applied to the remaining n terms; the numerator will always be less than one and the denominator will be greater than the denominator of the corresponding n.th term in the series. For example:
As we shall see later, this limit tends to the number e, where e=2.71828...Returning back to our derivative we get.
Therefore:
For this reason we denote the Logarithm of base, e, to be called the Natural Logarithm or ln x , whose derivative is simply 1/x .The derivative of logarithms of other bases is , where the Log e is some other constant other than 1. Having shown that the derivative of ln x is 1/x we can go on to prove the derivative of the inverse of the logarithm function, the exponential function, using our algorithm presented in the chapter on inverse functions. Before doing it that way let us use the definition of the derivative to find the exponential function.s derivative. To do this we follow similar steps to those we used to calculate the derivative of the Logarithm of base a, function. Instead of
Therefore As
and
We will now make a small change in variable to simplify the limit calculation. We replace with . We now have:
The value within the parenthesis will tend to zero or . We now want to solve derivative will be Solving the part in parenthesis gives us:
Becomes:
Remember this only happens when a=e. The derivative of is or the function itself. This tells us that the rate of change of the function at a point x is proportional to the function.s value at that point. This will be discussed further in the next chapter. Now let us examine the graphs of the Natural Logarithm function, , and the
exponential function of base e, . Remember these two functions are inverses of each other, since only the dependent and independent variable have switched places, or x and y.
Since we know that the derivative of y = ln x is simply 1/x, we can then find the derivative of the inverse function, the exponential function, with respect to the x-axis, using our algorithm presented in the chapter on inverse functions. It goes as follows:
We have just found proven that the derivative of is the function itself or y. = y. This was done by just examining the function.s inverse, the natural logarithm and its derivative with respect to x. This shows the close relationship the Exponential Function has with the Logarithmic function, the two being inverses of one another. This concludes our study of these two function.s and their derivatives. We will explain later how e is actually a number defined as a limit. For now just consider how in calculating the derivative of the multiplicative function, the exponential function, involved an analysis of each term in the series of the binomial expansion, whereas in the study of repetitive addition functions, polynomials, all the terms except the second, went to zero.
This tells us that the change in a function over an interval is independent of the constants that are added on. The constant are there to define the initial condition, such that for any, x, F(x) is simply plus or minus a constant. This will become clearer later on as we look at
examples. If constants are added on to polynomial functions, then they are multiplied on to exponential functions. What do we mean by this? When we are integrating the Logarithmic and exponential functions we can multiply constants that have no effect on the derivative, they are there only to satisfy initial condition. For example, the integral of 1/x is:
Note that we have not attached any limits to the integral, this is called the indefinite integral as it simply refers to any x over any interval. Suppose we were to integrate over a definite interval from a to b, then just as the constants in the polynomial functions canceled out, so will the constant in ln cx:
Therefore the constant plays no role in the change of function over an interval and is only there to define the initial condition.. Now let us move on to some applications, first a look at populations. Population growth is generally expressed as a percentage of the population per year. Such that a 3% growth rate means a population of 1000 people will see 30 new babies for the first year, 3% of 1030 for the next year and so on. No matter how large the population, there will always be 3% of the population wanting to have babies every year. We can express this relationship as follows:
The only function whose rate of change or derivative is proportional to itself is the exponential function or:
A more accurate way of solving this, is separating the variables and integrating the change in population from some initial to some final population, and integrating time from 0 to t.
We can now integrate both sides over the limits just defined:
Note how the initial condition of a given population at t=0 is multiplied on to the exponential function and not added on, such that at t=0 we have:
Consider the population of Pakistan, 120,000,000 people. If the birth rate is 3% per year, then what will be the population in 10 years? We do not take the death rate into consideration is because the birth rate, takes into account both the birth and death rate such that it reflects the net or true growth in
The population will therefore be over 148,484,468 people which represents an increase of 38,484,468 people in ten years. Note that this answer was not found by taking 3% of 110 million and multiplying by ten. This fails to work since it assumes that every year the population is 110 million, but this is clearly not the case, since the population is constantly growing and growing and 3% of the population is a larger and larger number. That is why 38,484,468 > 33,000,000. If you are a skeptical person, like me, you will probably doubt how accurate our answer is. One argument might be that since the population is increasing every second and not only once a year, then 38,484,468 is not the exact increase as it is based on a yearly increase in population growth. In fact our answer does indeed take into consideration that babies are born every second and not only one a year. To understand this, go back to our proof of the derivative of the exponential function:
We are taking the limit as time goes to zero and similarly the integral:
Is also based on t going to zero. Furthermore the growth rate is a constant value and will always be 3%, in this case, regardless of how much time has passed in years, months or seconds. The fact remains that there will always be 3% of the population desiring to have babies. is based on time going to zero, how can we be sure that when we solve for t =10, it is referring to ten years and not ten months or ten seconds later. The way we are sure that 10 refers to years and not anything else, is by remembering that 3% growth rate is a per year figure. So.
means that even though the growth rate is always 3%, it doesn't take into consideration that it takes around a year for a baby to be born, so that 3% per year means that the there will be .03P new babies per year, not per month. Another fault with our equation is that it assumes that the 3 percent of the entire population at any given time t, has an equal desire to have a baby in one year. This is obviously untrue. Therefore our population equation is clearly flawed but the general idea is correct. The continent of South Asia has an average growth rate of 2.5% per year. If the current population is 1.1 billion, in how many years will the population double or be 2.2 billion. Our equation:
In 27 years the Population of South Asia will double. In this next example we will take a closer look at the exponential function's role as a solution to the equation: . Hopefully all doubts will be cleared up now.
When one deposits money in a savings account, it is generally for letting the money earn interest. Interest is just money paid for the use of money. Interest rates are quoted as per year or 6% per year or 9% per annum( per year). So a 6% rate would mean that $1000 would earn once a year. . This is called simple interest since the money only earn interest
Compound interest on the other hand, is where the interest earns interest. So $10,000 put into an account that earns 6% interest, compounded quarterly or every three months, undergoes a series of calculation. In each quarter we add the interest earned from the previous quarter to the amount of money in the bank to get our new balance for the next quarter. Therefore the interest earned in the previous quarters adds to the interest earned in the following quarters Quarter 1:
Quarter 2:
Quarter 3:
End of 1 Year:
The difference is not much since the amount of interest earning interest is small. The balance in the bank after one period for money compounded n times a year can be found by:
This last formula gives us the new balance after the first time it was compounded. To find the amount for the second compounding period we multiply by:
Factoring out a
gives us:
The money in the bank after the 3.rd and 4.th it has been compounded is given by . By observing the pattern we can conclude that the balance at the end of the year, when the money has been compounded n times is:
If $10,000 were put into a savings account that gives an interest rate of 6%. compounded quarterly,
If this new balance were put into the same account for another year we would have:
The formula then for calculating the new balance, or how much money is in the bank after t years compounded n times is:
The exponent (nt) refers to how many times the money has been compounded. If in one
year it is compounded n times, then in t years it is compounded nt times over the t years. If the same money were compounded infinite times this would mean that the interest is constantly earning interest. The interest earned over a small change in time is added to the previous balance, creating a new balance on which the interest earned over the next small time interval is based on. Therefore the rate at which the money is changing is proportional to the amount of money in the bank times the interest rate.
Let us examine this familiar looking limit in the brackets, . It looks similar to the definition for e, except for the r, interest rate which is constant throughout the time interval. To evaluate this recall that the limit as x goes to zero of x/x and x/x were infinity and zero, respectively. This is true regardless of the value of x, since its value was fixed and did not change. Therefore by raising to the 1/r power gives us:
Now as n tends to infinity, r/n goes to zero, and n/r goes to infinity. This gives us the familiar answer, e.
Returning back to our initial problem of calculating the Principal in an account that compounds interest infinite times per year:
This important answer could also have been arrived at by realizing that money that is put into an account that compounds interest infinitely is changing at a rate proportional to the amount of money in the bank, times the interest rate. For example if at t=4, the amount of money in the bank was $10000, and the interest rate was 6%, then at that instant the money would be changing at the rate of $600 a year. As the money changes then so does the amount of money in the bank, such that our equation becomes:
We can integrate this equation to solve for P in terms of r and t. We integrate time from 0 to t, and Principal, from P at t=0, to some final value of P.
This last answer is the same answer we derived using the definition of compound interest, which gives us the amount of money in a bank after t years and infinite compounding with an initial deposit of P0. Differentiating this equation with respect to time:
Despite how difficult it may seem to explain this change the fact remains that the money is changing at a rate proportional to the amount of money itself, such that as the money begins to change over a small change in time, the amount of money in the bank changes by a small amount creating a new balance that is only accurate for the next t . It is really this simple. In this example we will compare simple interest to compound interest. Consider a poor college student who decides to put his summer earnings of $2000 into a savings account for 3 years. His choices are to City Bank that offers a simply interest rate of 3.2% per year or Carolina Bank that offer an interest rate of 2.9% compounded continuously. Where should he put his money? Also note that City Bank offers a free T-shirt for opening a savings account. Lets first consider City Bank. The rate at which his money will be changing is the rate of interest time the amount of money in the bank or (3.2/100)(2000) = $64/year. Remember this is the definition of simple interest, the rate at which your money is changing is only dependent on how much Money you initially open the savings account with.
With Carolina Bank, the interest rate is 2.9% compounded continuously. We can just go directly to our equation for continuous compounding:
This tells us that the interest earning interest only contributed to an extra $9.52, barely
more than the cost of the shirt. This is because the interest earned was small, and the interest it earns is even smaller. Now if we were to put the money on for 30 years, then the interest earned will have enough time to earn some substantial interest.
In simple interest the amount of money in the bank remains constant, while in compound interest the balance is increasing constantly. Just as exponential functions increase with rates proportional to the function itself, they can also decrease. This happens when:
The evolution of living beings is also based on the priniciples of exponential growth. Most people can not understand how small changes to populations per generation can lead to large differences in species over long periods of time. This works exactly the same way money grows to a large amount when interest is compounding. Sean B. Carroll writes in his book, The Making of the Fittest: In the case of evolution, the amount of the initial investment is analagous to the number of individuals in a population with a trait. And the interest rate is analogous to the small selected advantage that trait confers on those individuals that carry it. While the time required for one trait to become prevalant in a population is greater than one lifetime, it is often no more than a few hundred generations. This is a blink of an eye in geologic time. This tells us that small differences among individuals when compounded by natural selection over time really do add up to the large differences we see among species. This concludes our study of the exponential and logarithmic function. It is important to understand the differences and similarities to the polynomial function. The base of the exponential function is multiplication and exponential growth can be thought of as repetitive multiplication. In this chapter we saw how this applies to multiplying populations and multiplying money.
Questions
1. The power of compound interest lets you amass large wealth as long as you start investing early. In the early years your investment will grow linearly , but after some years you will see the investment grow very fast. Make a bar graph in excel for investing $10,000 once a year for 30 years. If you do not have excel, you can download either Open Office or Gnumeric , both are free spreadsheet softwares.
Assume the interest is compounded continuously at a rate of 6%. Write a formula to calculate the value of the $10,000 at the end of the first year. Then in the next row add $10,000 to the new principal and use the same formula to find the value of the new principal afer that year ends. The graph you make should illustrate the power of investing early in life and reaping large rewards towards the end . 2. Exponential decay occurs whenever the rate at which the function is decreasing is proportional to the function itself. Consider the element Uranium that undergoes radioactive decay at the rate of 12% of the material per year. This decay rate, 12%, is independent of the amount of material present. Find the time taken for half of Uranium to disappear, or the half life.
This half life is true for any amount of material. A 1000 kg sample and a 1 kg sample will both take the same time to reduce to half their original mass.
almost any study of oscillating or vibrating systems, the Sine function arises, for reasons that are not attributed to mere coincidence and chance. In all simplicity the Sine function is the complex mathematical function to describe circles. From the lines from T.S Eliot's Wasteland and from your own experience circles represent a never ending, repetitive cycle. Motion along a circle passes through the same point infinite times. It is this complex nature of the circle, that causes its study to be so intense and intricate. Before we can talk about circle, we need to first understand triangles. Let us begin our study of the Sine function with a look at right triangles. In all simplicity the Sine of an angle ( The issue of what is an angle and how to describe it will be dealt with later) is the ratio of the opposite side to the hypotenuse:
Or more directly:
From the calculator or a Trig. Table we can find the angle approximately. The Cosine functions similar to the Sine function except that it measures the adjacent side, not the opposite side, ratio to the hypotenuse. For example:
The last trig. Function, the Tangent, is the ratio of the opposite side to the adjacent side.
Thus in right-triangle ABC, The remaining Trigonometric functions; Secant, Cosecant, and Cotangent are simply the reciprocal values of the Cosine, Sine, and Tangent respectively.
It is not important to remember the reciprocal functions as they unnecessarily add confusion to one.s understanding of Sine.s and Cosine.s and Tangent.s. It is enough to remember that they are just 1 over the value for the Sine, Cosine or Tangent. To further make life easier one can express Tan (as just Sin (over Cos ( This is easy to see as:
This reduces the study of Trigonometry to only two functions, the Sin (and Cos ( Any other Trig function can be expressed as a variation of these two. To state them again they are:
At this stage your mind should only be focused on knowing what the Sine and Cosine of an angle mean. Sine is opposite over hypotenuse, and Cosine is adjacent over hypotenuse. Now there are two very important concepts to understand about the Sine and Cosine of a given angle. First, Sines and Cosines apply only to right triangles, when there is one right angle of 90 degrees, whose opposite side is called the hypotenuse. If one were asked to find the Sine of the following angle.
Then the answer is not 2.8/5.5 or 2.8/7 as triangle ABC is clearly not a right triangle and
side AC is not a hypotenuse. To find Sin (we would have to somehow construct a right triangle, probably the easiest way by drawing an altitude from C perpendicular to AB, like this:
The values for h and x can be found by writing two simultaneous equations based on
Pythagoreans's theorem:
The second important point to understand about Sines and Cosines of angles is that their values are independent of the dimensions of the triangle. What this means is that the Sine of a 62 degree angle will always be .883, regardless of the size of triangle it is measured in.
Size is a meaningless quantity unless it is used with reference to something else. If you examine a right triangle individually then you will find it impossible to describe its size. It is only when you look at it with respect or relative to another triangle that you can correctly say the following concerning these two right triangle.
Triangle A is large. (Relative to B only) Now if we were to compare Triangle A with let us say another Triangle C we would say:
Triangle A is small. ( relative to C) Finally we can compare Triangle A with two other triangles and say the following:
Triangle A is of average size. ( relative to b and C). This brings us to a logical question; how can Triangle A be small, large, and even average? Since Triangle A is unique and does not change then it is our adjectives which are at fault in describing the attribute of size of the triangle. We are now in a position to state an important philosophical idea: Adjectives are only as accurate as the number of objects an object is described relative to. In other words to describe something one must have at least a standard to compare it to. As we mentioned before descriptions are only as accurate as the number of objects it is compared relative to. For this reason calling Triangle A average was more accurate than calling it large or small. This is because, in describing Triangle A as average we were comparing it relative to two triangles and not just one as in the other two cases. The importance of understanding relativity cannot be understated in describing any situation, phenomena, object or attribute. To name but a few examples from the natural world, weight, velocity, size, distance and objects and light. Relativity can also be taken a step further to ask what in this world is absolute, or whose existence is independent of anything else. In the SI units of measurement, every quantity except length, time and mass are considered derived. For example volume, density , force etc.. can all be expressed in terms of the fundamental quantities of length, time and mass. But are length, mass and time, independent of each other? Returning back to our discussion of the Sine function we saw that the Sine of an angle was simply the opposite side over the hypotenuse. We also now see how all other trigonometric functions can be expressed as variations of the Sine function alone, with the help of the
Pythagorean theorem. Therefore before continuing let us go through a simple proof of the Pythagorean theorem, a theorem which as you already know, arises almost everywhere in Mathematics. We begin this proof by drawing an altitude in a right triangle ABC,
The altitude with height, h, divides the triangle into two other right triangles, both similar to triangle ABC.
Taking the theorem a step further we can prove the following important identity relating sine with cosine. In any right triangle ABC
Cos (q) would then be equal to a/c; however by the Pythagorean theorem; , we can then re-write a as Hence:
Remember that
This identity holds true for any right triangle. For remembrance sake it can also be written as; This useful identity tells us that even the Cosine of an angle can be expressed in terms of the Sine of the angle, therefore the Sine function is the fundamental Trigonometric function, as all other functions can be derived from it, including the Cosine function. Until now we have restricted our study to triangles, where the Sine of an angle is the ratio of the opposite side to the hypotenuse. We can therefore define the Sine function to be the function that outputs a unique ratio for an inputted angle. But we must ask ourselves what is an angle, and how can we express the sine of an angle mathematically without having to look up Trig. Tables? The solution is left for the next chapter that will go into detail to define the Sine function mathematically by using a circle and arclength.
Type in any questions, correctio for this page here.
Email:
Remember a function is by definition only a function when the output is directly related to the input. For example the following function gives the Volume of water in a bowl as function of height:
Input any height and one can easily calculate the volume. However the function,
is not a function as none of the variables; opposite, hypotenuse, or angle (theta) are related in any way. Give me an angle and I can not give you the Sine of it, that is without looking it
up in some table. To begin our study of the Sine function, we need to first find a way of defining it as a mathematical function, such that the Sine of an angle is a function of the angle inputted. It therefore behooves us to define what exactly is an angle. Till now we have assumed an understanding of angles and we looked at Triangles to show what they mean. Before defining what an angle is, let us now take an important look at circles to see how angles can be defined by them. From the Pythagorean theorem we showed that in any right triangle, equation : . The
Is therefore the set of all points a distance r from the origin, from the distance formula which is based on the Pythagorean theorem. The important point to realize here is that the co-ordinates of each point on the circle gives us the dimension of a unique right-triangle with base, x , height, y and hypotenuse r. Each point satisfies the Pythagorean theorem for right triangles, ; though the radius or hypotenuse remains fixed, the x and y.s can take on infinite values. To visualize these concepts, the Graph of the circle
Let us now look at how to define an angle in terms of the circle. An angle in all simplicity is the measurement of rotation between two intersecting lines. This rotation can be expressed in terms of the arc length of the circle. If you follow the path of a point at (3,0) to (0,3) along the arc of the circle then you will notice that the radius drawn to each point on the path will increase at a steady rate.
You can notice two points here. First the circumference of this circle is given by the formula:
This tells us the total arc length of this circle is just , where r is the radius. Now if we define there to be 360 degrees in a circle, what this means is that a line that rotates a full circle to return to its original position will have covered 360 degrees ( a completely arbitrary number). Since degrees are a measurement of an angle, or the amount a line or r in the graph has rotated then we must figure out some method of relating the arc length, a measurement of rotation to degrees. If we define there to be 360 degrees in one full rotation the ratio of some number of degrees to 360 degrees must equal the ration of the arc length covered to the total arc length of the circle.
There is still one thing left, this definition of degrees is entirely dependent on the radius of the circle. If you recall from our discussion of the relativity of size in mathematics as being meaningless without reference to something else, then theoretically 30 degrees should be independent of the size of the circle it is measured in. To account for this we introduce an important new concept, the radian. The radian is a relativistic measurement for arc length that gives us the arc length measurement in terms of the radius of the circle. For example in a circle of radius 2, the circumference is 4 . An arc length of 2 radians is then just 4 units long or twice the radius. Since the circumference of the circle is directly related to its radius then an arc length in radians will always have the same relative size to its circle, regardless of the size of the circle. This probably sounds more confusing than it really is, but this is all because we are looking for a way to define degrees in terms of arc length. If we replace arc length with radians*r, where r represents the constant radius and radians is any number which could be a fraction we get:
This important equation now tells us how to define an angle in terms of the radian measurement of an arc length, measured in radians. We can re-write it as:
The important point to realize throughout this discussion of what an angle is, is that angles can only be first defined in terms of arc length rotations between two lines measured in
radians. Having established radians, only then can we also use degrees, which is just another unit of measurement of angles entirely based on radians. Instead of calling an angle .86 radians we can say 60 degrees:
Degrees are directly related to radians, and are more often used when there is no reference circle to define an angle easily. Let us now return to our discussion of circles. Having established just what an angle refers to conceptually we now need to define an angle mathematically. We shall select for our study a circle of radius one, which is often called the unit-circle, for reasons soon to be seen. If we draw a small triangle in the unit-circle we can see that:
In such a triangle
What this means is that for any point on the unit circle, the Sine of the arc length measured from (0,) to (x, y) will be the y value of that particular arc length. For example:
In terms of triangles and degrees, this says that the ratio of the opposite side to the hypotenuse of a 45 degree angle is just . It is extremely important to forget about triangles and degrees for the moment but just concentrate on circles and arc lengths which are both mathematically defined shapes. In terms of the unit circle whose the angle measurement 45 degrees is expressed in terms of the arc length of a circle whose radius is 1. pi/4 then refers to an arc length of 0.78 radii. Error in picture below, it should say 0.78, not 1.56
We are now ready to define the Sine function as that function that outputs the y-value for an inputted arc length of a unit circle. The cosine functions gives us the x-value for that same arc length. It is here where the difficulty arises. We need to find a way of expressing the arc length of a circle a unit circle in terms of x and y. By being able to calculate the arclength in terms of x and y values, the Sines and Cosines will just be the x and y values of the point until which we are calculating the arc length. Therefore the Sine function is really an arc length function as we shall soon see. To begin let us recall our formula for calculating the length of curves of graphs of equations. It was defined as the following differential that gives the length of a hypotenuse used to
Relating this to our study of the circle, the arc length can be expressed as follows:
The arc length from zero to a where, a is less than or equal to 1 is:
There are two important points to understand here. First for any inputted the output will be an arc length measured in radians since we are dealing with the unit circle.
For example if I were to evaluate the integral from x=0 to x=1 I should get The second point to keep in mind is that therefore Returning to our integral we have: is the same as
The question now arises, . how do we interpret this or relate it to the Sine function. To answer this let us study the graph of the circle once more. Since we commonly measure arc lengths in radians or degrees beginning at (1,0) it will require a moments thought to realize that now we are measuring angles from (0,1).
First take a look at section 1. It represents the way we initially defined the Sine Function, where Sine( arc length) = y. Now examine section 2. It is exactly the same as section 1 excepts instead of y we have x. The sections are identical because the graph of a circle is symmetric about the line y= x. We can therefore define the Sine function as follows.
Since x is our output and not our input here, we need to find a better way of defining the Sine function. If you recall from our study of inverse functions, we saw that the inverse function always yields x when inputted in the original function since a function and its inverse were essentially the same function except that the x and y were switched around, and the two functions were then graphed in the same x-y plane. For example if we had then its inverse was found by replacing y with x to get point (x, y) on the graph of inverse; example: then inputing y values from in or . Since any has a corresponding point on (y, x) on the graph of its would output x values. For
This may sound more complicated then it really is but we can now conclude that since the Sine of the arc length integral outputs x, then that integral is the inverse of the Sine function:
The inverse sine function is commonly called the arcsine function which is analogous to asking what angle or arc length has sin x?
At this point is important to stop thinking about angles, degrees, and triangles. What is essential now are circles and arc lengths measured in radians. The Sine function, mathematically speaking is then just the function gives the corresponding x value of an arc length measured from to 0 to x on the circle. The inverse sine of a number x is then just the arc length from 0 to x on the graph of a circle. This is all it is, nothing more, nothing less. The inverse sine function is therefore our most important function since it serves as the base for calculating arc lengths for any
This concludes our definition of the Sine function as mathematical function to express the arc length of a unit circle, or any circle. You can already begin to see the problems we will encounter in integrating it since the arc length approach infinity towards x=1.
Once again if you recall from the Chapter on inverse we learned that a function and its inverse are essentially the same function. We can therefore easily find an expression for based on the definition for integrate . The only problem here is that we have to . We do however know how
By following the algorithm presented in the chapter on inverses for finding the derivative of a functions inverse we can find the derivative of the Sine function without having found a way to define it yet. We first replace any x with its y equivalent so as to be able to find the derivative with any given y-value.
From the graph of the circle this tells us that the derivative of the inverse sine function is just 1 over the y value of that particular arc length.
From the graph you can see that this value corresponds to the Cosine of the arc length or
small change in the arc length. On the other hand the y in the is just referring to the point until which this arc length is being measured in terms of the unit circle. We can now write:
are the same thing from our definition of the angle. In the
graph of the inverse sine function, x is our independent variable and or arc length is our dependent variable. Since the Sine function is the inverse of the inverse Sine function, then is our independent variable and x is our dependent variable. This means that the derivative of the Sine function is given by:
Since we already calculated the derivative of the inverse Sine function to be which is simply 1 over the Cosine of the angle, then the derivative of the Sine function is just the reciprocal of that:
This important conclusion tells us that the derivative of the Sine function is just the Cosine of that particular angle or arc length. The Sine function inputs arc lengths and outputs the corresponding x-value of that arc length. Therefore its derivative with respect to arc length is just the Cosine of that arc length.
. This tells is that the derivative of the Cosine function with respect to arc length is just the negative Sine of that arc length. To summarize:
We have now found the derivative of the Sine function in terms of the Sine and Cosine of the arc length, . There still remains the stumbling block, what is or arc length? Though we have shown rather crudely what is the derivative of the Sine and Cosine function are , we still have not found a way of defining inverse function: . The only obstacle was our definition for the
There is absolutely no way we can evaluate this integral. You can understand this in two ways. First the derivative approaches infinity as x goes to 1 and second from the graph of the circle, arc length is continuous never ending length. It goes round and round the circle forever. Though we have defined a beginning there is no end. It is this complex nature of the arc length of a circle that tells us that there is no way to solve its using real numbers. It is what we call an irrational integral and can only be solved with complex or imaginary numbers, . The best way of thinking of what imaginary numbers are is that they are numbers that basically tell us there is no solution to the equation. There is nothing magical or transcending about them. If we ever see them, they only alert is that the equation has no solutions. Using complex numbers the integration goes as follows: First factor out -1 in the denominator:
Factoring out a
gives us:
The derivative of
is:
Factoring out a i or
from
gives us:
Eulers equation probably makes little sense to you. All it really is is another way of expressing the relationship between the complex logarithm and circular functions ( Stillwell 221 ). The sine and cosine functions are therefore solutions to such irrational integrals. There is still one way of defining which is through an infinite series of terms based on the binomial expansion theorem, which defines an infinite series for sums raised to negative and fractional exponents:
or more simply:
Though we will study infinite series in more detail in the following chapter, it would do no harm to begin the study with our inverse Sine dilemma. As the expansion for for is valid for all values of n, then we are able to write the integral
in a different way:
This is clearly an infinite series with never ending terms since Integrating the series will give us the series for
The series is valid for -1 < x < 1. For each term in the series will get larger, causing the series to diverge. We will talk more about how this series works in the next chapter. Summary In these three lengthy chapters we undertook an in depth study of defining the Sine function. Beginning with the simplest definitions we moved on to defining its inverse function as an arc length:
The
We showed how angles can only be measured in radians, a relative measurement that relates arc length to the rotation of two lines and the angle they define. Thus:
This relates degrees, measured in radians, as a function of x. The integral being un-integrable due to the unique properties of the circle, could only be solved with imaginary numbers. We then went on to prove the complex relationship between the exponential function and the Sine function, or .
Last but not least we were shown, rather simply, how the inverse sine function can be
expressed as an infinite series of terms based on the binomial theorem for negative fractional exponents. Our study of the Sine function will resume in the next chapter when we study the function in yet another way, actually the same way but by using some new theorems from Calculus.
If we assume that the block with mass m rests on a friction less surface then the block is free to oscillate with any horizontal force. According to Hooke's law the force required to stretch a spring is directly proportional to the length of the spring. Each spring therefore has a constant, known as the spring constant k which tells us the force required to stretch the spring a given distance. For example if the spring constant, k, of a rather stiff spring such as those used in cars as opposed to a slinky, were 1800 newtons per meter or k= 1800 N/m then to stretch the spring one meter would require a force of 1800 Newtons and to further stretch it three meters would require a force three times as great or 5400 N. This is all in accord with Hookes Law, which states that the more you stretch it the greater the force required. Consider now the block above of mass m at equilibrium position. By equilibrium position we mean the object is at rest with no unbalanced force acting on it. This tells us that the spring is being neither compressed nor stretched but is just lying there attached to the block. If we stretch the block a distance x from its equilibrium position and let go of it, then it will oscillate back and forth. Our goal is to mathematically describe this motion.
The amplitude of oscillation is 2x and the maximum force exerted by the spring on the block is kx. Keep in mind however that the force is zero at the equilibrium position , since the spring is un-stretched there and at this point the blocks entire energy is kinetic while at the endpoints of its oscillation it possesses entirely potential energy stored in the spring and its velocity is zero. Since the spring exerts a force kx when stretched, the work done stretch a spring a distance x is given by:
The work required to stretch a spring a distance x is equal to . When any mass m is stretched a distance x then let go of it will be pulled back with a force that varies with the distance stretched. The total energy of the system varies between kinetic and potential energy depending on the masss position. The kinetic energy of the block in motion is always equal to ,
where v is its velocity at a particular point. At that point x the potential energy will be or the energy used to stretch the spring to that position. The sum of the potential and kinetic energy of the block at any point is a constant that never changes. It corresponds to the net energy of the oscillating system:
At x=0, the velocity is a maximum and when v=0, x is a maximum and the block is at its
endpoint.
Recall that E stood for the total energy of the system. At its endpoints the velocity of the block is zero. The total energy is then equal to where equals half the amplitude
of oscillation.
from
This integral now has the familiar solution that we have studied:
where
and
Now that we have solved it, how do we interpret it or make sense of this solution? Basically this solution tells us that the time required for the mass to a distance x from the equilibrium position. We can therefore easily calculate the period of the oscillation, the time required for the mass to complete one cycle. The period would be four times the time required for the mass to move from x=0 to x= . Substituting this into our solution gives:
This answer tells us that the period of an oscillating mass on a spring is independent of the amplitude of the oscillation. For this reason springs are used in watches. Despite airresistance and friction between the spring and other parts that would cause the Amplitude to gradually decrease over time, the period of vibration would remain unchanged. Just as we are able to find time as a function of x, we can find x as a function of time using the Sine function or the inverse Sine Function. We saw that:
gives us:
Taking the Sine of both sides just switching around the dependent and independent variable:
with
and simplifies calculations.) Now that we have the position x as a function of time we can differentiate the function one to find the velocity and twice to find the acceleration.
This expression
in
or position since
Since Force is by definition mass* acceleration, if we multiply the function for acceleration by m we get:
This is what we originally started with, Hookes Law. The force in a spring is proportional to the distance it is stretched and acts in the opposite direction of the motion to oppose the motion. Before ending you might be wondering what happened to the constants? I decided not to put them in as to avoid any unnecessary variables or numbers that would impede your understanding of the important concepts being studied. The solutions with the constants included should here forth read:
Summary
In this chapter we saw how in a spring the force required to stretch the spring is directly proportional to the length stretched. Recalling that force is defined as mass* acceleration or F= ma, where acceleration is the second derivative of the position function of an object in motion:
The only function whose second derivative is proportional to negative of the original function is the Sine function, since:
Therefore this chapter was in essence devoted to the understanding of how the Sine function and its derivative apply to applications where the second derivative is directly proportional to the negative of the original function. This is why the Sine function is restricted to Oscillatory motion, since a force -kx, will always act so as to oppose the motion of the body in any direction. From the Conservation of Energy the body will then rebound back in the opposite direction once whatever force acting on it brings its velocity to zero because all the energy will now have been transferred from the mass to the system. In such a way this back and forth process goes on forever until interrupted by other forces such as friction.
Type in any questions, correctio for this page here.
Email:
derivative
Before continuing it is important to keep in mind that from the definition of the derivative, the derivative of , gives us the slope at any point on the graph of
The area under the graph of the derivative from x=4 to x=6 is given by:
, of the graph of the anti-derivative from x=4 to x=6 is exactly the area under its derivative bounded by the same interval. Remember from the definition of the integral:
we calculated to be 10. If we draw a line through (4,8) and (6,18), and draw an imaginary base of length , then we get somewhat of a triangle. The slope of this line is . Therefore the area under the graph also equals the slope of this line multiplied by the change in x, 2 or 5*2=10.
the same slope as the line connecting the endpoints of the interval (4,8), (6,18). The reason we know that this is true is because the line connecting the two endpoints is the direct way to link the two points. Since the graph of takes a curved way to connect the points then somewhere along its path its direction must be parallel to the connecting line if it is to ever reach the endpoint. Now that we know such a point exists and that the derivative of the graph is defined at every point, then the change in y, 10, can be found by finding the slope of the line that is parallel to the line through (4,8) and (6,18). This slope is . This tells us that there exists a value c between 4 and 6, such that the derivative evaluated at this point c, gives the slope of the line parallel to the line connecting the two endpoints on the anti-derivative. Since this line must lie between x=4 and x=6, then c must also lie in this interval, and equals the value at which this line exists. We can express this as:
This is the mean value theorem; there exists a number c between a and b on the graph of f.(x) such that f.(c) gives the slope of a tangent through f(c) that is parallel to the line through (a, f(a)) and (b, f(b)). Conceptually speaking it is important to remember this as:
The area under the curve of a graph can be expressed either by evaluating the integral at the endpoints or by finding a number c, between a and b, and multiplying it by the change in x, (b-a). In other words the net change in a function.s anti-derivative can be found by just using the function itself. As we shall see, finding that constant, c, is another problem
itself.
are determined by the functions derivatives. For example the constant on the function.s third derivative, on the sixth derivative or and so on.
is based
It is obvious that a function with a finite number of derivatives would have a finite number of terms as . On the other hand ,infinitely differentiable functions such as exponential and trigonometric functions would be expressed as an infinite series, whose accuracy in expressing the function would be determined by the number of terms of the series used The proof of Taylor.s Theorem involves a combination of the Fundamental theorem of Calculus and the Mean value theorem, where we are integrating a function, . These two theorems say: to get
. We therefore know:
is just a constant, and is equal to the n.th derivative evaluated at some point c, between a and x. And Integrating the right side:
Integrating this entire mass a fourth time, where we started with the function gives as you may have already guessed from the pattern:
By now the pattern should be clear. If we integrate n successive times to eventually find f(x) we arrive at a remarkable conclusion:
What this is saying is that any function can be expressed as a series of its (n-1) derivatives, each evaluated at any point, .a., plus its n.th derivative evaluated at a point between a and x. Since the first few terms of the series are:
One needs to not only know the value of f(x) at f(a) but also at f.(a), f..(a), f...(a) etc... in order to find f(x),where x, is any number other than a. The last term of the Taylor series differs slightly from its preceding terms. One needs to be able to calculate for a c somewhere in between a and x.
In this case, = , which will always be 3 no matter what you input in the function. Remember n.th derivative refers to the last derivative of the function.
In order to write or calculate a Taylor series for we first need to calculate its n derivatives, which we have already done above. The Taylor Series is defined as:
Simplifying it we get:
The easiest number to choose for a is probably 1, though you can choose whatever number you want to for a , so long as its n derivatives are all defined at a. Substituting a for 1 gives:
The sum of the series of terms corresponds exactly; however, as you can see, writing a Taylor Series for a faintly differentiable function is not a practical thing to do. For example in this series we had to calculate in the last term in order to find f(6) or . Now if we directly
were able to calculate with ease then would it not be much easier to find without having to go through a series of calculations and summations? The answer is yes and thus the life of finite Taylor Series is short-lived.
The situation is a bit more complex when dealing with infinitely differentiable functions such as . In this case one can easily go on differentiating the function as many times as you desire to create an infinite Taylor series where each term is evaluated at some point a. The problem is that no matter how many terms you use, there will always be that last term that needs to be evaluated at an unknown point c, between a and x. Even if we use an infinite number of terms for our Taylor series to be evaluated at some known point a, how de we know that our infinite plus one) or last term does not completely throw off our answer? From the definition of Taylor Series:
f(x) is expressed as a series of consecutive positive integer powers of x from . The last term is hence:
with
The limit in the brackets is zero as the factorial grows much faster than the exponential, remember x can take on any value but it can never be larger than infinity or n.
No matter what value of the last term will always be zero provided an infinite number of terms are used in the series. The same is true of:
As a is only a constant; however, the closer x is to a the more accurate your answer will be using fewer terms. To conclude the Taylor Series for a function of x is:
Where the last term for a function of finite derivatives is some constant, independent of c, multiplied by . In a function of infinite derivatives the last term goes to zero after an infinite number of preceding terms. The closer x is to a then the few the terms needed to obtain an accurate answer Therefore Taylor Series hold true for all functions so long as one is able to evaluate its n derivatives as a point a. Let us move on to an important example of such infinitely differentiable functions such as . This is an infinitely differentiable function and as we shall see the infinite Taylor series provides an important way of relating square roots of a number through a series of terms than can be easily evaluated. The answer can be obtained from the Binomial expansion, but let us use Taylor series to show the relation between the binomial theorem and Taylor series. We will first use eight terms by differentiating the function seven times:
Since we can evaluate each of the n-derivatives at x=1, then we will let a=1, which simplifies the series to:
Take note of how the fraction preceding each term gets smaller and smaller. This is a check, that tells us that the last terms get closer and closer to zero, and hence the Taylor series will be defined for all x.
Therefore using eight terms we found the square root of two to an error of only .50% or half
a percent. Had we used more terms the error would have been even less. With an infinite amount of terms the error would be zero or next to nothing.
Let us begin with a look once again at the series for the infinitely differentiable function, the square root of x. The theories that we are to develop will hold true for any function raised to a positive or negative fractional exponent.
Remember we let a=1 to find the square root of any number x. Take a close look at the fractions preceding each term:
It seems that they are slowly going to zero, but now look at the terms of the Taylor Series:
Since each term represents a power of some number (x-1), then we can conclude that if this number is greater than 1, then each term in the series will get larger and larger and approach infinity, in the infinite term. However if the absolute value of (x-1) were less than 1, then as the number of terms in the series increased, the value of each term will decrease. Therefore we can say that the following series is only valid for:
This tells us that for x in this range of values, the infinite term will go to zero and the series will converge. Remember, the more times you multiply a fraction by itself, the smaller it becomes.
Now that we have defined the series for a certain range of numbers then we can extend this range to include all the real numbers. For example we can find the square root of any fraction . This means we can find the square root of any number, greater , then we just
than 2, by taking the reciprocal of its square root. If we had to find the
rewrite it as
. Since
by letting x=
, then taking the reciprocal of this value. Now let us move on to finding Taylor Series for the exponential and Complex Sine Function. The Taylor series for The series is then: >can be easily found since its n derivatives are all the same, .
Since the limit as n goes to infinity of is zero, regardless of what value x is, then the series is valid for any value of. Letting x=1 and using only the first eight terms gives us the value for e:
The calculator value for e is 2.718281828 which corresponds to an error of less than .001 % using only eight terms. The more terms used the more accurate your answer will be. Now that we have found the series for >we can find the Taylor series for y= ln (x), which is also an infinitely differentiable function. Fist let is find its n. derivatives:
Using the first seven derivatives we write the following Taylor series:
Letting a equal 1:
There is one small problem here. Though the Natural Log of x is defined for all values of x greater than zero, the Taylor series on the other hand is only valid for 1<x<2. If x were three for example, the series on the right would diverge as each term, larger. In an infinitely differentiable function as in will get larger and
went to zero as n went to infinity. This was based on the fact that
However in the case of the series for the natural log of x, the last few terms become:
The limit of this function clearly goes to infinity as long as the absolute value of
is
greater than 1. Conversely if is less than 1 or , then the last term will go to zero and the Taylor Series will hold true. By making the modification to the Taylor Series for ln(x), the series will converge for . Since a =1 the series converges for:
is valid for x between 0 and 2. Despite the restricted value of x, we are still able to calculate the ln (x) for any x, by just taking reciprocal values. If we wanted to find the ln (40), 40>2 we would have to calculate which equals
The Taylor Series for Sin(x) and Cos(x) are also quite easy to find. Since we know the derivative of Sin(x) is Cos(x) and Cos(x) is -Sin(x) and we can evaluate these functions at a=0, as Sin(0)=0 and Cos(0)=1, the Taylor Series are as follows. First find the first few derivatives.
Using this to write the Taylor Series for the first eight terms is:
Letting a=1
Since the limit the infinite term in this series goes to zero as n goes to infinity, then the series is convergent for all values of x. Differentiating the series gives us the Taylor series for Cos(x):
This solution is remarkable .It allows us to define the Sine and Cosine functions mathematically in terms of an infinite series. You might be wondering how the series is define for any arc length, etc. From the graph of the circle it is clear that its arc length is continuous and passes throughout the same point infinite times as it completes its rounds. For this reason our integral for the inverse sine function could only be solved using imaginary numbers. Our study of the Sine function began with simplicity, rose to reason, climaxed in the abstract, fell into the imaginary, and now ends in perfection. Our study of Taylor Series showed us how by integrating a function , n times, we were able to express f(x) in terms of its n derivatives evaluated at a point a with the last terms being evaluated at some point c, between a and x. The series took the form:
The important point to realize here is that n stands for an integer, such that a finite differentiable function can be expressed as a series of its n derivatives evaluated at some point a. The last term, can be found regardless of the value of c sine the last derivative of a finite differentiable function is always a constant. As we saw, expressing finite differentiable functions in terms of a series of powers of x turned out to be impractical. It is here where we saw the value of extending Taylor.s theorem to infinitely differentiable functions. This would mean we would have an infinite series with infinite derivatives. The only way to prove that such a series could be written
and would hold true for any value of x was by taking the limit of:
If the limit was zero then we know that , or the last term of the series also goes to zero and therefore the series converges for al values of x . If this limit did not go to zero then we had to modify or restrict the values of such that the series would converge for those values of x. We could then extend this interval for all real numbers.
If in any of the above functions, x was replaced by u, where u is another function of x, then by the Chain rule:
From the product rule, the derivative of any product uv, the derivative was:
The rules for Integration, as we already know are unfortunately not so straightforward as in differentiation. The reason for this is because there are numerous ways to write a function, yet only one of these ways will resemble a differentiable form. Therefore the key to integration lies in finding that integratable form and as we shall soon see, that form is not necessarily the simplest form. Let us take a look at two simple functions and show the variety of ways it can be simplified, expanded and re-written.
or another example:
Even though all these functions are equal to the original function, they can not be integrated directly as they do not appear in a differentiable form. The only two functions that can be integrated directly are the original functions , whose integrals
are and respectively. If you closely examine the physically different, yet mathematically equal functions, you can notice that in most cases the integrable form is quite evident and can easily be found with a few quick simplifications such as:
This final and important chapter is devoted to the study of functions that at first glance look impossible to integrate; however, after the application of certain simplifications and rewriting techniques, the function can be integrated. The first of these techniques is called integration by parts. It allows us to evaluate the integral of a product, in terms of the integral of one factor multiplied by the derivative of the other. From the product rule we have.
or just:
Solving for
gives us:
For example to integrate the product of Now that we have expressed the derivative as a sum of derivatives to get: , we can integrate the
The reason why this integration technique works is based on the product rule which allows us to differentiate a product as a sum of two functions, . Even when not integrating a product, one can always let such that v=x. The next technique that we shall study is based on the derivatives for the Sine and inverse Sine function. From the previous chapter we saw how was defined as:
or by the definition of the integral the derivative of derivative of Sin(x) was found to be Cos(x).
is
Furthermore we showed how all other trigonometric function such as Tan(x) and Sec(x) could be expressed in terms of Sin(x) and Cos(x), or just Sin(x) since Cos(x) is also dependent on Sin(x). With this in mind let us see how we can express the inverse tangent ans secant functions in terms of the inverse sine function. The function is defined as the arclength from 0 to x on a circle or:
If we define the inverse tan function to give us the ratio of the x value to the y value for a particular arclength then:
Where
gives us an arclength for that particular value of x, and the tangent of that . If we let and then differentiate both sides we
not
in terms of
. The
function tell us the x value for a given arc length. Now if for the same arclength , the Tangent were x, then what would be the Sine of the Arclength. To answer this, let is consider the following graph again:
In this circle, the tangent of the angle is x, and the Sine of the angle is
. Hence:
or more importantly.
By similar means we can find the derivative of other inverse trigonometric functions, by first defining it in terms of the Sine of the angle and then differentiating it.
The underlying concept in these definitions is the fact that the inverse tangent, secant and any other trigonemetric functions can be expressed in terms of integrating them is much easier as we already have them defined. such that
As we mentioned before, the difficulty of integration lies in the fact that a function could be written in so many different ways.