Transcript for:
Introduction to Quantum Mechanics Concepts

I will be giving a set of lectures pertaining to an introductory course in quantum mechanics so this is going to be quantum mechanics many of you would be familiar with concepts from classical physics presumably you know what is meant by independent degrees of freedom this is all from classical physics generalized coordinates of course generalized momenta phase space and so on some of you might even be aware of the richness of the structure of phase space now when it comes to quantum mechanics these concepts could undergo some alterations the alterations were not obvious and the history of quantum mechanics is as revealing as it was dramatic the reason is that classical mechanics could really not give any pointers to understanding the quantum world it made mockery of intuition of human beings and it became a challenge to the best and brightest of minds to find out how exactly the quantum world works the features relating to the quantum world were really obvious only through the atomic and submicroscopic phenomena that is not to say that quantum mechanics does not hold for macroscopic objects in fact several macroscopic phenomena I can think of ferromagnetism right away can be explained only on the basis of quantum physics having said that we normally can understand most of things that happen around us in daily life most of the macroscopic world just based on our physical intuition our prejudices when our common sense which has really been in us due to our experiences in daily life so as long as we are dealing with objects of length scales masks ales and time scales that we are used to in daily life most of the time we can get by with our physical intuition and with classical laws which seem to confirm predictions based on our intuition unfortunately and interestingly we cannot do that with quantum laws so it turns out that in the history of quantum mechanics it is in the context of atomic phenomena and sub microscopic particles that these laws really came out into the open Einstein is supposed to have remarked that our physical intuition is a collection of prejudices that we have developed till age 18 and prejudices and intuition do not really explain things in the quantum world many concepts in quantum physics are very dramatic it is very difficult for the human mind to comprehend it at least it was at that stage in history and I am talking about the end of the 19th century where most classical laws were pretty well understood the first thing that was very different when it comes to quantum physics was the fact that an object can be thought of as a wave and as a particle this is normally called the wave-particle duality in most books and a series of experiments showed that matter behaves like waves and also like particles I am normally fond of two of this class of experiments Young's double-slit experiment which demonstrates the wave nature of objects and Einstein's discovery of the photoelectric effect and this demonstrates the particle nature of objects so I will just give a rather brief description of the Young's double-slit experiment first and we will take it up from there let us first look at it from a classical point of view start with the source of bullets the bullets are shot out in various directions and they hit a target the target is a screen now as the bullets strike the target you could perhaps use detectors and count the number of bullets that have hit the target clearly there will be an integer number of bullets you could modify this experiment by putting a screen out here with two slits and now see what is the pattern that emerges on the target screen by pattern I mean that if you plot the distance along this versus the number of particles that strike the screen you would like to see how exactly this plot looks so from the source there are bullets that go and you would therefore expect the plot of distance versus number of bullets to have a peak out there and similarly out here because of bullets that come here the pattern would be that of two Peaks one there and one here and nothing else in between now this is what you would expect from classical physics let us replace this source of bullets with an electron gun so that we do a quantum experiment now we start off with source of electrons once more we have a screen on which these electrons fall and if we have detectors and counts the number of electrons that reach the screen we will count an integer number of electrons once more we can put a screen out here with two slits through which the electrons have to pass in order to hit the target you would normally expect once more a pattern like that if you plot a distance versus number of electrons that strike if the electrons behave like bullets or like particles on the other hand something very dramatic happens on the screen instead of a pattern like this what one sees is alternate bright and dark fringes this is the kind of property that you would expect if the electrons behave like waves and if they were waves that emerged out of these slits they could interfere and depending on whether there is constructive interference or destructive interference you would expect an interference pattern like this this is intensity versus distance from the screen a more intense area then a less intense area then a more intense area and so on this is very striking because this region for instance is a region where no electrons can reach if the electrons are thought of as particles on the other hand the fact that there is an overlap between waves that interfere has created this pattern the Young's double slit experiment therefore demonstrates very clearly the wave nature of the electron consider on the other hand the photoelectric effect now in the photoelectric effect light is Shawn on matter perhaps a metal and electrons are emitted now if the light we're thought off is electromagnetic waves and electrons come out of this metal plate you would normally expect the energy of the electron that comes out to be related to the intensity of the incident light in other words as the intensity of the incident light changes you would expect the energy of the electron to change for light of very low intensity you would expect low energy electrons for light of high intensity you would expect high energy electrons that's not what happens and what happens can be explained only on the basis of the corpuscular theory in other words you treat light as a collection of particles namely photons each one with energy given by H nu where nu is the frequency of the light and H is Planck's constant has the same dimensions as action jewels second those could be the units so it turns out that photons interact with the electrons in the metal plate and transfer energy to them and pull them out so that the energy of the electron does not really change with the intensity of the incident light the more intense the incident light the more the number of photons the less intense the incident light the less the number of photons and therefore the energy of the emitted electrons do not depend upon the intensity of the light on the other hand they would depend upon the frequency of the light the higher the frequency the more the energy now these two statements have been experimentally verified clearly demonstrating that particles and waves are both attributes of the same object the electron it's not merely the electron if you look at photons protons neutrons or any form of matter you can associate waves with all forms of matter and therefore even for large systems there are matter waves associated with them it is simply that as I said before you understand many things about macroscopic systems based on classical laws classical mechanics is really an approximation it is not the exact theory and finally when everything was in place it was clear that when the Planck's constant in the limit Planck's constant going to zero and if the limits are taken properly you can retrieve the classical laws from the quantum laws so given these two we understand that in the quantum world there is this wave particle nature attributed to all objects we need a theory to explain this we have to learn to accept waves and particles being properties of the same entity and not waves or particles which is the kind of lesson that the classical world has taught us to explain this was difficult Einstein and flanked and many other physicists were involved in this explanation and something called the old quantum theory was proposed primarily by Bohr Jordan however it was also realized soon enough that the old quantum theory was rather ad-hoc in nature and cannot be used as a strong edifice or a good mathematical structure for explaining what happens in the quantum world a new theory had to be brought in think of it this way physical intuition is not helpful and in a manner of speaking classical physics has merely shrugged her shoulders there is a certain helplessness there are no pointers as to what should be the central objects around which the theory should be built what should be the central objects which should be used in explaining things like the wave-particle nature of matter and then physicists brilliant minds working in different parts largely in Europe Copenhagen Berlin got into aggressive debates about what should be the right theory of the quantum world this was a period in the early 20th century which was very dramatic educative and revealing because by the 1920s it culminated in what we now know as quantum mechanics so I will not make much ado about the old quantum theory was not very satisfactory and what we now work with is quantum mechanics as was given to us largely by two master minds Heisenberg and Schrodinger the primary architects quantum mechanics as we see in the textbooks today there are alternative formalisms of quantum mechanics but during the course of my lectures I will be only talking about Heisenberg's matrix mechanics and schrödinger's wave mechanics both being equivalent formalisms of quantum mechanics having got no pointers from classical physics as to what should be the central issues central objects around which the quantum theory should be built Heisenberg thought that observables that is objects that you measure in experiments should be the focus of attention and the theory should be built around these objects in classical mechanics we are familiar with objects that can be experimentally measured like position momentum energy and so on now these objects are the observables in an experiment in principle a clever experiment on any physical system should be able to tell us what is the value of observables and in quantum mechanics Heisenberg said that observables are represented by matrices matrix mechanics itself came about because of the inspiration that Heisenberg drew from light atom interactions because there are atomic transitions that result due to interaction of the atom with light and parameters pertaining to these atomic transitions which can be measured it was realized by Heisenberg and others that these observables are best represented through matrices because the eigenvalues of the matrices are the outcomes of experiments so suppose one way measuring energy in an experiment the value of the energy which one would get on measurement would be one of the set of eigen values of the matrix that represents energy energy is observable the state of the system itself would be the eigenvector corresponding to that eigenvalue of course there are cases where for a given eigenvalue there is more than one eigenvector but such degeneracies will be dealt with by me later in subsequent lectures and therefore you have I gain value equations there is an observable I put a hat on that to show that it is an observable or an operator it acts on an eigen vector for the moment I call that sigh it is a column vector because this is a matrix that gives me a number may be a number a which is the eigen value or a set of eigen values this is a typical eigenvalue equation there are matrix acts on a column vector or eigen vector and you have a number with the same eigen vector on that side if i diagonalize the matrix the diagonal elements would be the eigenvalues and the outcome of the experimental measurement would be one of this set of eigen values there is a crucial difference between classical mechanics and quantum mechanics it's a very interesting difference and is being studied even now in great detail in measurement theory in a classical system if I make a measurement of the energy and I get a certain value I can insist that before I made the measurement the energy of the system was the same the same value that I got on measurement whereas in quantum systems I cannot do that the measurement outcome merely tells me that on measurement this is the energy of the system and this is the state of the system I cannot extrapolate and say that before measurement the energy of the system was the same nor can I insist that the state of the system after measurement is the same as the state of the system there was before the measurement now this is another crucial difference between classical and quantum physics so while Heisenberg developed matrix mechanics by which observables were given importance as central objects and outcomes of measurements are the eigenvalues of these observables there was an alternative formalism given by Schrodinger wave mechanics let us go back to the Young's double-slit experiment the Young's double-slit experiment there was a nonzero probability of the electron being in the region between the slits on the screen recall recall the experiment since we had fringes here also clearly the electrons where there the waves were interfering even in this region so there is a nonzero probability of finding the electron here here here here and of course all over there this makes the whole subject probabilistic in nature you can only talk of the probability of the electron being there or the electron being here or the electron being there or the electron being there you cannot say with certainty that the electron was there or was here and not here because if it were not here one would not see fringes out here it is this wave-like nature of the electron and any other physical system which perhaps inspired Schrodinger to develop wave mechanics because Schrodinger starts with the state of the system being a wavefunction and in fact we will see in subsequent lectures that psy could in general be complex we could work with psy as a function of space-time and even if we were working in a situation where we are not worried about the time evolution of the system is the probability amplitude of the system being in the region x - x + DX y - y plus dy + Z - Z plus DZ psystar psy is therefore the probability density and the total probability of seeing the system anywhere in space will be 1 so observables are represented by matrices and the eigenvalues of observables are the outcomes of experiments when you do a measurement the answer must be a real number and therefore observables are represented by hermitian matrices because eigenvalues of hermitian matrices are real it is possible to make measurements of two or more observables each observable represented by a hermitian matrix and as I said earlier the system collapses to an eigenstate the eigenstate is going to be a common eigenstate of all these observables it was soon realized that matrices are really representations of operators in an appropriate linear vector space the state of the system the quantum state lives in this linear vector space now this needs some explanation the linear vector space should not be confused with the usual physical space in which we live where we have the usual Cartesian coordinates XY and Z and time T I'll give an example to illustrate this point consider a free particle moving along the x axis so in principle the particle can be anywhere it continuously moves from minus infinity to infinity it's pretty clear therefore that if I measure the position of the particle the observable is position there will be an infinity of eigenvalues available in this problem which means that the hermitian matrix representing the position observable should be an infinite dimensional matrix and therefore the linear vector space in consideration is an infinite dimensional linear vector space so here we have a problem of a particle moving in one dimensions but the linear vector space in question is an infinite dimensional linear vector space there is another wrinkle to this problem the eigenvalues themselves form a continuous infinity it is not as if the eigenvalues are discrete like this it is a continuous infinity of eigenvalues because the particle is gliding along the x axis and therefore we have to modify our opinions of matrices and extend it and we have to allow for objects which have a continuous infinity of rows and a continuous infinity of columns in other words we have to accept the concept of operators in linear vector spaces the state of the system in this problem is of course an infinite column vector in a physical situation we would like to track the state of the system as it moves in space-time and therefore there is a prescription this prescription tells us how to go from this abstract eigenvector in some linear vector space to a function sigh of space-time coordinates now given this prescription it is possible to talk about the dynamics of the system as time evolves and the dynamics is normally given in terms of differential equations function spaces have in a very natural fashion gotten into the picture because this is a function of X Y Z and T thus linear vector spaces function spaces operators matrices differential equations all these have become a very natural ingredient of the understanding of quantum mechanics so we go on to the next question how do we make measurements how do we find out what is the value of the observable in question normally when we do an experiment we conduct several trials on a system repeated measurements many trials and then take the rithmetic mean of all the values that we have got as outcomes of the measurement and that is going to be the value of the observable now in quantum mechanics as I have already stated if you make one measurement on a system the state is no longer the original state which was their pre measurement post measurement the state collapses to one of the possible eigenstates of the system immediately after measurement the system collapses to an eigenstate and therefore there is no point in repeating the measurement on this eigenstate or the new state of the system because this was not the system initially the only way to handle this problem is to make an ensemble identical copies of the system on which we conduct the measurement and then make one measurement on each copy take the rhythmatic mean and that is going to be the value the average value of the observable in question the average is represented like this where X is the observable this is merely a notation and the average value is always with reference to a particular state of the system so this is the way you represent quantum averages now we have an interesting question suppose we make the measurement and suppose we make simultaneous measurement of two observables or more observables is it possible in a simultaneous measurement of these observables the word simultaneous is very important is it possible to get accurate values for both these observables the answer in quantum mechanics is generally no it is not possible and you can trace it back to the fact that the matrices corresponding to the two observables need not commute with each other in general they do not and when two matrices do not commute with each other we can show that it is not possible to get accurate values for both the observables in question of course we are allowing for and and if we make simultaneous measurements that's a very important thing you allow for human errors you allow for instrumentation errors and after that also it is not possible in a simultaneous measurement of two observables in general to get accurate values for both of them this is an inherent feature of quantum mechanics this is quite unlike classical physics it is an inherent feature of quantum mechanics and it has led to very many interesting relations one of them being the Heisenberg uncertainty relation normally refer to as the uncertainty principle so what is the Heisenberg uncertainty relation suppose we consider two observables x and y and delta x is the variance in X I make measurements in a certain state of the system so Delta X is simply equal to X minus expectation value of x the whole squared average this as most of you know is expectation x squared minus expectation X the whole square that is the variance in X similarly I define Delta Y in the particular case of a particle moving along the x axis Let X denote the position of the particle and P sub X the momentum linear momentum the Heisenberg uncertainty principle in this context says the Delta X Delta P sub X should always be greater than or equal to H cross by 2 where H is the Planck's constant and it crosses H by 2 pi in a minimum uncertainty state the equality is satisfied in general it is greater than this bound there is no way by reducing instrumentation errors improving human effort and bringing this down to zero whereas in classical physics it is possible that the measurement gives you Delta X Delta px equal to zero such a thing is not possible in quantum physics it is not just true for position and momentum but for any two observables that do not commute there are corresponding uncertainty relations if you look for instance at the ground state of quantized linear harmonic oscillator that turns out to be a minimum uncertainty state in the position and momentum variables there are very interesting states in optics states of light which obey the minimum uncertainty relation you look at ideal laser light it's a coherent state of light and there in terms of another pair of variables which I will call the quadrature variables such a relation is satisfied such an equality relation to satisfied we should remember that there X does not mean position and certainly the momentum is not involved but they are two pairs of a pair of observables for that context then there is qui state of light squeeze state of light is one where either Delta X or Delta P X is less than root of H cross by 2 so squeezed light squeezed state of light once again I should emphasize that X and P sub X would not stand for position and linear momentum but for two appropriate observables so in quantum physics it is possible to have different types of States satisfying uncertainty relation in a subsequent lecture I will talk about the Heisenberg uncertainty relation and also about variations and extensions of the uncertainty principle so already we have seen some unfamiliar things we have seen wave particle duality quantum mechanics is inherently probabilistic in contrast to classical mechanics which is inherently deterministic and in this context I have to mention something else in classical physics suppose there were a particle which is kept on this side of a barrier an impenetrable barrier the probability of seeing the particle on that side of the barrier is strictly zero whereas in quantum physics because of the wave nature there is a nonzero probability of seeing the particle on this side of the barrier as well if it's a quantum particle this phenomenon that there is a nonzero probability of seeing the particle there here there and there is due to something called tunneling this again is a third very important difference which which manifests itself in the quantum world which you do not see in the classical world it is very important that states of a system can be tampered with particularly in the context of quantum optics you will see very many interesting states of light in atom light interactions it is possible to tune the state of the atom it's possible to make atomic transitions various types of light sources could be used for this purpose and even in active areas like quantum information and computation we work with quantum states specific quantum states think of with them use them for logic gate operations like you would do logic gate operations with classical systems with bits you could use quantum states to store quantum information use it for teleporting information from one place to another and these are areas of extreme research activity even now all this leads to a certain picture of the state of the system in classical physics the state of the system is completely specified classical mechanics the state is completely specified if if the relevant generalize coordinates and momenta are specified and if you have a phase space of these coordinates and momenta every point is a state of the system time is a parameter and these coordinates and momenta evolve in time these become the observables in quantum physics time is a parameter both in classical and quantum physics observables are represented by matrices and the state of the system in quantum mechanics is prescribed by the wave function sy it is pretty clear that this is dramatically different from classical physics when all intuition fails and you want to create her theory you have to take recourse to something very exact something that will speak the truth help comes only in the form of mathematics and the various branches of mathematics involved in the understanding of quantum mechanics are linear vector spaces differential equations matrix algebra group theory so an understanding of the quantum world really amounts to seeing how exactly mathematical structures are useful in describing the physics in the quantum world and in explaining quantum phenomena and therefore in subsequent lectures I will be first visiting some of these branches of mathematics starting from something we know and explaining some of the essential features of linear vector spaces in the next lecture and taking recourse to all these wherever necessary in order to explain the quantum phenomenon in this lecture I have attempted to give you a first glimpse of the quantum world I've tried to emphasize the contrast between a world of classical laws and a world of quantum laws the surprises and the unexpected I propose to do the following in subsequent lectures as I have already indicated the state of the system lives in an abstract linear vector space and therefore for something like the first half of my series of lectures I would be only looking at the abstract state of the system in order to illustrate various examples I would workout exercises as we go along rather explicitly taking into account the fact that perhaps you're a heterogeneous audience of listeners and therefore I will try to work out a lot of the mathematics explicitly now after about 20 lectures I'll move on to the concept of the function space and talk about the wave function psy as a function of space we will study that for a while and then proceed to do dynamics where psy is also an explicit function of time now and then I will intersperse my lectures with exercises in order to illustrate the salient features this is a good place to stop by way of introduction in the next lecture I would begin with finite dimensional linear vector spaces calling upon ideas that we already have from vectors in two dimensions vectors in three dimensions and so on and proceed from there you