The question: What is the necessity of the tail current source in the differential amplifier?
A naive approach leads us to think that: ultimately the result we want to achieve is to nullify the part that occurs due to the part of the input, in the amplified signal. With a setup as shown above, it seems as if we can successfully achieve that by taking the difference as it seems to cancel the effect due to due to it being the same on both the arms. Let's examine that through equations -
that makes
Similarly,
We see that the output taken in this way is linearly dependent on which in the beginning may not make sense intuitively because we tend to think that: even though the MOSFETS on both the arms are non-linear devices due to the 'square' dependence of on , but as is exactly the same on both the arms it would shift the DC operating point by the same amount either up or down. So the due to this should be and will be the same through both the arms. And upon that is where we give our differential input signals and , assuming these are small signals, they should ideally experience the same about the DC operating point decided by on both the arms. In case we wish not to assume the to be linearized in the proximity of , I can argue that the non linearity of itself is exactly the same on both the arms and should be cancelled when taken the difference. Now taking the difference of the outputs of both the arms will result in cancelling of the voltage gain caused by as and either drop or rise by the exact same amount due to changes in . But the signal on will cause to rise by a certain amount and the signal will cause to drop by the exact same amount. So, the only thing that should be visible in the difference of and is twice the change of the of each arm.
Similarly,
Now adding the effects of both the signals, we get
But this is in stark contrast with the result we previously obtained. What are we missing?
One might swiftly say that the property of Additivity does not apply to non-linear systems. Our attempt to split each input into and , examining the output due to each of them and then adding those outputs together expecting that to be equal to the output when the input is the combination of both and is outright wrong. And you may say that due this wrong assumption that we are erroneously arriving at the result that the effect of is nullified when we do .
But that is only half the answer.
We applied the property of additivity only after linearizing the system in the proximity of . In essence, is same for both MOSFETS, hence in that instant of time, both are biased at the exact same point. For a small signal over , the system looks linear. So, as we always do, we can absolutely add the small signal to the . So this approach is correct and let us examine the error introduced due to our assumption of linearity in the small signal proximity of in comparison to the actual value calculated using the large signal equation.
We shall first linearize the system about and find the at that point.
The output due to is
The output due to small signals and is
Adding the above two, we get the total outputs and .
The small signal analysis neglects the non-linearity caused due to 'squaring' operation and assumes a constant all over the near surroundings of . Shockingly enough, we see that the result obtained by assuming that the proximity of is linear is equal to the result obtained without that assumption. This says that, indeed the non linearity due to the 'square' is getting cancelled inherently when we take differential output .
Let me reiterate our findings so far:
The non-linearity caused by the 'square' in the equation is inherently being cancelled when differential output is taken.
So, we can effectively use the property of additivity to add the outputs due to and individually to arrive correctly at the output due to combined
Even though the non-linearity (that we were thinking was the cause of this problem) is nullified, the dependence of the differential output on is still present.
The answer to this would seem obvious and trivial to anyone who is slightly more observant than I am.
For a given the output is linear regardless of the model used (small or large). To reemphasize, the individual outputs and have not become linear, but it's their difference that has become linear with respect to and (devoid of any higher order terms).
At any given the due to that particular is exactly the same on both arms and gets cancelled. Only the due to the small signal (the differential input) remains, but the factor by which this input gets amplified in the output strongly depends upon itself. In other words, has become linear for differential inputs (for large signals as well), but this linear amplification factor has a dependency on
The graph above shows vs for different . It is clear that for any the exclusively due to () is . If the changes with time the gain of the amplifier (slope of vs curve) also changes. The that we were calling our bias point till now itself is one of the inputs, and as the gain of the system turns out to be dependent on the input, the system is non-linear. And neither is the a bias point as it is a variable.
The non-linearity in is due to the varying . To overcome this we should somehow nullify the effect due to on the circuit. Connecting a constant tail current source will make the source terminals of both the MOSFETS into a floating net. is external noise input that we cannot control, increasing or decreasing the will adjust the source node's floating net voltage such that remains constant to support half the current of the tail current source through each arm (as the circuit is all symmetric). We can picture it as the current is constant, any momentary increase in cause a momentary rise in current which will accumulate charge (as all of it is not allowed through the constant current source) at the source net rising its voltage bringing it back to normal. Effectively, this has fixed our 'bias' point and removed the dependence of on . Now the differential input is a perturbation over the fixed by the tail current source rather than over as in the earlier case. Can we then replace with this in the expression?
If this expression is correct then we will have successfully removed the dependence of on and perfectly linearized the output of the differential amplifier. But is it correct?
The sum of currents through the two arms must be equal to (tail current source).
When a differential input is applied:
Likewise when no differential input is applied:
We see that there's a term in excess to when differential input is applied. That means we cannot replace with our new 'bias' point . So to keep the total current the should adjust itself with the differential input . That means of the two MOSFETS is not and but something different. The term will stay as it is an external input, but the commonly shared should be varying to keep the total current constant.
As calling the commonly shared as a bias is incorrect due to its varying nature we will call it . Solving for while keeping the constant total current constraint we get:
Without the differential input the we see that as expected. But this common voltage is not constant, it shrinks with increasing differential input to keep the total current constant. Are we back to square one again? In the previous case without the tail current source , variation in is very huge as compared to the variation in (as it is often a small signal), but now, even though the variation in still exists, it doesn't depend on anymore.
By having the constant tail current source, we have significantly reduced the variations in by changing its dependence from to . Now only depends on not , but it is still non-linear due to the term. For as long as is a small signal we can ignore the effect due to the term and approximate to be linear.