Why not loose the tail

(R_D)

(R_D)
(VDD)
(\frac{W}{L})
(\frac{W}{L})
(M1)
(M2)
(V_{CM}...
(V_{CM}...
(...
(...
(I_{D1})
(I_{D2})
Text is not SVG - cannot display

The question: What is the necessity of the tail current source in the differential amplifier?

A naive approach leads us to think that: ultimately the result we want to achieve is to nullify the part that occurs due to the VCM part of the input, in the amplified signal. With a setup as shown above, it seems as if we can successfully achieve that by taking the difference Vout1Vout2 as it seems to cancel the effect due to VCM due to it being the same on both the arms. Let's examine that through equations -

Vout1=VDDID1RD$$Assumingeverythingisbiasedinsaturation$$ID1=k((VCM+x)Vt)2

that makes

Vout1=VDDk((VCM+x)Vt)2RD

Similarly,

Vout2=VDDk((VCMx)Vt)2RD$$where$k=12μnCox$Now,$$Vout1Vout2=4kx(VCMVt)

We see that the output taken in this way is linearly dependent on VCM which in the beginning may not make sense intuitively because we tend to think that: even though the MOSFETS on both the arms are non-linear devices due to the 'square' dependence of ID on (VGSVt) , but as VCM is exactly the same on both the arms it would shift the DC operating point by the same amount either up or down. So the ID due to this VCM should be and will be the same through both the arms. And upon that VCM is where we give our differential input signals +x and x , assuming these are small signals, they should ideally experience the same gm about the DC operating point decided by VCM on both the arms. In case we wish not to assume the gm to be linearized in the proximity of VCM , I can argue that the non linearity of gm itself is exactly the same on both the arms and should be cancelled when taken the difference. Now taking the difference of the outputs of both the arms will result in cancelling of the voltage gain caused by VCM as Vout1 and Vout2 either drop or rise by the exact same amount due to changes in VCM . But the signal +x on M1 will cause Vout1 to rise by a certain amount and the signal x will cause Vout2 to drop by the exact same amount. So, the only thing that should be visible in the difference of Vout1 and Vout2 is twice the change of the Vout of each arm.

Vout1|VCM=UVout2|VCM=UVout1|VCMVout2|VCM=UU=0

Similarly,

Vout1|V+x=+DVout2|Vx=DVout1|V+xVout2|V+x=+D(D)=2D

Now adding the effects of both the signals, we get

Vout1|VCM+xVout2|VCMx=2D

But this is in stark contrast with the result we previously obtained. What are we missing?

One might swiftly say that the property of Additivity does not apply to non-linear systems. Our attempt to split each input into VCM and x , examining the output due to each of them and then adding those outputs together expecting that to be equal to the output when the input is the combination of both VCM and x is outright wrong. And you may say that due this wrong assumption that we are erroneously arriving at the result that the effect of VCM is nullified when we do Vout1Vout2 .

But that is only half the answer.

We applied the property of additivity only after linearizing the system in the proximity of VCM . In essence, VCM is same for both MOSFETS, hence in that instant of time, both are biased at the exact same point. For a small signal over VCM , the system looks linear. So, as we always do, we can absolutely add the small signal vout to the Vout|VCM . So this approach is correct and let us examine the error introduced due to our assumption of linearity in the small signal proximity of VCM in comparison to the actual value calculated using the large signal ID equation.

We shall first linearize the system about VCM and find the gm at that point.

gm|VCM=μnCoxWL(VCMVt)=2k(VCMVt)

The output due to VCM is

ID|VCM=k(VCMVt)2Vout|VCM=VDDID|VCMRD=VDDkRD(VCMVt)2

The output due to small signals +x and x is

vout1=gmRDx=2kRDx(VCMVt)vout2=gmRDx=2kRDx(VCMVt)

Adding the above two, we get the total outputs Vout1 and Vout2 .

Vout1=Vout|CM+vout1Vout2=Vout|CM+vout2Vout1Vout2=4kx(VCMVt)

The small signal analysis neglects the non-linearity caused due to 'squaring' operation and assumes a constant gm all over the near surroundings of VCM . Shockingly enough, we see that the result obtained by assuming that the proximity of VCM is linear is equal to the result obtained without that assumption. This says that, indeed the non linearity due to the 'square' is getting cancelled inherently when we take differential output Vout1Vout2 .
Let me reiterate our findings so far:

  1. The non-linearity caused by the 'square' in the ID equation is inherently being cancelled when differential output is taken.
  2. So, we can effectively use the property of additivity to add the outputs due to VCM and x individually to arrive correctly at the output due to combined VCM+x
  3. Even though the non-linearity (that we were thinking was the cause of this problem) is nullified, the dependence of the differential output Vout1Vout2 on VCM is still present.

The answer to this would seem obvious and trivial to anyone who is slightly more observant than I am.

For a given VCM the output is linear regardless of the model used (small or large). To reemphasize, the individual outputs Vout1 and Vout2 have not become linear, but it's their difference that has become linear with respect to x and (devoid of any higher order terms).

At any given VCM the Vout due to that particular VCM is exactly the same on both arms and gets cancelled. Only the Vout due to the small signal (the differential input) remains, but the factor by which this input gets amplified in the output strongly depends upon VCM itself. In other words, Vout has become linear for differential inputs (for large signals as well), but this linear amplification factor has a dependency on VCM
(V_{out})
more
more (V_{CM})
less
less (V_{CM})
Text is not SVG - cannot display

The graph above shows Vout vs x for different VCM . It is clear that for any VCM the Vout exclusively due to VCM (x=0) is 0. If the VCM changes with time the gain of the amplifier (slope of Vout vs x curve) also changes. The VCM that we were calling our bias point till now itself is one of the inputs, and as the gain of the system turns out to be dependent on the input, the system is non-linear. And neither is the VCM a bias point as it is a variable.

The non-linearity in Vout is due to the varying VCM . To overcome this we should somehow nullify the effect due to VCM on the circuit. Connecting a constant tail current source will make the source terminals of both the MOSFETS into a floating net. VCM is external noise input that we cannot control, increasing or decreasing the VCM will adjust the source node's floating net voltage such that VGS remains constant to support half the current of the tail current source through each arm (as the circuit is all symmetric). We can picture it as the current is constant, any momentary increase in VGS cause a momentary rise in current which will accumulate charge (as all of it is not allowed through the constant current source) at the source net rising its voltage bringing it back to normal. Effectively, this has fixed our 'bias' point and removed the dependence of Vout on VCM . Now the differential input x is a perturbation over the VGS fixed by the tail current source rather than over VCM as in the earlier case. Can we then replace VCM with this VGSbias in the Vout expression?

Vout=4kx(VGSbiasVt)

If this expression is correct then we will have successfully removed the dependence of Vout on VCM and perfectly linearized the output of the differential amplifier. But is it correct?
The sum of currents through the two arms must be equal to ISS (tail current source).
When a differential input is applied:

ID1+ID2=k(VGSbias+xVt)2+k(VGSbiasxVt)2=2k((VGSbiasVt)2+2x2)

Likewise when no differential input is applied:

Ibias=ISS=2k(VGSbiasVt)2VGSbias=ISS2k+Vt

We see that there's a 4kx2 term in excess to ISS when differential input is applied. That means we cannot replace VCM with our new 'bias' point VGSbias . So to keep the total current ID1+ID2=ISS the VGS should adjust itself with the differential input x. That means VGS of the two MOSFETS is not VGSbias+x and VGSbiasx but something different. The x term will stay as it is an external input, but the commonly shared VGS should be varying to keep the total current constant.

As calling the commonly shared VGS as a bias is incorrect due to its varying nature we will call it Vcom . Solving for Vcom while keeping the constant total current constraint we get:

k(Vcom+xVt)2+k(VcomxVt)2=ISSVcom=ISS2kx2+Vt

We can replace the VCM in the previously derived Vout with this expression of Vcom. This is done in the Analysis of Differential Amplifier

Without the differential input x the we see that Vcom|x=0=VGSbias as expected. But this common voltage is not constant, it shrinks with increasing differential input x to keep the total current constant. Are we back to square one again? In the previous case without the tail current source Vcom=VCM , variation in VCM is very huge as compared to the variation in x (as it is often a small signal), but now, even though the variation in Vcom still exists, it doesn't depend on VCM anymore.

By having the constant tail current source, we have significantly reduced the variations in Vcom by changing its dependence from VCM to x. Now Vout only depends on x not VCM , but it is still non-linear due to the x2 term. For as long as x is a small signal we can ignore the effect due to the x2 term and approximate Vout to be linear.