At 4-bit quantization it should already fit quite nicely.
Unfortunately not with a reasonable context length.
Unfortunately not with a reasonable context length.