Transformers for Machine Learning : A Deep Dive