CLUTCH: Contextualized Language model for Unlocking Text-Conditioned Hand motion modelling in the wild (Arxiv version)