AIUI the paper's examples are from a version of Claude not Llama? The thinking process is going to be extremely model-specific.