Like in the home computing days, you could use inline Assembly for the decoding loop.
We used DATA blocks, now they can use @micropython.asm_thum.