golang concurrent IO is so accessible that even trivial IO transform scripts (e.g. compression, base64, md5sum/cksum) are very easy to multicore.
You'd be astonished at how much faster even seemingly fast local IO can go when you unblock the IO