Ah, busted Unicode in a programming language string parser, what a surprise. :\

Could you do the splitting at the raw data level before it stringifies it? (Betting not likely)

Or make the splitting character dynamic, and pick one that doesn’t conflict with post content? (PITA)

Or use a length + text rather than text + separator approach? (Might be most promising)

/