I was pushing some old data through an API at work when I received a weird error message. The API coughed up a hairball. It responded that I had u200b at position 154, and if I needed that character, I’d have to encode it. But I looked at position 154 and it was a number. Nothing weird. So what’s u200b, why does a problematic character exist, and how do you clean it up?
U200b is a Unicode non-printing space. It’s meant to assist typographers in doing page layouts, and it’s extremely useful in certain languages that don’t use the Roman alphabet. But those of us who use the Roman alphabet may go a lifetime without needing it.