Unicode's confusables.txt and NFKC normalization disagree on 31 characters