What are all of the allowable characters for people’s names? [closed]

There’s good article by the W3C called Personal names around the world that explains the problems (and possible solutions) pretty well (it was originally a two-part blog post by Richard Ishida: part 1 and part 2) Personally I’d say: support every printable Unicode-Character and to be safe provide just a single field “name” that contains … Read more

How to reliably guess the encoding between MacRoman, CP1252, Latin1, UTF-8, and ASCII [duplicate]

First, the easy cases: ASCII If your data contains no bytes above 0x7F, then it’s ASCII. (Or a 7-bit ISO646 encoding, but those are very obsolete.) UTF-8 If your data validates as UTF-8, then you can safely assume it is UTF-8. Due to UTF-8’s strict validation rules, false positives are extremely rare. ISO-8859-1 vs. windows-1252 … Read more