Tips and wisdom from 15 years of DBA experience

Wednesday, September 2, 2009

WE8MSWIN1252, WE8ISO8859P1 and UTF-8

In an ideal world, all applications would be unicode compliant, but in reality, we sometimes have to deal with applications that do not do proper character set conversion, when they are using an encoding like WE8MSWIN1252 and encounter a database in UTF-8. So here is a very good listing of the 27 code points where WE8MSWIN1252 differ from Unicode and WE8ISO8859P1. Symbols like the Euro symbol are not even defined in WE8ISO8859P1, so they will show up as an upside down question mark under some circumstances. Those circumstances I yet to fully understand - there is a lot of complexity in the conversion process.


No comments:

Followers