diff options
Diffstat (limited to 'ChangeLog')
-rw-r--r-- | ChangeLog | 26 |
1 files changed, 26 insertions, 0 deletions
@@ -1,3 +1,29 @@ +2011-10-10 Kaz Kylheku <kaz@kylheku.com> + + Improved support for broken unicode. + Regex support for extra-large character sets not compiled in + if wchar_t is not wide enough for it. + The utf-8 properly throws exceptions when encountering characters + that it cannot represent, instead of silently ignoring the + situation and continuing with incorrectly computed data. + + * regex.c (FULL_UNICODE): New macro. + (CHAR_SET_L3, CHAR_SET_L2_LO, CHAR_SET_L2_HI): Only defined + if full unicde is available. + (CHSET_XLARGE, cset_L3_t, struct xlarge_char_set, + L2_full, L3_fill_range, L3_contains): Ditto. + (unon char_set): Member x1 present only under FULL_UNICODE. + (char_set_destroy, char_set_add, char_set_add_range, + char_set_contains): CHSET_XLARGE cases only available on + FULL_UNICODE. + (char_set_compile): Default cst variable to CHSET_LARGE. + + * utf8.c (FULL_UNICODE): New macro. + (conversion_error): New function. + (utf8_from_uc): Throw error if not FULL_UNICODE and character is + outside the BMP. + (utf8_decode): Likewise. + 2011-10-09 Kaz Kylheku <kaz@kylheku.com> * HACKING: Documented portability hacks for narrow wchar_t. |