summaryrefslogtreecommitdiffstats
path: root/ChangeLog
diff options
context:
space:
mode:
Diffstat (limited to 'ChangeLog')
-rw-r--r--ChangeLog26
1 files changed, 26 insertions, 0 deletions
diff --git a/ChangeLog b/ChangeLog
index 3f691d57..c9f9119a 100644
--- a/ChangeLog
+++ b/ChangeLog
@@ -1,3 +1,29 @@
+2011-10-10 Kaz Kylheku <kaz@kylheku.com>
+
+ Improved support for broken unicode.
+ Regex support for extra-large character sets not compiled in
+ if wchar_t is not wide enough for it.
+ The utf-8 properly throws exceptions when encountering characters
+ that it cannot represent, instead of silently ignoring the
+ situation and continuing with incorrectly computed data.
+
+ * regex.c (FULL_UNICODE): New macro.
+ (CHAR_SET_L3, CHAR_SET_L2_LO, CHAR_SET_L2_HI): Only defined
+ if full unicde is available.
+ (CHSET_XLARGE, cset_L3_t, struct xlarge_char_set,
+ L2_full, L3_fill_range, L3_contains): Ditto.
+ (unon char_set): Member x1 present only under FULL_UNICODE.
+ (char_set_destroy, char_set_add, char_set_add_range,
+ char_set_contains): CHSET_XLARGE cases only available on
+ FULL_UNICODE.
+ (char_set_compile): Default cst variable to CHSET_LARGE.
+
+ * utf8.c (FULL_UNICODE): New macro.
+ (conversion_error): New function.
+ (utf8_from_uc): Throw error if not FULL_UNICODE and character is
+ outside the BMP.
+ (utf8_decode): Likewise.
+
2011-10-09 Kaz Kylheku <kaz@kylheku.com>
* HACKING: Documented portability hacks for narrow wchar_t.