Merge remote-tracking branch 'origin/topic/timw/cleaner-utf8'

* origin/topic/timw/cleaner-utf8:
  GHI-486: Switch over to using LLVM utf8-checking code to better validate characters

I addressed a buffer over-read during the merge and added test-cases for
it.
This commit is contained in:
Johanna Amann 2019-07-29 09:12:52 -07:00
commit 486bf1e713
3 changed files with 57 additions and 53 deletions

View file

@ -2,11 +2,23 @@
{"s":"\b\f\n\r\t\\x00\\x15"}
{"s":"ñ"}
{"s":"\\xc3("}
{"s":"\\xc0\\x81"}
{"s":"\\xc1\\x81"}
{"s":"\\xc2\\xcf"}
{"s":"\\xa0\\xa1"}
{"s":"₡"}
{"s":"࣡"}
{"s":"\\xe0\\x80\\xa1"}
{"s":"\\xe2(\\xa1"}
{"s":"\\xed\\xa0\\xa1"}
{"s":"\\xe2\\x82("}
{"s":"𐌼"}
{"s":"\\xf0(\\x8c\\xbc"}
{"s":"񀌼"}
{"s":"􀌼"}
{"s":"\\xf0\\x80\\x8c\\xbc"}
{"s":"\\xf2(\\x8c\\xbc"}
{"s":"\\xf4\\x90\\x8c\\xbc"}
{"s":"\\xf0\\x90(\\xbc"}
{"s":"\\xf0(\\x8c("}
{"s":"\\xf4\\x80\\x8c"}
{"s":"\\xf0"}