From 1065bbb0165b4608bd715866332751be9213cd51 Mon Sep 17 00:00:00 2001 From: Jay Berkenbilt Date: Tue, 15 Feb 2022 08:29:29 -0500 Subject: Handle odd PDFDoc codepoints in UTF-8 during transcoding (fixes #650) There are codepoints in PDFDoc that are not valid UTF-8 but map to valid UTF-8. We were handling those correctly with bidirectional mapping. However, if those same code points appeared in UTF-8, where they have no meaning, they were left as fixed points when converting to PDFDoc, where they do have meaning. This change recognizes them as errors. --- libtests/qtest/qutil/qutil.out | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) (limited to 'libtests/qtest') diff --git a/libtests/qtest/qutil/qutil.out b/libtests/qtest/qutil/qutil.out index aedf49e1..fa284237 100644 --- a/libtests/qtest/qutil/qutil.out +++ b/libtests/qtest/qutil/qutil.out @@ -88,7 +88,8 @@ alternatives 2: 83a9e99e 0: 717561636b done alternatives -w˘wˇwˆw˙w˝w˛w˚w˜w�w�w +w˘wˇwˆw˙w˝w˛w˚w˜w�w�w�w +w?w?w?w?w?w?w?w?w?w?w-w done other characters ---- whoami quack1 -- cgit v1.2.3-70-g09d2