From patrick.may at mac.com Tue Oct 15 20:18:40 2013 From: patrick.may at mac.com (Patrick May) Date: Tue, 15 Oct 2013 16:18:40 -0400 Subject: [closure-devel] Corrupted UTF-8 input Message-ID: <1382570F-794A-4503-A2E4-57EE9E5129FC@mac.com> Hi, I'm using chtml for a simple experimental web crawler. I'm occasionally getting this error (Slime output): 0: (RUNES-ENCODING::XERROR "Corrupted UTF-8 input (initial byte was #b~8,'0B)" 255) 1: (# :UTF-8 #(255 216 255 0 0 0 ...) 0 3 #(65535 0 0 0 0 0 ...) 0 8191 NIL) 2: (NIL #) 3: (# #) 4: (SGML::READ-TOKEN # #) 5: (SGML::READ-TOKEN* # #) 6: (SGML:SGML-PARSE # #) 7: (CLOSURE-HTML::PARSE-XSTREAM # #) Choosing the restart continuation seems to get past it, but I'd like to understand what's going on and how to automatically detect and work around it. Any input appreciated. Thanks, Patrick -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 495 bytes Desc: Message signed with OpenPGP using GPGMail URL: