[drakma-devel] Bug handling bad html?

Jeffrey Cunningham jeffrey at cunningham.net
Sat Feb 24 17:07:25 UTC 2007


I was playing with drakma and had it drop into the debugger when
retrieving a commercial page. It looks like it might be a bug in
flexi-streams, but I don't know how to isolate the input more
specifically than what came up here:


Unexpected value #xA0 at start of UTF-8 sequence.
   [Condition of type FLEXI-STREAMS:FLEXI-STREAM-ENCODING-ERROR]

Restarts:
 0: [ABORT] Abort SLIME compilation.
 1: [ABORT] Return to SLIME's top level.
 2: [TERMINATE-THREAD] Terminate this thread (#<THREAD "worker" {CDD6631}>)

Backtrace:
  0: (FLEXI-STREAMS::SIGNAL-ENCODING-ERROR
      #<FLEXI-STREAMS::FLEXI-UTF-8-IO-STREAM {12CF8379}>
      "Unexpected value #x~X at start of UTF-8 sequence."
      160)
  1: (FLEXI-STREAMS::SIGNAL-ENCODING-ERROR
      #<FLEXI-STREAMS::FLEXI-UTF-8-IO-STREAM {12CF8379}>
      "Unexpected value #x~X at start of UTF-8 sequence.")
  2: ((FLET #:BODY-FN327))
  3: ((SB-PCL::FAST-METHOD STREAM-READ-CHAR
       (FLEXI-STREAMS::FLEXI-UTF-8-INPUT-STREAM))
      #<unavailable argument>
      #<unavailable argument>
      #<unavailable argument>)
  4: ((SB-PCL::FAST-METHOD TRIVIAL-GRAY-STREAMS:STREAM-READ-SEQUENCE
       (FLEXI-STREAMS:FLEXI-INPUT-STREAM #1="#<...>" . #1#))
      #<unused argument>
      #<unused argument>
      #<unavailable argument>
      #<unavailable argument>
      #<unavailable argument>
      #<unavailable argument>)
  5: (READ-SEQUENCE
      "y make a difference this holiday season. Our gift ideas

     are unique and of high quality.<br/><br/></p>

     

     <p><a href=\"http://www.1giftidea.com/\" target=\"_blank\" title=\"Christmas Gift Ideas\">Gift ideas for every occasion, Christmas, Birthday, Mother's day...</a><br/>

     Gift ideas for every occasion, Christmas, Birthday, Mothers day, Graduation, Fathers day, Anniversary, Wedding, & Baby Shower.<br/><br/></p>

     

     <p><a href=\"http://www.mixedblessing.com/\" target=\"_blank\" title=\"Mixed Blessing\">Hanukkah card, Christmas gift idea and Holiday greeting cards from MixedBlessing</a><br/>

     Greeting Cards for Interfaith and Multicultures from MixedBlesing. Hanukkah cards, Holiday cards, Christmas Gift Ideas, Holiday Gifts and more.. Find great gifts now!<br/><br/></p>

     
 ..)
  6: (DRAKMA::READ-BODY
      #<FLEXI-STREAMS::FLEXI-UTF-8-IO-STREAM {12CF8379}>
      ((:DATE . "Sat, 24 Feb 2007 06:30:03 GMT")
       (:SERVER . "Apache/2.0.46 (Red Hat)")
       (:SET-COOKIE
        . "GS_UUID=24.18.193.65.1172298603635841; path=/,PHPSESSID=e009a521cb2bf134a00df925e4f4d510; path=/,cart_hash=e009a521cb2bf134a00df925e4f4d510; expires=Tuesday, 27-Feb-07 06:30:03 GMT; path=/")
       (:X-POWERED-BY . "PHP/4.4.0")
       (:EXPIRES . "Thu, 19 Nov 1981 08:52:00 GMT")
       (:CACHE-CONTROL
        . "no-store, no-cache, must-revalidate, post-check=0, pre-check=0") ..))
  7: ((LABELS DRAKMA::FINISH-REQUEST) NIL NIL)
  8: (HTTP-REQUEST
      #<URI http://www.gifttree.com/Christmas/Christmas-gift-idea.html>
      :PROXY
      NIL)
  9: (RETRIEVE-URI
      "http://www.gifttree.com/Christmas/Christmas-gift-idea.html"
      NIL)
 10: (WALK-SITE
      "http://www.gifttree.com/Christmas/Christmas-gift-idea.html"
      #<unavailable argument>
      #<unavailable argument>
      #<unavailable argument>
      #<unavailable argument>
      #<unavailable argument>
      #<unavailable argument>)
 11: (SB-FASL::FOP-FUNCALL)
 12: (SB-FASL::LOAD-FASL-GROUP
      #<SB-SYS:FD-STREAM for "file /tmp/fileIQGlqR.fasl" {CDF1089}>)
 13: (SB-FASL::LOAD-AS-FASL
      #<SB-SYS:FD-STREAM for "file /tmp/fileIQGlqR.fasl" {CDF1089}>
      NIL
      #<unavailable argument>)
 14: (SB-FASL::INTERNAL-LOAD
      #P"/tmp/fileIQGlqR.fasl"
      #P"/tmp/fileIQGlqR.fasl"
      :ERROR
      NIL
      NIL
      :BINARY
      NIL)
 15: (SB-FASL::INTERNAL-LOAD
      #P"/tmp/fileIQGlqR.fasl"
      #P"/tmp/fileIQGlqR.fasl"
      :ERROR
      NIL
      NIL
      NIL
      :DEFAULT)
 16: (LOAD #P"/tmp/fileIQGlqR.fasl")
 17: ((LAMBDA (STRING &KEY #1="#<...>" . #1#))
      "(print (walk-site \"http://www.gifttree.com\"))
     "
      :BUFFER
      "seo.lisp"
      :POSITION
      27060
      :DIRECTORY
      #<unused argument>)
 18: ((LAMBDA ()))
 --more--


--Jeff




More information about the Drakma-devel mailing list