[Bese-devel] Re: Parsing UTF-8 mime content

Marco Baringer mb at bese.it
Sat Apr 8 17:57:02 UTC 2006


"Aycan iRiCAN" <aycan.irican at core.gen.tr> writes:

> Hi,
>
> I had some problems with utf-8 encoded forms because we're using non
> ascii characters. I just looked at src/backend/common.lisp and
> realized that when a form has been submitted, ucw constructs a
> character array and parse the content byte by byte. This is a problem
> when we use UTF-8 content. So I changed the array to unsigned-byte and
> then added raw bytes. In termination-callback, I converted the array
> to a proper string using octets-to-string.

nice, thanks. i'd like to apply this but i keep getting:

soma:~/lisp/ucw_dev mb$ darcs apply utf8-mime-content.patch 
darcs: bug in get_extra.
Most likely this is caused by a bug that existed in darcs prior
to version 1.0.1.  Details for dealing with this issue can be found
at http://darcs.net/DarcsWiki/Issues1.0.1

can you try applying this to a clean ucw_dev tree and see if it's a
problem with my setup?

> It's not the perfect solution but it works for me.

the only issue i have is with the fixing of the form's encoding in the
parser. supposedly browesers will send data with the same encoding as
the page, but browser's don't always do this, and they even when they
do they don't always do it reliably. i thinks it's the best we can
hope for.

-- 
-Marco
Ring the bells that still can ring.
Forget the perfect offering.
There is a crack in everything.
That's how the light gets in.
	-Leonard Cohen




More information about the bese-devel mailing list