[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Search]
Re: XSLT and charset encoding
- To: emacspeak@xxxxxxxxxxx
- Subject: Re: XSLT and charset encoding
- From: Dmitry Paduchih <paduch@xxxxxxxxxxx>
- Date: Fri, 22 Feb 2002 14:51:41 +0500
- In-Reply-To: <15477.42212.860561.587560@xxxxxxxxxxx> ("T. V. Raman"'smessage of "Thu, 21 Feb 2002 17:54:44 -0800")
- Resent-Date: Fri, 22 Feb 2002 05:00:00 -0500 (EST)
- Resent-From: emacspeak@xxxxxxxxxxx
- Resent-Message-ID: <"kanr6B.A.fnC.oUhd8"@hub>
- Resent-Sender: emacspeak-request@xxxxxxxxxxx
- User-Agent: Gnus/5.090006 (Oort Gnus v0.06) Emacs/21.1(i386-debian-linux-gnu)
Raman,
I just want to experiment for my own because I have some difficulties
with displaying Russian pages. So if you know how to do the xslt
transparent, couldn't you give me a hint?
besides, I have the following reasons why I think xslt should be
transparent with respect to charset encoding.
First, if browser has gotten content-type through the http protocol,
then according to html standard these data have priority over those
found in the corresponding <meta
http-equiv...> tag. That is, xslt processor may make wrong assumption
about source charset encoding based on the meta tag.
Second, if browser notices that encoding has been changed in between
it may use this new encoding to send form data despite the original
one which may make form interaction behave badly.
Third, many html pages do not contain meta tag for content-type at
all. In this case xsltproc does not perform actual conversion and
simply adds "right" (as it thinks) charset= declaration.
These issues except latter are only theoretical though and require
additional testing. Currently My problem is that w3m-el displays
numeric references like п produced by xsltproc by garbage
characters.
>>>>> "TVR" == T V Raman <raman@xxxxxxxxxxx> writes:
TVR> that's too error prone >>>>> "Dmitry" == Dmitry Paduchih <paduch@xxxxxxxxxxx> writes:
Dmitry> Hello,
Dmitry> Currently I am playing with the xslt feature of
Dmitry> Emacspeak and have one question. Is it possible
Dmitry> to adjust xsltproc so that it wouldn't change
Dmitry> charset encoding in the html document? That is,
Dmitry> I would prefer if it does not convert characters
Dmitry> into any particular charset encoding and just
Dmitry> treats characters as a raw data.
Dmitry> Regards, Dimitri
[...]
TVR> --
TVR> Best Regards,
TVR> --raman
[...]
Regards,
Dimitri
-----------------------------------------------------------------------------
To unsubscribe from the emacspeak list or change your address on the
emacspeak list send mail to "emacspeak-request@xxxxxxxxxxx" with a
subject of "unsubscribe" or "help"