Welcome, Guest. Please login or register.
Did you miss your activation email?
May 25, 2012, 09:23:05 AM

Login with username, password and session length
Search:     Advanced search
Interested in joining the WebsiteBaker team?
For more Information read here or on our new website.
155482 Posts in 21708 Topics by 7734 Members
Latest Member: adrielkhollis4
* Home Help Search Login Register
Pages: [1]   Go Down
Print
Author Topic: News reader module (RSS) - Multi lingual character Bug  (Read 615 times)
solbu

Offline Offline

Posts: 22


« on: March 25, 2009, 05:43:46 AM »

We are using the news reader to parse RSS feeds from other sites into our own site. But we have problems with norwegian special characters in the links from the RSS feeds, resulting in an error message: An XML error occurred on line 25: not well-formed (invalid token).

This only happens on RSS feeds where the feed uses non-english special language characters (these three -> ÆØÅ).

This feed works: http://www.dagenmagazinet.no/rss/rssfeed.aspx?subject=13
This feed does not work: http://www.vl.no/rss/kristenliv/index.xml

Does anyone know what we need to change in the reader to make the feed work with norwegian characters, or does anyone know of another RSS Reader for WebsiteBaker that parses european languages corectly?
Logged

My posts are licensed under a Creative Commons license
BerndJM

Offline Offline

Posts: 1764



« Reply #1 on: March 25, 2009, 06:36:40 AM »

Hi,

maybe I'm totaly wrong, but I think that's not an issue of the news reader module.
If you have a look at the generated source code of the two feeds you linked you'll see:
the one who works starts with:
Code:
<?xml version='1.0' ?>
<rss version='2.0'>
and has the "special chars" encoded like this:
Code:
Den r&#248;dgr&#248;nne regjeringens politikk tr&#229;r stadig inn p&#229; kirkenes domene

The other starts with:
Code:
<?xml version="1.0" encoding="iso-8859-1"?>
And the special chars are not encoded.

Maybe this causes the pain, if your site works "per default" with utf-8 ?


Regards Bernd
Logged

In theory, there is no difference between theory and practice. But, in practice, there is.
solbu

Offline Offline

Posts: 22


« Reply #2 on: March 25, 2009, 07:49:11 AM »

Quote from: BerndJM
The other starts with:
Code:
<?xml version="1.0" encoding="iso-8859-1"?>
And the special chars are not encoded.

Maybe this causes the pain, if your site works "per default" with utf-8 ?
Our site use the default utf-8 charset.
I just tried, as a test, to change our website charset from utf-8 to iso-8859-1, it had no effect on the feed. We have even tried various combinations with the charset converter included in the module, also to no avail.

I have even used an rss feedvalidator, the feed validates without any warnings. (http://www.feedvalidator.org/check.cgi?url=http://www.vl.no/rss/kristenliv/index.xml)
« Last Edit: March 25, 2009, 07:58:48 AM by solbu » Logged

My posts are licensed under a Creative Commons license
Pages: [1]   Go Up
Print
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.16 | SMF © 2011, Simple Machines Valid XHTML 1.0! Valid CSS!