Google news feeds

Well congratulations guys, what kept you? Gnews2RSS dates back to 2002 (I think).

But look at the output. Here's a typical description field.

<br><table border=0 width= valign=top cellpadding=2 cellspacing=7><tr><td valign=top><a href="http://www.charlotte.com/mld/observer/news/local/states/north_carolina/counties/gaston/12267732.htm">True location of <b>lemurs</b>' love still a mystery</a><br><font size=-1><font color=#6f6f6f>Charlotte Observer, NC -</font> <nobr>Jul 31, 2005</nobr></font><br><font size=-1><b>...</b> Catawba Science Center exhibited the two adult <b>lemurs</b> -- primates of Madagascar and the Comoro Islands, with fox-like faces -- Feb. 5 to May 15. <b>...</b> </font><br></table>

- What's that line break doing at the start?
- Tables? Ugh! I strip those on all incoming RSS so they're gone.
- Tables with no closing TD or TR? Double Ugh!
- Repeating the title in the first line of the description. Why?
- More line breaks. Why?

So that's a solid 4 out of 10 and I'll stick to my scraper thank you.


[ << toxi.co.uk :: vCard/LDIF -> FOAF converter/editor/generator v0.3 ] [ FIZIKZ :: Film & Music Reviews and Community >> ]
[ 09-Aug-05 8:29pm ] [ ]