My newest time-waster

I’ve spent an inordinate amount of time playing with the healthBase beta, which claims to be NetBase’s ‘research solution for healthcare’, since reading this forum entry on TheDailyWtf.com.

Described as ‘when semantic extraction goes horribly wrong’… these are the results of a disastrously untested search engine being let loose on the net… and screenshots of howlers are growing day by day on forum and blog posts. At release, an article on TechCrunch revealed that a search for ‘Causes of AIDS’ rather unfortunately listed ‘Jew’, due to it’s inability to distinguish between the disease and the verb ‘to aid’, which could be found in a Wikipedia article on Jewish history. A company representative offered an explanation to TechCrunch author Leena Rao:

“This is an unfortunate example of homonymy, i.e. words that have different meanings.The showcase was not configured to distinguish between the disease “AIDS” and the verb “aids” (as in aiding someone). If you click on the result “Jew” you see a sentence from a Wikipedia page about 7th Century history: “Hispano-Visigothic king Egica accuses the Jews of aiding the Muslims, and sentences all Jews to slavery. ” Although Wikipedia contains a lot of great health information it also contains non-health related information (like this one) that is hard to filter out.”

LanguageLog also blogged the site under the most appropriate title ‘Semantic FAIL’, with readers posting their own examples of search results… my favourite so far was the suggested treatment for Hookers: ‘One Bourbon, One Scotch, One Beer’.

Perhaps the W3C’s dream of the ‘semantic web‘ needs a little refinement… before we all end up playing, as Monty Python once termed it, ‘word association football’.
It’s being patched daily… but some of my favourites so far include:
Food and Plants for Obama: Blackberry
Pros and Cons of Oracle: Downright ugly
Complications of Kittens: “Your crazed kitten ruined your last pair of pantyhose”
Food and Plants for Poison: 1-Up Mushroom
There are still some unfixed offensive errors: the entry on Causes of ‘fun’ currently lists ‘Bus passenger’, ‘Nasty spill’ and ‘Underpriveleged black high school student’….. Semantic FAIL indeed.

Post a Comment

You must be logged in to post a comment.