|
Applying Semantic Web Ideals
The New York Times recently ran a story about the notion of the Semantic Web. You have to be a subscriber to access the article on the paper's site, so here's an excerpt: "From the billions of documents that form the World Wide Web and the links that weave them together, computer scientists and a growing collection of start-up companies are finding new ways to mine human intelligence. "Their goal is to add a layer of meaning on top of the existing Web that would make it less of a catalog and more of a guide -- and even provide the foundation for systems that can reason in a human fashion. That level of artificial intelligence, with machines doing the thinking instead of simply following commands, has eluded researchers for more than half a century. "Referred to as Web 3.0, the effort is in its infancy, and the very idea has given rise to skeptics who have called it an unobtainable vision. But the underlying technologies are rapidly gaining adherents, at big companies like I.B.M. and Google as well as small ones. Their projects often center on simple, practical uses, from producing vacation recommendations to predicting the next hit song. "But in the future, more powerful systems could act as personal advisers in areas as diverse as financial planning, with an intelligent system mapping out a retirement plan for a couple, for instance, or educational consulting, with the Web helping a high school student identify the right college." The article goes on to discuss social networking sites, artificial intelligence, and the ontology and taxonomy efforts of Cycorp and IBM, both working to build a layer of intelligence across the entire web. Bloggers from all corners of the web critiqued the piece, ridiculing the use of the term "Web 3.0" and expressing skepticism over the existence of Web 2.0. Most articulate, in my opinion, was Nick Bradbury, architect of client solutions at NewsGator, who wrote, "The goals of the Semantic Web are good ones, and I believe many of those goals will be met in my lifetime. But too much of the Semantic Web relies on data being valid - that is, valid XML, XHTML, RDF, etc. - and too many of us will never publish valid data.…If the Semantic Web hopes to exist, it's going to have to deal with invalid HTML, badly-formed XML, and RSS with vague entity escaping. It's also going to have to filter out every new variation of spam, and be smart enough to know when people lie. The Semantic Web may happen, but if it does, it's going to be a helluva lot messier than the architects would like." These are great points -- data formatting and data quality are serious issues on the web and probably always will be. I wonder if it's even possible to create layers of meaning that would be universally understandable and intelligible to everyone, or to filter out inaccuracies and junk across the web. As Albert Einstein pointed out, "Whoever undertakes to set himself up as a judge of Truth and Knowledge is shipwrecked by the laughter of the gods." But within companies and websites, a semantic layer can be initiated -- albeit with much time and care -- with the development of ontologies and taxonomies that provide definitions and structure to categories of content. We look at these in detail in our December cover story, "Search in Focus." Do you have any thoughts on this topic? Please email me at pcrosman@cmp.com. E-MAIL | SLASHDOT | DIGG This is a public forum. CMP Technology and its affiliates are not responsible for and do not control what is posted herein. CMP Technology makes no warranties or guarantees concerning any advice dispensed by its staff members or readers. Community standards in this comment area do not permit hate language, excessive profanity, or other patently offensive language. Please be aware that all information posted to this comment area becomes the property of CMP Media LLC and may be edited and republished in print or electronic format as outlined in CMP Technology's Terms of Service. Important Note: This comment area is NOT intended for commercial messages or solicitations of business.
|
Blog Channels
Cindi Howson on Business Intelligence The Brain Food Blogger Tony Byrne on Content Management SQL Puzzlers by Joe Celko Rajan Chandras on IT & Information Management Seth Grimes on Analytics In Context by Doug Henschen Phil Kemelor on Web Analytics Sandy Kemsley's Column Two Nelson King on Enterprise App Development David Linthicum on Software as a Service Natural Insight, By Mark Madsen Alan Pelz-Sharpe on Content Management Mark Smith on Performance Management Neil Raden on Business Intelligence Bruce Silver on Business Process Management Product Maven Subscribe to RSS Archives
|
|
|












