Primed for the Semantic Web
by Leigh Dodds
November 08, 2000
Prompted by Edd Dumbill's "The
Semantic Web: A Primer" article published last week on XML.com,
the RDF
Interest Group have been discussing their own views of the
Semantic Web. This week's column summarizes the discussion and reports
on other progress in the RDF arena.
Vision
Noting that Dumbill's primer article is a useful starting point,
Dan Brickley invited
others to share their vision of the Semantic Web.
This strikes me as a useful article, both in
terms of providing a discussion strawman on what 'Semantic Web' might
mean in practical terms, as well as for surveying some key
technologies, making the point that XHTML, XSLT and mainstream XML
apps will be critical tools for the SW.
Rather than gabble on about what I think Semantic Web might
mean I'm pretty curious to hear what folk on this list understand by
the phrase...
Outlining his own viewpoint, Brickley believed that the
Semantic Web is the Web as it was originally conceived.
... the Semantic Web will be
the Web - we're not talking about a 'new' Web here, just finishing off
the job with the existing one.
With this in mind, Brickley encouraged
others to research Web history, particularly the archived design
discussions on the W3C www-talk mailing
list.
I encourage anyone interested in Future of the
Web ("Semantic Web" or whatever you prefer to call it) to spend some
time digging into Web history. Both the original proposal and the
www-talk archives are worth spending some time reading.
...we need to keep Semantic Web discussions grounded in a
sense of history and evolving Web architecture...
The Semantic Web then is not something that will be built from
scratch: it's the goal many have been pursuing for the last ten
years. Brickley discusses this further in "Nodes and
Arcs 1989-1999, The WWW Proposal and RDF: Then and Now"
Responding to Brickley's call for discussion, Graham Klyne
described his vision of the Semantic Web as an "end-to-end
architecture for content".
The semantic web aims to do for information what
the Internet and WWW have done for data. Today, the Internet, and the
WWW in particular, allow data to be exchanged between an arbitrary
pair of systems, without regard for or discrimination among different
system types, data content or the service to which the data
relates.
Today, information tends to be bound up in the data format
of the application for which it is designed. Without the right
application, the information content of data is inaccessible. The
semantic web will make the information content of exchanged data
accessible to any application that understands semantic web
protocols.
Bill deHora thought that the
Semantic Web is about "significance," something that the Web is
currently lacking.
The big idea behind the Semantic Web is that we
can do more significant stuff than turn some words blue and draw lines
underneath them. We can for example, turn some words blue and draw
lines underneath them and then say "what's on the end of this link is
written by the same person as this is". Link significance makes the
difference and significance is in the eye of the beholder. In other
words the current Semantic Web isn't very rich in
significances. Richness was traded for Acceptance, a very useful
deployment pattern, but you borrow some trouble
nonetheless.
Echoing deHora's final comment, Matt Jensen drew a parallel between
descriptions of the Semantic Web and the flaws in our current
situation. Jensen noted that the successes of the Web were realized
by
sacrificing the scope of what was trying to be achieved.
There were many people in the 80s working on
hypermedia systems, and a significant reason that they stalled and the
WWW took off is that they cared about ensuring consistency,
bidirectional links, etc., and Tim was willing to let go of that. The
result is >1 billion WWW pages, and probably >10 billion links.
A small percentage of the pages are broken, but on the whole the WWW
provides tremendous value.
Similarly, I view most of what has been done in AI as
focused on consistency, correctness, etc., which (so far) has
limited the successes it can claim. If you're looking for a
Semantic Web that can give you "truth", we've got a long wait. If
you're looking for something that improves search results through
related concepts and simple inferences, in a few years you should be
able to get something that's useful, but not perfect.
Seth Russell also commented
on the importance of acceptance.
There is a trade off between precision of
expression and growth and acceptance. The more precise and logically
consistent we insist upon RDF validations, the more difficult we make
implementations, and the less likely anyone will use them. This thing
should unfold like an onion, with the outer shells being super simple
and robust and the inner shells being rich and complex. But we'll
never survive long enough to get to that rich inner core unless the
outer shells gain wide acceptance.
The interesting aspect to this discussion is the positioning of the
Semantic Web as an evolutionary progression rather than alternative
architecture. It shows signs that the RDF community is keen to begin
identifying areas for development, converting the lofty aims into
realizable milestones. Projects such as the DAML Program are amongst the first to start
tackling the Semantic Web head-on.
Momentum
As Dumbill's primer noted, there are existing tools that can be
used to begin developing the Semantic Web. But the RDF community is
not letting the grass grow under its feet: more tools are appearing
all the time.
In the past week a new version of Redland,
the RDF Application Framework, has been released, along with an early
version of RDFStore. RDFStore
is a series of Perl modules used for manipulating RDF databases. Of
course, if you have an RDF database, then you'll be needing an RDF
query language. Enter RQL,
which "propose[s] a new data model and a query language for RDF
descriptions and schemas."
The RQL effort seems initially targeted at supporting the requirements of
the many Community Portals which are springing up across the web. In fact,
even the Semantic Web now has its own community site: SemanticWeb.org. The introductory pages
note that:
[w]ork is going on to realize tools and
techniques, which will help to create the Semantic Web. This site is
dedicated to collect these approaches, to explain them and to be a
forum for people interested in the Semantic Web.
And there is now even a
Semantic Web Journal that's concerned with
modeling semantics of web information, and
covers theory, methods, and applications.
The surge of interest in RDF which the Deviant has noted
over the last few months appears to be continuing with some of that
momentum being transferred to Semantic Web efforts.