Printer Friendly
The Free Library
14,507,882 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

W3C Completes Bridge Between HTML/Microformats and Semantic Web.


GRDDL GRDDL Gleaning Resource Descriptions from Dialects of Languages (technique for obtaining RDF data from XML documents)  Gives Web Content Hooks to Powerful Reuse and Data Integration

http://www.w3.org/ -- Today, the World Wide Web Consortium completed an important link between Semantic Web and microformats communities. With "Gleaning Harvesting for free distribution to the needy, or for donation to a nonprofit organization for ultimate distribution to the needy, an agricultural crop that has been donated by the owner.  Resource Descriptions from Dialects of Languages", or GRDDL (pronounced "griddle"), software can automatically extract information from structured Web pages to make it part of the Semantic Web. Those accustomed to expressing structured data with microformats in XHTML (EXtensible HTML) A markup language for Web pages from the W3C. XHTML combines HTML and XML into a single format (HTML 4.0 and XML 1.0). Like XML, XHTML can be extended with proprietary tags. Also like XML, XHTML must be coded more rigorously than HTML.  can thus increase the value of their existing data by porting it to the Semantic Web, at very low cost.

"Sometimes one line of code can make a world of difference," said Tim Berners-Lee, W3C (World Wide Web Consortium, www.w3.org) An international industry consortium founded in 1994 by Tim Berners-Lee to develop standards for the Web. It is hosted in the U.S. by the Computer Science and Artificial Intelligence Laboratory (CSAIL) at MIT (www.csail.mit.edu/index.php).  Director. "Just as stylesheets make Web pages more readable to people, GRDDL makes Web pages, microformat tags, XML XML
 in full Extensible Markup Language.

Markup language developed to be a simplified and more structural version of SGML. It incorporates features of HTML (e.g., hypertext linking), but is designed to overcome some of HTML's limitations.
 documents, and data more readable to Semantic Web applications, opening more data to new possibilities and creative reuse."

Getting Data into and out of the Web; how is it happening today?

One aspect of recent developments some people call "Web 2.0" involves applications based on combining -- in "mashups" -- various types of data that are spread all around on the Web. A number of active communities innovating on the Web share the goal of sharing data such as calendar information, contact information, and geopositioning information. These communities have developed diverse social practices and technologies that satisfy their particular needs. For instance, search engines have had great success using statistical methods while people who share photos have found it useful to tag their photos manually with short text labels. Much of this work can be captured via "microformats". Microformats refer to sets of simple, open data formats built upon existing and widely adopted standards, including HTML HTML
 in full HyperText Markup Language

Markup language derived from SGML that is used to prepare hypertext documents. Relatively easy for nonprogrammers to master, HTML is the language used for documents on the World Wide Web.
, CSS (1) See Cascading Style Sheets.

(2) (Content Scrambling System) The copy protection system applied to DVDs, which uses a 40-bit key to encrypt the movie.
 and XML.

This wave of activity has direct connections to the essence of the Semantic Web. The Semantic Web-based communities have pursued ways to improve the quality and availability of data on the Web, making it possible for more intensive data-integration and more diverse applications that can scale to the size of the Web and allow even more powerful mash-ups. The Web-based set of standards that supports this work is known as the Semantic Web stack. The foundations of the Semantic Web stack meet the requirements for formality of some applications such as managing bank statements, or combining volumes of medical data.

Each approach to "getting your data out there" has its place. But why limit yourself to just one approach if you can benefit, at low cost, from more than one? As microformats users consider more uses that require data modelling, or validation, how can they take advantage of their existing data in more formal applications?

A Bridge from Flexible Web Applications to the Semantic Web

GRDDL is the bridge for turning data expressed in an XML format (such as XHTML) into Semantic Web data. With GRDDL, authors transform the data they wish to share into a format that can be used and transformed again for more rigorous applications.

GRDDL Use Cases provides insight into why this is useful through a number of real-world scenarios, including scheduling a meeting, comparing information from various retailers before making a purchase, and extracting information from wikis See wiki.  to facilitate e-learning. Once data is part of the Semantic Web, it can be merged with other data (for example, from a relational database, similarly exposed to the Semantic Web) for queries, inferences, and conversion to other formats.

The Working Group has reported on implementation experience, and its members have come forward with statements of support and commitments to implement GRDDL.

GRDDL Test Cases is also published today, which describes and includes test cases for software agents to support GRDDL. The Working Group has produced a GRDDL service that allows users to input a GRDDL'd file and extract the important data.

About the World Wide Web Consortium [W3C]

The World Wide Web Consortium (W3C) is an international consortium where Member organizations, a full-time staff, and the public work together to develop Web standards. W3C primarily pursues its mission through the creation of Web standards and guidelines designed to ensure long-term growth for the Web. Over 400 organizations are Members of the Consortium. W3C is jointly run by the MIT Computer Science and Artificial Intelligence Laboratory It has been suggested that and be merged into this article or section.  (MIT MIT - Massachusetts Institute of Technology  CSAIL CSAIL Computer Science & Artificial Intelligence Laboratory (MIT)
CSAIL Computer Science and Artificial Intelligence Lab
) in the USA, the European Research Consortium for Informatics and Mathematics (ERCIM ERCIM - European Research Consortium on Informatics and Mathematics. An association of European research organisations promoting cooperative research on key issues in Information Technology. ) headquartered in France and Keio University in Japan, and has additional Offices worldwide. For more information see http://www.w3.org/
COPYRIGHT 2007 Business Wire
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2007, Gale Group. All rights reserved.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Publication:Business Wire
Date:Sep 11, 2007
Words:733
Previous Article:Explore Why the NAND Technology Market Has Grown to Sales of $16 Billion in Less Than Ten Years.
Next Article:HandHeld Entertainment Brings Exclusive and Entertaining Video Clips from WatchMojo.com to the ZVUE Networks.
Topics:



Related Articles
Kenzo Tange (1913-2005).(outrage)(Obituary)
Dream teams that shape L.A.'s skyline.(SPECIAL REPORT: WHO'S WHO IN REAL ESTATE)(City overview)
Internet Referee: Sergeant Laboratories.(New Products: THE LATEST OFFERINGS IN HARDWARE, SOFTWARE and THE INTERNET)
Rick & Steve: The Complete First Season.(DVD PICKS)
Tax savings opportunities for taxable contract acquisitions.(ACCOUNTING METHODS & PERIODS)
Company Watch - US Airways.
Airline Finance News - Africa / Middle East.
Faith in action.(signs of the times)
Church Father blogger.(catholic tastes)
Personal Trainer Schools

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles