CambridgeDocs Previews xDoc Converter at XML Conference and Exposition; New Tool Unlocks Value of Unstructured Content by Introducing XML Structure.Business Editors/High-Tech Writers XML XML in full Extensible Markup Language. Markup language developed to be a simplified and more structural version of SGML. It incorporates features of HTML (e.g., hypertext linking), but is designed to overcome some of HTML's limitations. Conference & Expo 2002 BALTIMORE--(BUSINESS WIRE)--Dec. 10, 2002 CambridgeDocs, a leader in the emerging market for XML-based content integration, is demonstrating its soon to be released xDoc Converter, a tool for migrating unstructured content from legacy sources, including Microsoft Word, HTML HTML in full HyperText Markup Language Markup language derived from SGML that is used to prepare hypertext documents. Relatively easy for nonprogrammers to master, HTML is the language used for documents on the World Wide Web. , and Adobe PDF (Portable Document Format) The de facto standard for document publishing from Adobe. On the Web, there are countless brochures, data sheets, white papers and technical manuals in the PDF format. documents into any XML schema (XSD (XML Schema Definition) The informal name for the XML schema from the W3C. See W3C XML Schema. XSD - XML Schema Definition ) or DTD (Document Type Definition) A language that describes the contents of an SGML document. The DTD is also used with XML, and the DTD definitions may be embedded within an XML document or in a separate file. for improved searching and indexing across the enterprise at the XML Conference. The xDoc Converter can work in any industry and can migrate to any DTD or XML Schema, such as DocBook, LegalXML, HR-XML HR-XML Human Resources eXtensible Markup Language , NewsML, SCORM SCORM Shareable Content Object Reference Model (web-based e-learning standard) SCORM Shared Courseware Object Reference Model SCORM Shareable Courseware Object Reference Model , XHTML (EXtensible HTML) A markup language for Web pages from the W3C. XHTML combines HTML and XML into a single format (HTML 4.0 and XML 1.0). Like XML, XHTML can be extended with proprietary tags. Also like XML, XHTML must be coded more rigorously than HTML. . Subsequent products will address other issues of content interoperability in the enterprise. Companies are investing millions of dollars to buy and implement content management systems, document management systems, and enterprise portals. However, many of the benefits associated with these new systems for managing enterprise content are lost without the ability to bring old documents into these systems in a meaningful way. XML provides a presentation-independent way to represent content, and is fast becoming the standard way to create new content. Unlike other tools, that can only generate stylistic XML, which looks like HTML, the xDoc Converter can actually extract meaning out of the document and assign it to appropriate tags. For this reason, the xDoc Converter can generate semantic or "meaningful" XML, which can be very complex schemas defined within each industry. "To date, there has been a big gap between how documents exist today - as unstructured Microsoft Word documents, HTML files, text files, PDF files - and how they will exist in the future as part of a enterprise content management and publishing strategy - which is predicated on them being structured XML documents," said Rizwan Virk, Chairman and CEO (1) (Chief Executive Officer) The highest individual in command of an organization. Typically the president of the company, the CEO reports to the Chairman of the Board. of CambridgeDocs. "This 'content gap' was because all of the existing documents needed to be re-typed or re-formatted in order to make them available or people had to write conversion code. Our goal is to narrow that gap with the xDoc Converter by making virtually seamless content integration possible regardless of the file and its original format." Prior to xDoc, companies had to write lots of custom parsing and formatting code, or had to manually re-type documents. With the xDoc Converter, companies can quickly and easily migrate large amounts of legacy content into meaningful XML. Many organizations are moving to manage their content - documents, memos, reports, intranet pages, brochures, and other documents- as XML because of its inherent ability to support the management and publishing of content. The benefits of having documents in XML include: -- Separation of content from presentation -- Write once, publish anywhere - from XML to HTML, WML, PDF, etc. -- Save time and money from reduced authoring and publishing costs -- Ability to assemble new documents from existing pieces more effectively About CambridgeDocs CambridgeDocs is a leader in the emerging market for XML-based content integration. This market deals with the integration of legacy content with new XML-based systems (e.g. Content Management, Enterprise Information Portals, EAI, and Web Services) and standards (e.g. DocBook, HRXML HRXML Human Resources eXtensible Markup Language HRXML Human Resources Xml , RIXML RIXML Research Information Exchange Markup Language , IRXML, FPML, DAS-XML, NewsML, any custom XML schema/DTD's, etc.). Towards this end, CambridgeDocs provides a technology platform & services for taking existing unstructured and semi-structured internal and external content (e.g. MS Word, HTML, PDF, Quark, etc.), and transforming it into "meaningful XML". Once transformed, the content can be made available for delivery through XML-based Web Services, classified and indexed within Enterprise Information Portals, and aggregated, assembled and published in multiple different formats including support for wireless and mobile devices. |
|
||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion