CambridgeDocs unveils XML conversion tool

news
Jan 13, 20032 mins

CAMBRIDGEDOCS ON MONDAY took the wraps off a new tool designed to migrate unstructured content from legacy sources into any XML schema for improved searching and indexing. Through a point-and-click interface, the xDoc Converter can take content from Microsoft Word and Adobe PDF documents, HTML, Quark, and other formats and transform it into “meaningful” XML, such as DocBook XML, LegalXML, NewsML, and SCORM, which can be used for content management, publishing, or other applications, according to company officials in Boston. One of the biggest challenges in deploying a CMS (content management system) is converting large amounts unstructured information into XML schema so it can be management by a CMS, said Irfan Virk, CEO of CambridgeDocs. “Up to 40 percent of effort in deploying a CMS is converting the content into the system,” he said. Once content is stored in XML it can be republished easily into a variety of different formats for the Web, handheld devices, and printers, Virk said. Having content appropriately tagged in XML can also help reduce authoring costs because fragments can be pulled from different sources to create personalized Web pages or reports, he said. The xDoc Converter is a J2EE platform with a .Net interface, and it runs on Windows 2000 and XP.