Abstract
The separation of content and presentation information is one of the principal benefits of applying XML technologies to the electronic publishing domain. This concept has been successfully demonstrated with textual publications using a combination of XML and XSL. However, the same model has rarely been applied to the production of graphics and images. Adaptive graphics (a.k.a. data-driven graphics or customised/personalised graphics) is the term used to describe graphics that are produced through a combination of content and presentation information, usually stored in separate data files.
This paper examines the concepts of adaptive graphics and their relationship to well established document publication methodologies. It also outlines possible applications and hypothetical solutions with the help of examples from different application domains. Some of the most obvious scenarios include statistics graphs and diagrams such as event maps or venue planners that need to be updated on a regular basis. A more advanced scenario is the creation of graphics that are localised and/or customised according to user preferences and requirements.
Keywords
Table of Contents
Since the emergence of XML in early 1998 and it's subsequent adoption across diverse application domains, one of the key benefits it enabled was the separation of content and presentation [Bos97]. XML borrowed this model (along with other important concepts) from the Standard Generalised Markup Language (SGML). An SGML document consists of logically structured content and uses a separate file (style sheet) to specify how the content should be formatted for presentation. For this purpose, the Document Style Semantics and Specification Language (DSSSL), was developed. It formats SGML documents for presentation/printing, but can also transform the physical structure of the document. XML documents can be transformed and formatted in a similar manner using the extensible Stylesheet Language (XSLT), a style sheet language developed specifically for XML. In addition to transformation and formatting, other layers have emerged (and been defined through formal specification) to complete the conceptual reference architecture for XML publishing. These additional layers are generally grouped into schema (Document Type Definition (DTD), XML Schema, RELAX-NG), logic (XSP [Apa02]) and semantic layers (RDF [RDF00], TopicMaps [XTM01]). In the electronic publishing domain, the advantage of separating content from transformation and presentation information has proven very successful, particularly where content must be customised and delivered in many output formats (e.g. to suit different devices and applications). Analogous to database design normalisation techniques, final document representations were "normalised" into their basic architectural components (content, transformation, presentation). Figure 1 shows relationships and typical cardinalities between the individual components using the entity-relationship syntax.
The four-level publication architecture Table 1 illustrates the publishing workflow that occurs in order to generate a final document:
| Content Layer (CL) |
The CL represents the physical storage or data-source for the basic content of the publication (e.g. document or image). Storage technologies could range from traditional file systems (e.g. plaintext files, files with comma-separated values, XML files) to document management and database systems (relational and object-oriented). Additional processing may be necessary to acquire the content in XML syntax. In many cases, database and document management systems support transformation from internal representations (e.g. database result sets) into XML syntax. |
| Transformation Layer (TL) |
The TL receives the source data from the CL in an XML format (possibly marked up according to a specific DTD, e.g. DocBook XML). It transforms the data into the target schema (Presentation layer DTD, e.g. XHTML) using transformation technologies (e.g. XSLT, DSSSL). The transformation process may modify the source data in a number of ways including deletion (filtering) and modification of information as well as generation of text such as "Table of Contents", internal cross-references (textual publishing) and "Legend" (graphical publishing). |
| Presentation Layer (PL) |
The PL does not physically change any of the textual information passed from the TL layer. It defines application-specific properties for presenting the document on the user's hardware device, e.g. CSS for visual and aural presentation. |
| Application Layer (AL) |
Finally, the AL receives the data in an expected data format and renders it using the appropriate application (e.g. HTML browser, PDF viewer). The AL uses information from the PL to render XML syntax consistent with user and/or application needs. |
The most widely used graphics formats on the Web are based on raster technology, where an image is composed of a flat two-dimensional grid of pixels. A pixel is the basic unit of programmable colour on a computer display or in a computer image. Graphics packages often allow images to be assembled from various graphical objects (e.g. geometrical shapes, text, other images) grouped on virtual layers. Saving the image as a raster image flattens the graphical objects into a grid of pixels removing whatever logical structures may be present. At this point, making minor changes to the image such as changing a piece of text becomes very difficult.
Vector Graphics offer a more flexible alternative to raster graphics whereby an image is composed of graphical objects (e.g. geometrical shapes, text etc.) each of which can be individually selected and/or modified. Furthermore, a saved vector graphic preserves its logical (object-) structure rather than flattening it. There is no increase in complexity between modifying a vector image at creation time or at a later stage. As a result, maintaining a vector graphic image is generally far easier than maintaining a raster graphic image. Macromedia Flash is one of the most popular vector graphic formats available and is on widespread use on the Web.
In 2001, the Scalable Vector Graphics (SVG) Specification was published by the W3C. This specification defines a vector graphic format using XML syntax that can be used to produce graphics for the Web and beyond. Like other content formats based on XML, SVG is non-proprietary, platform independent and interoperable with any existing XML aware software and tools. SVG is an important development in Web graphics and may prove to be as significant in the graphics domain as SGML and XML have been in the publishing domain.
Analysis of the logical image publishing architecture exposes equivalent layers to the document publication model. However, the tasks and properties of each distinct layer are not as obvious as they are for textual content. Irrespective of the final graphics format, the following four tiers (with connecting interfaces) can be identified:
| Content Layer |
There is no significant difference in the CL for textual and graphical content. Graphics content can be stored in content management systems, general databases or plain text files depending on the designer's hard- and software equipment. Often, these storage systems are also connected to graphic software systems such as Computer Aided Design (CAD)/Computer Aided Manufacturing (CAM) and Geographic Information Systems (GIS). Though, it is not ruled out that graphical data is already stored in an XML graphics format (e.g. SVG). As mentioned before, the CL has to pass information to the next layer in XML syntax in order to support further XML processing. |
| Transformation Layer |
Equivalent to the document publishing architecture, the TL serves two central purposes: Firstly, the conversion from the content format into an XML based graphics format, e.g. SVG, Computer Graphics Metafile (CGM)[CGM92], to be passed on to the Presentation Layer. Secondly, programmatic modifications such as additions, updates and deletions of information are important tasks of the TL. |
| Presentation Layer |
The PL, analogous to it's function in the model for document publishing, does not have an immediate effect on the textual content resulting from the TL. Instead, it defines presentational properties that can be used (together with the TL outcome) in the Presentation Layer to visualise the resulting graphic (e.g. CSS). |
| Application Layer |
The AL assures compliance with existing standards to present the final image. It uses the resulting content from the TL together with presentation information from the PL to visualise the resulting image. Examples of AL implementations suitable for SVG include Adobe's SVG viewer [Ado02] and Batik from Apache [Bat02]. The latter allows the final image to be converted into raster image formats (e.g. JPG or PNG). |
Table 1.
| Textual Data | Graphical Data | |
|---|---|---|
| Application Layer | e.g. XHTML | e.g. SVG |
| Presentation Layer | e.g. CSS, FO | CSS |
| Transformation Layer | e.g. XSLT, DSSSL | |
| Content Layer | Various XML vocabularies (DTD, XML schema), (XML-) databases |
This paper outlines two adaptive graphics categories, each with distinct characteristics. On the one hand, graphics with qualitative changes are those, which will remain valid and useful for long periods of time without being changed. Modifications are only required to adapt the content of the illustration (and here mostly textual content) to local conventions (localisation). They often occur in combination with localisation of documents. On the other hand, graphics with quantitative changes are defined as images with parts that undergo changes on a regular basis. Pictures that require both qualitative and quantitative changes are also likely and defined as having mixed changes.
In all examples, we detect components of the image that can be grouped into static and dynamic layers. The static parts remain unchanged in each version of the graphic whereas the dynamic part may change according to the users needs and requests.
The static part (here: the background) of Entertainment Maps contains a plan (layout) of the area including e.g. streets, houses and waterways. The most common example will be a street guide or area map, containing an outline of all places of interest for the user. The image contains qualitative (e.g. names and descriptions) as well as quantitative (e.g. opening times) change areas.
Depending on his or her preferences, a user can select from a list of attractions including historical monuments and sporting/entertaining venues, all of which can be integrated into the image during the transformation process. A legend with explanations of the selected attractions is also automatically generated and inserted.
Examples of time critical changes include:
Opening hours of shops and museums,
Collection times at letterboxes,
Special events at museums and theatres,
Departures of public transport vehicles (e.g. buses and trains),
Availability of accommodation,
Volume of traffic at selected points in town.
Each of the above mentioned examples could then be rendered according to a distinct presentational scheme, e.g.
Shops appear in different colours according to their opening status,
Streets appear in different colours according to the traffic situation.
The qualitative changes in this scenario could adapt the information available on the map to each users nationality (localisation) or visualisation requirements (e.g. high contract colours). Both textual adaptation (e.g. translation of content) as well as graphical adaptation (e.g. icon variations for different cultures) are part of the publication process.
The modelling process for creating and visualising conference maps is similar to entertainment maps. They consist, of a static part (the architectural layout of the venue, e.g. rooms, corridors, exhibition spaces/booths) and the dynamic components that can be modified according to quantitative and qualitative changes. This layer includes e.g. logos of booth occupants at specific dates and room seating layout.
Adaptive Graphics will play an important role in the Medical Imaging domain. Here, the dynamic part can be the background image (e.g. X-ray, MRT (Magnetic Resonance Tomography)) as well as the outlined view of various body-part shapes of the specified area. Two scenarios shall be examined here:
The first situation is envisaged for educational purposes: The static part contains an X-ray from the abdominal part of the body. Using adaptive graphics, the user/student can select and place dynamic layers with sketches of the different organs to e.g. identify exact locations. A possibility to manually mark and highlight interesting points can be seen as important preparation for 2.).
In the second scenario, the background image becomes the dynamic layer of the adaptive graphic. It changes from, e.g. an X-ray to an MRT or even a medical textbook image. According to the points made in the previous section, navigation within the image is simplified.
Meteorological maps undergo continuous modifications according to the core (quantitative and qualitative) transformation model. Two main logical components can be identified, the geographical layer (e.g. countries, mountains and waterways) and the meteorological layer (e.g. isobars, isotherms, sun and rain):
The geographical layer is merged with the meteorological layer to present climate and weather conditions for a specific region at a specific time. Nevertheless, the geographical layer is not a static layer in this scenario. It also provides adaptability to the user's needs with information of e.g. political, statistical data.
Meteorological Maps often have to be localised to support regional variations of names, numbers and units (e.g. conversion from Celsius and Fahrenheit and vice versa).
Examples of all three changes (qualitative, quantitative and mixed) are given in Figure 4.
The purpose of separating graphical and textual content of cartoons simplifies the process of localising the story to the user (language) and therefore widens the audience with minimal additional effort.
Both parts are obvious at first sight. Whereas the static part contains all background information (i.e. the image), the dynamic part holds information about scene, speaker and text. On request, both data files are processed and the final image is generated before sent to the reader.
Adaptive graphics technology physically separates the graphical and the descriptive elements within an image. Graphical elements include shapes such as circles and squares and their relation to each other. One possible format for representing graphical elements is SVG. Depending on the requester's properties, the SVG file is processed and completed with descriptive information and transferred in SVG format to the client.
In this case, the graphical part of the image (e.g. the layout of the rooms in a conference centre) is static and the most persistent part of the image. It is stored separately from the event specific data, e.g. occupation of rooms, title of presentations. On request, both files are merged and transformed into the final, data driven and customised image (adaptive graphic).
Another important area of adaptive graphics is the localisation domain of technical diagrams. Having separated the graphical from the descriptive elements, diagrams from domains such as the automotive, aeronautical or medical domain can easily be fitted with textual and graphical information in the reader's native language and according to their culture.
It is widely accepted that separation of content and presentation information within the area of electronic publishing is important. Equally we have learned that normalisation of graphical content can yield similar benefits for the creator and user as well as the person responsible for maintaining the content. SVG is an important technology and is well suited to the process of customising and adapting graphics to users' needs. It has the potential to be as liberating for the graphics community as XML has been for the publishing community.
Should we also consider normalising the remaining multimedia formats, i.e. audio and video? What advantages can we achieve by doing so? Does the overhead involved in creating the necessary metadata outweigh the possible advantages? Can we use existing specifications such as Synchronized Multimedia Integration Language (SMIL)[SMI01] and HyTime [HyT97] as a basis for this? These are interesting questions, complex to specify and implement.
[Bos97] Bosak, J., Overview: XML, HTML, and all that, Sun Microsystems, presented on April 11, 1997, http://www.oasis-open.org/cover/bosakWWW6-over.html.
[CGM92] International Standards Organisation (ISO), Computer Graphics Metafile - Metafile for the Storage and Transfer of Picture Description Information, ISO/IEC 8632:1992.
[HyT97] International Standards Organisation (ISO), Hypermedia/Time-based Structuring Language (HyTime), ISO/IEC 10744:1997, 2nd Edition.
[RDF00] W3C, Resource Description Framework, (RDF) Schema Specification 1.0, W3C Candidate Recommendation 27 March 2000, http://www.w3.org/TR/rdf-schema/.
Application Layer
Computer Aided Design
Computer Aided Manufacturing
Computer Graphics Metafile
Content Layer
Document Style Semantics and Specification Language
Document Type Definition
Geographic Information Systems
Presentation Layer
Standard Generalised Markup Language
Synchronized Multimedia Integration Language
Scalable Vector Graphics
Trinity College Dublin
Transformation Layer
extensible Markup Language
extensible Stylesheet Language
![]() ![]() |
Design & Development by deepX Ltd. 2002 |