Abstract
This paper/presentation discusses the development of Comic Book Markup Language (CBML) , or Comic Book Markup Language, an XML vocabulary (with Document Type Definition (DTD) and schema representations) designed to accommodate the XML encoding of comic books and graphic novels.
With the emergence of scholarly disciplines such as cultural studies and new areas of interest in traditional scholarly fields such as English and American studies, comic books and graphic novels have recently become the subject of serious critical attention. Additionally, comic books and the mythologies they have spawned continue to be a vital part of our popular culture and national consciousness. Witness the surprising and almost unprecedented popularity of the recent [ Spider-Man ] feature film. Before that, the [ X-Men ] comic book hero film was also enormously successful. In the 70s, 80s, and 90s, the [ Superman ] and [ Batman ] film franchises produced regular blockbusters. In 1992, Art Spiegelman won the Pulitzer Prize for his [ Maus ] , a comic book narrative of holocaust survival. More recently in 2001, Michael Chabon won the Pulitzer Prize for his novel The [ Amazing Adventures of Kavalier & Clay ] , which relates the experience of two Jewish cousins working in the nascent comic book industry at the beginning of WWII. These few examples, which are frequently discussed in school and university classrooms, serve to demonstrate the continuing and perhaps increasing importance of comic books as an art form and cultural touchstone.
Unfortunately, many of the comic books that might be possible subjects of scholarly and critical attention are not widely available to researchers. The reprints currently available are generally expensive and/or woefully incomplete. For instance, the Marvel Comics "essential" series provides affordable reprints of many of Marvel's classic comics, but these editions are printed on cheap newsprint and in black and white, lacking the bold colors of the original comics. In addition, the vast majority of comic book reprints lack the many interesting advertisements that are an integral part of these publications—especially when they are considered by scholars as cultural artifacts. For instance, when studying [ The Fantastic Four ] comic books from the 1960s, it is enlightening to juxtapose the sometimes stereotypical female roles assigned to Sue Storm, the Invisible Girl, the lone woman among the super hero team, with the advertisement recruiting young boys to sell [ Grit ] newspaper—the ad contains a mail-in form which requires the sender to answer the question, "Are you a boy?"
In order to facilitate the preservation, study, and analysis of these important cultural artifacts, the original comic books need to be digitized, and an XML vocabulary suitable for capturing the varied and complex data and metadata of the comic book needs to be developed. My presentation and paper will discuss some of the challenges encountered in developing the CBML XML vocabulary, and in keeping with the conference theme of technological integration and interoperability, I will also discuss issues related to the integration of XML and XML query technologies with various imaging technologies to provide users with an interface that combines XML-encoded textual data and metadata with various representations of the digitized page image. Comic books and graphic novels present a unique combination of text and graphics. The text—from the familiar speech and thought balloons to the graphically rendered POW! SMASH! BANG! sound effects—is inextricably bound with the image. The digitized comic book—no matter how meticulously encoded—cannot be sufficiently represented in XML alone; the page image is also required. An interface that integrates the comic book page images with XML-encoded textual data and metadata provides an extremely powerful tool for researchers, scholars, and students interested in comic books as art form and cultural touchstone.
Keywords
Table of Contents
This paper discusses the development of Comic Book Markup Language, a XML vocabulary, with both DTD and schema (W3C and RELAX NG) representations, for encoding comics, comic books, and graphic novels. For further introductory and background information, please see the abstract above. In the sections that follow, I do not give a detailed explanation of the DTD and all its elements and attributes. Instead, I provide an overview of the data I am seeking to encode, the motivations behind the development of CBML, the importance of metadata to the project, and the basic structure of comics. I also provide a number of illustrations and examples. The DTD and schema and other detailed documentation about the project will be available by the time of the conference at http://www.cbml.org/.
The intention of CBML is to provide a useful XML vocabulary for encoding documents that are variously called comics, comic books, and graphic novels. So what are these things? In his [ Understanding Comics ] , Scott McCloud provides a useful definition of comics: "Juxtaposed pictorial and other images in deliberate sequence, intended to convey information and/or produce an aesthetic response in the viewer" [ McCloud ] . This definition makes some interesting distinctions. It emphasizes sequence and juxtaposition, and so excludes single panel comic strips such as Gary Larson's [ The Far Side ] or Bil Keane's [ Family Circus ] . For McCloud, sequence and juxtaposition are so integral to the aesthetic effect and functionality of comics that their presence is required. As McCloud notes, "such single panels might be classified as 'comic art' in the sense that they derive part of their visual vocabulary from comics. . . . —but they are not the same thing" [ McCloud ] . Because they uses similar vocabularies, the single panel cartoon, as McCloud prefers to call it, can be encoded using CBML, but the emphasis in the development of CBML is the encoding of comics that juxtapose sequenced images. Also notice that McCloud's definition never mentions text. In my abstract/introduction, I emphasize the integration of image and text in comics, and indeed, the vast majority of comics employ text to a great degree, but text is not a requirement of the medium. For instance, Marvel Comics recently had and event dubbed "'Nuff Said," in which during a single month most of their major titles published an issue that used no caption or dialogue text and relied solely on images and artwork to tell the story. And many comics that do incorporate text will, nonetheless, include long sequences of panels that do not include text. The figure below depicts a full comic page without any narrative or dialogue text.
Of course, one will notice that there is still a lot of text in these panels; however, the text is part of the image, not the superimposed captions or balloons that are more commonly thought of as the "text" of a comic. How to deal with this integrated text, and distinguish it from the other text of the comic, is one of the many challenges faced in developing an XML vocabulary for encoding comics.
McCloud does an excellent job providing an abstract definition of comics. The comic book, however, is much more than "[j]uxtaposed pictorial and other images in deliberate sequence, intended to convey information and/or produce an aesthetic response in the viewer." The comic book is comics plus advertisements, plus fan mail, plus other prose, such as columns from the editor or news items about upcoming comics and comics-related events. A vocabulary that seeks to encode the entire comic book must accommodate the many varied types of content, or sub-documents, that may be found within a comic book. See the images below for examples of the non-comics content that one may find in a comic book:
The motivations behind the creation of CBML are the same as those shared by many other scholarly digitization projects. One motive is preservation. Comic books, especially older ones, are typically printed on cheap, high-acid newsprint. They are in constant danger of decay and irreparable loss. By digitizing these important artworks we preserve them for our own and future generations. A second motivation is access. Many comics are rare, expensive, highly-prized collector's items. While many rare and other comic books are often available in affordable reprints, the reprints, more often than not, fail to reproduce the entire comic book with the all-important context of the original advertisements, indicia, and so on. By digitizing these comics they are made accessible to a far greater number of scholars and students. In many cases, copyright restrictions prohibit one from making these digitized artifacts available for free over the Internet. But even when access is controlled in accordance with applicable laws and agreements, the digitized copy provides for greater access than is provided by a single (or a few) print copies of the original. Multiple students, for instance, may access the same digitized work simultaneously. And scholars and students and other researchers may, with the proper authentication, access a work from their office, dorm room, or home. A third motivation is increased functionality. The digitized version offers functionality not available in the original print version of the comic book. This added functionality includes the ability to perform on the data full-text searches and complex queries facilitated by added and regularized metadata and the rich and detailed encoding added to the original document. If, for instance, one wants to find every appearance of the character Galactus in the first one hundred issues of the [ Fantastic Four ] comic book, the only way one can do this with the print comic books is first to acquire access to the first one hundred issues of the [ Fantastic Four ] and then to read the text and study the artwork on each page of each issue. On the other hand, with the digitized collection containing metadata elements such as thorough character lists, one could quickly find this information using a simple query.
As mentioned in the previous section, one motivation behind the creation of CBML is to provide added value and functionality to the print originals. Much of this added value and functionality is dependant upon the metadata elements and attributes that are included in CBML. Metadata are, of course, data about the data--data that describe the data. In CBML, the data are the original comics, or comic books, or graphic novels. The metadata is, most obviously, the markup that is added to the original comic. The metadata also include added data elements and attributes that are not explicitly available in the original comic book. For example, the splash page of a comic may list Adam Austin as the artist. Nowhere in the comic book does the name Gene Colan appear. But the trained encoder knows that Adam Austin is a pseudonym for the famous comic artist Gene Colan. So the encoder may encode the name as follows:
<name type="person" regularized="Colan, Gene">Adam Austin</name>
Here, the added metadata, which are not explicit in the original, would allow a scholar researching the work of Gene Colan to find comics drawn not just by Gene Colan but also by his pseudonymous identity Adam Austin. This resolution of "true" or legal identities with pseudonymous identities is a function of metadata.
Character lists are another case in which added metadata may be extremely useful. I previously mentioned the case of a researcher searching for all appearances of the character Galactus in selected issues of the [ Fantastic Four ] . But the character may appear in the comic, even though his name never appears in the text. He may be referred to by a nickname; or his image may appear, without any textual reference, as a shadow in the final panel of the comic book, foreshadowing his appearance in the next issue. These appearances would not be uncovered by a full-text search of the original textual data of the comic. However, if such an appearance is noted in a metadata element such as a characterList or in the characters attribute of a panelDescription element, this metadata may be searched to retrieve more accurate information about a character's appearances. And of course super heroes are a common subject in comics, but super heroes go by many names. Spider-Man, for instance, is also know as Spidey, Web-Head, Wallcrawler, and other nicknames. And Spider-Man also has a secret identity and may be referred to as Peter Parker, or his Aunt May may call him Petey. If one is searching for all appearances of or references to Spider-Man or Peter Parker, one may have to search for dozens of different names and epithets. However, if all references to the character Spider-Man, regardless of the variation on his name or identity, are encoded with an IDREF attribute that points to a single authoritative and regularized name with a description of the character and a list of the multiple name and identity variations, one can easily find all the appearances of and references to that character without having to conduct multiple or complex searches--one simply searches for elements containing an IDREF matching the character in question.
The development of CBML began as an effort to encode a single representative instance of a comic book. I chose a fairly standard or traditional comic book from the 1970s by Jack Kirby, one of the towering giants of twentieth-century comics. By encoding this single representative instance of a comic, I have encountered many of the metadata, structural, and vocabulary issues one is likely to encounter in a majority of comics. Then moving past this single instance, I have looked at some more experimental examples in which the structure and layout of panels and text differs somehow from standard practice. These exceptions to the norm have forced me to consider many other possibilities and opened up the DTD/schema to deal with a wider variety of approaches to comics and comics art. As the development of CBML continues, I and others who join the development process will continue to look for challenging examples and modify CBML accordingly so that it will provide an appropriate vocabulary for encoding most, if not all, comics.
In developing CBML I have relied on the many excellent examples provided by other XML vocabularies, particularly the Text Encoding Initiative ( Text Encoding Initiative (TEI) ) Standard Generalized Markup Language (SGML) and XML markup language. The TEI is designed to encode existing texts, include novels and other prose, poems, dramas, and dictionaries. The TEI provides an extensive vocabulary and a documented and standard method for extending and modifying the stock vocabulary. It is widely used in the academic and digital library communities to encode existing scholarly and other texts. The text of a comic book, captions and speeches or thoughts by characters, shares much in common with other literary forms, such as the novel and the drama. For encoding these features of the text, the TEI provides a useful model. Other features of comics, such as the often complex arrangement and sequence of panels, sound effects (e.g. POW!, BAM!, KRASH!), and advertisements are less readily described by the stock TEI vocabulary. Rather than extend the TEI by adding and renaming elements, redefining content models, and so on, which would result in a document type difficult to recognize as having originated from the stock TEI, I have found it preferable to build a new XML vocabulary specific to comics, comic books, and graphic novels, borrowing where appropriate from the TEI and other vocabularies. XHTML forms, for instance, provide a useful model for encoding the many cut-out mail order forms that are found in the advertisements in comic books.
In most cases, the basic structural element of the comic book is the panel. But as noted earlier, the single panel cartoon is not comics or a comic book, but something else. In comics, panels are juxtaposed and have a sequence--sometimes clear, sometimes ambiguous--and are arranged compositionally in groups. So CBML provides elements such as panelGroup and panel. The panelGroup element contains a panels attribute to indicate the total number of panels, and the panel element includes a n (number) attribute to indicate the panel's place in the panelGroup sequence. Below is a sample comic book page. The layout is in a traditional grid pattern; panels are self-contained with no overlapping or sub-grouping:
And here is the same page encoded in the current, pre-release version of CBML:
<div type="story page" unit="page" extent="1">
<pageBreak n="11" />
<panelGroup panels="6">
<panel n="1">
<panelDescription characters="cap anon_man" />
<!--
characters attribute is of type IDREFS
and references character IDs from the
characterList in metadata section
-->
<caption>
<paragraph>
Cap is
<emphasis style="bold">
completely
</emphasis>
absorbed in his find. He
<emphasis style="bold">
ignores
</emphasis>
the danger about him as he toys madly with it...
</paragraph>
</caption>
<balloon speaker="anon_man" type="speech">
<!--
speaker attribute is of type IDREFS
and references character IDs from the
characterList in metadata section. It is
of type IDREFS instead of IDREF since a single
speech may be spoken simultaneously by multiple
characters, e.g., "Avengers Assemble!"
-->
<paragraph>
<emphasis style="bold">
Hey!
</emphasis>
Drop that!
<emphasis style="bold">
I
</emphasis>
want it!
</paragraph>
</balloon>
<balloon speaker="cap" type="speech">
<paragraph>
It's
<emphasis style="bold">
ugly! Very ugly!
</emphasis>
</paragraph>
</balloon>
</panel>
<panel n="2">
<panelDescription characters="cap anon_man" />
<balloon speaker="anon_man" type="speech">
<paragraph>
I
<emphasis style="bold">
don't
</emphasis>
care! If you don't give it to me,
<emphasis style="bold">
I'll--!!
</emphasis>
</paragraph>
</balloon>
<balloon speaker="cap" type="speech">
<paragraph>
It's
<emphasis style="bold">
ugly!
</emphasis>
It ought to be
<emphasis style="bold">
smashed
</emphasis>
to pieces!
<emphasis style="bold">
smashed! smashed!
</emphasis>
</paragraph>
</balloon>
</panel>
<panel n="3">
<panelDescription characters="cap" />
<caption>
<paragraph>
Before the hammer descends on cap, his shield
<emphasis style="bold">
demolishes
</emphasis>
the evil mechanism!
</paragraph>
</caption>
<soundEffect>
KRAK!
</soundEffect>
</panel>
<panel n="4">
<panelDescription characters="cap anon_man" />
<caption>
<paragraph>
The screaming suddenly
<emphasis style="bold">
stops--
</emphasis>
and, in the ensuing silence,
<emphasis style="bold">
both
</emphasis>
men sink
<emphasis style="bold">
slowly
</emphasis>
to the ground...
</paragraph>
</caption>
</panel>
<panel n="5">
<panelDescription characters="cap" />
<caption>
<paragraph>
For a seeming
<emphasis style="bold">
eternity
</emphasis>
Cap seems to drift in an ocean of darkness--and,
when he finally regains consciousness...
</paragraph>
</caption>
<balloon speaker="cap" type="thought">
<paragraph>
W-What
<emphasis style="bold">
happened--?
</emphasis>
I-I can't even remember passing out...
</paragraph>
</balloon>
</panel>
<panel n="6">
<panelDescription characters="cap anon_man" />
<balloon speaker="cap" type="thought">
<paragraph>
I feel as though I've been sitting through
<emphasis style="bold">
re-runs
</emphasis>
of
<emphasis style="bold">
old
</emphasis>
nightmares...
</paragraph>
<paragraph>
Must get my head together--this town is in
<emphasis style="bold">
trouble!
</emphasis>
</paragraph>
</balloon>
</panel>
</panelGroup>
</div>Many comics deviate radically from the simple structural example above. Following are examples of such complex or ambiguous structure which pose additional challenges for the development of a suitable XML vocabulary and for the encoding process. Below is an example of overlapping panels. The sequence of the panels remains clear, but the markup should nonetheless be able to indicate the fact that this overlapping exists.
In this next example, we again see overlapping panels, but the arrangement is more complex and the sequence more ambiguous. As I read it, the entire page is one large panel. The woman on the platform and the man at the bottom middle saying, "Mercy, Madame... Mercy! I tried! I tried!," coexist on the same plane in the same large panel. Overlayed on top of the large full-page panel are three smaller panels, with text balloons that exist outside the bounds of the panels.
In the next example, we see one full-page panel, but within this large panel are different narrative events or scenes. They are "panels" of a sort, but without clearly delineated borders. Should they be described with something like <panel border="none"> or should panel elements allow for something like scene child elements for encoding such a page? These are the type of issues raised as we move forward with the development of CBML and attempt to encode more structurally complex and experimental comics.
In this paper, I have just touched, in an introductory way, on the initial stages of development of CBML. I hope to have shown that CBML is an ambitious, interesting, and valuable project. The issues encountered and lessons learned in the development of CBML can be of great use to others developing XML vocabularies for describing complex documents, especially documents that describe sequential artwork and combinations of visual art and text. CBML remains in the early stages of development, and the hope is that after XML 2002, I will interest others in this project and develop a community of interested parties who will participate in the further development of CBML. The groundwork I have laid should serve as a useful starting point for development by a larger community of interested parties. I will also pursue placing CBML with an appropriate home in the form of a standards body such as Organization for the Advancement of Structured Information Standards (OASIS)http://www.oasis-open.org/.
Along with the development of the XML vocabulary for CBML, I am simultaneously working on a web-based prototype for searching the resulting CBML-encoded comic books and displaying the textual and image data and metadata. My plan is to have this prototype far enough along so I may demo it at XML 2002.
Given the early stages of development, I hope interested parties will regularly visit http://www.cbml.org/ and the conference web site http://www.xmlconference.org/ for updated versions of this paper, the CBML DTD and schema, the search and display prototype, and other news regarding the ongoing development of CBML and related projects.
[ Kirby ] Kirby, Jack (w,p) and Frank Giacoia (i). "The Madbomb Screamer in the Brain!" Captain America v1 #193 (Jan. 1976), Marvel Comics.
[ Steranko, Captain America 110 ] Lee, Stan (w), Jim Steranko (p), and Joe Sinnott (i). "No Longer Alone." Marvel Visionaries: Steranko. Ed. Joe Quesada. New York: Marvel Comics, 2002: 34-54.
![]() ![]() |
Design & Development by deepX Ltd. 2002 |