|
UPU
Approves International Address Standard
April
22, 2004 - Alexandria, VA - The
Universal Postal Union (UPU) international
address standard, International Postal
Address Components and Templates, designated
as UPU S42, has been approved by the
UPU Standards Board in its meeting in
Brussels in February 2004. The S42 standard
was developed in the UPU POST*Code group
under the leadership of Guy Goudet and
in its technical committee led by Ruth
Jones of USPS.
The
standard is based upon a comprehensive
list of name and address elements that
originated in the work of the European
standardization organization CEN, which
has an agreement with the UPU to work
together on postal standards. These
elements define the smallest meaningful
parts of names and addresses. The set
of elements is extended as necessary
to cover additional situations, but
so far has been sufficient to represent
names and addresses in a number of non-European
countries, including the US, taking
account of some terminological differences.
This element list can be used to support
a unified approach to address database
design by providing a single set of
column headers.
A
second major concept within UPU S42
is the address template, which describes
unique combinations and orderings of
elements, or in more general terms,
address types, within a country. Templates
in UPU S42 are described both in a natural
language notation (NLT) and using the
Postal Address Template Description
Language (PATDL), which is defined as
a W3C XML schema. PATDL supports branching
based on field values, business rules,
decision tables, or other defined algorithms.
Templates refer to elements by their
names or by using codes assigned by
the UPU, and can also utilize externally
defined elements or code sets. Templates
for some countries, such as the United
Kingdom, are substantially more complex
than for others, such as the United
States. By using the templates, the
names and addresses can be stored in
a permanently parsed format and reconstituted
when necessary according to the requirements
of a specific situation.
The
third major part of the standard has
to do with rendition, or the production
of addresses on an output medium such
as an address label or a computer display
screen. Included in the standard is
a registry of rendition instructions,
which can be formatting rules for final
presentation, including abbreviation
and prioritization of data elements
when there are constraints on available
space, and upstream procedures designed
to govern the rendition process as a
whole, to decide among alternatives,
or to implement user preferences. A
simple example of rendition instructions
is the formatting of a postal code,
while a more complex example is the
movement of apartment information as
recommended by the USPS to the line
above the street address as an alternative
to abbreviating or shortening the street
name or omitting the apartment information.
It
turns out that within a template an
element such as the UPU "thoroughfare
qualifier" may have multiple occurrences
in different positions, such as pre-directionals
and post-directionals in US addresses,
and other elements such as the UPU "postcode"
need to be divided into parts in order
to be properly rendered, such as the
US ZIP+4 code with its hyphen after
the first five digits. These situations
have in common that they raise issues
of cardinality not dealt with in the
list of elements itself. The POST*Code
group agreed to define element sub-types
in order to handle the issue of cardinality
in both forms by making it possible
to represent any multiplicity or subdivision
of elements in the templates. These
element sub-types are explicitly defined
within the standard as the need for
them is recognized.
Through
surveys and discussions at the UPU,
it has been learned that at least twenty-five
countries have or are developing a delivery
point database. By this is meant a full
definition of the specific addresses
to which deliveries are made, without
resort to summaries, range files, or
other methods that cause loss of information
about whether a certain set of address
elements represents a complete and correct
address. Without such a database, the
technology that the UPU standard facilitates
can only distinguish between addresses
that might be valid and those that are
definitely invalid. But with a delivery
point database, the same technology
can distinguish between the addresses
that are valid and those that are invalid.
Current approaches to address maintenance
typically store composite address lines
and cannot always make those key distinctions
correctly because they require an additional
step of parsing the address elements
and mapping them to the database fields,
which can fail if there are extraneous,
misplaced or ambiguous elements.
The
standard needs development of more templates
before it can be implemented on a worldwide
basis. Currently fifteen countries have
agreed to participate and twelve have
provided mappings of elements and sample
addresses representing the known address
types that involve different orderings
of elements. There are two approaches
to deriving the NLT and PATDL templates
from the inputs provided, which may
be deployed separately or together.
One is to translate directly from the
rules provided by the country, and this
works if those rules are sufficiently
precise and complete. The other is to
generalize upward from the sample addresses
to find a PATDL template that can generate
all the renditions correctly, and this
works if the sample is robust enough.
Actually some combination of deductive
and inductive approaches is needed in
order to ensure that the template is
capable of accomplishing the objective
of properly formatting all valid addresses
for the country. Thereafter any template
may be further elaborated and customized
with options and user preferences. Both
the NLT and PATDL templates will be
published for each country by the UPU
if appropriate approvals have been obtained.
The
standard can be obtained from the UPU
only by ordering the complete set of
UPU standards, which costs 400 Swiss
francs for a one time distribution either
on hard copy or CD-ROM, or 750 Swiss
francs for an initial distribution and
a one year subscription to receive updates
that are normally issued quarterly.
The URL to start with is www.upu.int/publications/en/upu_technical_standards.html.
Understandably, some may prefer that
the standards be made available separately
and that they be directly downloadable,
but at this time neither of these options
is offered by the UPU.
This
reporter has participated directly in
the UPU POST*Code process and is also
the chair of an IDEAlliance work group
developing an implementation of the
UPU S42 standard within the broader
context of business mail. This project,
known as the Address
Data Interchange Specification (ADIS),
facilitates many additional user options
and extensions while supporting all
the elements of UPU S42 and the PATDL
template description language. While
UPU S42 only uses XML to describe templates,
ADIS also allows address data to be
described in XML. The upcoming 2004
version of ADIS is expected to offer
many useful extensions to the published
UPU S42 standard.
Available
within ADIS are the alternatives of
an XML or a relational database architecture,
use for preparation of a specific mailing
or for long term address management,
support for bulk mailings as well as
collections of individual mail pieces,
and the option for fully element based
storage or to carry out a transition
from line-by-line methodology. Features
supported include documentation needed
for qualification for postal rates,
multiple production groups to support
multiple countries with different address
formats within a single campaign, addition
of mail production variables to the
address block, and formatting of ink
jet messages used for marketing purposes.
Application areas that are enabled include
late address changes after the mailing
file has been presorted, support for
combined mailings of different levels
of granularity, and support for sender
identification and the capability to
request ancillary services through mail
piece barcodes.
When
addresses are originally collected,
they are often obtained in a line-by-line
format. It is unlikely that a Web site
would be designed with separate fields
holding several dozen name and address
elements. However, immediately upon
acquisition, ideally while direct feedback
is still possible, the UPU S42 technology
will help accomplish permanent parsing
and validation of addresses. It facilitates
address updating and matching to databases,
and when a mailing is planned, allows
for simpler and quicker de-duplication.
During mail production or rendering
on a computer display, the UPU S42 element
based technology expressed through a
rendition engine, such as ADIS specifies,
enables the final presentation of the
address to achieve the best possible
quality. Through this new methodology,
address quality can approach more closely
to the elusive goal of 100% correctness
that provides greater efficiency for
both postal services and their customers.
For
information on POST*Code or UPU S42
contact Guy Goudet at (+41 31 350 3156)
or Ruth Jones at (+1 901 681 4585).
For information on IDEAlliance, contact
David Steinhardt at (+1 703 837 1066).
For questions or comments on any subject
in this report, contact Joe Lubenow
at (+1 773 478 2249) or lubenow@msn.com.
###
Home
| Events
| Standards
| Membership
| News
| Resources
| About
|