![[x]](../../../icons/eks.png)
Association for Computers and the Humanities,
Association for Computational Linguistics, and
Association for Literary and Linguistic Computing.
1990.
Guidelines for the Encoding
and Interchange of Machine-Readable Texts (TEI P1).
Ed. C. M. Sperberg-McQueen and Lou Burnard.
Chicago, Oxford: Text Encoding Initiative, 1990.
![[x]](../../../icons/eks.png)
Association for Computers and the Humanities,
Association for Computational Linguistics, and
Association for Literary and Linguistic Computing.
1994.
Guidelines for Electronic Text Encoding
and Interchange (TEI P3).
Ed. C. M. Sperberg-McQueen and Lou Burnard.
Chicago, Oxford: Text Encoding Initiative, 1994.
![[x]](../../../icons/eks.png)
Huitfeldt, Claus, and C. M. Sperberg-McQueen.
2003.
TexMECS:
An experimental markup meta-language for complex documents
.
Working paper of the project Markup Languages for Complex
Documents (MLCD), University of Bergen.
January 2001, rev. October 2003.
Available on the Web at
http://decentius.aksis.uib.no/mlcd/2003/Papers/texmecs.html
![[x]](../../../icons/eks.png)
Jagadish, H. V.,
Laks V. S. Lakshmanan,
Monica Scannapieco,
Divesh Srivastava,
and
Nuwee Wiwatwattana.
2004.
Colorful XML: One hierarchy isn't enough
.
Proceedings of the 2004 ACM SIGMOD International
conference on management of data, Paris,
sponsored by the Association
for Computing Machinery Special Interest Group on Management of Data.
New York: ACM Press.
![[x]](../../../icons/eks.png)
Sperberg-McQueen, C. M., and Claus Huitfeldt.
2000.
GODDAG: A Data Structure for Overlapping Hierarchies
,
paper given at Digital Documents: Systems and Principles. 8th
International Conference on Digital Documents and Electronic
Publishing, DDEP 2000, 5th International Workshop on the Principles of
Digital Document Processing, PODDP 2000, Munich, Germany, September
13-15, 2000. Published in
DDEP-PODDP 2000, ed. P. King and E.V. Munson.
Lecture Notes in Computer Science 2023.
Berlin: Springer, 2004, pp. 139-160.
Available on the Web at
http://www.w3.org/People/cmsmcq/2000/poddp2000.html
![[x]](../../../icons/eks.png)
Sperberg-McQueen, C. M., and Claus Huitfeldt.
2008.
Containment and dominance in Goddag structures
,
paper given at the conference
Processing text-technological resources,
Bielefeld, March 13-15, 2008,
organized by the Zentrum für interdisziplinäre Forschung
der Universität Bielefeld.
Slides (but not full text) available on the Web at
http://www.w3.org/People/cmsmcq/2008/bielefeld/slides.html
![[x]](../../../icons/eks.png)
World Wide Web Consortium (W3C).
2008.
XQuery 1.0 and XPath 2.0 Full-Text 1.0
,
ed. Sihem Amer-Yahia et al.
W3C Candidate Recommendation 16 May 2008.
[Cambridge, Sophia-Antipolis, Tokyo]: W3C, 2007.
Available on the Web at
http://www.w3.org/TR/xpath-full-text-10/
Markup Discontinued
Discontinuity in TexMecs, Goddag structures, and rabbit/duck grammars
C. M. Sperberg-McQueen
Member of the technical staff
World Wide Web Consoritum / MIT
Claus Huitfeldt
Associate Professor (førsteamanuensis)
Department of Philophy, University of Bergen
Abstract
That the textual phenomena of interest for markup are not always
hierarchically arranged is well known and widely discussed. Less
frequently discussed is the fact that they are also not always
contiguous, so that the units of our analysis cannot always correspond
to single elements in the document. Various notations for
discontinuous elements exist,
but the mapping from those notations to data structures has not been
well analysed or understood. And as far as we know, there are
no standard mechanisms for validating discontinuous elements.
We propose a data structure (a modification of the Goddag structure) to better
handle discontinuous elements: we relax the rule that every
pair of elements where one contains the other be related by
a path of parent/child links. Parent/child links are then not an
automatic result of containment. We conclude with a brief
sketch of the issues involved in extending current validation
mechanisms to handle discontinuity.
Markup Discontinued
Discontinuity in TexMecs, Goddag structures, and rabbit/duck grammars
Balisage: The Markup Conference 2008
August 12 - 15, 2008
The materials listed below were provided by the speaker as supplements to a
presentation at Balisage. These materials may include the slides or visuals used in the
presentation; supplementary material, such as code samples or a demonstration application;
and/or the paper underlying the presentation (if it has not been provided in XML). These
materials have been zipped for easy download and are identified by a brief description of
the contents. The materials themselves are untouched
, that is, they
have not been tested or edited by Balisage: The Markup Conference or by Mulberry
Technologies, Inc. As such, they are included on this website AS IS
,
i.e., as provided by the speaker, with no warranties, express or otherwise, made by Balisage
or Mulberry.
Slides and Materials