Filters for content checking
- Can work either over single documents
- List all the things tagged as country,
city or email
- Report all acronym elements, noting which
ones appear without an expansion sibling the
first time it's used in the document
- Check metadata, bibliographies and other semi-structured
information sets
- Check any content tagging or tagging subject to tag abuse
- Or (more usefully?) over sets of documents
- Poll a group of gcapaper documents and report
the authors whose bio has no content (with
names/addresses)
then list the bios of the ones you have