paul perry .   blog   |   about   |   notes

Web Annotations

While web annotations are not a new research topic and a few companies wanting to capitalize on this space have already come and gone, enough has changed in the web infrastructure to reconsider this area. Better browsers, new standards, and a set of prototype systems make this a perfect time to consider a second generation of annotation systems.

The Concept

Annotations are a broadly useful mechanism that can support a number of document and database management applications:
  • provide a trace of use
  • third party commentary
  • information sharing
  • information filtering
  • semantic labeling of document content
  • enhanced search

But it continues to be hard to publish to the web. If better tools are provided to information consumers then readers will be able to add commentary, make new connections, interpret content, and otherwise promote an accretion of both structure and content on the web. This process will add new semantics to the web and this new information will be the source for new approaches to searching and filtering of information.

The progression from the collection of annotations of free-text as stupid assertions to 1st order propositional assertions with more semantic binding will evolve the web from where it is today to the vision of a semantic web. Key to this transition is the deployment of annotation servers to particular market segments that have strong demand for more structured queries of web based information.

These market segments are likely to be in knowledge intensive industries such as the consulting, legal, and financial markets.

Past Research and Implementations

As far back as the inception of Memex [Bush45] researchers have postulated the many features and benefits of capturing and incorporating note taking in an automated fashion.

Much human factors research has gone into understanding our current use of annotations [Marsh97], and evaluating the useful features of an annotation system [Ovsi01].

Annotations have even been implemented for the web in Mosaic 1.2 ( and here ), or Mosaic 2.6 beta 1 [Brav], CritLlink [Yee98] and ThirdVoice.

However, the problems with mass adoption have been in browser support for editing a page, overlaying information, and sourcing of markup from multiple servers on the same page. Further, lack of support for bi-directional links and links to parts of a resource to help support links outside of the referenced resources complicated the architectures of early annotation services, forcing a slew of proxy server based approaches with the inherent weaknesses of this design [Vasu99].

But now we have IE5.5, with more than 85% penetration, with some support for XPointer (in MSXML), an RDF based implementation for annotation servers from the Annotea project in the new clients for Annotea.

Companies

There are a pile of companies in and around this space, some of these are squarely in the annotation space (e-quill), others are considered browser accesories, messaging companies (ThirdVoice, Odigo), and context services (Alexa).

The key ones in my mind are:

Other players include:

A bunch have already fallen off the map:

And a number of interesting domains are taken, but not up and running yet:

Reviewing the companies in the best position:

EQuill develops Web-based solutions that enable businesses to communicate and collaborate more effectively with their visual markup tool. You could say they offer a hosted workflow management suite. It's an excellent tool, but the resulting captured documents on their Markup Server don't seem well suited for complex search.

uTOK , the Users' Tree of Knowledge, developed a Web browsing companion that allows users to share knowledge through virtual notes.

Also, Noteclip has a key piece of client technology that seems useful.

Sentius's RichLink IDC paper

The Architecture

Since the release of editing, DHTML, and DOM2 support within IE5.5 and the recent release of the W3C Annotea server in conjunction with all the supporting standards (XPath, RDF, WebDAV, and UDDI), it should now be easier to build and deploy standards based annotation servers within enterprises.

However, we are still early in this space. Companies will have to find their market and differentiate themselves. Technically they will have to solve design problems to meet performance and scalability.

There will be a number of architectual choices that will be determined by the domain that they are applied to. For example:

  1. What context do you capture at annotation creation?
  2. How much do you capture (document and annotation)?
  3. How do you combine annotations?
  4. What kind of search queries do you need to support?
  5. How do you manage authentication and access control permissions?
  6. etc.

An interesting technology to scale annotiation servers is Guha's RDFdb IFF the server is based on RDF. In certain domains however, users will be able to use much more structured ontologies implemented on regular DBMS servers with an XML Schema.

The deployment of annotation servers for the public web still need to solve issues of discovery, scale, and trust.


July 2001 update.

Microsoft's announcement of SmartTags, had the rest of the world up in arms, forcing MS to back off and cacel its release when it must have noticed these meta tags showing up everywhere:

 <meta name="MSSmartTagsPreventParsing" content="TRUE"> 

Microsoft has been aware of the Annotation space since 1997, and has an Annotation Server team in MS Office, but the lack of marketing subtlety of this 800 lbs. gorilla continues to engender a lack of trust.

This leaves the window open on the client, server, and service space. Check TopText.

Turns out that web logs, or blogs, are really the ones annotating everything out there, and now Blogdex is starting to keep track of the top links. Speedle does the same, but is centered around email. None of them really enable the annotation user interface envisioned, and maybe that is irrelevant. More later ...

People

References

  1. Abrams D., Baecker R., Chignell M. Information Archiving with Bookmarks : Personal Web Space Construction and Organization CHI'98, 18-23 Avril 1998, Los Angeles, CA, USA
  2. Bieber, M, Vitali, F., Ashman, H., Balasubramanian, V., and Oinas-Kukkonen, H., Fourth generation hypermedia: some missing links for the World Wide Web , International Journal of Human-Computer Studies 47, 1997, 31-65.
  3. [Brav] Braverman, et al., NCSA Mosaic Group Annotations.
  4. [Bush45] Bush, Vannevar. As We May Think, Atlantic Monthly, 1945.
  5. Causton, Laurie. Web-based tools for document annotation, 1999.
  6. Davis, Jim. Annotation Systems , 1995.
  7. Davis, Jim. CoNote 1994.
  8. Davis J., Huttenlocher D. Shared Annotation for Cooperative Learning , Prog of CSCL'95, pp. 84-88.
  9. Denoue, Laurent, Laurence Vignollet, New ways of using Web annotations, WWW9 poster.
  10. [DeRose99] DeRose, Steven J. 1999. " XML Linking". ESHH.
  11. [DOM2] "W3C Document Object Model Level 2".
  12. [Gins98a] "Annotate: A Web-based Knowledge Management Support System for Document Collections". PhD. NYU 1998.
  13. Gramlich, W.C. Public Annotation Systems 1994.
  14. Heck Rachel M., Sarah M. Luebke, Chad H. Obermark A Survey of Web Annotation Systems Grinnell College.
  15. Heck, Rachel M., and Sarah M. Luebke HyperPass: An Annotation System for the Classroom Grinell College.
  16. Holtman, Koen. Futplex Homepage 1998.
  17. Holtman, Koen. The Futplex System, Proceedings of the ERCIM workshop on CSCW and the Web, 1996.
  18. Hummes J., Karsenty A., Merialdo B., Active Annotations of Web Pages
  19. Khoshafian S., Buckiewicz M., Groupware & Workflow. InterEditions, 1998.
  20. Koivunen, Marja-Riitta, Dan Brickley, Jos Kahan, Eric Prud'Hommeaux, Ralph R. Swick The W3C Collaborative Web Annotation Project ... or how to have fun while building an RDF infrastructure 2000.
  21. Laliberte, D., and Braverman, A., A Protocol for Scalable Group and Public Annotations, NCSA Technical Report, 1997.
  22. Luebke, et al. Annotating the World-Wide Web. Proceeding of EdMedia 1999: World Conference on Educational Multimedia, Hypermedia and Telecommunications (pp.409-414). Seattle, WA. 1998.
  23. [Luke96] Luke, S., Spector L., Rager D. Ontology-Based Knowledge Discovery on the World-Wide Web" , Proceedings of the Workshop on Internet-based Information Systems, AAAI-96 (Portland, Oregon), 1996.
  24. [Marsh97]Marshall , C. C. Annotation: from Paper Books to the Digital Library, Proceedings of Digital Libraries '97. New York: ACM Press 1997.
  25. Marshall , C. C. The Future of Annotation in a Digital (Paper) World. In Successes and Failures of Digital Libraries (Harum and Twidale, eds). Urbana-Champaign: University of Illinois, 2000, pp. 97-117.
  26. Marshall , C. C. Toward an Ecology of Hypertext Annotation , Proceedings of Hypertext 98. New York: ACM Press, 1998b.
  27. Maurer, H. , ed. HyperWave: The Next Generation Web Solution, Addison Wesley, 1996.
  28. Mylonas, E., and Heath, S., Hypertext from the Data Point of View: Paths and Links in the Perseus Project in Hypertexts: Concepts, systems, and applications, A. Rizk, N. Streitz and J. Andre (eds). Cambridge University Press, 1990.
  29. O'Hara, K., Towards a Typology of Reading Goals , Rank Xerox Technical Report EPC-1996-107, 1996.
  30. [Ovsi01] Ovsiannikov I., M. Arbib, and T. McNeill. Annotation Technology (AT) and a System for Online Annotations on the Web, submitted to IEEE Transactions in Knowledge and Data Engineering.
  31. [Ovsi99]Ovsiannikov I., M. Arbib, T. McNeill. Annotation Technology. Int. J. Human-Computer Studies (1999) 50.
  32. [Ovsi98] Ovsiannikov, I., M. Arbib, and T. McNeill. Annotation Software System Design, 1998
  33. Phelps, T. A. and Wilensky, R., Multivalent Annotations ,Proceedings of the First European Conference on Research and Advanced Technology for Digital Libraries, 1997.
  34. Presber, Alexander Knowledge Trees - Web Annotation Scheme
  35. Rhodesm, Bradley J. , Margin Notes: Building a Contextually Aware Associative Memory , in The Proceedings of the International Conference on Intelligent User Interfaces (IUI '00), New Orleans, LA, January 9-12, 2000.
  36. Roscheisen, M., Mogensen, M., Winograd, T., Shared Web Annotations As A Platform for Third-Party Value-Added Information Providers: Architecture, Protocols, and Usage Examples ,Technical Report CSDTR/DLTR Stanford.
  37. Beyond Browsing: Shared Comments, SOAPS, Trails, and On-line Communities, presented at the Third Intenational WWW Conference, 1995.
  38. Roscheisen M., Mogensen C., ComMentor: Scalable Architecture for Shared WWW Annotations as a Platform for Value-Added Providers , Stanford University Technical Report, Palo Alto, CA, USA
  39. Schickler A., Mazer S., Brooks C., Pan-Browser Support for Annotations and Other Meta-Information on the World Wide Web Computer Networks and ISDN Systems , Volume 28, Issues 7-11, p. 1063.
  40. Schilit, B. N., Golovchinsky, G., Price, M. N., Beyond Paper: Supporting Active Reading with Free Form Digital Ink Annotations , Proceedings of ACM CHI 98, 1998.
  41. Sperberg-McQueen, Michael and Lou Burnard. 1999. "6. Simple Links and Cross-References" and "14. Linking, Segmentation, and Alignment", in Guidelines for Electronic Text Encoding and Interchange (TEI P3).
  42. Vasudevan V., Palmer M. Web Annotations: Promises and Pitfalls of Current Web Infrastructure ; HICSS, 1999
  43. [Vasu99] Vasudevan V., Palmer, M. , On Web Annotations: Promises and Pitfalls of Current Web Infrastructure *, 32nd Hawaii International Conference on Systems Sciences, Maui, Hawaii 1999.
  44. [Vasu98] Vasudevan V., Palmer M. Web Annotation Service Object Services and Consulting, Inc., September 15, 1998
  45. [WEBDAV] HTTP Extensions for Distributed Authoring -- WEBDAV RFC 2518, Standards Track, Proposed Standard. February, 1999.
  46. [XLINK]W3C XML Pointer, XML Base and XML Linking
  47. Yan T.W., Jacobsen M., Garcia-Molina H., Dayal U., From User Access Patterns to Dynamic Hypertext Linking Fifth International World Wide Web Conference, May 6-10, 1996, Paris, France.
  48. [Yee98]Yee, Ka-Ping. CritLink: Better Hyperlinks for the WWW , 1998.
  49. [Yee97]Yee, Ka-Ping. M1+ Implementation Proposal , 1997.
  50. Zellweger, P. T., Chang, B., and Mackinlay , J. D., Fluid Links for Informed and Incremental Link Transactions, Proceedings of the Ninth ACM Conference on Hypertext and Hypermedia, 1998. Pp. 50-57.
  51. Zohar, Ron, Web Annotation - An Overview; Dept. of Electrical Engineering, Israel Institute of Technology, 1999