Word Statistics Links

For my What’s New with What’s New presentation for the Boston Regional Library System:

Book Information

  • Amazon: The Spirit Catches You and You Fall Down — The Amazon page for this book is an example of some of the ways that Amazon is playing around with text analysis
  • LibraryThing — This social cataloging site is doing some very interesting work analyzing and comparing its members’ collection data, and collecting and organizing data through their Common Knowledge fielded wiki [Example: John Steinbeck]

Tagging

Tagging is the primary method of organization for many social media sites, including Flickr, LibraryThing, Delicious, and many others. Tags are keywords users assign to their own items, which can also be used to search across the whole system. Because of the lack of a controlled vocabulary or standard cataloging rules, tagging is an imperfect system by design, but the use of natural vocabulary is quick, flexible and powerful.

Collections of tags can be presented in any format, but are often presented as tag clouds. Here are a couple of typical examples:
Flickr | LibraryThing | Delicious

Similar clouds can be made by analyzing the frequency of words used in any piece of text, like the example below.

This is a tag cloud made by uploading the text of the Declaration of Independence to the TagCrowd website

created at TagCrowd.com

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>