What is Bookworm arXiv?

Bookworm arXiv demonstrates a new way of interacting with over 700,000 scientific articles published on arXiv since 1992. The Harvard Cultural Observatory previously collaborated with Google Books on the Google ngrams viewer. Bookworm, by using openly circulated preprints, allows creation of interesting queries and comparisons across corpora, along with direct access to the original texts.

What texts does this use?

This site builds on the amazing work of arXiv.org. ArXiv provides open access to over 700,000 e-prints in Physics, Mathematics, Computer Science, Quantitative Biology, Quantitative Finance and Statistics. When you build a corpus, you can see exactly how many texts you are searching in the construction box.

How does it work?

You can compare different words in the same collection of articles, the same word across multiple collections, or a combination of the two. Enter your search word in the text box and click on 'Edit' to change the article collection. Bookworm will color code them as you enter them. Hitting 'Return' will create a new graph using your options.

The graph displays results: click on a point to see the articles that best match your search terms for that series and month. You can read the article at arXiv by clicking 'Read'.

Advanced Options

Disclaimers and Acknowledgements