Working with text 1
Unstructured text
in Doing digital history
Abstract only
Log-in for full text

The first of two chapters on working with text, this chapter covers the difference between plain text formats and proprietary formats, the pattern-matching technique ‘regular expressions’, the command line as an interface for working with large amounts of text, particularly the grep command. All of the examples work on a specific historical text, a Post Office directory for late nineteenth-century London.

Doing digital history

A beginner’s guide to working with text as data


All Time Past Year Past 30 Days
Abstract Views 115 23 0
Full Text Views 11 5 0
PDF Downloads 7 5 0