[1] It includes language from the UK, the United States, Ireland, Australia, New Zealand, the Caribbean, Canada, India, Singapore, and South Africa.
[2] The sources are writings of all sorts, from "literary novels and specialist journals to everyday newspapers and magazines and from Hansard to the language of blogs, emails, and social media".
[2] This may be contrasted with similar databases that sample only a specific kind of writing.
[2][3] The digital version of the Oxford English Corpus is formatted in XML and usually analysed with Sketch Engine software.
[5] Each document in the OE Corpus is accompanied by metadata including: