You are here
»
Universal Catalogue
»
Written Resources
»
Written Corpora
Language Resources
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Send us information
Would you like to collaborate ?
Contact Us
Languages
Catalog Reference : ELRA-WC343
Scottish Corpus of Texts and Speech (SCOTS)
The SCOTS Corpus contains documents in Scottish Standard English, documents in several varieties of Scots. While Scottish Standard English has a standard written form, Scots does not. This means that the corpus contains a wide range of spelling variation. Currently, an Advanced Search System is offered so as to exploit the corpus extensive sociolinguistic metadata by allowing the user to build up a search profile specifying sociolinguistic or textual criteria.
This corpus contains data in both Scottish Standard English and Scots.
The latest version of the corpus includes 936 documents and a total of 2,524,431 words. Genres include informal correspondence, prose fiction and non-fiction, poetry, religious texts and administrative/political texts. A number new spoken texts (conversations and interviews) are also now available, presented as audio/video files with synchronised orthographic transcriptions.
With transcribed spoken documents, it has been necessary to decide upon conventions in order to make the transcriptions as consistent as possible. Accordingly, where words are clearly Scots forms, rather than Scottish Standard English, the Scots School Dictionary (eds. Iseabail MacLeod and Pauline Cairns, Scottish National Dictionary Association, 1999) has been used as a guide. Where the dictionary offers alternative spellings for a word, then the one closest to the speaker’s pronunciation has been selected.
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
English (Scotland)
Number of tokens :
2,524,431 words
Saturday 23 November, 2024
Joint Copyright © 2008
ELRA
&
ELDA
Universal Catalogue 1.0.4