|Composition of our corpus||Sources (to create your own corpus)|
The following is the number of talks and the number of words for each decade in the corpus:
The 11,000+ General Conference talks were taken from a number of sites online, which had highly-accurate versions of the talks. Not all of these sites are still available.
For users who are interested in creating their own corpus, the best source is probably scriptures.byu.edu (not associated with our corpus), which contains all 1426 talks from 1851-1886 (see list) and all 1823 talks from 1942-1970 (see list).
Two other sites (#1 and #2) also contain all talks from the Journal of Discourses (1850s-1880s). Many talks from the early 1900s through the 1960s can be found in the issues of the Improvement Era, which are available from Google Books and www.archive.org (see sample).
We use these texts under US Fair Use Law. More information...