Many documents can serve as primary sources for linguistics analysis. They can be historical text, news, language archives, corpora resources, audio and video files.
Rosetta Project: The collection currently contains nearly 100,000 pages of material documenting over 2,500 languages, as well as a growing multimedia collection of modern and historical language recordings.