Skip to Main Content
Libraries
askus Ask us
 

UVic Libraries Data Services

Data Services at UVic Libraries

UVic Libraries manages both licensed and open data resources. We also manage data and statistics packaged specifically for researchers as well as data independently produced by colleagues during the research process. Both types of data can be public or restricted.

If you're looking for historical data, remember that our Special Collections & University Archives steward "data" from generations of researchers across disciplines. Contact them directly to see what is available. Example: environmental & oceanic archives, field notes, etc.

Abacus (Restricted to UVic Users)

Abacus holds UVic Libraries' collection of licensed datasets, including public use microdata (PUMFs) from StatCan censuses, other social and health surveys, public opinion polls, and spatial data for GIS. Access is restricted to UVic users.

Datasets in Abacus include:

  • Statistics Canada's Data Liberation Initiative (DLI)
  • Statistics Canada Open Data, including Public Use Microdata Files (PUMFs)
  • Spatial data from DMTISpatial
  • Inter-University Consortium for Political and Social Research (ICPSR) 
  • Postal Code Conversion Files

Research Data

Borealis (UVic Dataverse Collection)

Borealis, the Canadian Dataverse Repository is a national data repository for research data. The service, supported by UVic Libraries, is free for UVic researchers to deposit their datasets, which are registered with DOIs and are stored in a secure environment on Canadian servers. Researchers can choose to make their datasets available to the public, to specific individuals, or to keep it private.

With Borealis, researchers can search across research data from over 65 Canadian universities.

Lunaris

Lunaris provides a single point of search for research data held in Canadian data repositories, including academic institutions, departments at all levels of government, and research organizations. There are over 80,000 datasets from over 100 Canadian repositories and data collections currently indexed by Lunaris.

UVic Research Data Centre

For access to confidential microdata from Statistics Canada census and surveys, contact the UVic Research Data Centre.


The UVic RDC provides access, for approved projects, to a growing variety of Statistics Canada confidential microdata household, population and workplace files. The microdata used by researchers come primarily from Statistics Canada Survey Master files. Increasingly, the Research Data Centres (RDCs) are repositories of administrative records from a variety of sources including tax, employment insurance, social assistance, and hospitalization records.

The UVic Libraries collects hundreds of websites as part of its web archiving efforts using Archive-It. UVic Libraries can help researchers access a variety of data related to these collections via the to Archive-It's Research Services, including:

WARC

WARC and their predecessor ARC files are the files into which data crawled using Archive-It is stored. Each file may contain multiple digital objects, including HTML, images, and videos. (Note that collection data can consist of both WARC and ARC files depending upon when they were archived through our service. Throughout these guides, the term “WARC files” refers to both WARC and ARC files.)

WAT

WAT stands for Web Archive Transformation, and are composed of key metadata such as provenance/capture information, essential text and link data, and other information.  They are extracted from WARCs for every resource; because WAT files map one-to-one to WARC files, a collection's WARC files will have corresponding WAT files. WAT formats metadata into JavaScript Object Notation (JSON). The benefit is WATs are around 5%-20% the size of corresponding WARCs.

LGA

Longitudinal Graph Analysis files are archival web graph files that include a complete list of what URIs link to what URIs, along with a timestamp, from a collection’s origin through present.  They are ~1% the size of a collection's aggregate WARC files, and deliver as a ZIP container of two files:

ID-Map:

  • Contains one line per each URL in a collection and assigns a UID (unique identifier) to each URL.
  • Each line contains a JSON object with three fields: The URL's UID ("id"), the URL ("url") and the URL in SURT form ("surt_url")

ID-Graph:

  • Each line contains a JSON object with three fields: The URL's UID ("id"), the timestamp associated with the capture of this URL ("timestamp"), and the set of the UIDs of the URLs linked to by this URL at that given timestamp ("outlink_ids")

WANE

Web Archive Named Entities are files that use named-entity recognition tools to generate a list of all the people, places, and organizations mentioned in each URI in a web archive, with a timestamp of when the URI was captured.  The purpose is to link people, places, and organizations to time.  A WANE dataset is generated using the Stanford Named Entity Recognizer software (http://nlp.stanford.edu/software/CRF-NER.shtml) to extract named entities from each textual resource in a collection.  The analyzer uses an English model 3-class classifier to extract names that correspond to recognized Persons, Organizations, and Locations.  WANE files are less than 1% the size of their corresponding WARC files, and are structured as a JSON object per line: URL ("url"), timestamp ("timestamp"), content digest ("digest") and the named entities ("named_entities") containing data arrays of "persons", "organizations", and "locations".

Please contact Corey Davis for more information. 

UVic catalogues or has access to thousands of data sources. To specifically search for datasets in "Library Search" (Primo):

 

1. Enter a search term in the search box as you would for any other resource. 

 

2. Once you are directed to the results page, use the "Refine Results" filter on the left-hand side of the page. Go to "Content Type" and then "Show More"

 

3. Now choose "Datasets" and then click "Apply Filters" (green button)

 

4. You will now see the datasets that have been added to our catalogue (note: not all datasets are catalogued). 

Databases by Subject

 

See "Stats and Data for Business" guide for all resources

IBIS World

Comprehensive collection of Industry Market Research and Industry Risk Ratings.

Passport

Global market analysis software platform, which analyses the industry in countries around the world.

Statista

Global market and consumer statistics on over 80,000 topics from more than 22,500 sources.

Trade Analyzer

Canadian imports, exports, and world trade data. Each table contains historical timeseries data

Conference Board of Canada

Research reports and data related to Corporate Social Responsibility, Economic Trends, Industrial Trends and Public Policy and other topics. 

 

Canadian Institute for Health Information (CIHI)

CIHI collects comparable, pan-Canadian data on different aspects of the health system. View a summary of all the data available by year and jurisdictional coverage (XLSX).

PopData BC

Population Data BC (PopData) provides academic researchers with access to a comprehensive collection of population health data, including health services, education, workplace and environmental data. These data sets include longitudinal, person-specific, de-identified data on BC's 5 million residents. Access is provided by following a data request process.

BC CDC

BCCDC's data dashboards and reports provide statistics on diseases in BC and on the health of our populations and communities. 

Statistics Canada Public Use Microdata Files (PUMFs)

Access to all microdata and documentation from the collection of available Statistics Canada public use microdata files (PUMF). Public use microdata files contain anonymized, non-aggregated data. Using statistical software, the end user can group and manipulate data variables in these files to suit data and research requirements.

Access via Abacus   Search products on StatCan website

Statistics Canada RealTime Remote Access

Real Time Remote Access (RTRA) is an online tabulation tool allowing subscribers to run SAS programs in real time to extract results from masterfile subsets in the form of tables. RTRA system data users do not gain direct access to the microdata and cannot view the content of the microdata file. RTRA data users can calculate frequencies, means, percentiles, percent distribution, proportions, ratios and shares.

CANSIM @ CHASS

CANSIM (Canadian Socio-Economic Information Management System) is the Statistics Canada's computerized database of time series covering a wide variety of social and economic aspects of Canadian life. The use of CANSIM @ CHASS is restricted to current faculty, staff, and students.

Canadian Census Analyzer

The Canadian Census Analyser provides access to select Canadian demographic data from Statistics Canada's Census of Canada.  Included are census aggregated profile tables and census microdata.  This tool allows users to subset files and export data in varied formats including html, text, spreadsheet, and to statistical software such SAS and SPSS. Coverage: 1961 to 2021 censuses. The use of Canadian Census Analyzer @ CHASS is restricted to current faculty, staff, and students.

BC Data Catalogue

The BC Data Catalogue provides the easiest access to government's data holdings, as well as applications and web services. Thousands of the datasets discoverable in the Catalogue are available under the Open Government License - British Columbia.

Ipsos Canadian Public Affairs Database

The Ipsos Canadian Public Affairs Dataverse is a repository of over 60 Ipsos Canada surveys that shed light on Canadian elections, culture, politics, and society. All data is open access, supported by the Laurier Institute for the Study of Public Opinion and Policy

ICPSR

Data archive of more than 250,000 files of research in the social and behavioral sciences. ICPSR collaborates with a number of funders, including U.S. statistical agencies and foundations.

See "Geospatial Data" guide for all resources

GIS

DMTI CanMap Content Suite

CanMap Content Suite contains over 100 unique and rich content layers. Each layer has a unique file and layer name with associated definitions, descriptions, attribution and metadata. All layers, with a few exceptions, are vector data consisting of polygon, polyline, or point geometry representation.

ICI Society Data Sharing Initiative

Dataset includes:

  • ParcelMap BC (PMBC): a complete spatial representation of cadastre in BC
  • ICI Cadastre: cadastral data maintained and provided by Local Government
  • Address: a central and authoritative civic address registry for BC
  • Assessment: data maintained and contributed by BC Assessment
  • Infrastructure: data maintained and contributed by major Utility organizations operating in BC

Remote Sensing

UVic Libraries

Greater Victoria

Satellite, air photo and other remote sensing images and data world

Maps

Our Maps collection contains over 68,000 paper maps on many themes/topics, including biology, culture, economic, geology, history, hydrography, military, streets, social, topography, and transportation.

What are Arts & Humanities Data? 

Arts & Humanities data are as complex and varied as the disciplines that comprise these subjects. Below are three pieces that are useful in thinking about "data" in the Arts and Humanities

 

Our Data Management for Humanists website brings together training materials from recent SSHRC-funded workshops held across Canada to build foundational skills in research data management (RDM) for researchers in the Humanities.

Visual Arts

Books 

Special Collections & University Archives

Special Collections & University Archives contain resources from generations of UVic & local researchers; these include a seaweed inventory project (Alan Austin Fonds); underwater salmon research; B.C. Environmental associations (Mallard Fonds); Transgender archives; various scientific field notes; and more. Contact Special Collections or University Archives to learn more.  

Other Data LibGuides

How do you find "Data" at UVic?


There is a difference between data that can be used FOR research, and data that are produced FROM the research process. Knowing what kind of data you are looking for ("for" or "during") will help you locate the resources you need. 

Data FOR Research (Statistics, Open Data)

  • Many organizations produce data that can be used in the course of scholarship; these may be governmental agencies, health agencies, and corporations. These organizations produce data that you may use to construct arguments in research. These data can be licensed or made openly available on trusted websites; they tend to be peer-reviewed or simply have authority based on their source. See the links below to find these types of data.

Data FROM Research (Research Data)

  • Data resulting FROM research and scholarship, including the data you produce that underlies your publications and funded research projects, are shared through research data repositories typically hosted at home institutions. UVic's Research Data Repository is called UVic Dataverse (Borealis). These data are shared for reuse and support reproducibility and transparency in the research process.

Special Collections & University Archives

Special Collections & University Archives contain resources from generations of UVic & local researchers; these include a seaweed inventory project (Alan Austin Fonds); underwater salmon research; B.C. Environmental associations (Mallard Fonds); various scientific field notes; and more. Contact Special Collections or University Archives to learn more.  

Creative Commons License
This work by The University of Victoria Libraries is licensed under a Creative Commons Attribution 4.0 International License unless otherwise indicated when material has been used from other sources.