Bigquery Data Warehouse

Use SQL Google Cloud Bigquery to search and analyze preloaded omics metadata and integrate with external datasets.

Prerequisites

  • Using Google Cloud Bigquery requires a Google Cloud project.
  • Knowledge of basic SQL syntax, while not required, will be beneficial. Several primers of the SQL language are available online

Objectives

This section is intended to provide working knowledge of:

  • The general content of OmicIDX dataset in Google Cloud Bigquery
  • Bigquery Cloud Console usage and components
  • Basic SQL queries of OmicIDX datasets
  • Location of Bigquery documentation, including client libraries for programmatic access
  • The OmicIDX relational data model
  • How to save the results of an OmicIDX query for external downstream analysis

Outcomes

After working through this section, OmicIDX users will be able to:

  • Access preloaded OmicIDX data in Google Cloud Bigquery
  • Use the Google Cloud Bigquery Console to view OmicIDX data
  • Perform basic SQL queries of OmicIDX datasets
  • Locate Bigquery documentation
  • Describe the cost model and approaches to estimate and minimize costs
  • Export query results for further analysis

Getting set up to use OmicIDX

Establish Bigquery access to the OmicIDX dataset.

Start exploring OmicIDX on the Bigquery web console

Gain familiarity with the Bigquery cloud console when working with the OmicIDX dataset.

Query the OmicIDX Bigquery tables

Use standard SQL queries to explore omics metadata.

Export data and query results

Expand analysis, reporting, and data integration capabilities by exporting data or query results.

Advanced use cases

Advanced use cases.


Last modified January 29, 2020: Lots of additions to docs. (d526bc3)