Firestore to Google BigQuery

Data Integration & ETL

Automatically Export & Sync Firestore Data to Google BigQuery

Firestore
Google BigQuery
Full data sets
No instructions needed
No code
No complicated flow charts
No triggers
Set up in under 3 minutes
Flat pricing starting at $19 /month

Data sets

Uniform JSON Objects
Filterable
Varied JSON Objects
Filterable

Built for Synchronization

This service is designed to help anyone continuously export (i.e. "sync") bulk data from Firestore.

If you only need to export your data one time, check the website of Firestore to see if their service offers a one-time export feature natively. Sometimes a Data Source's native export feature may provide not as much data as its API does, in which case Flatly is the solution.

Additional Information

Flatly uses official API channels to provide an integration with Firestore.
Cloud Firestore is a flexible, scalable database for mobile, web, and server development from Firebase and Google Cloud.
Connection

There are no scripts, downloads or extensions required for this integration. Flatly uses Service Account authorization for Firestore. Bring your own Service Account key and then simply enter a path to your Collection.

Project Data versus Database Data

Flatly syncs Firestore Database data, namely the contents of Documents in specific Collections. It does not sync Project Data like Crashlytics or A/B Testing. Behind the scenes the service first converts Firestore Documents to JSON and then efficiently flattens that JSON into flat files for spreadsheets or storage, or newline JSON for BigQuery.


Data Volume

If you expect your Firestore Collection to contain more than 10 million cells worth of data, do not attempt to sync Firestore to Google Sheets, as Google Sheets is limited to 10 million cells. Instead, use any cloud storage or BigQuery.

Scopes

Flatly's access to Plaid and Plaid's downstream connected financial institutions is read-only access to text. It is limited to account data (words, numbers, dates). It does not include any money-movement, transfer capabilities or account holder profiles/identities.

Plaid alone interfaces directly with financial institutions, caches data from those institutions on Plaid infrastructure, and then makes the appropriate scoped subset of that data available to partners like Flatly using secure connections called client libraries.

Schema Simplicity

Flatly leverages intelligent schema generation that adapts to BigQuery's auto-detection protocols, handling data type inference and column ordering automatically so you never have to manually map fields.

Sanitized Data

Data hygiene is built directly into the ingestion process. Before a single row is written to BigQuery, Flatly validates your data against a generated reference schema. It proactively detects and strips out "schema violators" —inconsistent data types or malformed records that would typically cause a load job to crash. Flatly logs these exclusions in a detailed metadata file, ensuring your pipeline remains resilient and continuous while giving you full transparency into data quality issues.

Performance

Designed for scale and flexibility, the engine processes data in memory-optimized chunks to handle massive datasets efficiently. It offers powerful pre-load controls, allowing you to filter rows based on custom logic (e.g., date ranges, specific values) and allowlist specific keys. This ensures that you only sync high-value data, optimizing your BigQuery storage costs and query performance. Everything is secured via Service Account authentication, providing a stable, production-grade connection for your most critical analytics dashboards.

Looker reads tables from Google BigQuery.

File Formats

NDJSON

Integration Flow

Setup

Step 1

Select Firestore from the Data Source dropdown and Google BigQuery from the Data Destination dropdown.

Step 2

Authorize Firestore with your credentials using the Service Account flow.

Step 3

Authorize Google BigQuery with your credentials using the Service Account flow.

Step 4

Select your Data Set from the 2 Data Sets available using the dropdown.

Step 5

Click Flatten to sync your data.

Walkthrough Video

Extended Data Sources

Get started with your data pipelines

Connect your data with our turnkey data integration solution so you can focus on running your business.