Getting Started With Analytics 2.0

Backfill User Activity History

Beta

The Historical User Activity feature is currently in beta for the Analytics tool.

Background

Existing data pipelines will continue running normally, but will only import new daily activity. To import your full historical backlog into your target database, you must perform this one-time manual backfill.

Things to Consider

Required User Permissions:
- Data Connector Access: Verified connection to your destination database, such as Snowflake, SQL Server, BigQuery, and more.
Prerequisites:
- CLI Version: Download or upgrade to the latest CLI version.
- Time: The initial historical backfill can take anywhere from minutes to several hours, depending on your tenant size and account age. Schedule this operation during an off-peak window.
- Compute Sizing: Temporarily scale up your data warehouse or compute instance for the initial load, then scale back down when finished. For example, upgrade a Snowflake warehouse from X-Small to Medium/Large.
- Storage Requirements: Ensure the host machine has free disk space equal to or greater than twice the compressed size of the target table, as partition files are downloaded locally before loading to the database.
- File Limits (macOS/Linux): Raise the shell file descriptor limit in your terminal before running the catch-up script by executing: ulimit -n 65536.

Steps

Log in to your target data destination interface.
Clear the existing watermark for the User Activity table by executing the corresponding command for your environment:
- Snowflake
  - DELETE FROM <YOUR_DB>.<YOUR_SCHEMA>.PROCESSING_LOG WHERE TABLE_NAME = 'user_activity';
- SQL Server
  - DELETE FROM [<your_schema>].[processing_log] WHERE table_name = 'user_activity';
- BigQuery
  - DELETE FROM `<project>.<dataset>.processing_log` WHERE table_name = 'user_activity';
    Note
    For cloud storage locations like ADLS, S3, or Fabric Lakehouse, the processing_log file is located next to your data within your managed area. If you cannot locate the file path, check the setup guide for your specific data connector or reach out to Procore Support for additional assistance.
Run your Analytics Cloud Connector CLI script via your terminal or scheduling platform:
- # example — replace with the invocation your scheduler usespython ds_to_snowflake.py --config config.yaml
Monitor your execution console logs. The CLI will indicate destination-aware evaluation by logging:
- APPEND partitioned load for user_activity: N event_date partition(s) in share, M already in Snowflake, K to load.
Confirm that K matches the historical block of missing days you intend to populate.
Once the backfill completes, verify the earliest available records using the following row count query:
- SELECT MIN(event_date), MAX(event_date), COUNT(*) FROM <YOUR_DB>.<YOUR_SCHEMA>.USER_ACTIVITY;
  Note: This row count query is an example for Snowflake. You can modify this query based on your destination database.

Note

The MIN(event_date) value will now reflect your account's earliest global historical user activity rather than a rolling 30-day index.

Upgrade the Analytics Cloud Connector CLI

Migrate Config Key

Top Product Manuals

Procore Imports

Procore Drive

Portfolio (Company)

Submittals (Project)

Home (Project)

Featured Product Manuals

Scheduling

Procore for Government

MFA

See All Product Manuals

Developer Portal

Certifications

Training Video Library

Permissions Matrix

Glossary of Terms

System Status

Community

Product Updates

Developers

Australia (English)

Brasil (Português)

Canada (English)

Canada (Français)

Deutschland (Deutsch)

España (Español)

France (Français)

Latinoamérica (Español)

Polska (Polski)

United Kingdom (English)

United States (English)

新加坡 (中文)

日本 (日本語)

Backfill User Activity History

Beta

Background

Things to Consider

Steps

Note

Note