The following are some frequently asked questions about Data Connection.
For general information, view our Data Connection documentation.
SNAPSHOT
transactionUnknown
A build was scheduled to run at a given time, but it did not attempt to run.
To troubleshoot, perform the following steps:
Is there another running sync for this dataset and branch? Verify that no other job is running at the same time as it is not possible to run two syncs on the same dataset and branch simultaneously.
Has the schedule been paused? Verify this schedule is not paused on the schedule overview page for this dataset. You can access this page via the Edit in Scheduler view of the sync or in Dataset Preview's Manage Schedules option.
Was the agent of this sync disabled? Navigate to the agent associated with the source of this sync, and verify the agent is not disabled.
SNAPSHOT
transactionA sync ran but led to duplicate rows.
APPEND
type sync instead of a Snapshot
.last_upd_in_appl_ts
is the column that will be unique and always increasing, set that as the column and then select a value that is less than all the other values in this column.What value was used when I ran my incremental sync?
To troubleshoot, perform the following steps:
incrementalMetadata
and verify correctness.My type is something different from what appears on my dataset after it synced.
To troubleshoot, perform the following steps:
TIMESTAMP
, verify if the resultant type is LONG
in Foundry. If it is LONG
, you need to parse the type using your data preparation tool of choice (Code Repositories, Preparation, or another application) to TIMESTAMP
. This is a side effect of many drivers provided by database creators where types are reverted to their safest representations.DECIMAL
and has a different precision than the original database, we recommend casting the number to a specific precision and scale in the query on the database itself, or casting the column to VARCHAR
in the query and re-casting in Foundry.After a query begins running, how do I check its status?
To troubleshoot, perform the following steps:
If the schema of a file or JDBC table changes between incremental APPEND
transactions, your dataset will start failing with complaints of schema mismatches. Data Connection does infer schema for JDBC extracts, only propagates existing schemas for file-based extracts. In this event, you would have to apply the schema inference again if it is the same. If schemas truly have changed between APPEND
transactions, a new dataset is needed for the new schema.
To troubleshoot, perform the following steps:
File-based
If the files are XLSX or CSV tabular data, it may be possible to re-infer schemas on the synced dataset without issue. If this schema matches the previous, the dataset will add the appended rows without issue.
If after inferring schema you still get schema errors (either in Dataset Preview or another application), then this new file needs to instead be synced to a new dataset since it represents a fundamentally different view of a table.
If the dataset was already appended with the new schema, we recommend reaching out to Palantir Support to revert this transaction. Additionally, the syncs of files to the current dataset need to be paused by going to the sync overview page and pausing any schedule associated with this sync.
Subsequent files with new schemas should sync to a different dataset than the original, so we recommend copying the information from the original sync into a new sync, but replacing the target dataset with a different one (annotating in the dataset name the new version).
Additionally, it may be best to delete the original sync to avoid any future schema mismatch errors from occurring and corrupting the existing data in Foundry.
JDBC
account_transactions_v1.0
).account_transactions_v1.1
). This new dataset can then be unioned with the original to contain the full set of data.Unknown
My Bootvisor is stuck in a Unknown
state and won't stop / start.
Contact Palantir Support to check your setup before proceeding. Following the below steps will temporarily prevent syncs from running on this agent. Ensure multiple agents for sources or perform these steps during maintenance windows to prevent downtime.
kill
all JVM processes related to Data Connection../service/bin/init.sh start
.paused
as well.