Yes. Consult the documentation on execution engines for information regarding the Foundry-imposed limits on final result size when using the Spark engine, as well as how to bypass these limits by using the direct read engine.
Reference the following additional topics when working with large datasets:
The connector offers two connection mode options, which are specified when you set up a data connection.
In Import mode, the selected tables and columns are imported into Power BI®. As you create or interact with a visualization, Power BI® uses the imported data. To see underlying data changes since the initial import or the most recent refresh, you must refresh the data either manually or via a scheduled refresh.
In DirectQuery mode, the data is not imported or copied into Power BI®. As you create or interact with a visualization, Power BI® queries Foundry as the underlying data source, so you’re always viewing current data, and you can push back data transformations and filtering into the Foundry layer.
Learn more about the difference between Import and DirectQuery..
Broader usage recommendations
We recommend trying to work in import mode where possible. If the data is large, consider whether it is possible to reduce the size in Foundry before importing, for example by filtering unnecessary rows, dropping columns, or pre-aggregating where possible. This Foundry preprocessing can be performed via Foundry transforms or Contour and configured as an automatic build. Alternatively, you can consider breaking your table import into smaller chunks which are within the size limit threshold, e.g. by importing one dataset for each calendar year.
When working in DirectQuery mode, we recommend keeping the dashboards to relatively few queries and simple interactivity. This approach balances the power of DirectQuery to work on very large datasets on the one hand, with end user expectations around performance on the other.
Finally, consider using a composite model for a “best of both worlds” approach when working with very large data. With the composite model, a report can include both Import and DirectQuery data connections side by side. For example, you can configure an Import connection with pre-aggregated or filtered overview and key statistic data, providing fast loading times when users open the report. In the same report, you can also configure a DirectQuery connection to facilitate deeper dive on-demand analytics.
Reports developed with the Foundry Power BI® integration will automatically respect the report developer’s access rights as configured in Foundry. For example, a developer building a report in Power BI® Desktop will have access only to those datasets or restricted views in Power BI® that he would normally have access to in Foundry.
Published reports, if using data refresh, will use the credentials of the configured data source on the Power BI® Gateway rather than inheriting any credentials of the consumer of the report.
For more information about managing data sources on a gateway, see Manage your data source - import/scheduled refresh in the Microsoft Power BI® documentation.
Many of Palantir’s customers use the Power BI® service to publish and distribute reports more broadly throughout their organizations.
In order to leverage the Foundry integration in these reports, use Power BI® Gateway with the Foundry connector and ODBC driver installed. When creating a connection on the gateway using the Foundry connection type, set the Authentication Method
to Key
and provide a token generated for a service account in Foundry. The gateway will then have connectivity to Foundry, and can be used by reports hosted in Power BI® service.
When accessing Foundry through a Power BI® Gateway with the Foundry connector, token authentication is the only supported authentication method. OAuth authentication in the Power BI service is not yet supported because this functionality is not available for Power BI connectors developed by third parties. If you have a use case that requires OAuth authentication, consider adding feedback to the Power BI Ideas ↗ site, and sharing feedback with your Palantir and Microsoft representatives.
The Foundry Power BI® integration functions with Power BI® Gateway in much the same way you are used to from Power BI® Desktop. Work with your Power BI® administrator to ensure that installation has been completed on the Gateway.
The Foundry Power BI® integration is compatible with Power BI® Dataflows, using the Power Query Online editor.
Before you begin, ensure you have access to an on-premises gateway that is configured with an existing connection to Foundry. Creating a new gateway connection during the Dataflows setup process is currently not supported due to an active Power BI® service issue.
To create a Dataflow that pulls data from Foundry, follow these steps:
https://<subdomain>.palantirfoundry.com/
. Optionally, provide a Dataset RID and Branch.On the Dataflows Connect to data source page you may receive the following error message when trying to create a new on-premises gateway connection:
Unexpected error (Session ID: <UUID>, Region: <REGION>)
There is an active Power BI® service issue that prevents creating the gateway connection during the Dataflows setup process. To work around this, set up a Foundry connection on your gateway before you create the Dataflow. In Power BI® service, you can manage gateway connections in Settings > Manage connections and gateways > Connections > New.
Once you have a working connection configured on your gateway, the Dataflows Connect to data source page will automatically select your gateway when you provide a matching Base URL.
You can think of an authentication token as your own private password that authorizes Power BI® to access Foundry data on your behalf. You can manage existing tokens and create new ones under the Account → Settings → Tokens page as described in the documentation on generating tokens.
If you encounter an error message when trying to authenticate via OAuth, it's possible this feature has not been enabled. Your error message may look similar to the screenshot below. In order to use this feature, Power BI® must be enabled as a third-party application. The managing third-party applications security documentation details how a Foundry administrator can enable this integration.
If OAuth authentication is not yet enabled you can continue to connect via the alternative token-based authentication option.
If you have previously authenticated from Power BI® into Foundry and now want to switch proactively to using token-based authentication, you will first need to clear your existing credentials from Power BI®.
You can do so by following the instructions under Unable to connect to clear your existing credentials and reauthenticate.
If you receive an "Unable to connect" message, it is likely your authentication credentials have expired. Follow the below instructions to clear your existing credentials and configure a new data source.
There might be a problem regarding your authentication credentials, or Power BI® might want to ask for your permissions to run a query. In that case, Power BI® might fail silently and display a blank report page with no option to "Apply Changes". In this case, you should click on "Edit Queries". It can give you options to proceed. If it doesn't, contact your Palantir representative and share the error message.
When selecting a dataset via “Get Data” → “Palantir Foundry”, you may notice that certain tables you have access to in Foundry are not showing up under the folder navigator. This occurs when you have access to the table but not the parent folder structure.
As long as you do have Foundry access to the table, you can still pull it into Power BI® for visualization. Use the optional dialog box on the preceding page to specify the “dataset RID”. The RID is the unique identifier for the dataset and is stable even if you move the dataset. You can locate the correct RID in Foundry by navigating to the desired dataset’s “About” page, clicking on “see more” and copying the RID value.
See more detailed instructions on finding the RID under Guides: Identifying a dataset's RID or filepath in Foundry
In general, imports will take longer to process the larger the table is. Depending on your Power BI® subscription, “Import mode” in Power BI® may be limited to 1GB. The Palantir Foundry connector for Power BI® also sets limits on import size - see Is there a size limit on how much data I can transmit? for more details.
If your table import is failing or slow due to table size, review How do I optimize usage of the connector across Import and DirectQuery modes? for tips on using large datasets with the connector.
If you select a dataset through the table navigator in Power BI®, your data source will rely on the dataset existing at this location. If the dataset is renamed or moved in Foundry this can lead to issues with data refresh, as Power BI® will no longer be able to locate the dataset.
If you anticipate a dataset to move it is advised to configure the data source using the dataset RID. See Why don’t I see my table in the table navigator? for more detail on configuring a data source via RID.
Power BI® and the Power BI® logo are trademarks of the Microsoft group of companies.