In Foundry, to manage large datasets efficiently without duplicating data, you can use the fast_copy_media_item()
method on the output when copying items from one media set to another. Copying a reference to the same media blob rather than duplicating the blob itself is faster and more efficient option than downloading and re-uploading the media item.
Timestamp: April 13, 2024
MediaSet: TooManyItemsUploadedInTransaction
error while trying to upload large number of files to a media set? And how can it be fixed?The MediaSet: TooManyItemsUploadedInTransaction
occurs because there is a limit of 10,000 files that can be written to a media set in a single transaction. The solutions are:
Timestamp: March 6, 2024
You can read from or write to transactionless mediasets by altering the argument of the media output to media_output=MediaSetOutput(should_snapshot=False)
. This prevents the system from attempting to snapshot the output, which is not supported for non-transactional media sets. Additionally, incremental pipelines should work the same as transaction media sets for inputs.
Timestamp: March 18, 2024
Add the media set as an input to the build so that the build-token has permission to read from it, then use the method media_input.get_media_item_by_path()
to read values from it.
Timestamp: March 6, 2024
Extract Text from PDF
pipeline expression with a dataset of files that is not a media set?This is not possible because the pipeline board only works on items in a media set.
Timestamp: March 6, 2024
The media items in a media reference dataset can be linked to the respective objects from the Capabilities section of the object type in Ontology Manager.
Timestamp: March 6, 2024
Yes, it is possible to write to a media reference property in a function, in the same way that other object properties can be written to in functions. An action can be configured to run this function.
Timestamp: November 1, 2024
When packaging media sets in the marketplace, the 'default' branch is selected. Currently, it is not possible to select a different branch to be packaged. The branch name is preserved during packaging and installing.
Timestamp: April 16, 2024
There is no bulk upload workflow for media items in Workshop at this moment.
Timestamp: April 18, 2024