Getting started
Choose one of the following options to get started with the Unstructured Workflow Endpoint:- Follow the quickstart, which uses the Unstructured Python SDK from a remote hosted Google Collab notebook.
- Start using the Unstructred Python SDK.
- Start using a REST client, such as
curlor Postman.
Quickstart
This quickstart uses the Unstructured Python SDK to call the Unstructured Workflow Endpoint to get your data RAG-ready. The Python code for this quickstart is in a remote hosted Google Collab notebook. Data is processed on Unstructured-hosted compute resources. The requirements are as follows:- A compatible source (input) location that contains your data for Unstructured to process. See the list of supported source types. This quickstart uses an Amazon S3 bucket as the source location. If you use a different source type, you will need to modify the quickstart notebook accordingly.
- For document-based source locations, compatible files in that location. See the list of supported file types. If you do not have any files available, you can download some from the example-docs folder in the
Unstructured-IO/unstructured-ingestrepository in GitHub. - A compatible destination (output) location for Unstructured to put the processed data. See the list of supported destination types. For this quickstart’s destination location, a different folder in the same Amazon S3 bucket as the source location is used. If you use a different destination S3 bucket or a different destination type, you will need to modify the quickstart notebook accordingly.
Sign up, sign in, and get your API key
- Go to https://platform.unstructured.io and use your email address, Google account, or GitHub account to sign up for an Unstructured account (if you do not already have one) and sign into the account at the same time. The Unstructured user interface (UI) appears.
-
Get your Unstructured API key:
a. In the Unstructured UI, click API Keys on the sidebar.
b. Click Generate API Key.
c. Follow the on-screen instructions to finish generating the key.
d. Click the Copy icon next to your new key to add the key to your system’s clipboard. If you lose this key, simply return and click the Copy icon again.
Create and set up the S3 bucket
input represents the
source location. This is where your files to be processed will be stored.
The S3 URI to the source location will be s3://<your-bucket-name>/input.Inside of the same S3 bucket, a folder inside named output represents the destination location. This
is where Unstructured will put the processed data.
The S3 URI to the destination location will be s3://<your-bucket-name>/output.Learn how to create an S3 bucket and set it up for Unstructured. (Do not run the Python SDK code or REST commands at the end of those setup instructions.)Run the quickstart notebook
View the processed data
Unstructured Python SDK
Watch the following 4-minute video to learn how to use the Python SDK to call the Unstructured Workflow Endpoint to create connectors in the Unstructured UI.- Go to https://platform.unstructured.io and use your email address, Google account, or GitHub account to sign up for an Unstructured account (if you do not already have one) and sign into the account at the same time. The Unstructured user interface (UI) appears.
-
Get your Unstructured API key:
a. In the Unstructured UI, click API Keys on the sidebar.
b. Click Generate API Key.
c. Follow the on-screen instructions to finish generating the key.
d. Click the Copy icon next to your new key to add the key to your system’s clipboard. If you lose this key, simply return and click the Copy icon again.
unstructured_client functions for creating, listing, updating,
and deleting connectors, workflows, and jobs in the Unstructured UI all use the Unstructured Workflow Endpoint URL (https://platform.unstructuredapp.io/api/v1) by default. You do not need to
use the server_url parameter to specify this API URL in your Python code for these particular functions.
https://platform.unstructuredapp.io/api/v1 for your
API URL throughout the following examples.To specify an API URL in your code, set the server_url parameter in the UnstructuredClient constructor to the target API URL.- A source connector ingests files or data into Unstructured from a source location.
- A destination connector sends the processed data from Unstructured to a destination location.
- A workflow defines how Unstructured will process the data.
- A job runs a workflow at a specific point in time.
REST endpoints
The Unstructured Workflow Endpoint is callable from a set of Representational State Transfer (REST) endpoints, which you can call through standard REST-enabled utilities, tools, programming languages, packages, and libraries. The examples, shown later on this page and on related pages, describe how to call the Unstructured Workflow Endpoint withcurl and Postman. You can adapt this information as needed for your preferred programming languages and libraries, for example by using the
requests library with Python.
https://platform.unstructuredapp.io. To use the Swagger UI, you must provide your Unstructured API key with each call. To
get this API key, see the quickstart, earlier on this page.curl and Postman
The followingcurl examples use the following environment variables, which you can set as follows:
curl examples and help prevent
you from storing scripts that contain sensitive URLs and API keys in public source code repositories.
To get your Unstructured API key, do the following:
- Go to https://platform.unstructured.io and use your email address, Google account, or GitHub account to sign up for an Unstructured account (if you do not already have one) and sign into the account at the same time. The Unstructured user interface (UI) appears.
-
Get your Unstructured API key:
a. In the Unstructured UI, click API Keys on the sidebar.
b. Click Generate API Key.
c. Follow the on-screen instructions to finish generating the key.
d. Click the Copy icon next to your new key to add the key to your system’s clipboard. If you lose this key, simply return and click the Copy icon again.
- In Postman, on your workspace’s sidebar, click Environments.
- Click Globals.
-
Create two global variables with the following settings:
- Variable:
UNSTRUCTURED_API_URL - Type:
default - Initial value:
https://platform.unstructuredapp.io/api/v1 - Current value:
https://platform.unstructuredapp.io/api/v1
- Variable:
UNSTRUCTURED_API_KEY - Type:
secret - Initial value:
<your-unstructured-api-key> - Current value:
<your-unstructured-api-key>
- Variable:
- Click Save.
- Go to https://platform.unstructured.io and use your email address, Google account, or GitHub account to sign up for an Unstructured account (if you do not already have one) and sign into the account at the same time. The Unstructured user interface (UI) appears.
-
Get your Unstructured API key:
a. In the Unstructured UI, click API Keys on the sidebar.
b. Click Generate API Key.
c. Follow the on-screen instructions to finish generating the key.
d. Click the Copy icon next to your new key to add the key to your system’s clipboard. If you lose this key, simply return and click the Copy icon again.
- A source connector ingests files or data into Unstructured from a source location.
- A destination connector sends the processed data from Unstructured to a destination location.
- A workflow defines how Unstructured will process the data.
- A job runs a workflow at a specific point in time.
Restrictions
The following Unstructured SDKs, tools, and libraries do not work with the Unstructured Workflow Endpoint:- The Unstructured JavaScript/TypeScript SDK
- Local single-file POST requests to the Unstructured Partition Endpoint
- The Unstructured open source Python library
- The Unstructued Ingest CLI
- The Unstructured Ingest Python library
https://api.unstructuredapp.io/general/v0/general (the Unstructured Partition Endpoint URL).
Connectors
You can list, get, create, update, and delete source connectors. You can also list, get, create, update, and delete destination connectors. For general information, see Connectors.List source connectors
To list source connectors, use theUnstructuredClient object’s sources.list_sources function (for the Python SDK) or
the GET method to call the /sources endpoint (for curl or Postman).
To filter the list of source connectors, use the ListSourcesRequest object’s source_type parameter (for the Python SDK)
or the query parameter source_type=<type> (for curl or Postman),
replacing <type> with the source connector type’s unique ID
(for example, for the Amazon S3 source connector type, S3 for the Python SDK or s3 for curl or Postman).
To get this ID, see Sources.
Python SDK
Python SDK
Python SDK (async)
Python SDK (async)
curl
curl
Postman
Postman
- In the method drop-down list, select GET.
-
In the address box, enter the following URL:
-
On the Headers tab, enter the following headers:
- Key:
unstructured-api-key, Value:{{UNSTRUCTURED_API_KEY}} - Key:
accept, Value:application/json
- Key:
-
To filter the list of source connectors, on the Params tab, enter the following query parameter:
- Key:
source_type, Value:<type>
- Key:
- Click Send.
Get a source connector
To get information about a source connector, use theUnstructuredClient object’s sources.get_source function (for the Python SDK) or
the GET method to call the /sources/<connector-id> endpoint (for curl or Postman), replacing
<connector-id> with the source connector’s unique ID. To get this ID, see List source connectors.
Python SDK
Python SDK
Python SDK (async)
Python SDK (async)
curl
curl
Postman
Postman
- In the method drop-down list, select GET.
-
In the address box, enter the following URL:
-
On the Headers tab, enter the following headers:
- Key:
unstructured-api-key, Value:{{UNSTRUCTURED_API_KEY}} - Key:
accept, Value:application/json
- Key:
- Click Send.
Create a source connector
To create a source connector, use theUnstructuredClient object’s sources.create_source function (for the Python SDK) or
the POST method to call the /sources endpoint (for curl or Postman).
In the CreateSourceConnector object (for the Python SDK) or
the request body (for curl or Postman),
specify the settings for the connector. For the specific settings to include, which differ by connector, see
Sources.
For the Python SDK, replace <type> with the source connector type’s unique ID (for example, for the Amazon S3 source connector type, S3).
To get this ID, see Sources.
Python SDK
Python SDK
Python SDK (async)
Python SDK (async)
curl
curl
Postman
Postman
- In the method drop-down list, select POST.
-
In the address box, enter the following URL:
-
On the Headers tab, enter the following headers:
- Key:
unstructured-api-key, Value:{{UNSTRUCTURED_API_KEY}} - Key:
accept, Value:application/json - Key:
content-type, Value,application/json
- Key:
- On the Body tab, select raw and JSON, and specify the settings for the connector.
- Click Send.
Update a source connector
To update information about a source connector, use theUnstructuredClient object’s sources.update_source function (for the Python SDK) or
the PUT method to call the /sources/<connector-id> endpoint (for curl or Postman), replacing
<connector-id> with the source connector’s unique ID. To get this ID, see List source connectors.
In the UpdateSourceConnector object (for the Python SDK) or
the request body (for curl or Postman), specify the settings for the connector. For the specific settings to include, which differ by connector, see
Sources.
For the Python SDK, replace <type> with the source connector type’s unique ID (for example, for the Amazon S3 source connector type, S3).
To get this ID, see Sources.
You must specify all of the settings for the connector, even for settings that are not changing.
You can change any of the connector’s settings except for its name and type.
Python SDK
Python SDK
Python SDK (async)
Python SDK (async)
curl
curl
Postman
Postman
- In the method drop-down list, select PUT.
-
In the address box, enter the following URL:
-
On the Headers tab, enter the following headers:
- Key:
unstructured-api-key, Value:{{UNSTRUCTURED_API_KEY}} - Key:
accept, Value:application/json - Key:
content-type, Value,application/json
- Key:
- On the Body tab, select raw and JSON, and specify the settings for the connector.
- Click Send.
Delete a source connector
To delete a source connector, use theUnstructuredClient object’s sources.delete_source function (for the Python SDK) or
the DELETE method to call the /sources/<connector-id> endpoint (for curl or Postman), replacing
<connector-id> with the source connector’s unique ID. To get this ID, see List source connectors.
Python SDK
Python SDK
Python SDK (async)
Python SDK (async)
curl
curl
Postman
Postman
- In the method drop-down list, select DELETE.
-
In the address box, enter the following URL:
-
On the Headers tab, enter the following headers:
- Key:
unstructured-api-key, Value:{{UNSTRUCTURED_API_KEY}} - Key:
accept, Value:application/json
- Key:
- Click Send.
List destination connectors
To list destination connectors, use theUnstructuredClient object’s destinations.list_destinations function (for the Python SDK) or
the GET method to call the /destinations endpoint (for curl or Postman).
To filter the list of destination connectors, use the ListDestinationsRequest object’s destination_type parameter (for the Python SDK) or
the query parameter destination_type=<type> (for curl or Postman),
replacing <type> with the destination connector type’s unique ID
(for example, for the Amazon S3 source connector type, S3 for the Python SDK or s3 for curl or Postman).
To get this ID, see Destinations.
Python SDK
Python SDK
Python SDK (async)
Python SDK (async)
curl
curl
Postman
Postman
- In the method drop-down list, select GET.
-
In the address box, enter the following URL:
-
On the Headers tab, enter the following headers:
- Key:
unstructured-api-key, Value:{{UNSTRUCTURED_API_KEY}} - Key:
accept, Value:application/json
- Key:
-
To filter the list of destination connectors, on the Params tab, enter the following query parameter:
- Key:
destination_type, Value:<type>
- Key:
- Click Send.
Get a destination connector
To get information about a destination connector, use theUnstructuredClient object’s destinations.get_destination function (for the Python SDK) or
the GET method to call the /destinations/<connector-id> endpoint (for curl or Postman), replacing
<connector-id> with the destination connector’s unique ID. To get this ID, see List destination connectors.
Python SDK
Python SDK
Python SDK (async)
Python SDK (async)
curl
curl
Postman
Postman
- In the method drop-down list, select GET.
-
In the address box, enter the following URL:
-
On the Headers tab, enter the following headers:
- Key:
unstructured-api-key, Value:{{UNSTRUCTURED_API_KEY}} - Key:
accept, Value:application/json
- Key:
- Click Send.
Create a destination connector
To create a destination connectors, use theUnstructuredClient object’s destinations.create_destination function (for the Python SDK) or
the POST method to call the /destinations endpoint (for curl or Postman).
In the CreateDestinationConnector object (for the Python SDK) or
the request body (for curl or Postman),
specify the settings for the connector. For the specific settings to include, which differ by connector, see
Destinations.
For the Python SDK, replace <type> with the destination connector type’s unique ID (for example, for the Amazon S3 source connector type, S3).
To get this ID, see Destinations.
Python SDK
Python SDK
Python SDK (async)
Python SDK (async)
curl
curl
Postman
Postman
- In the method drop-down list, select POST.
-
In the address box, enter the following URL:
-
On the Headers tab, enter the following headers:
- Key:
unstructured-api-key, Value:{{UNSTRUCTURED_API_KEY}} - Key:
accept, Value:application/json - Key:
content-type, Value,application/json
- Key:
- On the Body tab, select raw and JSON, and specify the settings for the connector.
- Click Send.
Update a destination connector
To update information about a destination connector, use theUnstructuredClient object’s destinations.update_destination function (for the Python SDK) or
the PUT method to call the /destinations/<connector-id> endpoint (for curl or Postman), replacing
<connector-id> with the destination connector’s unique ID. To get this ID, see List destination connectors.
In the UpdateDestinationConnector object (for the Python SDK) or
the request body (for curl or Postman), specify the settings for the connector. For the specific settings to include, which differ by connector, see
Destinations.
You must specify all of the settings for the connector, even for settings that are not changing.
You can change any of the connector’s settings except for its name and type.
Python SDK
Python SDK
Python SDK (async)
Python SDK (async)
curl
curl
Postman
Postman
- In the method drop-down list, select PUT.
-
In the address box, enter the following URL:
-
On the Headers tab, enter the following headers:
- Key:
unstructured-api-key, Value:{{UNSTRUCTURED_API_KEY}} - Key:
accept, Value:application/json - Key:
content-type, Value,application/json
- Key:
- On the Body tab, select raw and JSON, and specify the settings for the connector.
- Click Send.
Delete a destination connector
To delete a destination connector, use theUnstructuredClient object’s destinations.delete_destination function (for the Python SDK) or
the DELETE method to call the /destinations/<connector-id> endpoint (for curl or Postman), replacing
<connector-id> with the destination connector’s unique ID. To get this ID, see List destination connectors.
Python SDK
Python SDK
Python SDK (async)
Python SDK (async)
curl
curl
Postman
Postman
- In the method drop-down list, select DELETE.
-
In the address box, enter the following URL:
-
On the Headers tab, enter the following headers:
- Key:
unstructured-api-key, Value:{{UNSTRUCTURED_API_KEY}} - Key:
accept, Value:application/json
- Key:
- Click Send.
Workflows
You can list, get, create, run, update, and delete workflows. For general information, see Workflows.List workflows
To list workflows, use theUnstructuredClient object’s workflows.list_workflows function (for the Python SDK) or
the GET method to call the /workflows endpoint (for curl or Postman).
To filter the list of workflows, use one or more of the following ListWorkflowsRequest parameters (for the Python SDK) or
query parameters (for curl or Postman):
source_id=<connector-id>, replacing<connector-id>with the source connector’s unique ID. To get this ID, see List source connectors.destination_id=<connector-id>, replacing<connector-id>with the destination connector’s unique ID. To get this ID, see List destination connectors.status=WorkflowState.<status>(for the Python SDK) orstatus=<status>(forcurlor Postman), replacing<status>with one of the following workflow statuses:ACTIVEorINACTIVE(for the Python SDK) oractiveorinactive(forcurlor Postman).
?source_id=<connector-id>&status=<status>.
Python SDK
Python SDK
Python SDK (async)
Python SDK (async)
curl
curl
Postman
Postman
- In the method drop-down list, select GET.
-
In the address box, enter the following URL:
-
On the Headers tab, enter the following headers:
- Key:
unstructured-api-key, Value:{{UNSTRUCTURED_API_KEY}} - Key:
accept, Value:application/json
- Key:
-
To filter the list of workflows, on the Params tab, enter one or more of the following query parameter:
- By source connector ID: Key:
source_id, Value:<connector-id> - By destination connector ID: Key:
destination_id, Value:<connector-id> - By workflow status: Key:
status, Value:<status>
- By source connector ID: Key:
- Click Send.
Get a workflow
To get information about a workflow, use theUnstructuredClient object’s workflows.get_workflow function (for the Python SDK) or
the GET method to call the /workflows/<workflow-id> endpoint (for curl or Postman), replacing
<workflow-id> with the workflow’s unique ID. To get this ID, see List workflows.
Python SDK
Python SDK
Python SDK (async)
Python SDK (async)
curl
curl
Postman
Postman
- In the method drop-down list, select GET.
-
In the address box, enter the following URL:
-
On the Headers tab, enter the following headers:
- Key:
unstructured-api-key, Value:{{UNSTRUCTURED_API_KEY}} - Key:
accept, Value:application/json
- Key:
- Click Send.
Create a workflow
To create a workflow, use theUnstructuredClient object’s workflows.create_workflow function (for the Python SDK) or
the POST method to call the /workflows endpoint (for curl or Postman).
In the CreateWorkflow object (for the Python SDK) or
the request body (for curl or Postman),
specify the settings for the workflow. For the specific settings to include, see
Create a workflow.
Python SDK
Python SDK
Python SDK (async)
Python SDK (async)
curl
curl
Postman
Postman
- In the method drop-down list, select POST.
-
In the address box, enter the following URL:
-
On the Headers tab, enter the following headers:
- Key:
unstructured-api-key, Value:{{UNSTRUCTURED_API_KEY}} - Key:
accept, Value:application/json - Key:
content-type, Value,application/json
- Key:
- On the Body tab, select raw and JSON, and specify the settings for the workflow.
- Click Send.
Run a workflow
To run a workflow manually, use theUnstructuredClient object’s workflows.run_workflow function (for the Python SDK) or
the POST method to call the /workflows/<workflow-id>/run endpoint (for curl or Postman), replacing
<workflow-id> with the workflow’s unique ID. To get this ID, see List workflows.
Python SDK
Python SDK
Python SDK (async)
Python SDK (async)
curl (remote source and remote destination)
curl (remote source and remote destination)
curl (local source and local destination)
curl (local source and local destination)
</full/path/to/local/filename.extension>with the full path to the local file to upload.<filename.extension>with the filename of the local file to upload.<local-file-media-type>with the local file’s media type. For a list of available media types, such asapplication/pdf, see Media Types.
--form entries, one per file.Postman (remote source and remote destination)
Postman (remote source and remote destination)
- In the method drop-down list, select POST.
-
In the address box, enter the following URL:
-
On the Headers tab, enter the following headers:
- Key:
unstructured-api-key, Value:{{UNSTRUCTURED_API_KEY}} - Key:
accept, Value:application/json
- Key:
- Click Send.
Postman (local source and local destination)
Postman (local source and local destination)
- In the method drop-down list, select POST.
-
In the address box, enter the following URL:
-
On the Headers tab, enter the following headers:
- Key:
unstructured-api-key, Value:{{UNSTRUCTURED_API_KEY}} - Key:
accept, Value:application/json
- Key:
-
On the Body tab, select form-data, and specify the settings for the workflow run:
-
Key:
input_files, File, Value: Click the Value box, then click New file from local machine, and select the file to upload. To upload multiple files, add additionalinput_filesentries after this one, one entry per additional file to upload. -
Key:
filename, Text, Value: Type the name of the file that you just uploaded. To upload multiple files, add additionalfilenameentries after this one, one entry per additional file to upload. Make sure the order of thesefilenameentries matches the order of theinput_filesentries, respectively. -
Key:
type, Text, Value:<local-file-media-type>To upload multiple files, add additionaltypeentries after this one, one entry per additional file to upload. Make sure the order of thesetypeentries matches the order of theinput_filesentries, respectively.
application/pdf, see Media Types. -
Key:
- Click Send.
schedule setting in the request body when you create or update a
workflow. See Create a workflow or Update a workflow.
Update a workflow
To update information about a workflow, use theUnstructuredClient object’s workflows.update_workflow function (for the Python SDK) or
the PUT method to call the /workflows/<workflow-id> endpoint (for curl or Postman), replacing
<workflow-id> with the workflow’s unique ID. To get this ID, see List workflows.
In UpdateWorkflow object (for the Python SDK) or
the request body (for curl or Postman), specify the settings for the workflow. For the specific settings to include, see
Update a workflow.
Python SDK
Python SDK
Python SDK (async)
Python SDK (async)
curl
curl
Postman
Postman
- In the method drop-down list, select PUT.
-
In the address box, enter the following URL:
-
On the Headers tab, enter the following headers:
- Key:
unstructured-api-key, Value:{{UNSTRUCTURED_API_KEY}} - Key:
accept, Value:application/json - Key:
content-type, Value,application/json
- Key:
- On the Body tab, select raw and JSON, and specify the settings for the workflow.
- Click Send.
Delete a workflow
To delete a workflow, use theUnstructuredClient object’s workflows.delete_workflow function (for the Python SDK) or
the DELETE method to call the /workflows/<workflow-id> endpoint (for curl or Postman), replacing
<workflow-id> with the workflow’s unique ID. To get this ID, see List workflows.
Python SDK
Python SDK
Python SDK (async)
Python SDK (async)
curl
curl
Postman
Postman
- In the method drop-down list, select DELETE.
-
In the address box, enter the following URL:
-
On the Headers tab, enter the following headers:
- Key:
unstructured-api-key, Value:{{UNSTRUCTURED_API_KEY}} - Key:
accept, Value:application/json
- Key:
- Click Send.
Jobs
You can list, get, and cancel jobs. A job is created automatically whenever a workflow runs on a schedule; see Create a workflow. A job is also created whenever you run a workflow; see Run a workflow. For general information, see Jobs.List jobs
To list jobs, use theUnstructuredClient object’s jobs.list_jobs function (for the Python SDK) or
the GET method to call the /jobs endpoint (for curl or Postman).
To filter the list of jobs, use one or both of the following ListJobsRequest parameters (for the Python SDK) or
query parameters (for curl or Postman):
workflow_id=<workflow-id>, replacing<workflow-id>with the workflow’s unique ID. To get this ID, see List workflows.status=<status>, replacing<status>with one of the following job statuses:completed,failed,im progress,scheduled, andstopped.
curl or Postman, you can specify multiple query parameters as ?workflow_id=<workflow-id>&status=<status>.
Python SDK
Python SDK
Python SDK (async)
Python SDK (async)
curl
curl
Postman
Postman
- In the method drop-down list, select GET.
-
In the address box, enter the following URL:
-
On the Headers tab, enter the following headers:
- Key:
unstructured-api-key, Value:{{UNSTRUCTURED_API_KEY}} - Key:
accept, Value:application/json
- Key:
-
To filter the list of jobs, on the Params tab, enter one or more of the following query parameter:
- By workflow ID: Key:
workflow_id, Value:<workflow-id> - By job status: Key:
status, Value:<status>
- By workflow ID: Key:
- Click Send.
Get a job
To get information about a job, use theUnstructuredClient object’s jobs.get_job function (for the Python SDK) or
the GET method to call the /jobs/<job-id> endpoint (for curl or Postman), replacing
<job-id> with the job’s unique ID. To get this ID, see List jobs.
Python SDK
Python SDK
Python SDK (async)
Python SDK (async)
curl
curl
Postman
Postman
- In the method drop-down list, select GET.
-
In the address box, enter the following URL:
-
On the Headers tab, enter the following headers:
- Key:
unstructured-api-key, Value:{{UNSTRUCTURED_API_KEY}} - Key:
accept, Value:application/json
- Key:
- Click Send.
Cancel a job
To cancel a running job, use theUnstructuredClient object’s jobs.cancel_job function (for the Python SDK) or
the POST method to call the /jobs/<job-id>/cancel endpoint (for curl or Postman), replacing
<job-id> with the job’s unique ID. To get this ID, see List jobs.
Python SDK
Python SDK
Python SDK (async)
Python SDK (async)
curl
curl
Postman
Postman
- In the method drop-down list, select POST.
-
In the address box, enter the following URL:
-
On the Headers tab, enter the following headers:
- Key:
unstructured-api-key, Value:{{UNSTRUCTURED_API_KEY}} - Key:
accept, Value:application/json
- Key:
- Click Send.
Download a processed local file from a job
This applies only to jobs that use a workflow with a local source and a local destination. To download a processed local file from a completed job, useGET to call the /jobs/<job-id>/download endpoint, replacing
<job-id> with the job’s unique ID. To get this ID, see List jobs.
You must also provide Unstructured’s IDs for the file to download and the workflow’s output node. To get these IDs, see Get a job. In the
response:
- Unstructured’s IDs for the file to download and the workflow’s output node are in the
output_node_filesarray. - The ID for the file to download is in the
output_node_filesarray’sfile_idfield. - The ID for the workflow’s output node is in the
output_node_filesarray’snode_idfield.
curl
curl
Postman
Postman
- In the method drop-down list, select GET.
-
In the address box, enter the following URL:
-
On the Headers tab, enter the following headers:
- Key:
unstructured-api-key, Value:{{UNSTRUCTURED_API_KEY}} - Key:
accept, Value:application/json
- Key:
-
On the Params tab, enter the following query parameters:
- Key:
file_id, Value:<file-id> - Key:
node_id, Value:<node-id>
- Key:
- Click Send.

