If you’re new to Unstructured, read this note first.Before you can create a source connector, you must first sign up for Unstructured and get your
Unstructured API key. After you sign up, the Unstructured user interface (UI) appears, which you use to get the key.
To learn how, watch this 40-second how-to video.After you create the source connector, add it along with a
destination connector to a workflow.
Then run the worklow as a job. To learn how, try out the
hands-on Workflow Endpoint quickstart,
go directly to the quickstart notebook,
or watch the two 4-minute video tutorials for the Unstructured Python SDK.You can also create source connectors with the Unstructured user interface (UI).
Learn how.If you need help, reach out to the community on Slack, or
contact us directly.You are now ready to start creating a source connector! Keep reading to learn how.
If you are generating an SAS token as shown in the preceding video, be sure to set the following permissions:
- Read and List for reading from the container only.
- Write and List for writing to the container only.
- Read, Write, and List for both reading from and writing to the container.
-
An Azure account. To create one, learn how.
-
An Azure Storage account, and a container within that account. Create a storage account. Create a container.
-
The Azure Storage remote URL, using the format
az://<container-name>/<path/to/file/or/folder/in/container/as/needed>
For example, if your container is namedmy-container
, and there is a folder in the container namedmy-folder
, the Azure Storage remote URL would beaz://my-container/my-folder/
. -
An SAS token (recommended), access key, or connection string for the Azure Storage account. Create an SAS token (recommended). Get an access key. Get a connection string.
Create an SAS token (recommended):
Get an access key or connection string:
-
<name>
(required) - A unique name for this connector. -
az://<container-name>/<path/to/file/or/folder>
(required) - The Azure Storage remote URL, with the formataz://<container-name>/<path/to/file/or/folder/in/container/as/needed>
For example, if your container is namedmy-container
, and there is a folder in the container namedmy-folder
, the Azure Storage remote URL would beaz://my-container/my-folder/
. -
<account-name>
(required for SAS token authentication and account key authentication) - The Azure Storage account name. -
<sas-token>
- For SAS token authentication, the SAS token for the Azure Storage account (required). -
<account-key>
- For account key authentication, the key for the Azure Storage account (required). -
<connection-string>
- For connection string authentication, the connection string for the Azure Storage account (required). -
For
recursive
(source connector only), set totrue
to recursively access files from subfolders within the container. The default isfalse
if not otherwise specified.