You can change a workflow’s preconfigured strategy only through Custom workflow settings.
- VLM: For the highest-quality transformation of these file types:
.bmp
,.gif
,.heic
,.jpeg
,.jpg
,.pdf
,.png
,.tiff
, and.webp
. - High Res: For all other supported file types, and for the generation of bounding box coordinates.
- Fast: For text-only documents.
Supported languages
Fast partitioning accepts any text inputs, though automatic language detection of those inputs is restricted to langdetect. High Res partitioning leverages Tesseract OCR. For the list of languages that Tesseract supports, see: Languages/Scripts supported in different versions of Tesseract. Language support for VLM depends on the model used. The list of supported languages for a particular model is maintained by that model’s provider. For the list of languages that each model supports, see the following, where provided:-
Anthropic
- Claude 3.5 Sonnet: Arabic, Bengali, Chinese (Simplified), English, French, German, Hindi, Indonesian, Italian, Japanese, Korean, Portuguese (Brazil), Spanish, Swahili, and Yoruba are mentioned. (Source)
-
OpenAI
- GPT-4o: Arabic, Chinese, English, French, German, Gujarati, Hindi, Italian, Japanese, Korean, Marathi, Persian, Portuguese, Russian, Spanish, Tamil, Telugu, Turkish, Urdu, and Vietnamese are mentioned. (Source)
-
Amazon Bedrock
- Claude 3.5 Sonnet: “English, Spanish, Japanese, and multiple other languages” (Source)
- Claude 3 Opus: “English, Spanish, Japanese, and multiple other languages” (Source)
- Claude 3 Haiku: “English, Spanish, Japanese, and multiple other languages” (Source)
- Claude 3 Sonnet: “English, Spanish, Japanese, and multiple other languages” (Source)
- Amazon Nova Pro: “200+ languages” (Source)
- Amazon Nova Lite: “200+ languages” (Source)
- Meta Llama 3.2 90B Instruct: “English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai” (Source)
- Meta Llama 3.2 11B Instruct: “English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai” (Source)