Skip to main content
transform.pdf-embed, Embed OCR text layer into scanned PDFs/images to make them searchable

Configuration

Configuration goes inside the step’s with: block.
input
string
required
File input - template expression e.g. {{input.document}}
parseResult
string
required
Parse result - template expression e.g. {{steps.parse.output}}
outputFilename
string
Output filename - supports {{filename}} syntax
confidenceThreshold
number
default:"0.7"
Minimum OCR confidence (0-1) to include a word

Output

File ID from files table
Number of pages in the output PDF
Number of words embedded
Extracted text from the document