Obtains the text of all the pages of a PDF document and return it in an arrangement of texts.
1 pdf.reader.getTextFromDocument
<pdf.reader.getTextFromDocument debug='debug'>
<inputreader_pdf /> ?
</pdf.reader.getTextFromDocument>
Attributes | |||||
---|---|---|---|---|---|
Name | Type | Required | Default | Description | |
Adebug | boolean | Allows to activate the debug trace which shows how the resulting text has been obtained. |
Arguments | |||||
---|---|---|---|---|---|
Name | Type | Required | Unique | Nullable | Description |
Einputreader_pdf | pdf.reader | The PDF object which you want to process. |
Returns | |
---|---|
Type | Description |
Array | Returns an array in which each element correspond to the text of each page of the document. |
Example
Obtains the reader object through a PDF file.
Copy
<iterator name='m_pagetxt' type='entry'> <in> <pdf.reader.getTextFromDocument debug='true'> <pdf.reader> <file type='absolute' name='src_plain.pdf' /> </pdf.reader> </pdf.reader.getTextFromDocument> </in> <do> <println> <m_pagetxt /> </println> </do> </iterator>