java - Encounter images in between the text while parsing pdf using tika-server -
i able fetch images , coordinates pdf using pdfbox. when parse pdf using tika server, text only. how know when image occures can put image after text. using code given in following 1st answer: extract images pdf using pdfbox
i using tika server 1.7 talking data of pdf in parser , using plain text version. want know while parsing, how know image encountered.
i got html output using prasetohtml() @ link https://tika.apache.org/1.10/examples.html still not giving me images present in pdf. nor giving tag.
Comments
Post a Comment