It does extract the corresponding locations, font names, font sizes, etc., for each bit of text. Note that the package cannot recognize text drawn as images because that would require optical character recognition. That’s right, you can even use the command to convert a PDF to HTML or XML! For example, say you want the HTML version of the first and third pages of your PDF, including images.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |