tag:blogger.com,1999:blog-5574479.post4930553356780718800..comments2024-03-29T06:53:23.405+00:00Comments on The IPKat: A Treasure-Trove of TravauxVerónica RodrÃguez Arguijohttp://www.blogger.com/profile/05763207846940036921noreply@blogger.comBlogger5125tag:blogger.com,1999:blog-5574479.post-56487875305587511852011-04-07T23:09:02.882+01:002011-04-07T23:09:02.882+01:00- super, this means that I can now let go of about...- super, this means that I can now let go of about 3' of paper that I obtained 1973f courtesy the Max Planck Institute in Munich and kept ever since. Only once did I really need those documents! But having them gave a nice feeling.<br /><br />George Brock-NannestadAnonymousnoreply@blogger.comtag:blogger.com,1999:blog-5574479.post-7222497272874517032011-04-07T11:58:54.084+01:002011-04-07T11:58:54.084+01:00There are at least two different formats of PDF fi...There are at least two different formats of PDF files. <br /><br />The original PDF encoding saves the file as an image, and all you can do with these is select and save part of the file as an image: searching is not possible. <br /><br />A later implementation involves character recognition and an encoding technique that produces an image file while retaining the individual character information. This format does allow searching by text string and selection and copying of text in word-processable form, as opposed to an image. <br /><br /><br />The EPO's more recent published decisions for example, do allow searching in this way, but all the "Travaux" files that I have dipped into have been in the old PDF format and are not searchable by text string. <br /><br />I believe that there are applications that allow the text in the original PDF Image format to be recognised for word processing, but this functionality is not provided in the standard Adobe Acrobat reader.Ronnoreply@blogger.comtag:blogger.com,1999:blog-5574479.post-23374423557905457592011-04-07T11:56:10.788+01:002011-04-07T11:56:10.788+01:00@AndyH - only PDFs with embedded text data can be ...@AndyH - only PDFs with embedded text data can be searched. These do not have embedded text data; they are purely images.Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-5574479.post-29862569175376579512011-04-07T11:27:28.539+01:002011-04-07T11:27:28.539+01:00.pdf is a flexible file format that allows both gr....pdf is a flexible file format that allows both graphic and text elements. <br /><br />The TP docs i have accessed contain only graphic elements which cannot be searched via the [find] function nor can the text be cut and paste as text. <br /><br />Matt is right, these would need to be passed through optical character reconition (OCR) package to generate the text elements.Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-5574479.post-21008392993443761072011-04-07T01:31:51.434+01:002011-04-07T01:31:51.434+01:00Just a brief note on the last item (Travaux). I...Just a brief note on the last item (Travaux). I'm sure you know that .pdf documents viewed in Adobe Reader or OpenOffice can be searched for text strings using the 'find' option.AndyJnoreply@blogger.com