Thursday, February 27, 2014

pdf-box plugin for greestone 2.85

I have noticed that they are many version of pdf-box plugin available out there. If by mistake you install a wrong one your Librarian Interface may fail to start. If you have greenstone 2.85 installed , you can try the plugin available in the following link. At least at the time of writing of this post, it was working properly.
I you have any further problem, knock me and will happy to help.

Note: You should put the extracted pdf-box in the ...\Greenstone\ext and restart the server.

Now that you've installed the PDFBox extension, this will be available as an option in the plugin's configuration dialog. To turn on the PDFBox extension for any collection you open in GLI, you would go to the Design panel, select Document Plugins from the left and on the right, double click the PDFPlugin (alternatively, select this plugin and click the <Configure Plugin...> below) to open the dialog to configure this plugin. In the Configure Plugin... dialog, scroll down to the section AutoLoadConverters and select the checkbox next to the pdfbox_conversion option. Click OK to close the dialog, switch to the Create panel and rebuild your collection. This time, PDF files will be processed by PDFBox which will extract their text.
Try this feature out on a collection of recent PDF files, by configuring its PDFPlugin with the pdfbox_conversion option turned on.

No comments:

Post a Comment