Saturday, August 02, 2008

PDF Search - Dotnet Program with Source

If you have a collection of many pdf files, this tool will be useful to search the pdf files for a text string in the pdf, pdf to html conversion will be done by this tool first, i built it for learning.

The tool is made with dotnet vb7, and the source code of pdf search is open source, the components used in this tool are not open source.

From SAN Gear - Software Developer and Web Design Resource


The computer should be win98/pentium2 or better, with dotnet framework installed, this is a 20MB install file dotnetfx.exe , it must be in microsoft site or find it at pcworld, tucows, webattack or thru google. Also you need the windows installer files InstMsiA.exe and InstMsiW.exe get them from somewhere.

The program is alpha and may look odd to many, but it works and does its job with a lot of work arounds.

The Source Code is this pdfser_net.zip on this Page as an Attachment. - Web Deuce - DotNet

For this software to work the folder names and pdf file names should not have spaces eg. you cant use 'my documents' folder, make a folder like c:\pdf_books and in that put as many folders as you want nested 3 deep and put your pdf files in them rename all pdf files with spaces if you have a file good book.pdf rename as good_book.pdf.

The command line components used in this program are

pdf to html PDFTOHTML conversion program

grep32 or try

(please note this was done many years back - ananth )

If you are learning like me download the source code, it is not much as i can only write few lines, you are free to use it the way you like.

Copernic Desktop Search (CDS) came soon after this program of mine and later other desktop search. If that's true, i am a inventor ! or at least a Brainstormer.

0 comments: