FiveTech Support Forums

FiveWin / Harbour / xBase community
Board index FiveWin for Harbour/xHarbour Extract information from PDF
Posts: 49
Joined: Thu Dec 22, 2005 12:50 PM
Extract information from PDF
Posted: Tue Mar 29, 2011 11:05 PM

Good evening friends
Does anyone know how to extract certain information from PDF files, via five?

Kleyton

Fwh906

Brazil
Posts: 4840
Joined: Fri Nov 18, 2005 04:52 PM
Re: Extract information from PDF
Posted: Wed Mar 30, 2011 04:24 PM

Kleyton,

PDF files are really not designed to be data sources. As you know PDF's can be incredibly complex with mixtures of graphics and text and lots of configuration data.

If your PDFs are simple and always the same format, then perhaps it would be feasible. With complex PDFs with varying formats it could quickly become a monumental task.

James

FWH 18.05/xHarbour 1.2.3/BCC7/Windows 10
Posts: 49
Joined: Thu Dec 22, 2005 12:50 PM
Re: Extract information from PDF
Posted: Thu Mar 31, 2011 06:15 PM

I indicated that DLL, also works with activex.

http://www.foxitsoftware.com/shopping/c ... ?suggest=0

Kleyton

Fwh906

Brazil
Posts: 883
Joined: Thu Dec 24, 2009 12:46 AM
Re: Extract information from PDF
Posted: Fri Apr 01, 2011 12:59 AM

Maybe if you explain a little better what you are trying to do, someone can help you...

Are you trying to read a PDF file???
Are you trying to edit a PDF file???
Are you trying to fill a PDF form???
Are you trying to extract information about the file???

=====>

Bayron Landaverry
xBasePHP.com
(215)2226600 Philadelphia,PA, USA
MayaBuilders@gMail.com
Guatemala

FWH25.06--Harbour 3.0.0--BCC7.7--UEstudio 10.10
Windows 10

FiveWin, One line of code and it's done...

Posts: 1091
Joined: Thu Nov 17, 2005 11:08 AM
Re: Extract information from PDF
Posted: Fri Apr 01, 2011 07:27 AM

My little implementation in a existing project.
Pdf documents arrive by email in advance daily.
Their contents are transport documents.
Warehouseman open pdf document using Acrobat reader (what else? ) .
(before you import something into a program I still need to check the contents)

Perform CTRL-A and CTRL-C (the content is copied to the clipboard).

In my program click a NEW button that reads from clipboard the contents and
extracts codes, quantities and prices.

Minimum effort maximum efficiency

Probably this is not an answer to your question, is only an input

bye
marco

Marco Boschi
info@marcoboschi.it
Posts: 946
Joined: Thu Oct 06, 2005 07:05 PM
Re: Extract information from PDF
Posted: Fri Apr 01, 2011 07:58 AM

try pdf toolkit , free and easy to use (command line)

I don't know what exactly you want to acheive with pdf but this tool offedrs a lot of possibilities, i use it to merge pdfs...etc

http://www.pdflabs.com/tools/pdftk-the-pdf-toolkit/

Hth

Richard

http://www.cbati.com

Uestudio
Fwh 13.05 Harbour 3.2 MSVC 2013

Continue the discussion