|
-
May 4th, 2009, 08:56 PM
#1
Thread Starter
Hyperactive Member
Alternative to PDFBox - .NET Version
I am trying to read a PDF file line by line using PDFBox.
This is the first time I have ever attempted to do this with a PDF so I am not sure what I was expecting but I thought I would end up with some "mark up", which I could use to parse the lines I was looking for out of the file.
Anyway it didn't work out the way I had planned and I am not sure if that is because I am using the wrong tool or not using the right tool correctly. I managed to extract the text but I didn't really see any mark up that would be usefull to parse the file.
Does anybody have any experiece with extracting test from PDFs. What tool(s) do you recomend?
Last edited by FastEddie; May 5th, 2009 at 06:53 AM.
Posting Permissions
- You may not post new threads
- You may not post replies
- You may not post attachments
- You may not edit your posts
-
Forum Rules
|
Click Here to Expand Forum to Full Width
|