I'm looking for an API that can:
1. Extract text from a PDF file (obviously, OCR)
2. Create images, as in a snapshot, of a page in a PDF file
For #1, preferably to XML or XHTML format.
Any suggestions?
I am using C#, so a .NET API or even a COM API would do.
