Results 1 to 2 of 2

Thread: Converting between file types w/ Java

  1. #1

    Thread Starter
    Member
    Join Date
    Sep 2002
    Posts
    45

    Converting between file types w/ Java

    Hello,

    I am looking for java programs (with source) that do the following :

    Convert Word to text from Java
    Convert HTML to text with correct layout and spacing
    Call Visual Basic or C++ from Java
    ~Found kind of here, still need help with calling vb [L=http://www.javaworld.com/javaworld/javatips/jw-javatip17.html]http://www.javaworld.com/javaworld/javatips/jw-javatip17.html[/L]
    Convert PDF to text from Java
    ~Found on sourceforge.net - pdfbox
    Convert PowerPoint to text from Java

    I am working on a project and we are doing the feasibility analysis of certain features...so that's why I need the open source.

  2. #2
    Kitten CornedBee's Avatar
    Join Date
    Aug 2001
    Location
    In a microchip!
    Posts
    11,594
    Word to text and PowerPoint to text will probably not be out there. Both formats are closed and to actually read them you have to try and reverse-engineer them. That's an extremely tedious process, so it's unlikely that anyone would go through it without a good reason to. The people of OpenOffice did it, for example, but their code is in C++.

    Calling VB from Java...
    In theory, it shouldn't be too hard. Sun offers or offered for free (but I don't think open source) a wrapper that reflects Java Beans into ActiveX - which is what VB classes are in the end. Doing the reverse ought not to be too hard. You would do it with some native modules for Java written in C++, that read a type library and inject the resulting information into the virtual machine. I can give you some hints there.

    As for calling C++ from Java, you only have to write a wrapper for the C++ classes in JNI.

    Converting HTML to text requires an HTML parser and a formatter. How hard this is is merely a question of how sophisticated you want it to be. Should it interpret CSS? Should it basically act as a text browser outputting into a text file?

    Oh, and the tag for links is URL, not L.
    All the buzzt
    CornedBee

    "Writing specifications is like writing a novel. Writing code is like writing poetry."
    - Anonymous, published by Raymond Chen

    Don't PM me with your problems, I scan most of the forums daily. If you do PM me, I will not answer your question.

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  



Click Here to Expand Forum to Full Width