Hello and welcome to our community! Is this your first visit?
Register
Enjoy an ad free experience by logging in. Not a member yet? Register.
Results 1 to 2 of 2
  1. #1
    New to the CF scene
    Join Date
    Apr 2008
    Posts
    1
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Extracting Tabular Data from .doc file

    Is it possible to Extract the tabular fields from the .doc file if yes
    please let me know with which API it possible and with a sample example.


    Regards

    Praveen

  • #2
    Regular Coder Aradon's Avatar
    Join Date
    Jun 2005
    Location
    USA
    Posts
    734
    Thanks
    0
    Thanked 20 Times in 19 Posts
    There really isn't much out there for this sort of functionality, mostly because you're trying to parse a microsoft product with a non-microsoft language.

    Apache has an API listed here:

    http://poi.apache.org/

    As well as an API for use. So that should at least get you started. Of course, if that doesn't work you may want to try to convert the word document into an easier format (such as RTF), and then parse it.
    "To iterate is human, to recurse divine." -L. Peter Deutsch


  •  

    Posting Permissions

    • You may not post new threads
    • You may not post replies
    • You may not post attachments
    • You may not edit your posts
    •