Hello and welcome to our community! Is this your first visit?
Register
Enjoy an ad free experience by logging in. Not a member yet? Register.
Results 1 to 3 of 3
  1. #1
    New to the CF scene
    Join Date
    Jul 2002
    Location
    Athens, Greece
    Posts
    5
    Thanks
    0
    Thanked 0 Times in 0 Posts

    taking XML data via raw HTML?

    hiya, i was wonderingif there is a way to take and format data using XML, but data that does not come, to you properly, data that just sits on a website (ex, weather data on bbc.co.uk, or sillilar) and is HTML?

  • #2
    Moderator
    Join Date
    May 2002
    Location
    Hayward, CA
    Posts
    1,453
    Thanks
    1
    Thanked 21 Times in 19 Posts
    I don't quite follow you. Can you give me an example of the sort of source code you're talking about?

    Which is the content the user calls on and which is the content you want the page to automatically call on?
    "The first step to confirming there is a bug in someone else's work is confirming there are no bugs in your own."
    June 30, 2001
    author, Verbosio prototype XML Editor
    author, JavaScript Developer's Dictionary
    https://alexvincent.us/blog

  • #3
    Regular Coder
    Join Date
    Jun 2002
    Posts
    185
    Thanks
    0
    Thanked 0 Times in 0 Posts
    You can if the page you're inputting is valid XHTML or some very simple, well-formed HTML. But if there are any unclosed tags or other errors your parser will likely error out. I don't know of any XML parser that will clean up a document for you.

    You could also try writing your own parser to clean up bad HTML but that's bound to get complicated considering just how bad HTML can be and still appear readable on a browser.

    I would caution that you need to be careful scrapping data from other's web sites. You could be infringing on their copywrites or terms of use so be sure to ask permission first.


  •  

    Posting Permissions

    • You may not post new threads
    • You may not post replies
    • You may not post attachments
    • You may not edit your posts
    •