Hello and welcome to our community! Is this your first visit?
Enjoy an ad free experience by logging in. Not a member yet? Register.
Results 1 to 2 of 2
  1. #1
    New to the CF scene
    Join Date
    Dec 2012
    Thanked 0 Times in 0 Posts

    Urgent. Web page content retrieval

    Im doing my final year project on Web page summarization.
    I need an from u ppl... My project is all about summarizing the google page links and provide a short abstract of those links in the place of snippets below each link when we go for google search.
    To start this ive to retrieve the contents of all those 10 links... For this i tried using web crawlers , html2txt s/w's but all ended in failure..
    Please someone guide me to retrieve the contents in the web page links given by google search. Whe i used crawlers it retrieved all the contents.. like all the hyper links from that search result page.
    Im talking about those 10 links alone which are returned by the google engine in return to our query... Please help me.. Still ive oly 3months to complete my project
    Last edited by Surya0192; 01-02-2013 at 04:25 PM. Reason: wrong title

  2. #2
    Senior Coder alykins's Avatar
    Join Date
    Apr 2011
    Thanked 210 Times in 209 Posts
    It's worrysome to me that you are in your final year and did not think of this.... here is a head start. Look at the source of the delivered page o.O

    If you have chrome you can even use inspect element and drill through. Me thinks if you are truely in your final year this tip and screen shot should be more than enough for you. Also this has nothing to do with ASP.NET (at least as posted)
    Attached Thumbnails Attached Thumbnails -help-jpg  

    I code C hash-tag .Net
    Reference: W3C W3CWiki .Net Lib
    Validate: html CSS
    Debug: Chrome FireFox IE

  3. Users who have thanked alykins for this post:

    Surya0192 (01-15-2013)


Tags for this Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts