Go Back   CodingForums.com > :: Server side development > ASP.NET

Before you post, read our: Rules & Posting Guidelines

Reply
 
Thread Tools Rate Thread
Enjoy an ad free experience by logging in. Not a member yet? Register.
Old 01-02-2013, 03:23 PM   PM User | #1
Surya0192
New to the CF scene

 
Join Date: Dec 2012
Posts: 3
Thanks: 2
Thanked 0 Times in 0 Posts
Surya0192 is an unknown quantity at this point
Urgent. Web page content retrieval

Im doing my final year project on Web page summarization.
I need an from u ppl... My project is all about summarizing the google page links and provide a short abstract of those links in the place of snippets below each link when we go for google search.
To start this ive to retrieve the contents of all those 10 links... For this i tried using web crawlers , html2txt s/w's but all ended in failure..
Please someone guide me to retrieve the contents in the web page links given by google search. Whe i used crawlers it retrieved all the contents.. like all the hyper links from that search result page.
Im talking about those 10 links alone which are returned by the google engine in return to our query... Please help me.. Still ive oly 3months to complete my project

Last edited by Surya0192; 01-02-2013 at 03:25 PM.. Reason: wrong title
Surya0192 is offline   Reply With Quote
Old 01-03-2013, 01:15 PM   PM User | #2
alykins
Senior Coder

 
alykins's Avatar
 
Join Date: Apr 2011
Posts: 1,608
Thanks: 37
Thanked 183 Times in 182 Posts
alykins will become famous soon enough
It's worrysome to me that you are in your final year and did not think of this.... here is a head start. Look at the source of the delivered page o.O

If you have chrome you can even use inspect element and drill through. Me thinks if you are truely in your final year this tip and screen shot should be more than enough for you. Also this has nothing to do with ASP.NET (at least as posted)
Attached Thumbnails
Click image for larger version

Name:	help.jpg
Views:	33
Size:	46.4 KB
ID:	11832  
__________________

I code C hash-tag .Net
Reference: W3C W3CWiki .Net Lib
Validate: html CSS
Debug: Chrome FireFox IE
alykins is offline   Reply With Quote
Users who have thanked alykins for this post:
Surya0192 (01-15-2013)
Reply

Bookmarks

Tags
asp.net, web page retrieval

Jump To Top of Thread


Thread Tools
Rate This Thread
Rate This Thread:

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT +1. The time now is 02:41 PM.


Advertisement
Log in to turn off these ads.