...

View Full Version : Goodbye to Google



tfit
03-25-2010, 05:12 AM
It seems as of today Google goes the Yahoo way of returning search ressults which means they insert their ip/name in the url. If you use the searchengine with a browser nothing changes, but programmatically it's a disaster. So goodbye Google for me.

bazz
03-25-2010, 05:30 AM
Can you explain some more? When I search, the results show only the url of the website returned.

bazz

tfit
03-25-2010, 05:49 AM
Can you explain some more? When I search, the results show only the url of the website returned.
bazz
My normal searchstring fetches 100 results like search?q=searchterm+filetype:html&num=100 for that I use windows/total commander. And I used to get a clean url. Now i get something like

http://74.125.77.132/search?q=cache:d78awKjp2p4J:http://www.w3schools.com/DOM/dom_nodes.asp+textnode&hl=&ct=clnk

for what used to be http://www.w3schools.com/DOM/dom_nodes.asp.
But like i said; if you use a browser nothing changes, but getting the urls from the result is too annoying to spend time on

torham
03-30-2010, 08:17 PM
What is the reason they are doing that; is it for a better way of tracking user behavior, or is there any other explanation?

Apostropartheid
03-30-2010, 08:39 PM
It looks like something went wrong and Google is returning the cached version of the site instead of the true URL.

eu4ria
03-30-2010, 10:53 PM
So if it is not some weird temporary quirk, why not just use a regular expression to strip out the first part of the url to get the part that you really want? So just grab everything after the 3rd colon in the example above.

tfit
04-01-2010, 10:03 PM
So if it is not some weird temporary quirk, why not just use a regular expression to strip out the first part of the url to get the part that you really want? So just grab everything after the 3rd colon in the example above.

The tool surf that i use gives a file list.txt and from that i build my data file. I I already strip out everything besides A-Za-z. Furthermore you have to do some screenscraping to find out if things won't break in the future.



EZ Archive Ads Plugin for vBulletin Copyright 2006 Computer Help Forum