Did the Wayback machine break?
Beau Schwabe
Posts: 6,566
I was looking for something and needed to use the Wayback machine .... http://archive.org/web/ ... and can't get anything but "Page cannot be crawled or displayed due to robots.txt" everywhere I try to go. Even my own website that I had in the early 90's isn't there anymore... I get the same message.
Bummer !!
Bummer !!
Comments
http://www.ionet.net/~bschwabe/index.html.
This is the message I keep getting ....
-Phil
IOW, keep out!
-Phil
http://blog.archive.org/2013/10/25/reader-privacy-at-the-internet-archive/
i seem to get a 500 Internal Server Errory when I try your link. Hopefully that helps you in some way.
As others have mentioned, your site now displays a 500 server error.
The behaviour is mentioned in the wikipedia article: Source for this statement is probably The Internet Archive FAQ
This really points up not using personal pages provided by an ISP and school site for anything you want to keep, because you have less than zero control. It should all be considered transient. Google offers free pages, and they are highly unlikely to ever apply a global robots exclusion. If you don't want to pay for hosting, there are also freebie blog hosts you can use where your blog is a subdomain, in which case each subdomain can have a unique robots.txt file.
The Web was designed for pulling info, but more and more of the new bits are for pushing ads and tracking and controlling the end users. I'm becoming more disconnected all the time.
i just added a NO TRACK robots.text on my site a few months ago .......... Lets see if my site is there still on WBM....... granted there is a way to add a exclusion to the fie to let the wayback machine do its thing.
Well I guess its not retroactive as we think it is .....
I can see my stuff back to 05