msnbot/2.0b and robots.txt

Looks like the new MSN spider is having some trouble following the rules. While they fix it, this can be blocked in Apache by doing:
 SetEnvIfNoCase user-agent "^msnbot/2.0b" banned_by_now
 ...
 <Directory "/srv/www/htdocs/">
    Order allow,deny
    Allow from all
    Deny from env=banned_by_now
 </Directory>


blog comments powered by Disqus
This is a personal web page. Things said here do not represent the position of my employer.