# Sample robots.txt file (make sure the filename is ALL LOWERCASE on Linux/Unix systems) # This says to apply these settings to ALL search engine spiders/crawlers User-agent: * # These settings will keep spiders from indexing your unwanted pages # This assumes that your OSC install is in your web site's ROOT directory # ie: http://www.tablesud.fr/languedoc/index.php <- Use if this brings up your OSC main page Disallow: /languedoc/account.php Disallow: /languedoc/advanced_search.php Disallow: /languedoc/checkout_shipping.php Disallow: /languedoc/create_account.php Disallow: /languedoc/login.php Disallow: /languedoc/logoff.php Disallow: /languedoc/password_forgotten.php Disallow: /languedoc/popup_image.php Disallow: /languedoc/shopping_cart.php Disallow: /account.php Disallow: /advanced_search.php Disallow: /checkout_shipping.php Disallow: /create_account.php Disallow: /login.php Disallow: /logoff.php Disallow: /password_forgotten.php Disallow: /popup_image.php Disallow: /shopping_cart.php # Feel free to add any other pages on your site that you don't want to be indexed by # the search engines. # PLEASE NOTE: Any pages that you list here should be secured by other means if you # don't want people to be able to view them, as some malicious users will look at a # IF YOU DO NOT WISH TO HAVE THE GOOGLE IMAGE BOT SCAN YOUR DOMAIN FOR IMAGES # THEN YOU CAN INCLUDE THE FOLLOWING IN YOUR ROBOTS FILE. # I FOUND THAT MY BANDWIDTH USAGE DROPPED BY A MASSIVE AMOUNT AFTER I GOT RID # OF THE GOOGLE IMAGE BOT. ALL I HAD WAS IMAGE HUNTERS STEALING PRODUCT SHOTS # AND NOT EVEN BROWSING THE SITE. User-agent: Googlebot-Image Disallow: /