# robots.txt for http://www.fh-krems.ac.at, http://www.imc-krems.ac.at/ # Normal robots.txt body is purely substring match only # We exclude lots of general purpose forms which are available in various mount points of the site # and internal image bank which is hidden in the navigation tree in any case User-agent: * Disallow: set_language Disallow: login_form Disallow: sendto_form Disallow: /bilder Disallow: /images Disallow: /downloads-de Disallow: /downloads-en Disallow: /links Disallow: /awstats Disallow: /Members Disallow: /author Disallow: /newsletter Allow: / # Googlebot allows regex in its syntax # Block all URLs including query strings (? pattern) - contentish objects expose query string only for actions or status reports which # might confuse search results. # This will also block ?set_language User-Agent: Googlebot Disallow: /*?* Disallow: /*sendto_form$ Disallow: /*folder_factories$ # Allow Adsense bot on entire site User-agent: Mediapartners-Google* Disallow: Allow: /*