Set up robots.txt on lists servers

This sets up a robots.txt on our lists servers. To start this file
prevents SEMrush bot from indexing our lists as that has been causing
lists.openstack.org to OOM with many listinfo processes started by
Apache.

We've avoided this OOM by manually configuring this robots.txt. Other
things we have ruled out are bup and input email causes qrunner's to
grow unexpectedly large. Fairly confident this bot is the trigger.

Note this fixes testing by adding 'hieradata' to set listpassword var.

Depends-On: https://review.opendev.org/724389
Change-Id: Id4f6739a8cf6a01f9796fa54c86ba1af3e31fecf
This commit is contained in:
Clark Boylan
2020-04-29 08:24:50 -07:00
committed by Clark Boylan
parent fffd971c38
commit eeac5467c3
4 changed files with 13 additions and 0 deletions

View File

@@ -0,0 +1,2 @@
User-agent: SemrushBot
Disallow: /

View File

@@ -8,6 +8,15 @@ class openstack_project::lists(
multihost => true,
}
file { '/var/www/robots.txt':
ensure => present,
source => 'puppet:///modules/openstack_project/mailman/robots.txt',
owner => 'root',
group => 'root',
replace => true,
mode => '0444',
}
# Disable inactive admins
user::virtual::disable { 'oubiwann': }
user::virtual::disable { 'rockstar': }

View File

@@ -67,6 +67,7 @@
- host_vars/etherpad01.opendev.org.yaml
- host_vars/letsencrypt01.opendev.org.yaml
- host_vars/letsencrypt02.opendev.org.yaml
- host_vars/lists.openstack.org.yaml
- host_vars/gitea99.opendev.org.yaml
- host_vars/mirror01.openafs.provider.opendev.org.yaml
- host_vars/mirror-update01.opendev.org.yaml

View File

@@ -0,0 +1 @@
listpassword: notarealpassword