Jeremy Stanley 79103e1a35 Update our Gitea robots.txt from gitea.com's
We've experienced some runaway growth of Gitea archive cache files
on one of our backends, which according to upstream is often caused
by web crawlers indexing the archive URLs. They recommended updating
our robots.txt to the current state of https://gitea.com/robots.txt
in order to help mitigate the issue.

I've kept things we expressly commented out before still commented
out, or anything that seems similar to what we commented out on the
assumption that the reasons would carry over.

After some discussion in IRC, we also decided it would make sense to
disallow /avatars and /user/* like they do.

Change-Id: I2b43b89de08c9a9d170e1ecbd14b1e6336fd2c84
2024-01-05 17:14:20 +00:00
..
2024-01-02 08:41:39 -08:00
2023-10-11 23:31:19 +00:00
2023-08-24 11:31:46 -07:00