2021-04-04T14:10:44 cboltz: would you point 1 of the old mailing list urls to mailman3? 2021-04-04T14:10:57 I will do testing of rewriting with nginx 2021-04-04T14:11:14 (I don't care which one tbh) 2021-04-04T14:11:36 I guess opensuse-test is a good candidate ;-) 2021-04-04T14:11:44 yeah 2021-04-04T14:12:41 I need to find some space on this vm because this file has over 300M and we barely manage to fit with the existing stuff 2021-04-04T14:13:00 haproxy config updated 2021-04-04T14:13:17 thank you! 2021-04-04T14:13:27 sounds like you should request a bigger disk 2021-04-04T14:14:12 we do have 120GB disk in addition to the base one 2021-04-04T14:14:20 but that's taken up by xapian index 2021-04-04T14:43:49 during normal workhours it is possible to resize any disks or add more without downtime 2021-04-04T15:21:03 cboltz: https://lists.opensuse.org/opensuse-test/2006-07/msg00000.html 2021-04-04T15:21:47 so redirects from the mails are covered, now the lists 2021-04-04T15:21:56 :-) 2021-04-04T15:24:27 nginx isn't particularly happy about this tbh, it doesn't like big hashmaps like this >:D 2021-04-04T15:24:53 I'm not too surprised ;-) 2021-04-04T15:25:13 what exactly does "not particularly happy" mean in practise? 2021-04-04T15:27:50 it means I had to bump the default settings 16 or so times before it stopped complaining 2021-04-04T15:29:24 ;-) 2021-04-04T15:30:19 also restarting nginx kills mailman webui 2021-04-04T15:30:40 oh, "great" 2021-04-04T15:30:59 why is that? 2021-04-04T15:31:22 are you running it within nginx' python support? 2021-04-04T15:35:05 it's its own process so it's through uwsgi_pass 2021-04-04T15:35:18 yeah, we're using nginx's uwsgi runner 2021-04-04T15:35:48 (using uwsgi_pass means nginx is the controller process, as opposed to how pagure runs with simple proxy pass to gunicorn) 2021-04-04T15:48:44 i think we package the external uwsgi runners too 2021-04-04T15:49:05 it is even in the distro 2021-04-04T15:49:08 JFYI 2021-04-04T15:49:42 uwsgi-python3 2021-04-04T15:50:03 yeah, I'm not sure why mailman is configured to use uwsgi through nginx itself 2021-04-04T15:50:30 it also makes mailman slower, because the worker pool is in nginx rather than separately 2021-04-04T15:59:33 I mean, all the config is in salt if you want to modify it 2021-04-04T16:44:02 lcp: I'm not sure if your lists.rewritemap is correct, especially (?/[0-9]{2}) 2021-04-04T16:44:15 the old archives use YYYY-MM, not YYYY/MM 2021-04-04T16:44:26 uh, you are right 2021-04-04T16:45:00 I am more worried about https://lists.opensuse.org/opensuse-test/ redirecting to users@l.o.o? 2021-04-04T16:45:06 how? why? 2021-04-04T16:46:03 wild guess: add a $ to your regexes 2021-04-04T16:46:48 that's probably a good idea 2021-04-04T16:47:48 I don't exactly know where to add $ tbh, because I also wanna capture the random html files from mhonarc 2021-04-04T16:49:35 how about `"~/opensuse-test(?/[0-9]{4})?-?(?[0-9]{2})?.*?$" "/archives/list/test@lists.opensuse.org$year/$month";` 2021-04-04T16:50:55 .*$ is the same as not having $ ;-) 2021-04-04T16:51:43 that's true 2021-04-04T16:52:04 I could do .*?(.html)?$ instead I guess? 2021-04-04T16:53:07 maybe I should do more than 1 file for redirects, this is not gonna be very readable 2021-04-04T16:53:31 that's still all optional, and would allow .* _or_ .html 2021-04-04T16:53:48 maybe enforce a / after the date, or use (/.*)? ? 2021-04-04T16:54:32 I will just go on baloo and check what those html files were 2021-04-04T16:56:10 `(|threads.html|date.html|all.html|author.html)` 2021-04-04T16:56:50 nitpicking: \.html ;-) 2021-04-04T16:57:02 good call 2021-04-04T16:57:19 well, I doubt there would be accidential matches for the . in this case 2021-04-04T16:58:11 `"~/opensuse-test(?/[0-9]{4})?-?(?[0-9]{2})?/(|threads\.html|date\.html|all\.html|author\.html)$" "/archives/list/test@lists.opensuse.org$year/$month";` 2021-04-04T16:59:01 looks good 2021-04-04T17:03:06 curl tells me it works, firefox has the redirect cached which makes sense considering it's 301 2021-04-04T17:03:39 or is 302 permanent 2021-04-04T17:03:44 it doesn't matter >:D 2021-04-04T17:04:04 301 is permanent 2021-04-04T17:04:11 ah 2021-04-04T17:04:30 there are technically 2 redirects missing though 2021-04-04T17:04:40 /stats and /search 2021-04-04T17:05:59 /search already goes 404, are you sure about that? 2021-04-04T17:06:10 for /stats, I'd redirect that to / 2021-04-04T17:06:45 it was cgi-bin/search maybe? 2021-04-04T17:07:11 /cgi-bin/search.cgi 2021-04-04T17:07:20 right 2021-04-04T17:07:41 but that requires rewriting parameters >:D 2021-04-04T17:08:01 I'd just redirect it to / 2021-04-04T17:08:06 maybe ignore the parameters and just redirect to search 2021-04-04T17:08:45 no objections ;-) 2021-04-04T17:08:45 why not /archives/search 2021-04-04T17:09:34 https://lists.opensuse.org/archives/search -> "Sorry but your query looks empty." 2021-04-04T17:09:40 which is not really surprising 2021-04-04T17:09:52 because it is 2021-04-04T17:09:59 yes ;-) 2021-04-04T17:10:20 now the question is if it makes sense to redirect people to this error message, or if / would be more helpful 2021-04-04T17:16:26 just checked the access log - /cgi-bin/search has been accessed 260 times in March 2021-04-04T17:16:48 150 of them have a lists.o.o referrer, and 100 have an empty referrer 2021-04-04T17:17:09 so I guess when the redirects for all lists are in place, /cgi-bin/search won't see many accesses anymore 2021-04-04T17:18:00 where did the last 10 come from 2021-04-04T17:20:51 something like https://lists.opensuse.org/opensuse-factory/ - with various lists, sometimes also from a specific message 2021-04-04T17:21:11 so basically people use the search field on the old archive pages 2021-04-04T17:21:15 ah 2021-04-04T17:21:45 they can use the search engine on the new archive pages too >:D 2021-04-04T17:21:59 right 2021-04-04T17:22:10 oh 2021-04-04T17:22:21 I forgot about redirects to the archives 2021-04-04T17:22:31 as in mboxes 2021-04-04T17:22:47 should be easy to add ;-) 2021-04-04T17:22:49 and rss feed 2021-04-04T17:23:04 to that regex? >:D 2021-04-04T17:23:09 I doubt it 2021-04-04T17:23:39 you can make it a separate set of regexes if that's easier ;-) 2021-04-04T17:24:07 also just wondering - would it make sense to make the trailing / optional? 2021-04-04T17:24:18 this sounds like you want me to actually try using the current regex >:D 2021-04-04T17:24:39 in the source of the redirect? 2021-04-04T17:27:04 I don't think so? since the current archives enforce trailing slash and otherwise 404 out 2021-04-04T17:27:39 https://lists.opensuse.org/opensuse-project - apache will redirect you to https://lists.opensuse.org/opensuse-project/ 2021-04-04T17:27:54 oh, I guess it depends then 2021-04-04T17:27:54 I guess replacing /(|threads\.html|date\.html|all\.html|author\.html)$ with (|/|/threads\.html|/date\.html|/all\.html|/author\.html)$ should work 2021-04-04T17:29:28 it's probably the usual directory redirect 2021-04-04T17:30:14 yeah, maybe 2021-04-04T17:52:05 did `"/opensuse-test/opensuse-test-(?/[0-9]{4})-(?[0-9]{2})\.mbox(\.gz)?" "/test@lists.opensuse.org/export/test@lists.opensuse.org-$year-$month\.mbox\.gz";` and `"/opensuse-test/mailinglist.rss" "/archives/list/test@lists.opensuse.org/feed/";` 2021-04-04T18:10:39 for mbox, / should be just 2021-04-04T18:12:36 and unless you add a $, (\.gz)? is superfluous ;-) 2021-04-04T18:24:37 I should add $ tbh 2021-04-04T18:25:55 cboltz: I managed to crush ci with the size of this mr in gitlab 2021-04-04T18:34:37 I guess nobody expected a 340 MB config file ;-) 2021-04-04T18:38:29 it's not even finished yet though >:D 2021-04-04T18:38:48 we are still missing around 2400 mails there 2021-04-04T18:38:55 just wondering - does it really make sense to have a file of this size in git? 2021-04-04T18:39:11 or would packaging it (and limiting salt to touch'ing it) be better? 2021-04-04T18:40:25 good question 2021-04-04T19:09:12 the problem I see is that this file will make our git repo much bigger, and that it will make the CI much slower (or completely break it) 2021-04-04T19:10:00 so even if managing this file outside git is in theory worse, it's probably better in practise 2021-04-04T19:48:35 lcp: how did you manage to cut back the disk space consumed on mailman3? 2021-04-04T19:49:50 38% is way better than 98% :-) 2021-04-04T19:51:15 I don't really know? I didn't touch xapian at all 2021-04-04T19:51:52 honestly, we should try running index per list or for a selected period of time so it's not doing everything at once