2019-10-29T00:05:34 *** malcolmlewis has joined #opensuse-admin 2019-10-29T02:20:25 *** boombatower has quit IRC (Quit: Konversation terminated!) 2019-10-29T03:46:50 *** okurz_ has joined #opensuse-admin 2019-10-29T03:48:37 *** okurz has quit IRC (Ping timeout: 240 seconds) 2019-10-29T03:48:37 *** okurz_ is now known as okurz 2019-10-29T06:55:47 *** maxlin has joined #opensuse-admin 2019-10-29T07:17:44 *** marxin has quit IRC (Quit: Leaving) 2019-10-29T07:23:14 *** marxin has joined #opensuse-admin 2019-10-29T07:30:24 *** moozaad has joined #opensuse-admin 2019-10-29T07:48:47 *** petracvv has quit IRC (Ping timeout: 268 seconds) 2019-10-29T07:50:32 *** petracvv has joined #opensuse-admin 2019-10-29T08:17:10 *** jadamek has joined #opensuse-admin 2019-10-29T09:28:25 *** ldevulder has quit IRC (Quit: Leaving) 2019-10-29T10:00:39 *** petracvv has quit IRC (Ping timeout: 264 seconds) 2019-10-29T10:02:10 *** petracvv has joined #opensuse-admin 2019-10-29T10:17:49 *** ldevulder has joined #opensuse-admin 2019-10-29T10:29:02 *** ldevulder has quit IRC (Quit: Leaving) 2019-10-29T11:08:39 *** ldevulder has joined #opensuse-admin 2019-10-29T12:31:25 *** boombatower has joined #opensuse-admin 2019-10-29T13:30:08 *** boombatower has quit IRC (Remote host closed the connection) 2019-10-29T13:30:30 *** boombatower has joined #opensuse-admin 2019-10-29T14:02:37 *** boombatower has quit IRC (Remote host closed the connection) 2019-10-29T14:03:32 *** boombatower has joined #opensuse-admin 2019-10-29T17:46:17 *** a-865k has quit IRC (Ping timeout: 240 seconds) 2019-10-29T17:58:14 *** cboltz has joined #opensuse-admin 2019-10-29T19:42:11 progress.o.o is down - does someone know what's going on, and how to fix it? 2019-10-29T19:43:36 it looks like there's no redmine process running 2019-10-29T19:44:05 rcredmine restart results in a green "done", but still no redmine process 2019-10-29T20:18:01 *** moozaad has quit IRC (Quit: Konversation terminated!) 2019-10-29T20:38:27 Hey cboltz 2019-10-29T20:38:38 did anyone touched this machine for maintenance or anything like that? 2019-10-29T20:38:49 kbabioch: maybe you? 2019-10-29T20:39:14 nope, i didn't do anything to this machine :-) 2019-10-29T20:42:24 cboltz: just to be sure I am in the correct machine, it does have this IP addr 192.168.47.8 ? 2019-10-29T20:45:26 hummm I dont think so 2019-10-29T20:50:56 no, it's redmine.infra.o.o with IP 192.168.47.29 2019-10-29T20:51:17 .8 is the _future_ progress.o.o 2019-10-29T20:51:55 I am not on sudoers file :-( 2019-10-29T20:52:00 can you restart sssd ? 2019-10-29T20:52:23 doesn't help, it's an old SLE 11 2019-10-29T20:52:32 it is not salted? 2019-10-29T20:52:40 ow, let me see if I have the password in teampass 2019-10-29T20:52:48 it's in the heroes pass 2019-10-29T20:53:03 and "su" works ;-) 2019-10-29T20:55:11 last time someone touched this machine was 30/09 2019-10-29T20:55:25 is there any chance that this redmine is not working since then? 2019-10-29T20:56:20 it serves progress.o.o which worked until (at least) yesterday 2019-10-29T20:57:11 humm.. ok 2019-10-29T20:58:13 it has the abuild-online-update package installed 2019-10-29T20:58:36 *maybe* it did an upgrade last night and because of that it is not working? 2019-10-29T20:59:35 we have something from today on /var/log/zypper.log 2019-10-29T21:01:10 do you agree with me that the first thing we should do is remove abuild-online-update from this machine? (Karol and I have removed from mostly all machines already) 2019-10-29T21:26:31 progress.o.o definitely worked still today. Many teams are relying on it, I guess it's down no longer than 4h 2019-10-29T21:28:36 Would be awesome if someone could look into it 2019-10-29T21:38:38 klein: I would not know why you should remove the auto-update. I don't believe in manual updating if this is what you would propose :) anyway, "the first thing to do" is to fix that the redmine instance is down 2019-10-29T21:44:10 ok, I dont like to have auto-updates on things that can breake like this, I had a look in to the server, and I found nothing wrong with the machine itself 2019-10-29T21:44:20 and, sadly I have no experience with redmine 2019-10-29T21:46:15 Same as with all other server systems, take a look in logfiles :) sorry, can't take a look myself right now 2019-10-29T21:47:38 *** cboltz has quit IRC (Ping timeout: 240 seconds) 2019-10-29T21:48:56 *** cboltz_ has joined #opensuse-admin 2019-10-29T21:51:01 we have no redmine or unicorn process runing, but there are files in /srv/www/vhosts/redmine/tmp/pids 2019-10-29T21:52:33 do these files change if you (try to) restart redmine? 2019-10-29T21:55:47 hmm, given the timestamps, at least unicorn.pid gets updated (it's just 4 minutes old) 2019-10-29T21:56:47 I tried to restart the service 2019-10-29T21:57:11 and yes, it creates a new pid file, but the process probably dies after start 2019-10-29T21:57:29 yes, looks so 2019-10-29T21:57:41 any idea if it writes a useful log anywhere? 2019-10-29T21:58:11 in /var/log/redmine/unicorn.log, it says "check stderr" 2019-10-29T21:58:19 but I have no idea where stderr ends up 2019-10-29T21:59:47 yeah, I saw that, and tried to run what init.d runs: startproc -u redmine -g redmine -l /var/log/redmine/unicorn.log /usr/bin/unicorn_rails -c /srv/www/vhosts/redmine/config/unicorn.rb -E production -D 2019-10-29T21:59:59 but got nothing 2019-10-29T22:00:34 let me try running "/usr/bin/unicorn_rails -c /srv/www/vhosts/redmine/config/unicorn.rb -E production -D" as user "redmine" to see what happens 2019-10-29T22:00:52 redmine@redmine:~> /usr/bin/unicorn_rails -c /srv/www/vhosts/redmine/config/unicorn.rb -E production -D 2019-10-29T22:00:55 master failed to start, check stderr log for details 2019-10-29T22:01:04 great... very usefull :-( 2019-10-29T22:01:13 indeed :-/ 2019-10-29T22:01:54 hah... without -D it shows something 2019-10-29T22:02:25 big ruby stack trace, do we have any paste on opensuse ? 2019-10-29T22:02:33 *pastebin 2019-10-29T22:02:34 yes, paste.o.o 2019-10-29T22:03:20 https://paste.opensuse.org/b3f5b14d 2019-10-29T22:03:25 as a sidenote - I can recommend a printout of the opensuse.org zone as bed lecture ;-) 2019-10-29T22:04:07 good idea 2019-10-29T22:04:21 anyway, I can see mysql connection errors, maybe that is our problem? 2019-10-29T22:05:12 yes, looks like a ssl problem with it 2019-10-29T22:05:36 expired certificate on MySQL server maybe? 2019-10-29T22:05:41 do you know if there was an upgrade on the mysql server(s)? 2019-10-29T22:06:59 no idea, need to search wich one redmine is using 2019-10-29T22:07:49 mysql.infra.opensuse.org (if I get the config file right) 2019-10-29T22:08:08 and there's also a mention of ssl certificates 2019-10-29T22:08:36 yeah, I was reading the same config file :-) 2019-10-29T22:09:04 maybe we can change the config file to use mysql without SSL and see what happens? 2019-10-29T22:09:52 mysql runs on anna? 2019-10-29T22:10:13 anna is the proxy, and forwards it to somewhere[tm] 2019-10-29T22:10:25 openssl x509 -in /dev/stdin -text -noout < /etc/redmine/ssl/redmine.infra.opensuse.org.crt 2019-10-29T22:10:34 Not After : Nov 1 12:50:38 2019 GMT 2019-10-29T22:11:05 so it's close to the expire date, but not there yet 2019-10-29T22:11:30 so if there is any cert prolem, it is on anna, because it is where redmine is connecting, right? 2019-10-29T22:13:24 AFAIK anna does the forwarding with haproxy - I'm not sure if it does SSL termination or blindly forwards whatever it gets 2019-10-29T22:18:59 *** jadamek has quit IRC (Quit: Leaving) 2019-10-29T22:19:00 I don't see anything certificate-related in the mysql section of haproxy.cfg, therefore it probably just forwards whatever comes in 2019-10-29T22:21:41 BTW: you mentioned zypper.log has something from today, but /v/l/zypp/history has the last change 2019-10-08 2019-10-29T22:34:32 I just tried with the "mysql" command (which needs an interesting[tm] command line when using client certs) 2019-10-29T22:34:44 it says ERROR 2026 (HY000): SSL connection error: error:00000001:lib(0):func(0):reason(1) 2019-10-29T22:35:50 BTW: use screen -x if you want to see what I tried ;-) 2019-10-29T22:38:27 "at least" we now know that mysql is the problem 2019-10-29T22:40:05 searching for the error message points to https://bugs.mysql.com/bug.php?id=75311 2019-10-29T22:40:52 which lists cipher suites mismatch as a possible reason, and a revoked certificate as another option 2019-10-29T22:41:48 sorry, I lost my connection 2019-10-29T22:42:15 ok, so, can I talk with karol in the morning and maybe solve this? He might know more than us about it 2019-10-29T22:42:43 mysql.infra.o.o gets forwarded to 172.16.42.4 (tarzan?) which AFAIK is a mysql cluster running in the SUSE network 2019-10-29T22:42:52 (which also means I don't have access to it) 2019-10-29T22:43:35 I'd guess someone updated it, and now it's incompatible with the old SLE11 openssl we have on the redmine server 2019-10-29T22:43:46 yeah... thats where we need Karol :-)... I may have access but, in 1,5 month in SUSE I haven't touched everything (yet) 2019-10-29T22:44:48 I'd be surprised if you touched everything in the next 6 months ;-) 2019-10-29T22:45:39 so talking to kbabioch sounds like a good idea 2019-10-29T22:46:55 and yes, we had an upgrade on the mysql/mariadb/galera/whatever cluster last week 2019-10-29T22:47:43 that update might explain it, but then - why did progress.o.o survive until today? 2019-10-29T22:47:59 have no freakin idea :-) 2019-10-29T22:48:12 and, about the 6months, is that a challenge? hahaha 2019-10-29T22:48:17 do you know if someone worked on the mysql config yesterday or today? 2019-10-29T22:48:36 maybe someone thought "oh, we shouldn't allow SSLv3 anymore"? 2019-10-29T22:48:56 I don't think anyone has touched it, but I can ask tomorrow for everyone in the EngInfra Team 2019-10-29T22:50:27 please tell me what you find out ;-) 2019-10-29T22:52:12 sure... and for today, thats all folks ;-) 2019-10-29T22:52:41 good night ;-) 2019-10-29T23:24:08 *** cboltz_ has quit IRC ()