2018-01-03T00:24:53 *** cboltz has quit IRC 2018-01-03T01:06:19 *** katnip has quit IRC 2018-01-03T01:06:19 *** katnip has joined #opensuse-admin 2018-01-03T01:29:11 *** fvogt has quit IRC 2018-01-03T02:07:29 *** plinnell has quit IRC 2018-01-03T02:17:30 *** plinnell has joined #opensuse-admin 2018-01-03T03:01:30 *** Son_Goku has quit IRC 2018-01-03T03:40:27 *** okurz has quit IRC 2018-01-03T03:42:50 *** okurz has joined #opensuse-admin 2018-01-03T03:44:41 *** nicolasbock has quit IRC 2018-01-03T04:03:27 *** Son_Goku has joined #opensuse-admin 2018-01-03T05:16:42 *** tigerfoot has quit IRC 2018-01-03T05:57:54 *** plinnell has quit IRC 2018-01-03T06:00:56 *** plinnell has joined #opensuse-admin 2018-01-03T08:15:33 *** kollex has joined #opensuse-admin 2018-01-03T08:22:57 *** ldevulder has quit IRC 2018-01-03T08:33:44 *** ldevulder has joined #opensuse-admin 2018-01-03T08:50:10 *** sven15 has joined #opensuse-admin 2018-01-03T08:54:29 *** asmorodskyi has joined #opensuse-admin 2018-01-03T09:00:25 *** mcaj has joined #opensuse-admin 2018-01-03T10:00:10 tampakrap: Hi . So I am able to reach monitor.infra.o.o but I need sudo there or supybot user password ( I mean linux user ) 2018-01-03T10:12:05 asmorodskyi: try sudo su - supybot 2018-01-03T10:12:10 and use your freeipa password 2018-01-03T10:13:16 tampakrap: ah this part was missing for me . so I should use my freeipa password 2018-01-03T10:13:42 as I connected via ssh key I was not sure which password I have there ... 2018-01-03T10:16:46 asmorodskyi: I just set up the sudo rule, so it wouldn't work either way :) 2018-01-03T10:16:54 let me know if it works please so I can submit it also to salt 2018-01-03T10:20:17 tampakrap: I am in 2018-01-03T10:22:32 *** heroes-bot has joined #opensuse-admin 2018-01-03T10:23:37 *** heroes-bot has joined #opensuse-admin 2018-01-03T10:24:07 good 2018-01-03T10:25:02 tampakrap: funny to add user I need to shutdown supybot , but I can't :) looks like some demon starting new instance everytime I am killing current one :\ 2018-01-03T10:27:54 https://gitlab.infra.opensuse.org/infra/salt/merge_requests/135 2018-01-03T10:30:34 tampakrap: any idea how to shutdown bot for a while ? 2018-01-03T10:31:55 you need sudo access to the service commands I suppose 2018-01-03T10:34:17 asmorodskyi: sudo rcsupybot {status,restart,stop} 2018-01-03T10:37:13 tampakrap: Sorry, user asmorodskyi is not allowed to execute '/usr/sbin/rcsupybot status' as root on monitor 2018-01-03T10:37:49 try again, it was a typo 2018-01-03T10:38:19 tampakrap: now it works 2018-01-03T10:40:02 tampakrap: but sudo rcsupybot stop does not have any effect 2018-01-03T10:41:01 any error message? 2018-01-03T10:42:04 tampakrap: output is "redirecting to systemctl stop supybot.service" 2018-01-03T10:42:42 but I have no permission to execute "sudo systemctl stop supybot.service" so might be permission issue still 2018-01-03T10:42:44 that's it? 2018-01-03T10:42:53 yap 2018-01-03T10:42:57 okay let's try 2018-01-03T10:43:15 sudo rcsupybot stop 2018-01-03T10:43:17 now you do 2018-01-03T10:43:17 redirecting to systemctl stop supybot.service 2018-01-03T10:43:19 asmorodskyi@monitor:/home/asmorodskyi> sudo systemctl stop supybot.service 2018-01-03T10:43:21 Sorry, user asmorodskyi is not allowed to execute '/usr/bin/systemctl stop supybot.service' as root on monitor. 2018-01-03T10:44:46 sudo rcsupybot status 2018-01-03T10:44:48 Checking for service SUPYBOT running 2018-01-03T10:44:50 ● supybot.service - LSB: Supybot daemon 2018-01-03T10:44:52 Loaded: loaded (/etc/init.d/supybot; bad; vendor preset: disabled) 2018-01-03T10:44:54 Active: inactive (dead) 2018-01-03T10:44:59 but process is still running in a fact 2018-01-03T10:45:25 I suspect that bot was started not in a usual way ... 2018-01-03T10:46:13 maybe 2018-01-03T10:46:18 try to kill the running process 2018-01-03T10:46:30 I did it 2018-01-03T10:46:47 but it get back in same second 2018-01-03T10:46:52 with another PID 2018-01-03T10:47:08 so something keep it alive 2018-01-03T10:49:47 *** heroes-bot has joined #opensuse-admin 2018-01-03T10:49:50 ps aux | grep supybot 2018-01-03T10:49:52 supybot 19556 0.7 0.9 233212 55004 ? Ssl 10:47 0:00 /usr/bin/python /usr/bin/supybot /home/supybot/supybot/heroes-bot.conf 2018-01-03T10:49:54 root 20501 0.0 0.1 129348 7732 pts/1 S 10:48 0:00 sudo su - supybot 2018-01-03T10:49:56 root 20502 0.0 0.0 106920 5340 pts/1 S 10:48 0:00 su - supybot 2018-01-03T10:49:58 supybot 20503 0.3 0.1 24512 6564 pts/1 S 10:48 0:00 -bash 2018-01-03T10:50:00 supybot 20654 0.0 0.0 55472 4056 pts/1 R+ 10:49 0:00 ps aux 2018-01-03T10:50:02 supybot 20655 0.0 0.0 10540 1632 pts/1 S+ 10:49 0:00 grep --color=auto supybot 2018-01-03T10:50:04 supybot@monitor:~> kill -9 19556 2018-01-03T10:50:06 supybot@monitor:~> ps aux | grep supybot 2018-01-03T10:50:08 root 20501 0.0 0.1 129348 7732 pts/1 S 10:48 0:00 sudo su - supybot 2018-01-03T10:50:10 root 20502 0.0 0.0 106920 5340 pts/1 S 10:48 0:00 su - supybot 2018-01-03T10:50:12 supybot 20503 0.1 0.1 24512 6572 pts/1 S 10:48 0:00 -bash 2018-01-03T10:50:14 supybot 20971 34.7 0.9 232952 54716 ? Ssl 10:49 0:00 /usr/bin/python /usr/bin/supybot /home/supybot/supybot/heroes-bot.conf 2018-01-03T10:50:16 supybot 20994 0.0 0.0 55472 4168 pts/1 R+ 10:49 0:00 ps aux 2018-01-03T10:50:18 supybot 20995 0.0 0.0 10540 1628 pts/1 S+ 10:49 0:00 grep --color=auto supybot 2018-01-03T10:54:32 yep I see it as well 2018-01-03T10:55:47 *** heroes-bot has joined #opensuse-admin 2018-01-03T10:57:28 systemctl says it is off now 2018-01-03T10:57:43 although the process is still running 2018-01-03T11:01:09 so we have a real hero bot , it servives everything including "kill -9" :) 2018-01-03T11:11:01 asmorodskyi: I could reboot the machine with the supybot service not enabled and see what happens 2018-01-03T11:11:04 sounds fine? 2018-01-03T11:19:15 *** cthugha has joined #opensuse-admin 2018-01-03T11:20:05 *** ldevulder has quit IRC 2018-01-03T11:29:57 *** cboltz has joined #opensuse-admin 2018-01-03T11:29:58 *** cboltz has joined #opensuse-admin 2018-01-03T11:36:13 *** nicolasbock has joined #opensuse-admin 2018-01-03T11:40:35 sorry I was at lunch 2018-01-03T11:40:48 yes , sounds fine 2018-01-03T11:42:32 *** heroes-bot has joined #opensuse-admin 2018-01-03T11:43:14 asmorodskyi: rebooted, the service is still there 2018-01-03T11:44:59 I mean the process 2018-01-03T11:47:07 remove execution flag from /usr/bin/supybot ? 2018-01-03T11:47:14 and kill process again 2018-01-03T11:48:08 tampakrap: can I try ? 2018-01-03T11:49:52 sure 2018-01-03T11:51:07 tampakrap: file owned by root , can you please try ? 2018-01-03T11:52:15 *** cthugha has quit IRC 2018-01-03T14:37:23 *** heroes-bot has joined #opensuse-admin 2018-01-03T14:37:57 tampakrap: ah , restart solve issue :) 2018-01-03T14:42:24 tampakrap: and finally bot recognizes me !!! yahoo ! 2018-01-03T14:45:30 *** asmorodskyi_ has joined #opensuse-admin 2018-01-03T14:46:48 so it really needed to run without the --daemon 2018-01-03T14:46:56 *** asmorodskyi_ has joined #opensuse-admin 2018-01-03T15:15:51 cboltz: so I'll reject all the fate tickets then 2018-01-03T15:23:18 I can do that - better spend the time to get me started with breaking haproxy ;-) 2018-01-03T15:24:08 well the config is huge but it is quite straightforward 2018-01-03T15:24:34 global and defaults sections are clear 2018-01-03T15:24:54 the listen * sections are various interfaces that we listen to, to which ports etc 2018-01-03T15:25:09 and it also specifies where is the ssl cert for the ssl ports 2018-01-03T15:25:34 let's start with a boring question - haporxy hosts are anna and elsa AFAIK 2018-01-03T15:25:38 yes 2018-01-03T15:25:41 on which one do I need to change the config? 2018-01-03T15:25:57 the configs are synced via csync2 2018-01-03T15:26:06 they are not in salt and they should be :) 2018-01-03T15:26:15 so whichever you change, it will get synced to the other 2018-01-03T15:26:25 ok 2018-01-03T15:26:32 but to see the changes, you need to edit the one that has the vrrp 2018-01-03T15:26:48 you can see the VRRPs on the keepalived config 2018-01-03T15:27:06 and you can see which machine has the ips right now with the usual "ip a" command 2018-01-03T15:27:35 if you check the keepalive configs, you'll see that anna is master and elsa is slave, so chances are that you need to touch anna 2018-01-03T15:27:37 clear so far? 2018-01-03T15:27:41 did I already mention that I know nothing about keepalived? ;-) 2018-01-03T15:27:50 so - what exactly do I need to check? 2018-01-03T15:28:15 so keepalived is doing a ping let's say between two machines 2018-01-03T15:28:32 and it says that the one with the biggest priority will take the listed virtual IPs 2018-01-03T15:28:50 so anna is master and has the virtual IPs 2018-01-03T15:29:02 if anna is down, then elsa will take over the virtual IPs 2018-01-03T15:29:05 clear? 2018-01-03T15:29:48 yes 2018-01-03T15:30:03 on the keepalive config you can see who is the master, and which are the virtual IPs 2018-01-03T15:30:20 ah, found it :-) 2018-01-03T15:30:30 this one is also in salt 2018-01-03T15:31:06 yes, I notied the "managed by salt" headers in keepalived.conf 2018-01-03T15:31:32 okay go to the haproxy config now 2018-01-03T15:31:49 ok, I'm in /etc/haproxy on anna 2018-01-03T15:32:16 check the sections that I explained already 2018-01-03T15:32:18 looks like there's a .git, and a quite big "git diff" 2018-01-03T15:32:56 this git was probably done by me when we were setting up the provo haproxy 2018-01-03T15:32:58 I'll remove it 2018-01-03T15:33:19 ah no it existed from before 2018-01-03T15:33:49 anyway, I'm removing it 2018-01-03T15:33:50 it's dangerous 2018-01-03T15:34:00 I'd say even a cron.daily that does a git commit -m 'daily haproxy config' could be useful if someone breaks something ;-) 2018-01-03T15:34:11 why do you think it's dangerous? 2018-01-03T15:34:23 i'd prefer to move them to salt instead 2018-01-03T15:34:48 because I tend to do "git checkout ." in wrong windows :) 2018-01-03T15:35:04 ;-) 2018-01-03T15:35:22 so, check the haproxy config 2018-01-03T15:36:30 oh, "only" ~900 lines? nice[tm] 2018-01-03T15:36:46 don't worry, it's straightforward 2018-01-03T15:38:21 the first lesson is probably "skip the first 150 lines, the interesting stuff starts at "frontend http-ext-in"? 2018-01-03T15:38:30 (at least if I don't want to add new IPs) 2018-01-03T15:38:52 so the listen * sections I explained before what they are 2018-01-03T15:39:00 then we have frontend and backend 2018-01-03T15:39:25 the frontend is a bind to an IP address and port 2018-01-03T15:39:33 separate frontend for each interface 2018-01-03T15:40:17 they take the http request and they try to match them to an acl rule 2018-01-03T15:40:19 acl is_static hdr(host) -i static.opensuse.org 2018-01-03T15:40:54 this means that http requests coming to the ext-in interface (incoming connections to the external interface) 2018-01-03T15:41:02 are getting the is_static acl rule 2018-01-03T15:41:04 understood? 2018-01-03T15:41:19 I think so ;-) 2018-01-03T15:41:56 then it assigns each acl rule to a backend 2018-01-03T15:41:58 use_backend static if is_static 2018-01-03T15:42:22 and then you go to the backend section, and you see that the backend consists of one or more servers that take the request 2018-01-03T15:43:10 so it is: interface -> IP -> domain -> acl -> select backend -> select server 2018-01-03T15:43:49 the backend could have more than one server, because in case one is not online, it goes to the other 2018-01-03T15:43:55 looks like enough levels of indirection ;-) 2018-01-03T15:44:29 (there are probably reasons for that, but it still makes things slightly more interesting) 2018-01-03T15:44:48 it can also do http requests to all the backend members every X seconds to a specific URL to see if they reply or not 2018-01-03T15:44:55 as an example: option httpchk HEAD /robots.txt HTTP/1.1\r\nHost:\ conference.opensuse.org 2018-01-03T15:45:37 nice, makes sense to avoid broken servers 2018-01-03T15:45:57 hmm, what about 2018-01-03T15:46:01 redirect code 301 prefix https://cz.opensuse.org if is_dewiki 2018-01-03T15:46:13 "if is_dewiki" doesn't look correct... 2018-01-03T15:46:50 and, more interesting, cz.opensuse.org gives me download.o.o 2018-01-03T15:47:17 it is indeed not correct, I'll leave it as an excercise to you to find the correct one 2018-01-03T15:47:25 it gives you download.o.o because it failed to find a backend 2018-01-03T15:47:33 and it goes to the default, which is download.o.o 2018-01-03T15:47:41 ## fallback 2018-01-03T15:47:43 default_backend download 2018-01-03T15:48:15 now after you do a change, you need to do: haproxy -c -f /etc/haproxy/haproxy.cfg 2018-01-03T15:48:20 to check if the config is correct 2018-01-03T15:48:27 and also only reload, don't restart 2018-01-03T15:48:35 restart will drop existing connections 2018-01-03T15:48:47 we restart only in special cases (eg nothing else worked) 2018-01-03T15:48:51 any objections against putting this command into /etc/haproxy/check_config.sh ? 2018-01-03T15:49:02 no objections 2018-01-03T15:49:13 well, better to put it in /usr/local/bin 2018-01-03T15:49:28 and even better in salt 2018-01-03T15:50:50 well, sometimes I like to have such little helper scripts next to the config files 2018-01-03T15:50:58 that avoids the need to remember the name ;-) 2018-01-03T15:51:17 put a symlink then 2018-01-03T15:51:55 ok, good idea 2018-01-03T15:54:20 redirect code 301 prefix https://cs.opensuse.org if is_mediawiki_cz 2018-01-03T15:54:58 looks much better than redirecting to cz (that's what people already typed), and if is_mediawiki_cz makes more sense than is_dewiki 2018-01-03T16:00:08 looks like the redirects I just created work :-) 2018-01-03T16:00:31 diff -u haproxy.cfg-2018-01-03 haproxy.cfg if you want to check my changes 2018-01-03T16:01:03 checking 2018-01-03T16:01:44 *** asmorodskyi has quit IRC 2018-01-03T16:02:12 they are fine 2018-01-03T16:02:26 :-) 2018-01-03T16:02:48 sounds like I can delete the backup file 2018-01-03T16:04:04 ah, grep is quite useful to understand the config, for example grep '^[a-z]\|wiki_de' haproxy.cfg 2018-01-03T16:04:49 (much better than scrolling up to the section header each time ;-) 2018-01-03T16:05:29 yep 2018-01-03T16:06:08 * cboltz puts that grep into another little script 2018-01-03T16:19:39 how often does csync2 run? 2018-01-03T16:19:46 let's see if I can limit the tickets in 5 pages before I go home 2018-01-03T16:19:50 it looks like elsa still has the old haproxy.cfg 2018-01-03T16:19:56 I don't remember, just put the stuff into salt :) 2018-01-03T16:22:54 I'm afraid that's a quite big task for haproxy (unless we do a simpe file.managed) 2018-01-03T16:23:04 yes 2018-01-03T16:23:29 bonus points for a test that runs haproxy -c -f 2018-01-03T16:25:08 it seems haproxy.service already does this (with an additional -q) in ExecStartPre and ExecReload 2018-01-03T16:25:42 I think it stops the service eitherway 2018-01-03T16:25:48 and I also think it doesn't do that on reload 2018-01-03T16:25:53 right, if you use "restart" you are lost 2018-01-03T16:26:15 for reload it should work in theory - but I'm not too keen to test it in production ;-) 2018-01-03T16:26:25 it doesn't 2018-01-03T16:26:46 test it on elsa 2018-01-03T16:27:35 this is still valid? https://progress.opensuse.org/issues/20210 2018-01-03T16:28:04 *** kollex has left #opensuse-admin 2018-01-03T16:28:15 good question 2018-01-03T16:28:29 the wiki cookies are now restricted to each wiki's subdomain 2018-01-03T16:28:49 but I would be surprised if something on paste.o.o was fixed without updating the ticket 2018-01-03T16:29:10 so the bug in paste.o.o probably still exists, it's "just" hidden 2018-01-03T16:31:28 *** mcaj has quit IRC 2018-01-03T16:38:50 just tested - "systemctl reload haproxy" with a broken config file results in 2018-01-03T16:38:55 Job for haproxy.service failed because the control process exited with error code. See "systemctl status haproxy.service" and "journalctl -xe" for details. 2018-01-03T16:39:03 but haproxy is still running 2018-01-03T16:39:13 so reload is probably safe 2018-01-03T16:39:27 cool 2018-01-03T16:39:33 BTW: haproxy.cfg on elsa is from Dec 7 2018-01-03T16:39:54 so csync didn't do anything for a month... 2018-01-03T16:39:59 it was changed after updates then, because I am totally sure last time I tried it long time ago it didn't 2018-01-03T16:40:17 so csync is not run automatically which is cool, more reasons to move the configs to salt :) 2018-01-03T16:40:38 lol 2018-01-03T16:41:04 I agree on the long term, but you'll hate yourself for saying that if elsa has to take over *now* 2018-01-03T16:44:27 well, move them to salt now then :P 2018-01-03T16:48:26 the list of files mentined in csync2.cfg is a bit ;-) longer than what I want to handle ;-) 2018-01-03T16:52:21 start with the ones you want to handle :) 2018-01-03T16:56:06 tell that to the other AppArmor devs - they already miss me ;-) 2018-01-03T16:57:08 besides that - I didn't setup all those unsalted services on anna and elsa *eg* 2018-01-03T17:08:49 PROBLEM: memcached data stored on riesling.infra.opensuse.org - MEMCACHE CRITICAL: bytes is 246335264 200000000 - memcached 1.4.33 on 127.0.0.1:11211, up 14 days 21 hours ; See https://monitor.opensuse.org/icinga/cgi-bin/extinfo.cgi?type=2&host=riesling.infra.opensuse.org&service=memcached%20data%20stored 2018-01-03T17:09:30 looks like the monitoring noticed the changed memcached parameters ;-) 2018-01-03T17:11:15 *** sven15 has quit IRC 2018-01-03T17:12:12 * tampakrap goes home 2018-01-03T17:14:04 *** malcolmlewis has quit IRC 2018-01-03T17:21:28 *** malcolmlewis has joined #opensuse-admin 2018-01-03T17:21:28 *** malcolmlewis has joined #opensuse-admin 2018-01-03T17:31:48 tampakrap: was the due date 01/01/5000 on https://progress.opensuse.org/issues/2392 and the two related tickets intentional? 2018-01-03T18:00:49 no 2018-01-03T18:04:30 then fix it ;-) 2018-01-03T18:05:46 BTW: you might like https://github.com/saltstack-formulas/haproxy-formula - but migrating our config to pillar data is probably a PITA 2018-01-03T18:06:13 (just to give you an impression - the jinja template for haproxy.cfg has 585 lines) 2018-01-03T19:36:13 PROBLEM: HTTP on conference.infra.opensuse.org - HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - 922 bytes in 6.021 second response time ; See https://monitor.opensuse.org/icinga/cgi-bin/extinfo.cgi?type=2&host=conference.infra.opensuse.org&service=HTTP 2018-01-03T19:41:35 PROBLEM: HTTP progress on redmine.infra.opensuse.org - HTTP CRITICAL: HTTP/1.1 500 Internal Server Error - 968 bytes in 0.011 second response time ; See https://monitor.opensuse.org/icinga/cgi-bin/extinfo.cgi?type=2&host=redmine.infra.opensuse.org&service=HTTP%20progress 2018-01-03T19:48:12 cboltz: ^^ that's you? 2018-01-03T19:48:35 no 2018-01-03T19:49:33 okay I'll check 2018-01-03T19:53:03 postgresql seems to be the issue 2018-01-03T19:53:43 sorry, mysql 2018-01-03T20:03:01 PROBLEM: HAProxy on elsa.infra.opensuse.org - HAPROXY CRITICAL - Active service galera2 is DOWN on galera proxy ! Active service galera3 is DOWN on galera proxy ! Active service redmine is DOWN on redmine proxy ! ; See https://monitor.opensuse.org/icinga/cgi-bin/extinfo.cgi?type=2&host=elsa.infra.opensuse.org&service=HAProxy 2018-01-03T20:03:02 PROBLEM: HAProxy on anna.infra.opensuse.org - HAPROXY CRITICAL - Active service galera2 is DOWN on galera proxy ! Active service galera3 is DOWN on galera proxy ! Active service redmine is DOWN on redmine proxy ! ; See https://monitor.opensuse.org/icinga/cgi-bin/extinfo.cgi?type=2&host=anna.infra.opensuse.org&service=HAProxy 2018-01-03T20:08:52 *** cboltz_ has joined #opensuse-admin 2018-01-03T20:08:52 *** cboltz_ has joined #opensuse-admin 2018-01-03T20:10:49 *** cboltz has quit IRC 2018-01-03T20:12:29 *** cboltz_ has quit IRC 2018-01-03T20:12:52 *** cboltz has joined #opensuse-admin 2018-01-03T20:12:52 *** cboltz has joined #opensuse-admin 2018-01-03T20:13:35 I am checking it now with darix, it seems that logrotate run at the same time on all three nodes and it crashed all dbs AND reset them 2018-01-03T20:13:57 we have recent backups though 2018-01-03T20:14:17 sounds interesting[tm] :-( 2018-01-03T20:14:56 it sucks a lot yeah 2018-01-03T20:40:03 PROBLEM: HAProxy on mufasa.infra.opensuse.org - HAPROXY CRITICAL - Active service narwal4 is DOWN on static proxy ! Active service riesling is DOWN on riesling proxy ! ; See https://monitor.opensuse.org/icinga/cgi-bin/extinfo.cgi?type=2&host=mufasa.infra.opensuse.org&service=HAProxy 2018-01-03T20:44:08 *** tigerfoot has joined #opensuse-admin 2018-01-03T21:01:35 RECOVERY: HTTP progress on redmine.infra.opensuse.org - HTTP OK: HTTP/1.1 200 OK - 8460 bytes in 0.104 second response time ; See https://monitor.opensuse.org/icinga/cgi-bin/extinfo.cgi?type=2&host=redmine.infra.opensuse.org&service=HTTP%20progress 2018-01-03T21:06:07 RECOVERY: HTTP on conference.infra.opensuse.org - HTTP OK: HTTP/1.1 200 OK - 24126 bytes in 0.450 second response time ; See https://monitor.opensuse.org/icinga/cgi-bin/extinfo.cgi?type=2&host=conference.infra.opensuse.org&service=HTTP 2018-01-03T21:17:41 restored, darix saved the day once again 2018-01-03T21:31:31 thanks! 2018-01-03T21:32:02 124 open tickets, so it looks like you managed to get the latest version :-) 2018-01-03T21:32:22 yep I verified that as well, no data loss fortunately 2018-01-03T21:32:31 I'm writing a postmortem now 2018-01-03T21:32:41 how's it goin today? 2018-01-03T21:32:52 I'm looking forward to read it! 2018-01-03T21:33:15 looks busy 2018-01-03T21:37:37 indeed, we just had a strange database failure, and from tampakrap's quick summary it sounded like the data was destroyed on all cluster nodes 2018-01-03T21:37:52 luckily he and darix somehow managed to restore everything :-) 2018-01-03T21:38:11 and I'm really looking forward to read about that "somehow" 2018-01-03T21:40:36 cboltz: the data was not destroyed on the master node 2018-01-03T21:40:45 it was destroyed on one of the slaves 2018-01-03T21:41:14 ah, that makes it less critical 2018-01-03T21:49:51 that's good 2018-01-03T21:50:15 do you use mysql? 2018-01-03T21:50:44 i mean for everything 2018-01-03T21:51:46 depends on the service - some use postgresql, some mysql (actually mariadb) 2018-01-03T21:52:26 ahh i see 2018-01-03T21:54:18 sent a msg 2018-01-03T21:58:11 just saw an update to mariadb in zypper dup 2018-01-03T23:00:31 *** tigerfoot has quit IRC