Pavel Bushmelev

Cloud Operations Manager at ATTRAQT group

No wonders. Even at scale.

Day 2 - 28th Nov 13:30-14:20 Hall 3.1 #J2D Advanced Advanced Pavel Bushmelev, Alexey Ustenko

So, your system is well designed and thoroughly tested. It should run in production smoothly, shouldn’t it? If only it did! Alas, incidents happen.

Sometimes the reason is as simple (and embarrassing) as an edge case leading to a NullPointerException. But sometimes it’s really hard to understand the issue. You cannot believe it can ever happen! Yet it does. And after long hours of drilling in, you finally have an “aha! moment”:
A well known, widely used library gets stuck in…
An entire JVM crashes just because of…
It appears to be an explosive combination of your Cloud provider policies and…
A JVM setting, tweaked a few years ago…

No spoilers! Come and listen to four real stories of how “wonders” (not to say “nightmares”) which occurred in the production systems got investigated and how they have driven design or process improvements in our company.

Slides