Blogs (1) >>
ICSE 2019
Sat 25 - Fri 31 May 2019 Montreal, QC, Canada

Distributed systems often face transient errors and localized component degradation and failure. Verifying that the overall system remains healthy in the face of such failures is challenging. At Netflix, we have built a platform for automatically generating and executing chaos experiments, which check how well the production system can handle component failures and slowdowns. This paper describes the platform and our experiences operating it.

Wed 29 May

icse-2019-Paper-Presentations
11:00 - 12:30: Papers - Controlled Experiments of Production Software at St-Denis / Notre-Dame
Chair(s): Yvonne DittrichIT University of Copenhagen, Denmark
icse-2019-Software-Engineering-in-Practice11:00 - 11:20
Talk
Aleksander FabijanMicrosoft, Pavel DmitrievOutreach.io, Helena Holmström OlssonMalmö University, Jan BoschChalmers University of Technology, Sweden, Lukas VermeerBooking.com, Dylan LewisIntuit
icse-2019-Software-Engineering-in-Practice11:20 - 11:40
Talk
Tong XiaMicrosoft, Sumit BhardwajMicrosoft, Pavel DmitrievOutreach.io, Aleksander FabijanMicrosoft
icse-2019-Software-Engineering-in-Practice11:40 - 12:00
Talk
Paul Luo LiMicrosoft, Pavel DmitrievOutreach.io, Huibin Mary HuMicrosoft, Xiaoyu ChaiMicrosoft, Zoran DimovMicrosoft, Brandon PaddockMicrosoft, Ying LiMicrosoft, Alex KirshenbaumMicrosoft, Irina NiculescuMicrosoft, Taj ThoresenMicrosoft
icse-2019-Software-Engineering-in-Practice12:00 - 12:20
Talk
Ali BasiriNetflix, Lorin HochsteinNetflix, Nora JonesNetflix, Haley TuckerNetflix
Pre-print
icse-2019-Paper-Presentations12:20 - 12:30
Talk