技术控

    今日:126| 主题:49390
收藏本版 (1)
最新软件应用技术尽在掌握

[其他] Early detection of configuration errors to reduce failure damage

[复制链接]
花心戒了 发表于 2016-11-30 02:36:46
44 4

立即注册CoLaBug.com会员,免费获得投稿人的专业资料,享用更多功能,玩转个人品牌!

您需要 登录 才可以下载或查看,没有帐号?立即注册

x
Early detection of configuration errors to reduce failure damage Xu et al, OSDI ’16
   Here’s one of those wonderful papers that you can read in the morning, and be taking advantage of the results the same afternoon! Remember the ‘ Simple testing can prevent most critical failures ‘ paper from OSDI’14 that we looked at last month? In that paper we learned that trivial mistakes in error handling, which are easy to test for, accounted for a vast majority of catastrophic production incidents. Well, as soon as you’ve got your error / exception handlers sorted out, you might want to read today’s paper to discover another class of easy-to-test for bugs that are also disproportionately responsible for nasty production failures.
   Facebook’s ‘ Holistic configuration management ‘ paper stresses the importance of version control and testing for configuration, as configuration errors are a major source of site errors. Xu et al. study configuration parameters in the wild, focusing especially on those associated with reliability, availability, and serviceability (RAS) features. What they find is that very often configuration values are not tested as part of system initialization. The program runs along happily until reaching a point (say for example, it needs to failover) where it needs to read some configuration for the first time, and then it blows up – typically when you most need it.
   So here’s the short takeaway test all of your configuration settings as part of system initialization and fail-fast if there’s a problem . Do that, and you’ll cut out another big source of major production errors. The paper itself is in two parts: the first part (where I’ll focus most of my attention in this short write-up) is an analysis of these latent configuration errors in real code bases; the second part introduces a tool called PCheck, which if your program is written in C or Java can even find latent configuration usage and automatically write tests for you!
  Latent configuration errors can result in severe failures, as they are often associated with configurations used to control critical situations such as fail-over, error handling, backup, load balancing, mirroring, etc… Their detection or exposure is often too late to limit the failure damage.
   In a study of real-world configuration issues in the the products of COMP-A, “major storage company in the US,” with footnote “we are required to keep the company and its products anonymous,” it turns out that 75% of all high severity  configuration-related errors are caused by latent configuration errors. It may well be that the authors are required to keep COMP-A anonymous, but I couldn’t help noticing the author affiliations printed in big type on the front page. A more than fair chance that company is NetApp I would say!
  The authors also studied a number of real-world open-source systems (see table below), and inspected usage of all of their RAS-related configuration parameters.
   
Early detection of configuration errors to reduce failure damage-1 (management,Facebook,advantage,afternoon,mistakes)

  They looked at how many of those parameters were explicity checked vs simply being used when first required, yielding the results below:

Early detection of configuration errors to reduce failure damage-2 (management,Facebook,advantage,afternoon,mistakes)

  Many of the studied RAS parameters do not have any special code for checking the correctness of their settings. Instead, the correctness is verified (implicitly) when the parameters’ values are actually used in operations such as a file open call.
  Here’s an example of a real-world latent configuration error in MapReduce:
1234下一页
友荐云推荐




上一篇:JetBrains在国内举办开发者日 首席布道师Hadi Hariri向中国开发者介绍Kotlin ...
下一篇:Is Standard Java Logging Dead? Log4j vs Log4j2 vs Logback vs java.util.logging
酷辣虫提示酷辣虫禁止发表任何与中华人民共和国法律有抵触的内容!所有内容由用户发布,并不代表酷辣虫的观点,酷辣虫无法对用户发布内容真实性提供任何的保证,请自行验证并承担风险与后果。如您有版权、违规等问题,请通过"联系我们"或"违规举报"告知我们处理。

zyxin 发表于 2016-11-30 04:42:16
围观 围观 沙发在哪里!!!
回复 支持 反对

使用道具 举报

lopa2380 发表于 7 天前
确实不错,顶先
回复 支持 反对

使用道具 举报

yang5209050 发表于 7 天前
人是铁,范是钢,一天不装憋得慌
回复 支持 反对

使用道具 举报

彼此错过 发表于 6 天前
我看着大家顶!
回复 支持 反对

使用道具 举报

*滑动验证:
您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

我要投稿

推荐阅读

扫码访问 @iTTTTT瑞翔 的微博
回页顶回复上一篇下一篇回列表手机版
手机版/CoLaBug.com ( 粤ICP备05003221号 | 文网文[2010]257号 )|网站地图 酷辣虫

© 2001-2016 Comsenz Inc. Design: Dean. DiscuzFans.

返回顶部 返回列表