System Reliability: Why Boards Care More About Trust Than Uptime
Corporate boards are increasingly viewing essential systems – payroll, payment processing, and healthcare infrastructure – not as targets for achieving high uptime, but as non-negotiable obligations, a shift in perspective that is driving a renewed focus on risk management and resilience.
The change, according to industry observers, stems from a growing recognition that disruptions to these core functions carry significant legal and financial repercussions. Boards are less concerned with percentage-based reliability targets and more focused on potential negligence claims and breaches of fiduciary duty. This governance stance demands a more robust approach to system stability, moving beyond simply aiming for high availability to ensuring uninterrupted operation.
This evolving expectation is prompting companies to undertake what are often termed “reliability resets” – comprehensive overhauls of their systems designed to harden defenses and proactively identify vulnerabilities. Salesforce, for example, previously conducted a “Trust Reset” to address systemic weaknesses. These resets typically involve implementing chaos engineering practices, deliberately introducing failures into systems to expose weaknesses before they manifest in production environments, and refining incident response procedures.
However, technical improvements alone are insufficient, experts caution. Organizations can execute technically sound reliability programs and still suffer erosion of customer trust following a major incident. The key, they argue, is to anchor the mission around protecting people – the finish-users who rely on these systems – rather than solely focusing on service level agreements (SLAs).
The emphasis on human impact is particularly relevant in sectors like healthcare, where system failures can have life-or-death consequences. Recent labor disputes highlight the fragility of these systems. Workers at NEON, a healthcare services provider, are preparing to sue the company over unpaid wages, as reported by the Cleveland Scene, adding another layer of complexity to the challenges facing the industry.
Similar pressures are emerging in other sectors. Rail unions, representing workers at Canadian Pacific Kansas City (CPKC), have invoked federal mediation in a dispute over pay and benefits for employees of the Dakota, Minnesota & Eastern (DM&E) railroad, according to the IAM Union. This dispute underscores the broader trend of workers demanding greater security and fair compensation, particularly in essential services.
The San Francisco Unified School District remains closed due to a historic teachers’ strike centered on pay and healthcare, as reported by ABC7 San Francisco. This situation demonstrates the critical link between adequate compensation, benefits, and the provision of essential public services.
Experts also point to systemic issues within the healthcare system itself, citing concerns about escalating costs and a lack of patient-centered care. A recent analysis by Your Local Epidemiologist identified five key areas where the system has become “utterly insane,” suggesting a need for fundamental reform.
While technical solutions are vital, the shift in board-level thinking suggests a broader recognition that system reliability is fundamentally about protecting revenue and, more importantly, safeguarding the well-being of those who depend on these critical services. The focus on governance and accountability is likely to intensify as the consequences of system failures become increasingly severe.
