Tagged in

erasure coding

Rubrik -  - How Rubrik Protects Against System Failures for Always-Accessible Data


How Rubrik Protects Against System Failures for Always-Accessible Data

There are few things that are certain in our universe, but encountering a failure of an IT system is certainly one of them. Whether the system itself fails or experiences an external problem such as a power failure or natural disaster, everyone has or will experience a system failure during their career. As technology has evolved, so have the mechanisms to handle and recover from those failures.  Rubrik is designed to safeguard against these failures to ensure your data is always accessible and recoverable. We talked about our fault-tolerant technology in a previous post on erasure coding. In this blog post, I’ll review a few other key components of Rubrik Cloud Data Management (CDM) and how they address different failure scenarios. Progression of Resilience For backup and recovery solutions, there should be a progression of resilience that extends from data storage all the way through the stack to the services managing that data. Since these solutions are often the last line of defense against a disaster such as a ransomware attack, it is paramount to ensure your data is safe and secure. Let’s take a look at how Rubrik CDM does that: Rubrik CDM was engineered with an immutable file…
Rubrik -  - Erasure Coding or: How Rubrik Doubled the Capacity of Your Cluster


Erasure Coding or: How Rubrik Doubled the Capacity of Your Cluster

At Rubrik, we’re big believers in data protection. But until we’re able to take consistent snapshots of our brain state and upload them to the promised hierarchical neural interconnect, we’re going to focus on backing up the more traditional machines — the ones whose smooth functioning will enable this cause. Any complete backup solution needs a distributed, scalable, fault-tolerant file system. Rubrik’s is Atlas, which made the switch from triple mirrored encoding to a Reed Solomon encoding scheme during our Firefly release. To help you understand the motivation behind this change, this post introduces erasure coding and compares the two methods. What is Erasure Coding? Suppose we want to store a piece of data on a fault-tolerant and distributed file system. In this case, the loss of any single drive should not result in data loss. The only way to achieve fault tolerance is through redundancy, which refers to storing extra information about the data across different drives to allow for its complete recovery in the event of a failure. The more redundancy we add, the greater the fault tolerance. However, the cost of redundancy is increased storage overhead. Every file system needs to make this tradeoff between availability and overhead. At Rubrik, the…