Data Domain

Data Domain Technology

Technology

Simplifying Storage Management with Core Technological Advantages

Data Domain deduplication storage systems have transformed disk backup, network-based disaster recovery and remote office data protection. Data Domain deduplication storage systems include unique technological advantages, such as:

Global Compression, highly efficient deduplication and compression technology, radically changing the storage economics.

Simplifying Data Protection with Core Technology Advantages
Data Domain’s Global Compression technology combines an exceptionally efficient, high performance inline deduplication technology with a local compression technique. The deduplication technology reduces data down to its raw essentials by pooling redundant patterns within a file, across files, and even within a block, and stores only unique data segments. Local compression scans the unique data sequence across a local small window of comparison like a tape drive.

Key benefits of Global Compression include:

  • Maximum data reduction, averaging 20x data reduction over time, which minimizes the physical storage capacity required for storing backup images.
  • Extended retention with petabytes (PB) of protection storage, Data Domain systems offer up to 28 PB of useable capacity in a very small footprint.
  • Economy of tape by storing multiple months of backup images for less than $0.35/GB. This extended retention also optimizes performance of both backup and recoveries without making any tradeoffs.
  • Network-efficent replication by transferring only unique data segments, reducing bandwidth requirements by as much as 99%.

Data Invulnerability Architecture, providing unprecedented levels of data integrity, data verification, and self-healing capabilities. DOWNLOAD WHITE PAPER

What happens when your data protection storage system fails? Data Domain's Data Invulnerability Architecture lays out the industry’s best defense against data integrity issues, by providing unprecedented levels of data protection, data verification, and self-healing capabilities that are unavailable in conventional disk or tape systems.

Key areas of data integrity protection include:

  • End-to-End Verification at Backup Time

> Data is read after it is written and verified that it is the correct data and that it is reachable through the file system to disk.
> Most restores happen within a day or two of backups. Systems that verify/correct data integrity slowly over time will be too late for most recoveries.

  •  Fault Avoidance and Containment

> New data never overwrites good data.
> Fewer complex data structures.
> NVRAM for fast, safe restart.
> No partial stripe writes.

  • Continuous Fault Detection and Healing

> Data Domain RAID 6 provides double disk failure protection and read error correction.
> On-the-fly error detection and correction.
> Scrubbing to find and repair grown defects on the disk before they can become a problem.

  • File System Recoverability

> Data is written in a self-describing format. If necessary the file system can be recreated by scanning the log and rebuilding it from the metadata stored with the data.

Backups are the data store of last resort. Data Domain’s Data Invulnerability Architecture provides extra levels of data integrity protection to detect faults and repair them to ensure backup data or recovery is not at risk.

Replication Technology, enabling automatic, secure transfer of compressed data over the wide area network (WAN) with 99% bandwidth reduction.

Simplifying Data Protection with Core Technological Advantages
Replication as a data protection technique was once reserved for a very small set of mission-critical application data, certainly not for backup data. Data Domain has changed all that with deduplication storage.

Data Domain replication technology transfers only the deduplicated and compressed unique changes across any IP network requiring a tiny fraction of the bandwidth, time and cost, compared to traditional replication methods.

Data Domain replication technology provides the following benefits:

  • Network-efficient replication for DR provides network-based data protection by securely and automatically replicating backup data to a secure offsite location.
  • Remote office data protection enables network-efficient replication of backup data from multiple branch office to a central hub or data center.
  • Tape consolidation eliminates the need to duplicate backup data at each remote office and leaves tape infrastructure only at a central hub with IT staff.

Stream-Informed Segment Layout (SISL) Architecture allows the continued scaling of CPUs to add direct benefit to system throughput scalability.DOWNLOAD WHITE PAPER

Deduplication Throughput Scales with CPU Performance
One of the core technologies pioneered by Data Domain is its Stream Informed Segment Layout (SISL) Scaling Architecture, which allows the continued advancement of CPU performance to add direct benefit to system throughput scalability.

Other deduplication technologies require additional disk drives or “spindles” to achieve the throughput speeds needed for efficient deduplication. Ironically, these other hybrid technologies that mandate the use of more disk drives require more storage, time and cost to achieve a similar, yet fundamentally inferior result.

Data Domain SISL technology provides many unique advantages:

  • 99% of duplicate data segments are identified in RAM, inline, before storing to disk.
  • Block data transfers with related segments and fingerprints are stored together, so large groups are written or read at once.
  • Efficient disk access minimizes disk seeks to enable increased performance and minimizes the number of large capacity, cost-efficient SATA disks needed to deliver high throughput.
  • Minimal spindle count reduces the amount of total physical storage needed, along with associated storage management.

In SISL, Data Domain has developed a proven architecture that uses deduplication to achieve high throughput with economical storage hardware. Over time, this will allow the continued scaling of CPUs to add direct benefit to system scalability in the form of additional throughput while minimizing the storage footprint.