Find White Papers
Home
About Us
List Your Papers
    
> Diligent Technologies > Understanding the Power of Data De-Duplication

Understanding the Power of Data De-Duplication

White Paper Published By: Diligent Technologies

Data de-duplication has the power to revolutionize the data protection process by significantly reducing capacity requirements. Plagued by media hype and vendor FUD, this message can be easily lost. This paper serves as a primer on data de-duplication.



Tags : 
deduplication, diligent, diligent technologies, storage, data protection, data deduplication

Diligent Technologies
Published:  Apr 11, 2008
Type:  White Paper
Length:  5 pages

DATA PROTECTION
BRIEF
Understanding the Power of Data De-DuplicationDate: October, 2007Author: Heidi Biggar, AnalystAbstract: Data de-duplication has the power to revolutionize the data protection process by significantly reducingcapacity requirements. Plagued by media hype and vendor FUD, this message can be easily lost. This paperserves as a primer on data de-duplication.
Easing the Pain of BackupPrimary storage volumes may be growing at a rate of 30% or more each year, but it is often secondary storagevolumes that are causing organizations the greatest pain-devouring IT resources (both manpower andtechnology) along the way. For years, organizations could do little-if anything-to control this problem. Today,users have new options. Capacity optimized protection (COP) technologies that attack the secondary storage"capacity bloat" at its roots have emerged. Data de-duplication is one example.
FIGURE 1. DETERRENTS TO DISK-BASED BACKUP
What factors do you believe would prevent your organization from replacingenterprise tape libraries with large-scale near-line disk solutions? (Percent ofrespondents, N = 94, multiple responses accepted)
Cost of new disk-based solution 74%
Lack of mature products available 47%
Too much investment in existing tape infrastructure 46%Lack of staff resources to evaluate, select andimplement solutions 34%Concerns with reliability of low-cost disk technologies(i.e., SATA) 32%
Lack of media portability 27%Current leasing agreement or depreciation cycle ontape infrastructure 26%Concerns about solution's ability to ensure regulatorycompliance (e.g., WORM capability, off-site data 24%
Believe it will take additional staff to manage 16%Concerned that disk-based solutions are difficult to 16%scale 0% 10% 20% 30% 40% 50% 60% 70% 80%
Source: Enterprise Strategy Group, 2007
From a very high level, data de-duplication enables organizations to reduce back-end capacity requirements byminimizing the amount of redundant data that is ultimately written to disk backup targets. The actual amount ofdata reduction can vary significantly from organization to organization or from application to application,depending on the granularity of the data de-duplication technology being used (i.e., whether the de-duping is done
Copyright ?2007, The Enterprise Strategy Group, Inc. All Rights Reserved.ESG BriefPage 2at the file-, block- or byte-level) or the type of data being de-duped (e.g., Word .doc, .mpeg file or .dbf file).However, ESG has found that, on average, 10x to 20x reduction is realistic and greater than 40x reduction is1definitely achievable .
At these rates, data de-duplication has the power to change the economics of disk backup, making disk backup amuch more affordable-and compelling-alternative to tape, and even eliminate the long-time cost delta betweentape and disk (see Figure 1). Factor in the operational efficiencies of not having to move, store and manageredundant data as well as not having to deal with management headaches common with tape, and you've got avery compelling story in favor of disk backup.
For these reasons and others, we believe data de-duplication is one of this decade's most important-hence,most talked-about-new technologies. It has the power to revolutionize data protection from both a technologyand an end-user adoption standpoint by simply making disk-based backup and recovery, as well as remotereplication, much more efficient than it is today.
The Benefits of De-DupeData de-duplication has several significant-and immediate-benefits for users:
? It can lower disk costs. Just consider the ability to store 20 TB of backup data on 1TB of disk. The cost-savings are significant-not only in terms of actual disk costs, but also in terms of power and cooling.Fewer disks mean lower power and cooling costs.
? It allows users to store more data on fewer disks for longer periods of time. While the actualcapacity reduction will vary from organization to organization depending on a number of variables (e.g.,the type of data that is being backed up, the change rate and the frequency of the backup, etc.), de-duplication will reduce back-end capacity requirements significantly. Organizations can use this"newfound" space to 1) protect other backup data (i.e., data that wasn't previously protected by disk) or 2)lengthen the retention periods of the data that is backed up to disk... [download for more]

Browse Technology Topics

Data Center

Virtualization, Cloud Computing, Infrastructure, Design and Facilities, Power and Cooling, Green Computing  
    

Data Management

Application Integration, Analytical Applications, Business Intelligence, Configuration Management, Database Development, Data Integration, Data Mining, Data Protection, Data Quality, Data Replication, Database Security, EDI, SOAP, Service Oriented Architecture, Web Service Management, Data Warehousing  
    

Enterprise Applications

Application Integration, Application Performance Management, Best Practices, Business Activity Monitoring, Business Analytics, Business Integration, Business Intelligence, Business Management, Business Metrics, Business Process Automation, Business Process Management, Call Center Management, Call Center Software, Change Management, Corporate Governance, Customer Interaction Service, Customer Relationship Management, Customer Satisfaction, Customer Service, EBusiness, Enterprise Resource Planning, Enterprise Software, EProcurement, Extranets, Groupware Workflow, HIPAA Compliance, IP Faxing, IT Spending, Marketing Automation, Performance Testing, Product Lifecycle Management, Project Management, Return On Investment, Risk Management, Sales & Marketing Software, Sales Automation, Server Virtualization, Simulation Software, Supply Chain Management, System Management Software, Total Cost of Ownership, Video Conferencing, Voice Recognition, Voice Over IP, Workforce Management, Incentive Compensation, Spend Management, Manufacturing Execution Systems, International Computing  

Human Resource Technology

Human Resources Services, Payroll Software, Time and Attendance Software, Workforce Management Software, Financial Management, Employee Monitoring Software, Employee Training Software, Recruiting Software/Services, Employee Performance Management, ELearning, Benefits Management, Expense Management  
    

IT Career Advancement

Cisco Certification, Microsoft Certification, Linux Certification, Network Security Certification, Software Development Certification  

IT Management

Employee Performance, ITIL, Productivity, Project Management, Software Compliance, Sarbanes Oxley Compliance, Service Management, Desktop Management  
    

Knowledge Management

Collaboration, Collaborative Commerce, Contact Management, Content Delivery, Content Integration, Content Management System, Corporate Portals, Customer Experience Management, Document Management, Information Management, Intranets, Messaging, Records Management, Search And Retrieval, Search Engines, Secure Content Management, SLA  

Networking

Active Directory, Bandwidth Management, Convergence, Distributed Computing, Ethernet Networking, Fibre Channel, Gigabit Networking, Governance, Grid Computing, Infrastructure, Internetworking Hardware, Interoperability, IP Networks, IP Telephony, Local Area Networking, Load Balancing, Migration, Monitoring, Network Architecture, Network Management, Network Performance, Network Performance Management, Network Provisioning, Network Security, OLAP, Optical Networking, Quality Of Service, Remote Access, Remote Network Management, Server Hardware, Servers, Small Business Networks, TCP/IP Protocol, Test And Measurement, Traffic Management, Tunneling, Utility Computing, VPN, Wide Area Networks, Green Computing, Cloud Computing, Power and Cooling, Data Center Design and Management, Colocation and Web Hosting  
    

Platforms

AS/400, Domino, Linux, Microsoft Exchange, Oracle, PeopleSoft, SAP, Siebel, Solaris, Tivoli, Unix, Web Sphere, Windows, Windows Server  

Security

Access Control, Anti Spam, Anti Spyware, Anti Virus, Application Security, Auditing, Authentication, Biometrics, Business Continuity, Compliance, DDoS, Disaster Recovery, Email Security, Encryption, Firewalls, Hacker Detection, High Availability, Identity Management, Internet Security, Intrusion Detection, Intrusion Prevention, IPSec, Network Security Appliance, Password Management, Patch Management, Phishing, PKI, Policy Based Management, Security Management, Security Policies, Single Sign On, SSL, Secure Instant Messaging, Web Service Security, PCI Compliance, Vulnerability Management  
    

Software Development

.NET, C++, Database Development, Java, Middleware, Open Source, Software Outsourcing, Quality Assurance, Scripting, SOAP, Software Testing, Visual Basic, Web Development, Web Services, Web Service Security, XML  

Storage

Backup And Recovery, Blade Servers, Clustering, IP Storage, ISCSI, Network Attached Storage, RAID, Storage Area Networks, Storage Management, Storage Virtualization, Email Archiving, Data Deduplication  
    

Wireless

802.11, Bluetooth, CDMA, GPS, Mobile Computing, Mobile Data Systems, Mobile Workers, PDA, RFID, Smart Phones, WiFi, Wireless Application Software, Wireless Communications, Wireless Hardware, Wireless Infrastructure, Wireless Messaging, Wireless Phones, Wireless Security, Wireless Service Providers, WLAN  
Search