Data center management

Last updated

Data center management is the collection of tasks performed by those responsible for managing ongoing operation of a data center. [1] This includes Business service management and planning for the future.

Contents

Historically, "data center management" was seen as something performed by employees, with the help of tools collectively called data center-infrastructure management (DCIM) tools. [2]

Both for in-house operation and outsourcing, service-level agreements must be managed to ensure data-availability. [3]

Competition

Data center management is a growing major topic for a growing list of large companies who both compete and cooperate, including: Dell, [4] Google, [5] HP, [6] IBM, [5] Intel [6] and Yahoo. [6]

Hardware/software vendors who are willing to live with coopetition [7] [8] are working on projects such as "The Distributed Management Task Force" (DMTF) [9] with a goal of learning to "more effectively manage mixed Linux, Windows and cloud environments."

With the DMTF a decade old, the list of companies is growing, and also includes companies much smaller than IBM, Microsoft, et al. [10]

Focus

Among the topics currently being explored are: [11] scalability, securing data center networks, disaster recovery, government restrictions. [12]

Another major area is the cost of downtime regarding customer dissatisfaction & business loss, [13] and also the "astonishing" yet hidden cost and effect regarding personnel & productivity. [14]

Business-service management

Business-service management (BSM) treats IT as part of the larger enterprise strategy, [15] and helps fill the gap between business and IT. [16]

IBM notes that major problems often happen in the grey areas, particularly due to errors in the interfaces, and focuses on critical failures. Sufficient redundancy should allow failures in non-critical areas to protect the business from being affected. [16] BSM, which is positioned above IT Service Management (ITSM), promotes a customer-centric and business-focused approach to service management, aligning business objectives with IT or ICT from strategy through to operations. Tools that help BSM include a modeling language, [17] and a common dashboard, which together allow data center personnel to see problems before business customers do. [18]

Newer developments

Remote data center management [19] allows offsite experts to watch for situations needing their timely intervention at a lower cost than having such staff be onsite 24/7/365.

While some requirements for on-site hardware have been reduced, [20] spending in other hardware areas such as UPS may have to increase. [21]

Data center asset management

Data center asset management (also referred to as inventory management) [22] is the set of business practices that join financial, contractual and inventory functions to support life cycle management and strategic decision making for the IT environment. Assets include all elements of software and hardware that are found in the business environment. [23]

IT asset management generally uses automation to manage the discovery of assets [24] so inventory can be compared to license entitlements. Full business management of IT assets requires a repository of multiple types of information about the asset, as well as integration with other systems such as supply chain, help desk, procurement and HR systems and ITSM.

Hardware asset management

Hardware asset management entails the management of the physical components of computers and computer networks, from acquisition through disposal. [24] Common business practices include request and approval process, procurement management, life cycle management, redeployment and disposal management. A key component is capturing the financial information about the hardware life cycle which aids the organization in making business decisions based on meaningful and measurable financial objectives.

Software asset management

Software Asset Management is a similar process, focusing on software assets, including licenses. Standards for this aspect of data center management are part of ISO/IEC 19770.

Data center infrastructure management

Data center-infrastructure management (DCIM) is the integration [25] of information technology (IT) and facility management disciplines [26] to centralize monitoring, management and intelligent capacity planning of a data center's critical systems. Achieved through the implementation of specialized software, hardware and sensors, DCIM enables common, real-time monitoring and management platform for all interdependent systems across IT and facility infrastructures.

DCIM products can help data center managers identify and eliminate sources of risk [27] and improve availability of critical IT systems. They can also be used to identify interdependencies between facility and IT infrastructures to alert the facility manager to gaps in system redundancy, and provide dynamic, holistic benchmarks on power consumption and efficiency to measure the effectiveness of "green IT" initiatives. [28] [29]

Important data center metrics include those regarding energy efficiency and use of servers, storage, and staff. In too many cases, disk capacity is vastly underused and servers run at 20% use or less. [30] More effective automation tools can also improve the number of servers or virtual machines that a single admin can handle.

DCIM providers are increasingly linking with computational fluid dynamics providers to predict complex airflow patterns in the data center. The CFD component is necessary to quantify the impact of planned future changes on cooling resilience, capacity and efficiency. [31]

Operations

Operations ("ops") NOC-Architel.jpg
Operations ("ops")

Information technology operations, or IT operations (ITOps), are the set of all processes and services managed by IT staff [32] for use by internal or external clients. The term refers to the application of operations management to the technology used to run the business.

Operations work can include responding to support tickets generated for maintenance work or customer issues. [33] Some operations teams provide on-call support, responding to incidents outside of normal business hours. [33]

As lights out [34] operations increased, less of the staff are located near corporate headquarters. [35] [36] Gartner defines IT operations as "the people and management processes associated with IT service management to deliver the right set of services at the right quality and at competitive costs for customers." [37]

Technical support

A library help desk HelpDeskLibrary.jpg
A library help desk

Technical support (often shortened to tech support) refers to services. Within a corporation, these are also known as help desks [38] often arrange their technical support structure as a three-tier (plus two) system: [39]

The extra tiers are: [39]

Access to varying levels of support for products and services to in-house employees and corporate customers, providing information and troubleshooting [40] is via various channels such as toll-free numbers, [41] websites, instant messaging, or email.

Help desk professionalism

An ITIL-compliant help desk is usually a part of a bigger service desk unit, which is part of ITSM. [42]

As the incoming phone calls are random in nature, help desk agent schedules are often maintained using an Erlang C calculation. Companies with custom application software may also have an applications team who are responsible for the development of in-house software. The help desk may assign to the applications team such problems as finding software bugs. Requests for new features or information about the capabilities of in-house software that come through the help desk are also assigned to applications groups. The help desk staff and supporting IT staff may not all work from the same location. With remote access applications, technicians are able to solve many help desk issues from another work location or their home office. While there is still a need for on-site support to effectively collaborate on some issues, remote support provides greater flexibility.

AT&T Mobility previously provided technical support for some of its mobile phones through Device Support Centers. Attdevicesupportcenter.jpg
AT&T Mobility previously provided technical support for some of its mobile phones through Device Support Centers.

Some companies and organizations provide discussion boards for users of their products to interact; such forums allow companies to reduce their support costs [43] without losing the benefit of customer feedback.

Some fee-based service companies charge for premium technical support services. [44]

Outsourcing technical support

Many organizations relocated their technical support departments or call centers to countries or regions with lower costs. Dell was amongst the first companies to outsource their technical support and customer service departments to India in 2001, but then reshored. [45] There has also been a growth in companies specializing in providing technical support to other organizations. These are often referred to as MSPs (Managed Service Providers). [46]

For businesses needing to provide technical support, outsourcing allows them to maintain a high availability of service. Such need may result from peaks in call volumes during the day, periods of high activity due to introduction of new products or maintenance service packs, or the requirement to provide customers with a high level of service at a low cost to the business. It allows businesses to use specialized personnel whose technical knowledge base and experience may exceed the scope of the business, thus providing a higher level of technical support to their employees.

Scams

A common scam typically involves a cold caller claiming to be from a technical support department of a company like Microsoft. Such cold calls are often made from call centers based in India to users in English-speaking countries, although increasingly these scams operate within the same country. The scammer will instruct the user to download a remote desktop program and once connected, use social engineering techniques that typically involve Windows components to persuade the victim that they need to pay for the computer to be fixed and then proceeds to steal money from the victim's credit card. [47]

Preventive maintenance

Preventive maintenance (or preventative [48] maintenance (PM)) is ongoing scheduled [49] inspection [50] intended to detect and correct incipient failures either before they occur or before they develop into major problems such as downtime.

Managing the capacity of a data center

Capacity of a datacenter - Life Cycle Capacity of a datacenter - Life Cycle.jpg
Capacity of a datacenter – Life Cycle

With the increasing use of "the cloud" and what has been called "the Era of Infinite Capacity", [51] there is still a need for professional Data Center Capacity Planners. [52]

There is a need to know what will be needed, and when. [53] Data must continually be collected regarding usage of power/energy, computing power, data storage and networking/telecommunications. Plans must include awareness of cooling and space requirements.

Sometimes analysis of this data, and comparison to industry norms, can be outsourced. [53] The balance for the need to focus more on data collection [54] or analysis depends on current use levels: prior to 50%, the focus can stay more on data collection. Beyond 75%, the focus must shift to analysis, in preparation for upgrades, replacements and expansions. The data center is a resource in its own right. [55]

Top data centers and service providers

According to Cloudscene's Leaderboard for Q1 2018, data center operators are ranked "based on both data center density (total operated data centers)", as well as "the number of listed service providers in the facility". Cloud service providers are ranked based on "connectivity (the total number of PoPs) for the region." Chosen from a pool of more than 6,000 providers, the rankings are as follows: [56]

Q1, 2018 Top Data Center Operators Worldwide
RankNorth AmericaEMEAOceaniaAsia
1 Equinix Equinix Equinix Equinix
2 Digital Realty Interxion NextDC Global Switch
3 CoreSite Telehouse Vocus Communications NTT Communications
4 Zayo Digital Realty Global SwitchGPX Global Systems
5 Level 3 Communications Global SwitchYourDC ST Telemedia Global Data Centres
6 Cologix Level 3 Communications Macquarie Telecom Netmagic Solutions
7CyxteraitconiciseekAIMS
8TierPoint Colt Technology Services Interactive Digital Realty
9Netrality Properties Nikhef Datacom Telstra
10QTS Realty Trust Orange Business Services Data Centre LimitedOneAsia Network
Q1, 2018 Top Service Providers Worldwide
RankNorth AmericaEMEAOceaniaAsia
1 Zayo Colt Technology Services Telstra Colt Technology Services
2 Level 3 Communications EuNetworks Vocus Communications PCCW Solutions
3 Verizon Cogent Communications PIPE Networks Tata Communications
4 Crown Castle Zayo Optus PCCW Global
5 AT&T Level 3 Communications NextGen Group Telstra
6 Cogent Communications BT AAPT NTT Communications
7 CenturyLink Interoute MegaportSuperloop
8 XO Communications Verizon SuperloopZenlayer
9 Comcast Orange Business Services Zencross Connect China Telecom
10 TW Telecom NL-IX Uecomm Singtel

See also

Related Research Articles

In telecommunications, provisioning involves the process of preparing and equipping a network to allow it to provide new services to its users. In National Security/Emergency Preparedness telecommunications services, "provisioning" equates to "initiation" and includes altering the state of an existing priority service or capability.

<span class="mw-page-title-main">Technical support</span> Maintenance service of electronic consumers

Technical support, also known as tech support, is a call centre type customer service provided by companies to advise and assist registered users with issues concerning their technical products. Traditionally done on the phone, technical support can now be conducted online or through chat. At present, most large and mid-size companies have outsourced their tech support operations. Many companies provide discussion boards for users of their products to interact; such forums allow companies to reduce their support costs without losing the benefit of customer feedback.

A service-level agreement (SLA) is an agreement between a service provider and a customer. Particular aspects of the service – quality, availability, responsibilities – are agreed between the service provider and the service user. The most common component of an SLA is that the services should be provided to the customer as agreed upon in the contract. As an example, Internet service providers and telcos will commonly include service level agreements within the terms of their contracts with customers to define the level(s) of service being sold in plain language terms. In this case, the SLA will typically have a technical definition of mean time between failures (MTBF), mean time to repair or mean time to recovery (MTTR); identifying which party is responsible for reporting faults or paying fees; responsibility for various data rates; throughput; jitter; or similar measurable details.

NetApp, Inc. is an American data infrastructure company that provides unified data storage, integrated data services, and cloud operations (CloudOps) solutions to enterprise customers. The company is based in Cork City, Ireland. It has ranked in the Fortune 500 from 2012 to 2021. Founded in 1992 with an initial public offering in 1995, NetApp offers cloud data services for management of applications and data both online and physically.

Information technology service management (ITSM) are the activities performed by an organization to design, build, deliver, operate and control IT services offered to customers.

<span class="mw-page-title-main">Cloud computing</span> Form of shared internet-based computing

Cloud computing is the on-demand availability of computer system resources, especially data storage and computing power, without direct active management by the user. Large clouds often have functions distributed over multiple locations, each of which is a data center. Cloud computing relies on sharing of resources to achieve coherence and typically uses a pay-as-you-go model, which can help in reducing capital expenses but may also lead to unexpected operating expenses for users.

Dynamic Infrastructure is an information technology concept related to the design of data centers, whereby the underlying hardware and software can respond dynamically and more efficiently to changing levels of demand. In other words, data center assets such as storage and processing power can be provisioned to meet surges in user's needs. The concept has also been referred to as Infrastructure 2.0 and Next Generation Data Center.

<span class="mw-page-title-main">Compellent Technologies</span> Computer data storage

Compellent Technologies, Inc., was an American manufacturer of enterprise computer data storage systems that provided block-level storage resources to small and medium sized IT infrastructures. The company was founded in 2002 and headquartered in Eden Prairie, Minnesota. Compellent's flagship product, Storage Center, is a storage area network (SAN) system that combines a standards-based hardware platform and a suite of virtualized storage management applications, including automated tiered storage through a proprietary process called "DataProgression", thin provisioning and replication. The company developed software and products aimed at mid-size enterprises and sold through a channel network of independent providers and resellers. Dell acquired the company in February 2011, after which it was briefly a subsidiary known as Dell Compellent.

Cloud communications are Internet-based voice and data communications where telecommunications applications, switching and storage are hosted by a third-party outside of the organization using them, and they are accessed over the public Internet. Cloud services is a broad term, referring primarily to data-center-hosted services that are run and accessed over an Internet infrastructure. Until recently, these services have been data-centric, but with the evolution of VoIP, voice has become part of the cloud phenomenon. Cloud telephony refers specifically to voice services and more specifically the replacement of conventional business telephone equipment, such as a private branch exchange (PBX), with third-party VoIP service.

In cloud computing, a carrier cloud is a class of cloud that integrates wide area networks (WAN) and other attributes of communications service providers’ carrier-grade networks to enable the deployment of highly-complex applications in the cloud. In contrast, classic cloud computing focuses on the data center and does not address the network connecting data centers and cloud users. This may result in unpredictable response times and security issues when business-critical data are transferred over the Internet.

Data center services encompass all of the services and facility-related components or activities that support the implementation, maintenance, operation, and enhancement of a data center, which is an environment that provides processing, storage, networking, management and the distribution of data within an enterprise.

<span class="mw-page-title-main">Virtual Computing Environment</span> American computer hardware brand

Virtual Computing Environment Company (VCE) was a division of EMC Corporation that manufactured converged infrastructure appliances for enterprise environments. Founded in 2009 under the name Acadia, it was originally a joint venture between EMC and Cisco Systems, with additional investments by Intel and EMC subsidiary VMware. EMC acquired a 90% controlling stake in VCE from Cisco in October 2014, giving it majority ownership. VCE ended in 2016 after an internal division realignment, followed by the sale of EMC to Dell.

<span class="mw-page-title-main">Dell Software</span> Former software division of Dell, Inc.

Dell Software was a former division of Dell with headquarters in Round Rock, Texas, United States. Dell Software was created by merging various acquisitions by Dell Inc., the third-largest maker of PCs and now a privately held company, to build out its software offerings for data center and cloud management, information management, mobile workforce management, security and data protection for organizations of all sizes.

Cloud computing is used by most people every day, but there are issues that limit its widespread adoption. It is one of the fast developing area that can instantly supply extensible services by using internet with the help of hardware and software virtualization. Cloud computing biggest advantage is flexible lease and release of resources as per the requirement of the user. Its other advantages include efficiency, compensating the costs in operations and management. It curtails down the high prices of hardware and software

Cloud management is the management of cloud computing products and services.

Samanage, now part of SolarWinds, an enterprise service-desk and IT asset-management provider, had its headquarters in Cary, North Carolina. The company's flagship product, Samanage, operates as a multi-tenant, Software-as-a-Service (SaaS) system for IT and enterprise service management. As of November 2018, Samanage had more than 1,800 customers in 50+ countries.

Robotic process automation (RPA) is a form of business process automation that is based on software robots (bots) or artificial intelligence (AI) agents. RPA should not be confused with artificial intelligence as it is based on automotive technology following a predefined workflow. It is sometimes referred to as software robotics.

"X as a service" is a phrasal template for any business model in which a product use is offered as a subscription-based service rather than as an artifact owned and maintained by the customer. Originating from the software as a service concept that appeared in the 2010s with the advent of cloud computing, the template has expanded to numerous offerings in the field of information technology and beyond it. The term XaaS can mean "anything as a service".

Absolute Software Corporation is an American-Canadian company that provides products and services in the fields of endpoint security and zero trust security. It was publicly traded company on the Toronto Stock Exchange (TSX) and Nasdaq until it was acquired by Crosspoint Capital Partners in July 2023.

A secure access service edge (SASE) is technology used to deliver wide area network (WAN) and security controls as a cloud computing service directly to the source of connection rather than a data center. It uses cloud and edge computing technologies to reduce the latency that results from backhauling all WAN traffic over long distances to one or a few corporate data centers, due to the increased movement off-premises of dispersed users and their applications. This also helps organizations support dispersed users.

References

  1. "What is Data Center Management?". Sunbird DCIM. Archived from the original on December 14, 2021.
  2. Ann Bednarz (May 24, 2018). "Data-center management: What does DMaaS deliver that DCIM doesn't?". Network World. Archived from the original on December 9, 2023.
  3. Taylor, Christine (April 17, 2017). "Cloud Computing and Service Level Agreements (SLAs)". Datamation . Archived from the original on October 13, 2022.
  4. Hardy, Quentin (August 21, 2012). "Dell Makes Moves to Survive in Cloud-Centric World". The New York Times . Archived from the original on January 16, 2023.
  5. 1 2 Lohr, Steve (October 8, 2007). "Google and I.B.M. Join in 'Cloud Computing' Research" . The New York Times. Archived from the original on October 24, 2023.
  6. 1 2 3 Perez, Juan Carlos (July 29, 2008). "Yahoo, Intel and HP Form Cloud Computing Labs". The New York Times. IDG News Service. Archived from the original on January 16, 2023.
  7. "Coinages That Last" . The New York Times . August 9, 2003. Archived from the original on January 17, 2023. Many buzzwords, like coopetition and thought-leading,
  8. Tedeschi, Bob (November 7, 2005). "The Online Travel Landscape Is Getting Crowded" . The New York Times . Archived from the original on January 17, 2023. ... which Internet analysts love to call "coopetition."
  9. Fontana, John (October 27, 2008). "Meeting virtualization management challenges". The New York Times. IDG. Archived from the original on January 17, 2023.
  10. The Times article mentions "a crop of next-tier vendors, start-ups and open source players."
  11. "Data Center Management". The Data Center Journal. Retrieved October 28, 2018.
  12. Matt Hancock (October 26, 2018). "Power struggle". Computer Weekly. a row is brewing over an EU plan to curb datacentre energy use
  13. "Flights Cancelled for more than 75,000 passengers". Reuters. May 29, 2017.
  14. David Gewirtz (May 30, 2017). "The astonishing hidden and personal costs of IT downtime (and how predictive analytics might help)". ZDNet .
  15. "What is business service management (BSM)?".
  16. 1 2 Jenko Gaviglia. "Business Service Management" (PDF). IBM.com. Archived from the original (PDF) on December 25, 2018.
  17. Ghose, A. K.; Lê, L. S.; Hoesch-Klohe, K.; Morrison, E. (2011). "The Business Service Representation Language: A Preliminary Report". Towards a Service-Based Internet. ServiceWave 2010 Workshops. Lecture Notes in Computer Science. Vol. 6569. pp. 145–152. doi:10.1007/978-3-642-22760-8_16. ISBN   978-3-642-22759-2.
  18. Bednarz (June 2010). "Targeting hybrid IT environments". Computerworld .
  19. "Remote Data Center Management".
  20. Quentin Hardy (November 17, 2012). "Hard Times Could Create a Tech Boom".
  21. "In 2014, Proactive UPS Maintenance is Essential for all Data Center Managers" (PDF). UPS-redundant configurations, providing backups for backups that have their own backups.
  22. or IT asset management (ITAM)
  23. "IT Asset Management (ITAM)". Gartner. May 18, 2013. Retrieved January 15, 2019.
  24. 1 2 Brent Bowers (March 13, 2008). "It's Easy, and Expensive, to Forget About Old Equipment". The New York Times .
  25. "Tracking All the Data: Data Center Infrastructure Management ..." ECmag. (DCIM) software enables ... integration ...
  26. "Data Center Infrastructure Management – Data Center Handbook".
  27. "Measure and manage the risk inherent in your IT infrastructure". Network World . August 13, 2010.
  28. Tom Coughlin (September 9, 2018). "Green Computing And Storage". Forbes .
  29. "Mission: Green Computing" by Supermicro Introduces Total Cost". The New York Times . August 20, 2018. Archived from the original on November 30, 2018. Retrieved December 3, 2018.
  30. "Measuring Data Center Efficiency: Easier Said Than Done". Dell.com. Archived from the original on October 27, 2010. Retrieved June 25, 2012.
  31. "Computational-Fluid-Dynamic (CFD) Analysis | Gartner IT Glossary". gartner.com. Retrieved August 27, 2014.
  32. "Computer Operators".
  33. 1 2 Site Reliability Engineering: How Google Runs Production Systems. O'Reilly. 2016. ISBN   978-1-491-92912-4.
  34. "Premier 100 Q&A: HP's CIO sees 'lights-out' data centers". Informationweek . March 6, 2006.
  35. "From Manhattan to Montvale". The New York Times . April 20, 1986.
  36. Ashlee Vance (December 8, 2008). "Dell Sees Double With Data Center in a Container". The New York Times .
  37. "IT Operations – Gartner IT Glossary". gartner.com. February 8, 2012.
  38. Quentin Hardy (October 30, 2012). "An Update for the Corporate Help Desk". The New York Times .
  39. 1 2 Joe Hertvik (July 7, 2016). "IT Support Levels Clearly Explained: L1, L2, L3, and More".
  40. "IT Help Desks Not Just For Large Enterprises". Archived from the original on July 18, 2012. Retrieved May 12, 2019.
  41. "Students – Information technology – Calvin College" (PDF). Calvin College. Retrieved March 23, 2018.
  42. "Help Desk vs Service Desk vs ITSM".
  43. "How to Use Online Forums". Inc. April 22, 2010.
  44. "Technical support for the neighbours". BBC News. March 28, 2005. Retrieved March 6, 2008.
  45. Dell moves outsourced jobs back to U.S. shores
  46. Berkley, Susan; Maggie Klenke. "Call Centre Trends". The Great Voice Company. Retrieved May 2, 2008.
  47. Arthur, Charles (July 18, 2012). "Virus phone scam being run from call centres in India". Guardian. Retrieved March 31, 2014.
  48. Ben Zimmer (April 18, 2010). "Wellness". The New York Times . Complaints about preventative go back to the late 18th century ... ("Oxford English Dictionary dates preventive to 1626 and preventative to 1655) ..preventive has won"
  49. "What is Preventive Maintenance?". MicroMain.com.
  50. "What is preventive maintenance?". BusinessDictionary.com. Archived from the original on November 19, 2018. Retrieved November 18, 2018.
  51. Samir Mehra (September 11, 2018). "Capacity Planning in the Era of Infinite Capacity".
  52. "Data Center Capacity Planner Jobs, Employment". indeed.com (job search). 293 Data Center Capacity Planner jobs available on Indeed.com
  53. 1 2 Thomas A. Limoncelli; Strata R. Chalup; Christina J. Hogan. "Room to grow: Tips for data center capacity planning". Computerworld .
  54. since this consumes both computing and storage resources
  55. J Xu; M Zhao; J Fortes; R Carpenter (2007). "On the use of fuzzy modeling in virtualized data center management". Fourth International Conference on Autonomic Computing (ICAC'07). p. 25. doi:10.1109/ICAC.2007.28. ISBN   978-0-7695-2779-6. S2CID   16153431.
  56. "Cloudscene Rankings: Top Data Centers & Service Providers Worldwide". Cloudscene. Archived from the original on October 26, 2018. Retrieved November 9, 2018.