Runbook

Last updated
Runbook workflow Automation workflow map.png
Runbook workflow

In a computer system or network, a runbook is a compilation of routine procedures and operations that the system administrator or operator carries out. System administrators in IT departments and NOCs use runbooks as a reference.

Contents

Runbooks can be in either electronic or in physical book form. Typically, a runbook contains procedures to begin, stop, supervise, and debug the system. It may also describe procedures for handling special requests and contingencies. An effective runbook allows other operators, with prerequisite expertise, to effectively manage and troubleshoot a system.

Through runbook automation, [1] these processes can be carried out using software tools in a predetermined manner. In addition to automating IT specific processes, the results of the runbook can be presented on-screen back to the user or Service Desk engineer. [2] Multiple runbooks can be linked together using a Decision Tree to provide users with interactive troubleshooting and guided procedures. [3]

Runbook applied to operations

Operational runbooks may be tied to ITIL incidents to allow repeatable processes supporting specific aspects of the service catalog. [4] The runbook is typically divided into routine automated processes and routine manual processes. The runbook catalog begins with an index of processes covered and may be broken down in outline form to align the processes to the major elements they support in the service catalog. A runbook is a compilation of routine procedures and operations that the system administrator or operator carries out.

Computer operator documentation

The documentation for running a task, [5] whether in electronic or paper form, is called a runbook. [6] Sometimes written as "run book," it may supplement bullet-pointed steps with error messages (and what to do) and flowcharts. [7]

Runbook automation

Runbook automation (RBA) [8] is the ability to define, build, orchestrate, manage, and report on workflows that support system and network operational processes. Areas of a business ideal for IT automation are Operations Teams, Service Desk, Network Operations Center's (NOC's), Cloud Operations, Integrations, and Automation Center of Excellence (CoE).

A runbook workflow can potentially interact with all types of infrastructure elements, such as applications, databases, and hardware - using a variety of communication methods such as command-line interfaces (CLI), HTTP REST and SOAP API's, SSH sessions, scripts, utilities, and code libraries.

According to Gartner, the growth of RBA has coincided with the need for IT operations executives to enhance IT operations efficiency measures—including reducing mean time to repair (MTTR), increasing mean time between failures (MTBF), and automating the provisioning of IT resources. In addition, it is necessary to have the mechanisms to implement best practices (for example, implement and manage IT operations processes in line with the ITIL), increase the effectiveness of IT personnel (for example, automate repetitive tasks associated with IT operations processes), and have the tools to report on how well the processes are executed in line with established policies and service levels. Patents have been granted for various aspects of producing, improving, and using runbooks. [9]

Some tools can incorporate a front-end or presentation layer (to the runbook) so that the results of the automated tasks are presented on-screen, sometimes even with red/amber/green traffic lights to visualise if a specific task completed successfully, or failed to execute correctly. More advanced runbook automation platforms incorporate dashboards, analytics and audit trails for regulatory reporting.

See also

Related Research Articles

Information technology service management (ITSM) are the activities performed by an organization to design, build, deliver, operate and control information technology (IT) services offered to customers.

Business process automation (BPA), also known as business automation,distinguished from Business Process Management (BPM), is the technology-enabled automation of business processes. It can help a business in simplicity, to increase digital transformation, increase service quality, improve service delivery, or contain costs. BPA consists of integrating applications, restructuring labor resources, and using software applications throughout the organization. Robotic process automation is an emerging field within BPA.

Microsoft Servers is a discontinued brand that encompasses Microsoft software products for server computers. This includes the Windows Server editions of the Microsoft Windows operating system, as well as products targeted at the wider business market. Microsoft has since replaced this brand with Microsoft Azure, Microsoft 365 and Windows 365.

Operations, administration, and management or operations, administration, and maintenance are the processes, activities, tools, and standards involved with operating, administering, managing and maintaining any system. This commonly applies to telecommunication, computer networks, and computer hardware.

Operations architecture allows the ongoing support and management of IT services infrastructure of an enterprise. The IT infrastructure of an enterprise will typically comprise many different systems and platforms, often in different geographic locations. Operations architecture ensures that these systems perform as expected by centrally unifying the control of operational procedures and automating the execution of operational tasks. It also reports the performance of the IT infrastructure. The implementation of an operations architecture consists of a dedicated set of tools and processes which are designed to provide centralisation and automation.

Database administration is the function of managing and maintaining database management systems (DBMS) software. Mainstream DBMS software such as Oracle, IBM Db2 and Microsoft SQL Server need ongoing management. As such, corporations that use DBMS software often hire specialized information technology personnel called database administrators or DBAs.

Business process management (BPM) is the discipline in which people use various methods to discover, model, analyze, measure, improve, optimize, and automate business processes. Any combination of methods used to manage a company's business processes is BPM. Processes can be structured and repeatable or unstructured and variable. Though not required, enabling technologies are often used with BPM.

CMDBuild is an open source web enterprise environment to configure custom applications for asset management.

Application service automation is the field where the operations needed to deploy and service data center applications are automated in order to centrally and accurately control application change.

<span class="mw-page-title-main">Release management</span> Process of software building

Release management is the process of managing, planning, scheduling and controlling a software build through different stages and environments; it includes testing and deploying software releases.

Continuous testing is the process of executing automated tests as part of the software delivery pipeline to obtain immediate feedback on the business risks associated with a software release candidate. Continuous testing was originally proposed as a way of reducing waiting time for feedback to developers by introducing development environment-triggered tests as well as more traditional developer/tester-triggered tests.

Collaborative workflow is the convergence of social software with service management (workflow) software. As the definition implies, collaborative workflow is derived from both workflow software and social software such as chat, instant messaging, and document collaboration.

Marketing automation refers to software platforms and technologies designed for marketing departments and organizations to more effectively market on multiple channels online and automate repetitive tasks.

Omnitracker is a proprietary business process platform developed by Omninet GmbH in Germany. Omnitracker is developed as modular software for medium and large companies. It is used to track, control and evaluate relevant business processes. The main area of application is the support of ITIL compliant processes, which manages requests, incidents/ errors, customer inquiries from the moment of creation to completion. Omnitracker received certifications of the Federal Association of IT-Mittelstand and Pink Elephant that covers nine processes.

BlackLine Systems, Inc., is an American enterprise software company that develops cloud-based services designed to automate and control the entire financial close process. The Los Angeles–based company has 11 offices worldwide.

IT process automation (ITPA) is a series of processes which facilitate the orchestration and integration of tools, people and processes through automated workflows. ITPA software applications can be programmed to perform any repeatable pattern, task or business workflow that was once handled manually by humans.

Data center management is the collection of tasks performed by those responsible for managing ongoing operation of a data center. This includes Business service management and planning for the future.

Creatio is a global vendor of software to automate workflows and a CRM with no-code development.

Intelligent automation (IA), or alternately intelligent process automation, is a software term that refers to a combination of artificial intelligence (AI) and robotic process automation (RPA). Companies use intelligent automation to cut costs and streamline tasks by using artificial-intelligence-powered robotic software to mitigate repetitive tasks. As it accumulates data, the system learns in an effort to improve its efficiency. Intelligent automation applications consist of but are not limited to, pattern analysis, data assembly, and classification. The term is similar to hyperautomation, a concept identified by research group Gartner as being one of the top technology trends of 2020.

Security orchestration, automation and response (SOAR) is a group of cybersecurity technologies that allow organizations to respond to some incidents automatically. It collects inputs monitored by the security operations team such as alerts from the SIEM system, TIP, and other security technologies and helps define, prioritize, and drive standardized incident response activities.

References

  1. IBM has a software product called "IBM Runbook Automation." "IBM Runbook Automation". IBM . Archived from the original on 14 June 2019. IBM Runbook Automation helps DevOps and IT Operations management teams to simplify and automate repetitive tasks.
  2. "Service Desk Automation". Automate the collection of key troubleshooting information into an easy-to-read, color-coded dashboard to quickly find where the issue likely resides.
  3. "Interactive troubleshooting and guided procedures". SMEs can document and design conditional logic that guides less-experienced agents through best practice procedures that pull in pre-built automations that overcome common hurdles related to system access and permissions.
  4. Don Krapohl. "An Integrated Approach to Organizational Transformation". AugmentedIntel. Don Krapohl. Retrieved 1 May 2013.
  5. "job"
  6. "Operator documentation". Computerworld . November 13, 1978. p. 38. What the operators need and want is a runbook for each application
  7. "Precollated Run Book". Computerworld . May 16, 1983. p. 95.
  8. "RBA : Run Book Automation".
  9. "Automating the production of runbook workflows".

Further reading