Qsub

Last updated

qsub is an IEEE Std 1003.1-2008 Unix command for submitting jobs to a job scheduler, usually in cluster or grid computing. The qsub command is used to submit jobs to Slurm Workload Manager, to TORQUE, and to Oracle Grid Engine; HTCondor calls it condor_qsub. [1]

In computing, a job is a unit of work or unit of execution. A component of a job is called a task or a step. As a unit of execution, a job may be concretely identified with a single process, which may in turn have subprocesses which perform the tasks or steps that comprise the work of the job; or with a process group; or with an abstract reference to a process or process group, as in Unix job control.

A job scheduler is a computer application for controlling unattended background program execution of jobs. This is commonly called batch scheduling, as execution of non-interactive jobs is often called batch processing, though traditional job and batch are distinguished and contrasted; see that page for details. Other synonyms include batch system, distributed resource management system (DRMS), distributed resource manager (DRM), and, commonly today, workload automation (WLA). The data structure of jobs to run is known as the job queue.

Grid computing is the use of widely distributed computer resources to reach a common goal. A computing grid can be thought of as a distributed system with non-interactive workloads that involve a large number of files. Grid computing is distinguished from conventional high-performance computing systems such as cluster computing in that grid computers have each node set to perform a different task/application. Grid computers also tend to be more heterogeneous and geographically dispersed than cluster computers. Although a single grid can be dedicated to a particular application, commonly a grid is used for a variety of purposes. Grids are often constructed with general-purpose grid middleware software libraries. Grid sizes can be quite large.

Related Research Articles

The Globus Toolkit is an open-source toolkit for grid computing developed and provided by the Globus Alliance. On 25 May 2017 it was announced that the open source support for the project would be discontinued in January 2018, due to a lack of financial support for that work. The Globus service continues to be available to the research community under a freemium approach, designed to sustain the software, with most features freely available but some restricted to subscribers.

The Berkeley printing system is one of several standard architectures for printing on the Unix platform. It originated in 2.10BSD, and is used in BSD derivatives such as FreeBSD, NetBSD, OpenBSD, and DragonFly BSD. A system running this print architecture could traditionally be identified by the use of the user command lpr as the primary interface to the print system, as opposed to the System V printing system lp command.

Oracle Grid Engine, previously known as Sun Grid Engine (SGE), CODINE or GRD, was a grid computing computer cluster software system, acquired as part of a purchase of Gridware, then improved and supported by Sun Microsystems and later Oracle. There have been open source versions and multiple commercial versions of this technology, initially from Sun, later from Oracle and then from Univa Corporation.

HTCondor is an open-source high-throughput computing software framework for coarse-grained distributed parallelization of computationally intensive tasks. It can be used to manage workload on a dedicated cluster of computers, or to farm out work to idle desktop computers – so-called cycle scavenging. HTCondor runs on Linux, Unix, Mac OS X, FreeBSD, and Microsoft Windows operating systems. HTCondor can integrate both dedicated resources and non-dedicated desktop machines into one computing environment.

Job Submission Description Language is an extensible XML specification from the Global Grid Forum for the description of simple tasks to non-interactive computer execution systems. Currently at version 1.0, the specification focuses on the description of computational task submissions to traditional high-performance computer systems like batch schedulers.

Geograph Britain and Ireland freely accessible online community for geographically located photographs of Great Britain and Ireland

Geograph Britain and Ireland is a web-based project, initiated in March 2005, to create a freely accessible archive of geographically located photographs of Great Britain and Ireland. Photographs in the Geograph collection are chosen to illustrate significant or typical features of each 1 km × 1 km (100 ha) grid square in the Ordnance Survey National Grid and the Irish national grid reference system. There are 331,957 such grid squares containing at least some land. Each page uses a Geo microformat.

DRMAA or Distributed Resource Management Application API is a high-level Open Grid Forum API specification for the submission and control of jobs to a Distributed Resource Management (DRM) system, such as a Cluster or Grid computing infrastructure. The scope of the API covers all the high level functionality required for applications to submit, control, and monitor jobs on execution resources in the DRM system.

In system software, a job queue, is a data structure maintained by job scheduler software containing jobs to run.

GridWay is an open-source meta-scheduling technology that enables large-scale, secure, reliable and efficient sharing of computing resources, managed by different Distributed Resource Management Systems (DRMS), such as SGE, HTCondor, PBS or LSF, within a single organization or scattered across several administrative domains. To this end, GridWay supports several Grid middlewares.

Sun Cloud was an on-demand Cloud computing service operated by Sun Microsystems prior to its acquisition by Oracle Corporation. The Sun Cloud Compute Utility provided access to a substantial computing resource over the Internet for US$1 per CPU-hour. It was launched as Sun Grid in March 2006. It was based on and supported open source technologies such as Solaris 10, Sun Grid Engine, and the Java platform.

The VHF/UHF Century Club, or VUCC, is an award granted by the American Radio Relay League. The award is given to those amateur radio operators who successfully complete amateur radio communications with other amateur radio stations in a specified number of Maidenhead grid locators. The Maidenhead locator system divides the Earth's surface area into a grid of locators that are each 1° latitude by 2° longitude in size. For the 50 MHz and 144 MHz awards, the minimum number of grid locators is 100, the number from which the award derives its name.

The Simple API for Grid Applications (SAGA) is a family of related standards specified by the Open Grid Forum to define an application programming interface (API) for common distributed computing functionality.

Globus resource allocation manager is a software component of the Globus Toolkit that can locate, submit, monitor, and cancel jobs on Grid computing resources. It provides reliable operation, stateful monitoring, credential management, and file staging.

Univa

Univa is a privately held software company that develops workload management products to manage applications, services and containers. Univa's software-defined computing infrastructure and workload orchestration solutions enable companies to scale computer resources across on-premises, cloud computing, and hybrid systems. The company is based in Chicago, with offices in Canada and Germany.

The National E-Infrastructure Service (NES), formerly the National Grid Service, was an organisation for UK academics and researchers from 2004 through 2011. It was funded by two governmental bodies, Engineering and Physical Sciences Research Council (EPSRC) and the Joint Information Systems Committee (JISC).

Xgrid

Xgrid is a proprietary program and distributed computing protocol developed by the Advanced Computation Group subdivision of Apple Inc that allows networked computers to contribute to a single task.

The Open Science Grid Consortium is an organization that administers a worldwide grid of technological resources called the Open Science Grid, which facilitates distributed computing for scientific research. Founded in 2004, the consortium is composed of service and resource providers, researchers from universities and national laboratories, as well as computing centers across the United States. Members independently own and manage the resources which make up the distributed facility, and consortium agreements provide the framework for technological and organizational integration.

gLite

gLite is a middleware computer software project for grid computing used by the CERN LHC experiments and other scientific domains. It was implemented by collaborative efforts of more than 80 people in 12 different academic and industrial research centers in Europe. gLite provides a framework for building applications tapping into distributed computing and storage resources across the Internet. The gLite services were adopted by more than 250 computing centres and used by more than 15000 researchers in Europe and around the world.

Slurm Workload Manager free and open-source job scheduler for Linux and similar computers

The Slurm Workload Manager, or Slurm, is a free and open-source job scheduler for Linux and Unix-like kernels, used by many of the world's supercomputers and computer clusters.

The Generic Grid-Grid (3G) Bridge is an open-source core job bridging component between different grid infrastructures. Its development started in 2008 within the CancerGrid and EDGeS projects. The aim was to create a generic bridge component that can be used in different grid interoperability scenarios. The 3G Bridge used within the EDGeS project that provides the core component of the Service Grid - Desktop Grid interoperability solution. 3G Bridge helps to connect user communities of different grid systems. For example, communities working on parameter sweep problems and using service grid infrastructures can migrate their applications to the more adequate desktop grid platform using the 3G Bridge technology, resulting in an accelerated research.

References

  1. https://packages.debian.org/file:qsub