Oracle Grid Engine

Last updated
Oracle Grid Engine
Developer(s) Oracle Corporation (formerly Sun Microsystems) in association with the community
Stable release
6.2u8; see the information on forks in the first section for sources for recent versions of the technology / October 1, 2012 (2012-10-01)
Operating system Cross-platform
Type Grid computing
License SISSL
Website www.oracle.com/technetwork/oem/grid-engine-166852.html

Oracle Grid Engine, [1] previously known as Sun Grid Engine (SGE), CODINE (Computing in Distributed Networked Environments) or GRD (Global Resource Director), [2] was a grid computing computer cluster software system (otherwise known as a batch-queuing system), acquired as part of a purchase of Gridware, [3] then improved and supported by Sun Microsystems and later Oracle. There have been open source versions and multiple commercial versions of this technology, initially from Sun, later from Oracle and then from Univa Corporation.

Contents

On October 22, 2013 Univa announced it acquired the intellectual property and trademarks for the Grid Engine technology and that Univa will take over support. [4] Univa has since evolved the Grid Engine technology, e.g. improving scalability as demonstrated by a 1 million core cluster in Amazon Web Services (AWS) announced on June 24, 2018. [5]

The original Grid Engine open-source project website closed in 2010, but versions of the technology are still available under its original Sun Industry Standards Source License (SISSL). [6] Those projects were forked from the original project code and are known as Son of Grid Engine, [7] Open Grid Scheduler [8] and Univa Grid Engine. [9]

Grid Engine is typically used on a computer farm or high-performance computing (HPC) cluster and is responsible for accepting, scheduling, dispatching, and managing the remote and distributed execution of large numbers of standalone, parallel or interactive user jobs. It also manages and schedules the allocation of distributed resources such as processors, memory, disk space, and software licenses.

Grid Engine used to be the foundation of the Sun Grid utility computing system, made available over the Internet in the United States in 2006, [10] later becoming available in many other countries and having been an early version of a public cloud computing facility predating AWS, for instance.

History

In 2000, Sun acquired Gridware a privately owned commercial vendor of advanced computing resource management software with offices in San Jose, Calif., and Regensburg, Germany. [11] Later that year, Sun offered a free version of Gridware for Solaris and Linux, and renamed the product Sun Grid Engine.

In 2001, Sun made the source code available, [12] and adopted the open source development model. Ports for Mac OS X and *BSD were contributed by the non-Sun open source developers. [13]

In 2010, after the purchase of Sun by Oracle, the Grid Engine 6.2 update 6 source code was not included with the binaries, and changes were not put back to the project's source repository. [14] In response to this, the Grid Engine community started the Open Grid Scheduler project to continue to develop and maintain a free implementation of Grid Engine. [15] [16] [17]

On January 18, 2011, it was announced that Univa had recruited several principal engineers from the former Sun Grid Engine team and that Univa would be developing their own forked version of Grid Engine. The newly announced Univa Grid Engine did include commercial support and would compete with the official version of Oracle Grid Engine. [18] [19] [20]

On Oct 22, 2013 Univa has announced that it had acquired the intellectual property and trademarks pertaining to the Grid Engine technology and that Univa will take over support for Oracle Grid Engine customers. [21]

In September 2020, Altair Engineering, a global technology company providing solutions in data analytics, product development, and high-performance computing (HPC) acquired Univa. [22]

Cluster architecture

A screenshot of the xml-qstat web interface in 2007 Xml-qstat.jpg
A screenshot of the xml-qstat web interface in 2007

A typical Grid Engine cluster consists of a master host and one or more execution hosts. Multiple shadow masters can also be configured as hot spares, which take over the role of the master when the original master host crashes. [23]

Support and training

Univa is providing commercial support and training for Univa Grid Engine and Oracle Grid Engine. Below is a description of some of the historic options.

Sun provided support contracts for the commercial version of Grid Engine on most UNIX platforms and Windows. [24] Professional services, consulting, training, and support were provided by Sun Partners. [25] Sun partners with Georgetown University to deliver Grid Engine administration classes. [26] The Bioteam runs short SGE training workshops that are 1 or 2 days long. [27]

Users obtained community support on the Grid Engine mailing lists. [28] Grid Engine Workshops were held in 2002, 2003, 2007, 2009, and 2012 in Regensburg, Germany. [29]

Other Grid Engine based products

The below contains historic information and some of the products and solutions are no longer available:

See also

Related Research Articles

<span class="mw-page-title-main">MySQL</span> SQL database engine software

MySQL is an open-source relational database management system (RDBMS). Its name is a combination of "My", the name of co-founder Michael Widenius's daughter My, and "SQL", the abbreviation for Structured Query Language. A relational database organizes data into one or more data tables in which data may be related to each other; these relations help structure the data. SQL is a language programmers use to create, modify and extract data from the relational database, as well as control user access to the database. In addition to relational databases and SQL, an RDBMS like MySQL works with an operating system to implement a relational database in a computer's storage system, manages users, allows for network access and facilitates testing database integrity and creation of backups.

<span class="mw-page-title-main">Sun Microsystems</span> American computer company, 1982–2010

Sun Microsystems, Inc. was an American technology company that sold computers, computer components, software, and information technology services and created the Java programming language, the Solaris operating system, ZFS, the Network File System (NFS), and SPARC microprocessors. Sun contributed significantly to the evolution of several key computing technologies, among them Unix, RISC processors, thin client computing, and virtualized computing. Notable Sun acquisitions include Cray Business Systems Division, Storagetek, and Innotek GmbH, creators of VirtualBox. Sun was founded on February 24, 1982. At its height, the Sun headquarters were in Santa Clara, California, on the former west campus of the Agnews Developmental Center.

<span class="mw-page-title-main">Oracle Solaris</span> Unix operating system originally developed by Sun Microsystems

Solaris is a proprietary Unix operating system originally developed by Sun Microsystems. After the Sun acquisition by Oracle in 2010, it was renamed Oracle Solaris.

<span class="mw-page-title-main">Beowulf cluster</span> Type of computing cluster

A Beowulf cluster is a computer cluster of what are normally identical, commodity-grade computers networked into a small local area network with libraries and programs installed which allow processing to be shared among them. The result is a high-performance parallel computing cluster from inexpensive personal computer hardware.

The Globus Toolkit is an open-source toolkit for grid computing developed and provided by the Globus Alliance. On 25 May 2017 it was announced that the open source support for the project would be discontinued in January 2018, due to a lack of financial support for that work. The Globus service continues to be available to the research community under a freemium approach, designed to sustain the software, with most features freely available but some restricted to subscribers.

<span class="mw-page-title-main">DTrace</span> Dynamic tracing framework for kernel and applications

DTrace is a comprehensive dynamic tracing framework originally created by Sun Microsystems for troubleshooting kernel and application problems on production systems in real time. Originally developed for Solaris, it has since been released under the free Common Development and Distribution License (CDDL) in OpenSolaris and its descendant illumos, and has been ported to several other Unix-like systems.

Lustre is a type of parallel distributed file system, generally used for large-scale cluster computing. The name Lustre is a portmanteau word derived from Linux and cluster. Lustre file system software is available under the GNU General Public License and provides high performance file systems for computer clusters ranging in size from small workgroup clusters to large-scale, multi-site systems. Since June 2005, Lustre has consistently been used by at least half of the top ten, and more than 60 of the top 100 fastest supercomputers in the world, including the world's No. 1 ranked TOP500 supercomputer in June 2020, Fugaku, as well as previous top supercomputers such as Titan and Sequoia.

HotSpot, released as Java HotSpot Performance Engine, is a Java virtual machine for desktop and server computers, developed by Sun Microsystems and now maintained and distributed by Oracle Corporation. It features improved performance via methods such as just-in-time compilation and adaptive optimization.

<span class="mw-page-title-main">United Devices</span> A privately held, commercial volunteer computing company

United Devices, Inc. was a privately held, commercial volunteer computing company that focused on the use of grid computing to manage high-performance computing systems and enterprise cluster management. Its products and services allowed users to "allocate workloads to computers and devices throughout enterprises, aggregating computing power that would normally go unused." It operated under the name Univa UD for a time, after merging with Univa on September 17, 2007.

<span class="mw-page-title-main">Platform Computing</span> Software company in Canada

Platform Computing was a privately held software company primarily known for its job scheduling product, Load Sharing Facility (LSF). It was founded in 1992 in Toronto, Ontario, Canada and headquartered in Markham, Ontario with 11 branch offices across the United States, Europe and Asia.

The Terascale Open-source Resource and QUEue Manager (TORQUE) is a distributed resource manager providing control over batch jobs and distributed compute nodes. TORQUE can integrate with the non-commercial Maui Cluster Scheduler or the commercial Moab Workload Manager to improve overall utilization, scheduling and administration on a cluster.

Distributed Resource Management Application API (DRMAA) is a high-level Open Grid Forum (OGF) API specification for the submission and control of jobs to a distributed resource management (DRM) system, such as a cluster or grid computing infrastructure. The scope of the API covers all the high level functionality required for applications to submit, control, and monitor jobs on execution resources in the DRM system.

Rocks Cluster Distribution is a Linux distribution intended for high-performance computing (HPC) clusters. It was started by National Partnership for Advanced Computational Infrastructure and the San Diego Supercomputer Center (SDSC) in 2000. It was initially funded in part by an NSF grant (2000–07), but was funded by the follow-up NSF grant through 2011.

<span class="mw-page-title-main">Univa</span> Software company

Univa was a software company that developed workload management and cloud management products for compute-intensive applications in the data center and across public, private, and hybrid clouds, before being acquired by Altair Engineering in September 2020.

<span class="mw-page-title-main">Cloud computing</span> Form of shared Internet-based computing

Cloud computing is the on-demand availability of computer system resources, especially data storage and computing power, without direct active management by the user. Large clouds often have functions distributed over multiple locations, each location being a data center. Cloud computing relies on sharing of resources to achieve coherence and typically uses a "pay-as-you-go" model, which can help in reducing capital expenses but may also lead to unexpected operating expenses for users.

<span class="mw-page-title-main">Acquisition of Sun Microsystems by Oracle Corporation</span> Agreement announced in 2009 and completed in 2010

The acquisition of Sun Microsystems by Oracle Corporation was completed on January 27, 2010. After the acquisition was completed, Oracle, only a software vendor prior to the merger, owned Sun's hardware product lines, such as SPARC Enterprise, as well as Sun's software product lines, including the Java programming language.

Techila Distributed Computing Engine is a commercial grid computing software product. It speeds up simulation, analysis and other computational applications by enabling scalability across the IT resources in user's on-premises data center and in the user's own cloud account. Techila Distributed Computing Engine is developed and licensed by Techila Technologies Ltd, a privately held company headquartered in Tampere, Finland. The product is also available as an on-demand solution in Google Cloud Launcher, the online marketplace created and operated by Google. According to IDC, the solution enables organizations to create HPC infrastructure without the major capital investments and operating expenses required by new HPC hardware.

Univa Grid Engine (UGE) is a batch-queuing system, forked from Sun Grid Engine (SGE). The software schedules resources in a data center applying user-configurable policies to help improve resource sharing and throughput by maximizing resource utilization. The product can be deployed to run on-premises, using IaaS cloud computing or in a hybrid cloud environment.

Open Scalable File Systems, Inc. (OpenSFS) is a nonprofit organization promoting the Lustre file system. OpenSFS was founded in 2010 to ensure Lustre remains vendor-neutral, open, and free.

<span class="mw-page-title-main">Singularity (software)</span> Free, cross-platform and open-source computer program

Singularity is a free and open-source computer program that performs operating-system-level virtualization also known as containerization.

References

  1. "Oracle Grid Engine". Oracle Corporation. 2010-05-30.
  2. "A Little History Lesson". Sun Microsystems. 2006-06-23.
  3. "Sun snaps up software company Gridware - CNET".
  4. "Univa Completes Acquisition of Grid Engine Assets, Becoming the Sole Commercial Provider of Grid Engine Software". Univa Corporation. 2013-10-22.
  5. "Univa Demonstrates Extreme Scale Automation by Deploying More Than One Million Cores in a Single Univa Grid Engine Cluster using AWS". Univa. 2018-06-24. Retrieved June 24, 2018.
  6. "The Grid Engine Source License". Sun MicroSystems. 2010-06-04. Archived from the original on 2013-11-12. Retrieved 2013-10-23.
  7. 1 2 "Son of Grid Engine". University of Liverpool. Archived from the original on 26 August 2021.
  8. "Open Grid Scheduler". The Open Grid Scheduler Community.
  9. "Univa Grid Engine". Univa.
  10. "World's First Utility Grid Comes Alive on the Internet". Sun Microsystems. 2006-03-22.
  11. "Gridware's resource management software increases efficiency and productivity in compute-intensive technical computing environments". Sun Microsystems. 2000-07-24.
  12. "Sun Microsystems makes SUN GRID ENGINE software available to open source community". Sun Microsystems. 2001-07-23.
  13. "Porting HPC Tools to FreeBSD". 2010-05-14.
  14. Eadline, Douglas. "Grid Engine: Running on All Four Cylinders » ADMIN Magazine". ADMIN Magazine. Retrieved 2021-03-05.
  15. Open Grid Scheduler
  16. Eadline, Ph.D., Douglas (September 1, 2010). "The State of Oracle/Sun Grid Engine". Linux Magazine. Archived from the original on September 4, 2010. Retrieved 2011-01-18.{{cite web}}: CS1 maint: unfit URL (link)
  17. Templeton, Daniel (2010-12-23). "Changes for a Bright Future at Oracle" . Retrieved 2011-01-19.
  18. "Univa Acquires Grid Engine Expertise" (Press release). Univa. 2011-01-18. Archived from the original on 2011-01-21. Retrieved 2011-01-18.
  19. Feldman, Michael (2011-01-18). "Univa Rescues Grid Engine From Oracle". HPCwire. Archived from the original on 2011-01-24. Retrieved 2011-01-18.
  20. Morgan, Timothy Prickett (2011-01-18). "Univa forks Oracle's Sun Grid Engine". The Register. Retrieved 2011-01-18.
  21. "Univa Completes Acquisition of Grid Engine Assets, Becoming the Sole Commercial Provider of Grid Engine Software". Univa Corporation. 2013-10-22.
  22. "Altair Acquires Univa". September 14, 2020.
  23. "How to Install the Shadow Master Host". Sun Grid Engine 6.2u3 blog. August 27, 2009. Archived from the original on October 2, 2011. Retrieved October 6, 2011.
  24. "Sun Store Grid Engine Entitlement Purchase". Sun Microsystems. Archived from the original on 2006-11-13. Retrieved 2008-03-03.
  25. "Sun Grid Engine 6 Partners". Sun Microsystems. Retrieved 2007-12-14.
  26. "Advanced Sun Grid Engine Configuration and Administration Class". Sun Microsystems. Retrieved 2007-12-14.
  27. "Training". The Bioteam Inc. Retrieved 2008-03-24.
  28. "Grid Engine Mail Lists". Sun Microsystems. Archived from the original on December 25, 2008. Retrieved October 6, 2011.
  29. "Grid Engine Workshops". Sun Microsystems. Archived from the original on March 8, 2009. Retrieved October 6, 2011.
  30. "Sun Compute Cluster Solution". Sun Microsystems.
  31. "Some Grid Engine". Michigan Neuroscience Institute, University of Michigan.