SEG-Y

Last updated
SEG-Y
Seg-y picture.gif
An image of seismic line that was originally encoded in the SEG-Y format.
Filename extension .sgy, .segy
Developed bySEG Technical Standards Committee
Initial release1975;49 years ago (1975)
Latest release
rev 2.1
October 2023;6 months ago (2023-10)
Type of format Reflection seismology data
Extended fromSEG "Ex"
Website https://library.seg.org/pb-assets/technical-standards/seg_y_rev1-1686080991247.pdf

The SEG-Y (sometimes SEG Y) file format is one of several standards developed by the Society of Exploration Geophysicists (SEG) for storing geophysical data. It is an open standard, and is controlled by the SEG Technical Standards Committee, a non-profit organization.

Contents

History

The format was originally developed in 1973 to store single-line seismic reflection digital data on magnetic tapes. The specification was published in 1975. [1]

The format and its name evolved from the SEG "Ex" or Exchange Tape Format. [1] [2] However, since its release, there have been significant advancements in geophysical data acquisition, such as 3-dimensional seismic techniques and high speed, high capacity recording.

The most recent revision of the SEG-Y format was published in 2023, named the rev 2.1 specification. [3] It still features certain legacies of the original format (referred as rev 0), such as an optional SEG-Y tape label, the main 3200 byte textual EBCDIC character encoded tape header and a 400 byte binary header.

Notice about SEG-Y rev 2.1 released in November 2023:Hagelund, Rune; Stewart A. Levin & Shawn New, eds. (2023). SEG-Y_r2.1: seg_y_rev2-1_nov2023-announcement format (PDF). Tulsa, OK: Society of Exploration Geophysicists.</ref>


Data structure

SEGY file byte stream structure.svg

This image shows the byte stream structure of a SEG-Y file, with rev 1 Extended Textual File Header records.

Since the first SEG-Y standard was published, many companies dealing with seismic data have produced variants of the SEG-Y standard which have run contrary to the aims of defining a standard for universal interchange, thus generally causing confusion and delay when data received by a company in expected SEG-Y format turns out to be a variant of that format. Initially, many of these derived from the fact that the format was based on the de facto standard of using IBM computers for digital processing where character data was coded in EBCDIC and number data in IBM Floating Point, whereas processing systems in use quickly evolved based on ASCII character and IEEE number representations.

Even before the SEG-Y standard was agreed and published, earlier seismic data format standards published by the SEG such as SEG-A, SEG-B and SEG-C were modified by seismic acquisition companies, though not to the wide extent that the SEG-Y format has been modified by seismic acquisition and processing companies and oil companies using their own inhouse software.

As magnetic tape technology developed, the original SEG-Y format using individual small data blocks for each distinct seismic trace became very inefficient in terms of tape performance so the first and subsequent revisions allowed for larger tape data blocks containing many individual traces.

There have been many suggestions for including different kinds of metadata within the standard over the years, since when the standard was first proposed the processes of acquisition and processing were technologically much simpler.

For example geographical positioning information either real world or relative wasn't stored in the trace header at acquisition or final processing whereas it is routine today.

However, the relative simplicity of the SEG-Y format has meant that it has worked well for interchange of seismic data allowing anyone to read and understand data recorded in the 1970s on half inch magnetic tape as easily as reading it on modern tape media or from a disk file, as long as the standard has been adhered to.

While SEG-Y format data is still stored on magnetic tape as a permanent archive, SEG-Y data is increasingly stored as disk files for online and near-line ease of access, and later format revisions allow for character and number data to be stored in native system representations as ASCII and IEEE.

See also

Related Research Articles

<span class="mw-page-title-main">ASCII</span> American character encoding standard

ASCII, abbreviated from American Standard Code for Information Interchange, is a character encoding standard for electronic communication. ASCII codes represent text in computers, telecommunications equipment, and other devices. Because of technical limitations of computer systems at the time it was invented, ASCII has just 128 code points, of which only 95 are printable characters, which severely limited its scope. Modern computer systems have evolved to use Unicode, which has millions of code points, but the first 128 of these are the same as the ASCII set.

Extended Binary Coded Decimal Interchange Code is an eight-bit character encoding used mainly on IBM mainframe and IBM midrange computer operating systems. It descended from the code used with punched cards and the corresponding six-bit binary-coded decimal code used with most of IBM's computer peripherals of the late 1950s and early 1960s. It is supported by various non-IBM platforms, such as Fujitsu-Siemens' BS2000/OSD, OS-IV, MSP, and MSP-EX, the SDS Sigma series, Unisys VS/9, Unisys MCP and ICL VME.

Resource Interchange File Format (RIFF) is a generic file container format for storing data in tagged chunks. It is primarily used for audio and video, though it can be used for arbitrary data.

In computing, tar is a computer software utility for collecting many files into one archive file, often referred to as a tarball, for distribution or backup purposes. The name is derived from "tape archive", as it was originally developed to write data to sequential I/O devices with no file system of their own, such as devices that use magnetic tape. The archive data sets created by tar contain various file system parameters, such as name, timestamps, ownership, file-access permissions, and directory organization. POSIX abandoned tar in favor of pax, yet tar sees continued widespread use.

uuencoding is a form of binary-to-text encoding that originated in the Unix programs uuencode and uudecode written by Mary Ann Horton at the University of California, Berkeley in 1980, for encoding binary data for transmission in email systems.

<span class="mw-page-title-main">Reflection seismology</span> Explore subsurface properties with seismology

Reflection seismology is a method of exploration geophysics that uses the principles of seismology to estimate the properties of the Earth's subsurface from reflected seismic waves. The method requires a controlled seismic source of energy, such as dynamite or Tovex blast, a specialized air gun or a seismic vibrator. Reflection seismology is similar to sonar and echolocation.

<span class="mw-page-title-main">Society of Exploration Geophysicists</span> Nonprofit geoscience organization

The Society of Exploration Geophysicists (SEG) is a learned society dedicated to promoting the science and education of exploration geophysics in particular and geophysics in general. The Society fosters the expert and ethical practice of geophysics in the exploration and development of natural resources, in characterizing the near-surface, and in mitigating earth hazards. As of November 2019, SEG has more than 14,000 members working in more than 114 countries. SEG was founded in 1930 in Houston, Texas but its business office has been headquartered in Tulsa, Oklahoma since the mid-1940s. While most SEG members are involved in exploration for petroleum, SEG members also are involved in application of geophysics methods to mineral exploration as well as environmental and engineering problems, archaeology, and other scientific endeavors. SEG publishes The Leading Edge (TLE), a monthly professional magazine, Geophysics, a peer-reviewed archival publication, and Interpretation, a peer-reviewed journal co-published by SEG and the American Association of Petroleum Geologists.

<span class="mw-page-title-main">STL (file format)</span> File format for stereolithography applications

STL is a file format native to the stereolithography CAD software created by 3D Systems. Chuck Hull, the inventor of stereolithography and 3D Systems’ founder, reports that the file extension is an abbreviation for stereolithography.

<span class="mw-page-title-main">IBM Displaywriter System</span> 1980 office desktop computer

The IBM 6580 Displaywriter System is a 16-bit microcomputer that was marketed and sold by IBM's Office Products Division primarily as a word processor. Announced on June 17, 1980 and effectively withdrawn from marketing on July 2, 1986, the system was sold with a 5 MHz Intel 8086, 128 KB to 448 KB of RAM, a swivel-mounted monochrome CRT monitor, a detached keyboard, a detached 8" floppy disk drive enclosure with one or two drives, and a detached daisy wheel printer, or Selectric typewriter printer. The primary operating system for the Displaywriter is IBM's internally developed word processing software titled "Textpack", but UCSD p-System, CP/M-86, and MS-DOS were also offered by IBM, Digital Research, and CompuSystems, respectively.

Nigel Allister Anstey is a British geophysicist who has made major contributions to seismic exploration, which are the foundations for many of the techniques used in today's oil and gas exploration. Anstey's contributions impact every major area of seismic exploration -– from seismic acquisition to seismic processing to interpretation to research. He is the holder of over 50 multinational patents. He is best known by many geoscientists for distilling the geophysical concepts of the seismic method into non-mathematical teachings for seismic interpreters.

Intel hexadecimal object file format, Intel hex format or Intellec Hex is a file format that conveys binary information in ASCII text form, making it possible to store on non-binary media such as paper tape, punch cards, etc., to display on text terminals or be printed on line-oriented printers. The format is commonly used for programming microcontrollers, EPROMs, and other types of programmable logic devices and hardware emulators. In a typical application, a compiler or assembler converts a program's source code to machine code and outputs it into a object or executable file in hexadecimal format. In some applications, the Intel hex format is also used as a container format holding packets of stream data. Common file extensions used for the resulting files are .HEX or .H86. The HEX file is then read by a programmer to write the machine code into a PROM or is transferred to the target system for loading and execution. There are various tools to convert files between hexadecimal and binary format, and vice versa.

Jon F. Claerbout is an American geophysicist and seismologist. He is the Cecil Green Professor Emeritus of Geophysics at Stanford University. Since the later half of the 20th century, he has been a leading researcher and pioneered the use of computers in processing and filtering seismic exploration data, eventually developing the field of time series analysis and seismic interferometry, modelling the propagation of seismic waves.

Seismic migration is the process by which seismic events are geometrically re-located in either space or time to the location the event occurred in the subsurface rather than the location that it was recorded at the surface, thereby creating a more accurate image of the subsurface. This process is necessary to overcome the limitations of geophysical methods imposed by areas of complex geology, such as: faults, salt bodies, folding, etc.

Andrew S. Long is an Australian geophysicist. He has a PhD in geophysics (1996) from the University of Western Australia, and a post-doctoral term at Stanford University. He is a leader in the application of geophysical technologies to exploration for oil and gas in marine areas, and has written and presented several papers at the Society of Exploration Geophysicists (SEG), the European Association of Geoscientists and Engineers (EAGE), the Australian Society of Exploration Geophysicists (ASEG), the Australian Petroleum Production & Exploration Association (APPEA) and many other international conventions and journals.

A National Data Repository (NDR) is a data bank that seeks to preserve and promote a country's natural resources data, particularly data related to the petroleum exploration and production (E&P) sector.

Shell Processing Support (SPS) is a set of formatted files used for exchanging Land 3D seismic data. They can also be used for 2D.

Öz Yılmaz is the Chief Technology Officer of GeoTomo LLC and the founder of Anatolian Geophysical. He is the author of Seismic Data Processing and Seismic Data Analysis, the principle reference volumes for the seismic processing industry.

Robert E. Sheriff was an American geophysicist best known for writing the comprehensive geophysical reference, Encyclopedic Dictionary of Exploration Geophysics. His main research interests included the seismic detailing of reservoirs, in 3-D seismic interpretation and seismic stratigraphy, and practical applications of geophysical methods. Hua-Wei Zhou, Department Chair of the Department of Earth and Atmospheric Sciences, said about Sheriff: “…a giant figure in the world of exploration geophysics… When I think about Bob, a number of key words pop up in my mind: kindness, honesty, hardworking, seeking perfection, generosity and wisdom.”

References

  1. 1 2 Barry, K.M.; Cavers, D.A.; Kneale, C.W. (1975). "Recommended standards for digital tape formats" (PDF). Geophysics. 40 (2): 344–352. Bibcode:1975Geop...40..344B. doi:10.1190/1.1440530.
  2. Northwood, E.J.; Weisinger, R.C.; Bradley, J.J. (1967). "Recommended standards for digital tape formats" (PDF). Geophysics. 32 (6): 1073–1084.
  3. Hagelund, Rune; Stewart A. Levin & Shawn New, eds. (2023). SEG-Y_r2.1: SEG-Y revision 2.1 Data Exchange format (PDF). Tulsa, OK: Society of Exploration Geophysicists.