In computer hypertext, a URI fragment is a string of characters that refers to a resource that is subordinate to another, primary resource. The primary resource is identified by a Uniform Resource Identifier (URI), and the fragment identifier points to the subordinate resource.
The fragment identifier introduced by a hash mark #
is the optional last part of a URL for a document. It is typically used to identify a portion of that document. The generic syntax is specified in RFC 3986. [1] The hash mark separator in URIs is not part of the fragment identifier.
In URIs, a hash mark #
introduces the optional fragment near the end of the URL. The generic RFC 3986 syntax for URIs also allows an optional query part introduced by a question mark ?
. In URIs with a query and a fragment, the fragment follows the query. Query parts depend on the URI scheme and are evaluated by the server—e.g., http:
supports queries unlike ftp:
. Fragments depend on the document MIME type and are evaluated by the client (web browser). Clients are not supposed to send URI fragments to servers when they retrieve a document. [1] [2]
A URI ending with #
is permitted by the generic syntax and is a kind of empty fragment. In MIME document types such as text/html
or any XML type, empty identifiers to match this syntactically legal construct are not permitted. Web browsers typically display the top of the document for an empty fragment.
The fragment identifier functions differently to the rest of the URI: its processing is exclusively client-sided with no participation from the web server, though the server typically helps to determine the MIME type, and the MIME type determines the processing of fragments. When an agent (such as a web browser) requests a web resource from a web server, the agent sends the URI to the server, but does not send the fragment. Instead, the agent waits for the server to send the resource, and then the agent processes the resource according to the document type and fragment value. [3]
In an HTML web page, the agent will look for an anchor identified with an HTML tag that includes an id=
or name=
attribute equal to the fragment identifier.
text/html
pages such as http://www.example.org/foo.html#bar
the fragment refers to the element with id="bar"
. :target
CSS pseudoclass; Wikipedia uses this to highlight the selected reference. Notably CSS display: block
can be used to show content only if it is the target, and otherwise hidden by display: none
.name
attribute of the <a>
element served the same purpose, but is now obsolete in favor of the id
attribute, which can be applied to any element. [4] xml:id
or similar id
attributes follow the Name
-syntax and begin with a letter, underscore, or colon. Notably they cannot begin with a digit or hyphen. [5] xml:id
is one of the few generic XML attributes, e.g., xml:lang
, which can be used without explicitly declaring a namespace. [6] In XHTML id
has to be used, because XHTML was specified before xml:id
existed.http://www.example.org/foo.xml#xpointer(//Rube)
refers to all XML elements named "Rube" in the document identified by the URI http://www.example.org/foo.xml. An XPointer processor, given that URI, would obtain a representation of the document (such as by requesting it from the Internet) and would return a representation of the document's "Rube" elements.http://www.w3.org/2004/02/skos/core#broader
identifies the concept "broader" in SKOS Core vocabulary, but it does not refer to a specific part of the resource identified by http://www.w3.org/2004/02/skos/core
, a complete RDF file in which semantics of this specific concept is declared, along with other concepts in the same vocabulary. text/plain
documents RFC 5147 specifies a fragment identifier for the character and line positions and ranges within the document using the keywords "char
" and "line
". Browser support seems lacking. [7] The following example identifies lines 11 through 20 of a text document: http://example.com/document.txt#line=10,20
text/csv
documents, RFC 7111 specifies a fragment identifier as a selector for rows, columns, and cells using the keywords "row
" , "col
", and "cell
", for example: http://example.com/data.csv#row=4
– Selects the 4th row.http://example.com/data.csv#col=2
– Selects 2nd column.http://example.com/data.csv#row=5-7
– Selects three consecutive rows starting with 5th row.http://example.com/data.csv#row=5-*
– Selects all rows starting with 5th row.http://example.com/data.csv#cell=4,1-6,2
– Selects a region that starts at the 4th row and the 1st column and ends at the 6th row and the 2nd column.t
and xywh
. Therefore, one can use the following media fragments URI in the src
attribute of the audio
or video
HTML5 element: http://example.com/foo.mp4#t=10,20
http://example.com/bar.webm#t=40,80&xywh=160,120,320,240
#01h25m30s
to start playing at the specified position, [9] and YouTube uses similar code such as #t=3m25s
. [10] location.hash
– JavaScript can be also used with other document types. With the rise of AJAX, some websites use fragment identifiers to emulate the back button behavior of browsers for page changes that do not require a reload, or to emulate subpages. #/foo
could be used to extract the value from a key-value pair in a document beginning with { "foo": ["bar", "baz"], ... }
application/pdf
documents PDF viewers recognize a number of fragment identifiers. [13] [14] For instance, a URL ending in .pdf#page=35
will cause most readers to open the PDF and scroll to page 35. Several other parameters are possible, including #nameddest=
(similar to HTML anchors), #search="word1 word2"
, #zoom=
, etc. Multiple parameters can be combined with ampersands: http://example.org/doc.pdf#view=fitb&nameddest=Chapter3
.viewBox()
, preserveAspectRatio()
, and transform()
. [15] Several proposals have been made for fragment identifiers for use with plain text documents (which cannot store anchor metadata), or to refer to locations within HTML documents in which the author has not used anchor tags:
#:~:text=foo
will cause the browser to search for foo
, highlight the matching text, and scroll to it. Besides the start and end, the snippet can also specify a context: text that must precede or follow foo
but will not be highlighted (example that uses #:~:text=night-,vision
to find 'vision' preceded by 'night' ).https://pypi.python.org ... zodbbrowser-0.3.1.tar.gz#md5=38dc89f294b24691d3f0d893ed3c119c
!
. It was used in a now-deprecated approach to index dynamic single-page applications. An exclamation mark is illegal in HTML4, XHTML, and XML identifiers, granting certain degree of separation from that functionality. However, it is allowed in HTML5. [22] http://example.com/page?query#!state
#!
with ?_escaped_fragment_=
[23] history.pushState()
[25] method. [26] #!s!search terms
. Adding a number after the s (#!s10!
) indicates that the browser should search for the nth occurrence of the search term. A negative number (#!s-3!
) starts searching backwards from the end of the document. A Greasemonkey script is available to add this functionality to compatible browsers. [27] http://example.com/index.html#!s3!search terms
match
". [28] They also describe a prototype implementation as an extension for the Firefox browser. For example, the following would find the case-insensitive text "RFC" anywhere in the document: http://example.com/document.txt#match=[rR][fF][cC]
words
" is the first proposal in this scheme. [29] The following example would search a document for the first occurrence of the string "some context for a search term" and then highlight the words "search term": http://example.com/index.html#:words:some-context-for-a-(search-term)
#FWS+C
, where F is the length of the first word (up to five characters), W is the first word itself, S is the length of the selected text and C is a 32-bit CRC of the selected text. [32] They implemented a variant of this scheme as an extension for the Firefox browser, [33] using the form #LFWS+C
, where L is the length of the fragment itself, in two hex digits. Linking to the word "Fragment" using the implemented variant would yield: http://example.com/index.html#115Fragm8+-52f89c4c
A document type definition (DTD) is a specification file that contains set of markup declarations that define a document type for an SGML-family markup language. The DTD specification file can be used to validate documents.
Hypertext Markup Language (HTML) is the standard markup language for documents designed to be displayed in a web browser. It defines the content and structure of web content. It is often assisted by technologies such as Cascading Style Sheets (CSS) and scripting languages such as JavaScript.
A Uniform Resource Identifier (URI), formerly Universal Resource Identifier, is a unique sequence of characters that identifies an abstract or physical resource, such as resources on a webpage, mail address, phone number, books, real-world objects such as people and places, concepts. URIs are used to identify anything described using the Resource Description Framework (RDF), for example, concepts that are part of an ontology defined using the Web Ontology Language (OWL), and people who are described using the Friend of a Friend vocabulary would each have an individual URI.
In computing, a hyperlink, or simply a link, is a digital reference to data that the user can follow or be guided to by clicking or tapping. A hyperlink points to a whole document or to a specific element within a document. Hypertext is text with hyperlinks. The text that is linked from is known as anchor text. A software system that is used for viewing and creating hypertext is a hypertext system, and to create a hyperlink is to hyperlink. A user following hyperlinks is said to navigate or browse the hypertext.
An HTML element is a type of HTML document component, one of several types of HTML nodes. The first used version of HTML was written by Tim Berners-Lee in 1993 and there have since been many versions of HTML. The current de facto standard is governed by the industry group WHATWG and is known as the HTML Living Standard.
A Uniform Resource Name (URN) is a Uniform Resource Identifier (URI) that uses the urn
scheme. URNs are globally unique persistent identifiers assigned within defined namespaces so they will be available for a long period of time, even after the resource which they identify ceases to exist or becomes unavailable. URNs cannot be used to directly locate an item and need not be resolvable, as they are simply templates that another parser may use to find an item.
Web standards are the formal, non-proprietary standards and other technical specifications that define and describe aspects of the World Wide Web. In recent years, the term has been more frequently associated with the trend of endorsing a set of standardized best practices for building web sites, and a philosophy of web design and development that includes those methods.
A query string is a part of a uniform resource locator (URL) that assigns values to specified parameters. A query string commonly includes fields added to a base URL by a Web browser or other client application, for example as part of an HTML document, choosing the appearance of a page, or jumping to positions in multimedia content.
Digest access authentication is one of the agreed-upon methods a web server can use to negotiate credentials, such as username or password, with a user's web browser. This can be used to confirm the identity of a user before sending sensitive information, such as online banking transaction history. It applies a hash function to the username and password before sending them over the network. In contrast, basic access authentication uses the easily reversible Base64 encoding instead of hashing, making it non-secure unless used in conjunction with TLS.
The data URI scheme is a uniform resource identifier (URI) scheme that provides a way to include data in-line in Web pages as if they were external resources. It is a form of file literal or here document. This technique allows normally separate elements such as images and style sheets to be fetched in a single Hypertext Transfer Protocol (HTTP) request, which may be more efficient than multiple HTTP requests, and used by several browser extensions to package images as well as other multimedia content in a single HTML file for page saving. As of 2024, data URIs are fully supported by all major browsers.
URL encoding, officially known as percent-encoding, is a method to encode arbitrary data in a uniform resource identifier (URI) using only the US-ASCII characters legal within a URI. Although it is known as URL encoding, it is also used more generally within the main Uniform Resource Identifier (URI) set, which includes both Uniform Resource Locator (URL) and Uniform Resource Name (URN). Consequently, it is also used in the preparation of data of the application/x-www-form-urlencoded
media type, as is often used in the submission of HTML form data in HTTP requests.
A web resource is any identifiable resource present on or connected to the World Wide Web. Resources are identified using Uniform Resource Identifiers (URIs). In the Semantic Web, web resources and their semantic properties are described using the Resource Description Framework (RDF).
RDFa or Resource Description Framework in Attributes is a W3C Recommendation that adds a set of attribute-level extensions to HTML, XHTML and various XML-based document types for embedding rich metadata within Web documents. The Resource Description Framework (RDF) data-model mapping enables its use for embedding RDF subject-predicate-object expressions within XHTML documents. It also enables the extraction of RDF model triples by compliant user agents.
URI normalization is the process by which URIs are modified and standardized in a consistent manner. The goal of the normalization process is to transform a URI into a normalized URI so it is possible to determine if two syntactically different URIs may be equivalent.
In information and communications technology, a media type, content type or MIME type is a two-part identifier for file formats and format contents. Their purpose is comparable to filename extensions and uniform type identifiers, in that they identify the intended data format. They are mainly used by technologies underpinning the Internet, and also used on Linux desktop systems.
A Formal Public Identifier (FPI) is a short piece of text with a particular structure that may be used to uniquely identify a product, specification or document. FPIs were introduced as part of Standard Generalized Markup Language (SGML), and serve particular purposes in formats historically derived from SGML. Some of their most common uses are as part of document type declarations (DOCTYPEs) and document type definitions (DTDs) in SGML, XML and historically HTML, but they are also used in the vCard and iCalendar file formats to identify the software product which generated the file.
A single-page application (SPA) is a web application or website that interacts with the user by dynamically rewriting the current web page with new data from the web server, instead of the default method of loading entire new pages. The goal is faster transitions that make the website feel more like a native app.
Extensible HyperText Markup Language (XHTML) is part of the family of XML markup languages which mirrors or extends versions of the widely used HyperText Markup Language (HTML), the language in which Web pages are formulated.
A document type declaration, or DOCTYPE, is an instruction that associates a particular XML or SGML document with a document type definition (DTD). In the serialized form of the document, it manifests as a short string of markup that conforms to a particular syntax.
A uniform resource locator (URL), colloquially known as an address on the Web, is a reference to a resource that specifies its location on a computer network and a mechanism for retrieving it. A URL is a specific type of Uniform Resource Identifier (URI), although many people use the two terms interchangeably. URLs occur most commonly to reference web pages (HTTP/HTTPS) but are also used for file transfer (FTP), email (mailto), database access (JDBC), and many other applications.
The target URI excludes the reference's fragment component, if any, since fragment identifiers are reserved for client-side processing
{{cite web}}
: CS1 maint: multiple names: authors list (link)Pypi has the habit to append an md5 fragment to its egg urls, we'll use it to check the already present distribution files in the cache