Internet traffic

Last updated
Global Internet Traffic as of 2018

Internet traffic is the flow of data within the entire Internet, or in certain network links of its constituent networks. Common traffic measurements are total volume, in units of multiples of the byte, or as transmission rates in bytes per certain time units.

Contents

As the topology of the Internet is not hierarchical, no single point of measurement is possible for total Internet traffic. Traffic data may be obtained from the Tier 1 network providers' peering points for indications of volume and growth. However, Such data excludes traffic that remains within a single service provider's network and traffic that crosses private peering points.

As of December 2022 almost half (48%) of mobile Internet traffic is in India and China, while North America and Europe have about a quarter. [1] . However, mobile traffic remains a minority of total internet traffic.

Traffic sources

File sharing constitutes a fraction of Internet traffic. [2] The prevalent technology for file sharing is the BitTorrent protocol, which is a peer-to-peer (P2P) system mediated through indexing sites that provide resource directories. According to a Sandvine Research in 2013, Bit Torrent's share of Internet traffic decreased by 20% to 7.4% overall, reduced from 31% in 2008. [3]

As of 2023, roughly 65% of all internet traffic came from video sites, [4] up from 51% in 2016. [5]

Traffic management

Internet Connectivity Distribution & Core.svg

Internet traffic management, also known as application traffic management. The Internet does not employ any formally centralized facilities for traffic management. Its progenitor networks, especially the ARPANET established an early backbone infrastructure which carried traffic between major interchange centers for traffic, resulting in a tiered, hierarchical system of internet service providers (ISPs) within which the tier 1 networks provided traffic exchange through settlement-free peering and routing of traffic to lower-level tiers of ISPs. The dynamic growth of the worldwide network resulted in ever-increasing interconnections at all peering levels of the Internet, so a robust system was developed that could mediate link failures, bottlenecks, and other congestion at many levels.[ citation needed ]

Economic traffic management (ETM) is the term that is sometimes used to point out the opportunities for seeding as a practice that caters to contribution within peer-to-peer file sharing and the distribution of content in the digital world in general. [6]

Internet use tax

A planned tax on Internet use in Hungary introduced a 150-forint (US$0.62, €0.47) tax per gigabyte of data traffic, in a move intended to reduce Internet traffic and also assist companies to offset corporate income tax against the new levy. [7] Hungary achieved 1.15 billion gigabytes in 2013 and another 18 million gigabytes accumulated by mobile devices. This would have resulted in extra revenue of 175 billion forints under the new tax based on the consultancy firm eNet. [7]

According to Yahoo News, economy minister Mihály Varga defended the move saying "the tax was fair as it reflected a shift by consumers to the Internet away from phone lines" and that "150 forints on each transferred gigabyte of data – was needed to plug holes in the 2015 budget of one of the EU's most indebted nations". [8]

Some people argue that the new plan on Internet tax would prove disadvantageous to the country's economic development, limit access to information and hinder the freedom of expression. [9] Approximately 36,000 people have signed up to take part in an event on Facebook to be held outside the Economy Ministry to protest against the possible tax. [8]

In 1998, the United States enacted the Internet Tax Freedom Act (ITFA) to prevent the imposition of direct taxes on internet usage and online activities such as emails, internet access, bit tax, and bandwidth tax. [10] [11] Initially, this law placed a 10-year moratorium on such taxes, which was later extended multiple times and made permanent in 2016. The ITFA's goal was to protect consumers and support the growth of internet traffic by prohibiting recurring and discriminatory taxes that could hinder internet adoption and usage. As a result, ITFA has played a crucial role in promoting the digital economy and safeguarding consumer interests. According to Pew Research Center, as of 2024, approximately 93% of Americans use the internet, with platforms like YouTube and Facebook being highly popular. [12] [13] [14] [15] Additionally, 90% of U.S. households subscribed to high-speed internet services by 2021. [16] [17] Although the ITFA provides protection against direct internet taxes, ongoing debates about internet regulation and governance continue to shape the landscape of internet traffic and usage in the United States.

Traffic classification

Traffic classification describes the methods of classifying traffic by observing features passively in the traffic and line with particular classification goals. There might be some that only have a vulgar classification goal. For example, whether it is bulk transfer, peer-to-peer file-sharing, or transaction-orientated. Some others will set a finer-grained classification goal, for instance, the exact number of applications represented by the traffic. Traffic features included port number, application payload, temporal, packet size, and the characteristic of the traffic. There is a vast range of methods to allocate Internet traffic including exact traffic, for instance, port (computer networking) number, payload, heuristic, or statistical machine learning.

Accurate network traffic classification is elementary to quite a few Internet activities, from security monitoring to accounting and from the quality of service to providing operators with useful forecasts for long-term provisioning. Yet, classification schemes are extremely complex to operate accurately due to the shortage of available knowledge of the network. For example, the packet header-related information is always insufficient to allow for a precise methodology.

Bayesian analysis techniques

Work [18] involving supervised machine learning to classify network traffic. Data are hand-classified (based upon flow content) to one of a number of categories. A combination of data set (hand-assigned) category and descriptions of the classified flows (such as flow length, port numbers, time between consecutive flows) are used to train the classifier. To give a better insight of the technique itself, initial assumptions are made as well as applying two other techniques in reality. One is to improve the quality and separation of the input of information leading to an increase in accuracy of the Naive Bayes classifier technique.

The basis of categorizing work is to classify the type of Internet traffic; this is done by putting common groups of applications into different categories, e.g., "normal" versus "malicious", or more complex definitions, e.g., the identification of specific applications or specific Transmission Control Protocol (TCP) implementations. [19] Adapted from Logg et al. [20]

Survey

Traffic classification is a major component of automated intrusion detection systems. [21] [22] They are used to identify patterns as well as an indication of network resources for priority customers, or to identify customer use of network resources that in some way contravenes the operator's terms of service. Generally deployed Internet Protocol (IP) traffic classification techniques are based approximately on a direct inspection of each packet's contents at some point on the network. Source address, port and destination address are included in successive IP packets with similar if not the same 5-tuple of protocol type. ort are considered to belong to a flow whose controlling application we wish to determine. Simple classification infers the controlling application's identity by assuming that most applications consistently use well-known TCP or UDP port numbers. Even though, many candidates are increasingly using unpredictable port numbers. As a result, more sophisticated classification techniques infer application types by looking for application-specific data within the TCP or User Datagram Protocol (UDP) payloads. [23]

Global Internet traffic

Aggregating from multiple sources and applying usage and bitrate assumptions, Cisco, a major network systems company, has published the following historical Internet Protocol (IP) and Internet traffic figures: [24]

Global Internet traffic by year
 
Year
IP traffic
(PB/month)
Fixed Internet traffic
(PB/month)
Mobile Internet traffic
(PB/month)
19900.0010.001n/a
19910.0020.002n/a
19920.0050.004n/a
19930.01  0.01  n/a
19940.02  0.02  n/a
19950.18  0.17  n/a
19961.9    1.8    n/a
19975.4    5.0    n/a
199812      11      n/a
199928      26      n/a
200084      75      n/a
2001197      175      n/a
2002405      356      n/a
2003784      681      n/a
20041,477      1,267      n/a
20052,426      2,055      0.9   
20063,992      3,339      4      
20076,430      5,219      15      
2008 [25] 10,174      8,140      33      
2009 [26] 14,686      10,942      91      
2010 [27] 20,151      14,955      237      
2011 [28] 30,734      23,288      597      
2012 [29] [30] 43,570      31,339      885      
2013 [31] 51,168      34,952      1,480      
2014 [32] 59,848      39,909      2,514      
2015 [33] 72,521      49,494      3,685      
2016 [34] 96,054      65,942      7,201      
2017 [35] 122,000      85,000      12,000      

"Fixed Internet traffic" refers perhaps to traffic from residential and commercial subscribers to ISPs, cable companies, and other service providers. "Mobile Internet traffic" refers perhaps to backhaul traffic from cellphone towers and providers. The overall "Internet traffic" figures, which can be 30% higher than the sum of the other two, perhaps factors in traffic in the core of the national backbone, whereas the other figures seem to be derived principally from the network periphery.

Cisco also publishes 5-year projections.

Predicted global Internet traffic by year [35]
 
Year
Fixed Internet traffic
(EB/month)
Mobile Internet traffic
(EB/month)
201810719
201913729
202017441
202121957
202227377

Internet backbone traffic in the United States

The following data for the Internet backbone in the US comes from the Minnesota Internet Traffic Studies (MINTS): [36]

US Internet backbone traffic by year
YearData (TB/month)
19901
19912
19924
19938
199416
1995n/a
19961,500
19972,5004,000
19985,0008,000
199910,00016,000
200020,00035,000
200140,00070,000
200280,000140,000
2003n/a
2004n/a
2005n/a
2006450,000800,000
2007750,0001,250,000
20081,200,0001,800,000
20091,900,0002,400,000
20102,600,0003,100,000
20113,400,0004,100,000

The Cisco data can be seven times higher than the Minnesota Internet Traffic Studies (MINTS) data not only because the Cisco figures are estimates for the global—not just the domestic US—Internet, but also because Cisco counts "general IP traffic (thus including closed networks that are not truly part of the Internet, but use IP, the Internet Protocol, such as the IPTV services of various telecom firms)". [37] The MINTS estimate of US national backbone traffic for 2004, which may be interpolated as 200 petabytes/month, is a plausible three-fold multiple of the traffic of the US's largest backbone carrier, Level(3) Inc., which claims an average traffic level of 60 petabytes/month. [38]

Edholm's law

In the past Internet bandwidth in telecommunications networks doubled every 18 months, an observation expressed as Edholm's law. [39] This follows the advances in semiconductor technology, such as metal-oxide-silicon (MOS) scaling, exemplified by the MOSFET transistor, which has shown similar scaling described by Moore's law. In the 1980s, fiber-optical technology using laser light as information carriers accelerated the transmission speed and bandwidth of telecommunication circuits. This has led to the bandwidths of communication networks achieving terabit per second transmission speeds. [40]

See also

Related Research Articles

<span class="mw-page-title-main">History of the Internet</span>

The history of the Internet has its origin in the efforts of scientists and engineers to build and interconnect computer networks. The Internet Protocol Suite, the set of rules used to communicate between networks and devices on the Internet, arose from research and development in the United States and involved international collaboration, particularly with researchers in the United Kingdom and France.

<span class="mw-page-title-main">OSI model</span> Model of communication of seven abstraction layers

The Open Systems Interconnection (OSI) model is a reference model from the International Organization for Standardization (ISO) that "provides a common basis for the coordination of standards development for the purpose of systems interconnection." In the OSI reference model, the communications between systems are split into seven different abstraction layers: Physical, Data Link, Network, Transport, Session, Presentation, and Application.

Quality of service (QoS) is the description or measurement of the overall performance of a service, such as a telephony or computer network, or a cloud computing service, particularly the performance seen by the users of the network. To quantitatively measure quality of service, several related aspects of the network service are often considered, such as packet loss, bit rate, throughput, transmission delay, availability, jitter, etc.

<span class="mw-page-title-main">Router (computing)</span> Device that forwards data packets between computer networks

A router is a computer and networking device that forwards data packets between computer networks, including internetworks such as the global Internet.

<span class="mw-page-title-main">Frame Relay</span> Wide area network technology

Frame Relay is a standardized wide area network (WAN) technology that specifies the physical and data link layers of digital telecommunications channels using a packet switching methodology. Originally designed for transport across Integrated Services Digital Network (ISDN) infrastructure, it may be used today in the context of many other network interfaces.

<span class="mw-page-title-main">Denial-of-service attack</span> Type of cyber-attack

In computing, a denial-of-service attack is a cyber-attack in which the perpetrator seeks to make a machine or network resource unavailable to its intended users by temporarily or indefinitely disrupting services of a host connected to a network. Denial of service is typically accomplished by flooding the targeted machine or resource with superfluous requests in an attempt to overload systems and prevent some or all legitimate requests from being fulfilled. The range of attacks varies widely, spanning from inundating a server with millions of requests to slow its performance, overwhelming a server with a substantial amount of invalid data, to submitting requests with an illegitimate IP address.

<span class="mw-page-title-main">Metropolitan area network</span> Computer network serving a populated area

A metropolitan area network (MAN) is a computer network that interconnects users with computer resources in a geographic region of the size of a metropolitan area. The term MAN is applied to the interconnection of local area networks (LANs) in a city into a single larger network which may then also offer efficient connection to a wide area network. The term is also used to describe the interconnection of several LANs in a metropolitan area through the use of point-to-point connections between them.

The Routing Information Protocol (RIP) is one of the oldest distance-vector routing protocols which employs the hop count as a routing metric. RIP prevents routing loops by implementing a limit on the number of hops allowed in a path from source to destination. The largest number of hops allowed for RIP is 15, which limits the size of networks that RIP can support.

<span class="mw-page-title-main">Telecommunications network</span> Network for communications over distance

A telecommunications network is a group of nodes interconnected by telecommunications links that are used to exchange messages between the nodes. The links may use a variety of technologies based on the methodologies of circuit switching, message switching, or packet switching, to pass messages and signals.

<span class="mw-page-title-main">Internet backbone</span> Vital infrastructure of the networks of the Internet

The Internet backbone is the principal data routes between large, strategically interconnected computer networks and core routers of the Internet. These data routes are hosted by commercial, government, academic and other high-capacity network centers as well as the Internet exchange points and network access points, which exchange Internet traffic internationally. Internet service providers (ISPs) participate in Internet backbone traffic through privately negotiated interconnection agreements, primarily governed by the principle of settlement-free peering.

<span class="mw-page-title-main">VLAN</span> Network communications domain that is isolated at the data link layer

A virtual local area network (VLAN) is any broadcast domain that is partitioned and isolated in a computer network at the data link layer. In this context, virtual refers to a physical object recreated and altered by additional logic, within the local area network. Basically, a VLAN behaves like a virtual switch or network link that can share the same physical structure with other VLANs while staying logically separate from them. VLANs work by applying tags to network frames and handling these tags in networking systems, in effect creating the appearance and functionality of network traffic that, while on a single physical network, behaves as if it were split between separate networks. In this way, VLANs can keep network applications separate despite being connected to the same physical network, and without requiring multiple sets of cabling and networking devices to be deployed.

Traffic shaping is a bandwidth management technique used on computer networks which delays some or all datagrams to bring them into compliance with a desired traffic profile. Traffic shaping is used to optimize or guarantee performance, improve latency, or increase usable bandwidth for some kinds of packets by delaying other kinds. It is often confused with traffic policing, the distinct but related practice of packet dropping and packet marking.

Deep packet inspection (DPI) is a type of data processing that inspects in detail the data being sent over a computer network, and may take actions such as alerting, blocking, re-routing, or logging it accordingly. Deep packet inspection is often used for baselining application behavior, analyzing network usage, troubleshooting network performance, ensuring that data is in the correct format, checking for malicious code, eavesdropping, and internet censorship, among other purposes. There are multiple headers for IP packets; network equipment only needs to use the first of these for normal operation, but use of the second header is normally considered to be shallow packet inspection despite this definition.

Bandwidth throttling consists in the limitation of the communication speed, of the ingoing (received) or outgoing (sent) data in a network node or in a network device such as computers and mobile phones.

The next-generation network (NGN) is a body of key architectural changes in telecommunication core and access networks. The general idea behind the NGN is that one network transports all information and services by encapsulating these into IP packets, similar to those used on the Internet. NGNs are commonly built around the Internet Protocol, and therefore the term all IP is also sometimes used to describe the transformation of formerly telephone-centric networks toward NGN.

<span class="mw-page-title-main">Link aggregation</span> Using multiple network connections in parallel to increase capacity and reliability

In computer networking, link aggregation is the combining of multiple network connections in parallel by any of several methods. Link aggregation increases total throughput beyond what a single connection could sustain, and provides redundancy where all but one of the physical links may fail without losing connectivity. A link aggregation group (LAG) is the combined collection of physical ports.

Optical networking is a means of communication that uses signals encoded in light to transmit information in various types of telecommunications networks. These include limited range local-area networks (LAN) or wide area networks (WANs), which cross metropolitan and regional areas as well as long-distance national, international and transoceanic networks. It is a form of optical communication that relies on optical amplifiers, lasers or LEDs and wavelength-division multiplexing (WDM) to transmit large quantities of data, generally across fiber-optic cables. Because it is capable of achieving extremely high bandwidth, it is an enabling technology for the Internet and telecommunication networks that transmit the vast majority of all human and machine-to-machine information.

WAN optimization is a collection of techniques for improving data transfer across wide area networks (WANs). In 2008, the WAN optimization market was estimated to be $1 billion, and was to grow to $4.4 billion by 2014 according to Gartner, a technology research firm. In 2015 Gartner estimated the WAN optimization market to be a $1.1 billion market.

Peer-to-peer caching is a computer network traffic management technology used by Internet Service Providers (ISPs) to accelerate content delivered over peer-to-peer (P2P) networks while reducing related bandwidth costs.

Traffic classification is an automated process which categorises computer network traffic according to various parameters into a number of traffic classes. Each resulting traffic class can be treated differently in order to differentiate the service implied for the data generator or consumer.

References

  1. Kar, Ayushi (2022-12-04). "End of American internet, India-China contribute to 50% of world's data traffic". www.thehindubusinessline.com. Retrieved 2022-12-24.
  2. "Data volume of global file sharing traffic from 2013 until 2018". Statista. 2014. Retrieved 18 October 2014.
  3. Paul Resenikoff (12 November 2013). "File-Sharing Now Accounts for Less Than 10% of US Internet Traffic..." Retrieved 18 October 2014.
  4. "In 2022, 65% of all internet traffic came from video sites". 20 January 2023.
  5. "An explosion of online video could triple bandwidth consumption again in the next five years". 8 June 2017.
  6. Despotovic, Z., Hossfeld, T., Kellerer, W., Lehrieder, F., Oechsner, S., Michel, M. (2011). Mitigating Unfairness In Locality-Aware Peer-To-Peer Networks. International Journal of Network Management
  7. 1 2 Marton Dunai (2014). "Hungary plans new tax on Internet traffic, public calls for rally". Archived from the original on December 20, 2015.
  8. 1 2 "Anger mounts in Hungary over internet tax". Yahoo News. 25 October 2014. Retrieved 18 October 2014.
  9. Margit Feher (2014). "Public outrage mounts against hunger's plan to tax internet use" . Retrieved 18 October 2014.
  10. "The Internet Tax Freedom Act: In Brief" (PDF). FAS Project on Government Secrecy. Retrieved 2024-10-26.
  11. "- INTERNET TAX ISSUES". www.govinfo.gov. Retrieved 2024-10-26.
  12. Gottfried, Jeffrey (2024-01-31). "Americans' Social Media Use". Pew Research Center. Retrieved 2024-10-26.
  13. "Internet, Broadband Fact Sheet". Pew Research Center. 2024-01-31. Retrieved 2024-10-26.
  14. Jeffrey, Gottfried (2024-01-31). "traffic user". Pew Research Center. Retrieved 2024-10-26.
  15. "Social Media Fact Sheet". Pew Research Center. 2024-01-31. Retrieved 2024-10-26.
  16. "92 Percent of U.S. Households Get an Internet Service at Home". Benton Foundation. 2023-12-11. Retrieved 2024-10-26.
  17. Buckley, Sean (2023-12-20). "LRG: 92% of households can get residential internet service". Lightwave. Retrieved 2024-10-26.
  18. Denis Zuev (2013). "Internet traffic classification using bayesian analysis technique" (PDF). Retrieved 18 October 2014.
  19. J.Padhye; S.Floyd (June 2001). "Identifying the TCP Behavior of Web Servers". In Proceedings of SIGCOMM 2011, San Diego, CA.
  20. C.Logg; L.Cottrell (2003). "SLAC National Accelerator Laboratory". Archived from the original on June 13, 2008. Retrieved 21 October 2014.
  21. Bro intrusion detection system – Bro overview, http://bro-ids.org, as of August 14, 2007.
  22. V. Paxson, 'Bro: A system for detecting network intruders in real-time,' Computer Networks, no.31 (23-24), pp. 2435-2463, 1999
  23. S. Sen., O. Spats check, and D. Wang, 'Accurate, scalable in network identification of P2P traffic using application signatures,' in WWW2004, New York, NY, US, May 2004.
  24. "Visual Networking Index", Cisco Systems
  25. Cisco, "Cisco Visual Networking Index: Forecast and Methodology, 2008–2013" (PDF), 9 June 2009. Retrieved 13 June 2016
  26. Cisco, "Cisco Visual Networking Index: Forecast and Methodology, 2009–2014" (PDF), 2 June 2010. Retrieved 13 June 2016
  27. Cisco, "Cisco Visual Networking Index: Forecast and Methodology, 2010–2015" (PDF), 1 June 2011. Retrieved 13 June 2016
  28. Cisco, "Cisco Visual Networking Index: Forecast and Methodology, 2011–2016 Archived 2020-08-09 at the Wayback Machine " (PDF), 30 May 2012. Retrieved 13 June 2016
  29. Cisco, "Cisco Visual Networking Index: Global Mobile Data Traffic Forecast Update, 2012–2017 Archived 2016-08-12 at the Wayback Machine " (PDF), 2 Feb 2013. Retrieved 13 June 2016
  30. Cisco, "Cisco Visual Networking Index: Forecast and Methodology, 2012–2017" (PDF), 29 May 2013. Retrieved from archive.org, 28 Aug 2016
  31. Cisco, "Cisco Visual Networking Index: Forecast and Methodology, 2013–2018" (PDF), 10 Jun 2014. Retrieved from archive.org, 28 Aug 2016
  32. Cisco, "Cisco Visual Networking Index: Forecast and Methodology, 2014–2019" (PDF), 27 May 2015. Retrieved from archive.org, 28 Aug 2016
  33. Cisco, "Cisco Visual Networking Index:Forecast and Methodology, 2015–2020" (PDF) 6 June 2016. Retrieved 13 June 2016
  34. Cisco, "Cisco Visual Networking Index:Forecast and Methodology, 2016–2021" (PDF) 6 June 2017. Retrieved 14 August 2017
  35. 1 2 Cisco, "Cisco Visual Networking Index:Forecast and Trends, 2017–2022" (PDF) 28 November 2018. Retrieved 9 January 2019
  36. Minnesota Internet Traffic Studies (MINTS) Archived 2017-12-28 at the Wayback Machine , University of Minnesota
  37. "MINTS - Minnesota Internet Traffic Studies" . Retrieved 16 April 2017.
  38. 2004 Annual Report, Level(3), April 2005, p.1
  39. Cherry, Steven (2004). "Edholm's law of bandwidth". IEEE Spectrum. 41 (7): 58–60. doi:10.1109/MSPEC.2004.1309810. S2CID   27580722.
  40. Jindal, R. P. (2009). "From millibits to terabits per second and beyond - over 60 years of innovation". 2009 2nd International Workshop on Electron Devices and Semiconductor Technology. pp. 1–6. doi:10.1109/EDST.2009.5166093. ISBN   978-1-4244-3831-0. S2CID   25112828.

Further reading