Reliability of National Internet Segments

Qrator Labs
HackerNoon.com
Published in
5 min readJul 11, 2017

--

The connectivity of Internet at the network layer is a result of interaction between autonomous systems (AS), and it is more stable the more alternatives routes between ASNs there exist, which is basic fault tolerance principle. This research shows how outage of single, though significant AS affects the global connectivity of the region.

Global connectivity of any AS is based on its routes to set of Tier-1 ISPs. Tier-1s are transnational and transcontinental providers that provide connectivity service at world scale. If there are no active routes between AS and Tier-1s — this AS has no global connectivity.

Let’s suppose that a given AS is experiencing significant network degradation. We want to find out the answer to the following question: “What percentage of other AS in this region would lose its connectivity with the Tier-1 operators and therefore the global availability?”

Why are we modeling such a situation? Strictly speaking, when the modern Internet was emerging it was supposed that every AS would have at least two upstreams (higher level internet service providers) which would guarantee fault tolerance in case one of them fails. However, in reality, do failures of big transit operators ever occur? The answer is yes, and rather often. So if someone has not suffered yet, it is time to remind Murphy’s Law: “Anything that can go wrong, does.”

To model such scenario, we made following steps:

  1. For every autonomous system in the world, we create all alternative routes to the Tier-1 operators with the help of AS relation model of the Qrator.Radar project;
  2. Using the Maxmind geodatabase, we matched countries with every AS’ address space;
  3. For every AS we made a normalization of the geodata to avoid a situation when the AS has a degenerate presence in the region. A good example of a region with such issue is Hong Kong: there are hundreds of members at the largest Asian Internet Exchange HKIX — which have at the same time zero presence at Hong Kong Internet segment itself.
  4. After, we evaluate the effect of a possible failure of a given AS on other ASNs and, therefore, countries.
  5. For each country, we find the autonomous system that affects the biggest percentage of other ASNs in given region.

Here you can see the top-20 of most failure-tolerant regions in 2017 and the updated 2016 results.

Comparison between the 2016 and 2017 results of the network reliability survey

As you can see, there are some changes from year to year. However, with the tenths of a percent difference, there are not many significant variations in the top-20 stable regions. Speaking about the top fault tolerant countries where a single major AS shutdown is affecting less than 10% of the region’s autonomous systems (there are 29 such countries) — all these countries have diversified IP-transit service market with lots of alternative routes.

Also, we want to highlight a significant influence of the AS 174, belonging to Cogent, on several regions: France, Great Britain, United States, Ireland just among top-20 countries. This means that issues in the AS174 could lead to problems even in several neighboring regions. Though outage of Cogent would not result in total unavailability because we are speaking about diversified and highly developed national segments.

Also, there are no surprises in Russia. Major internet service provider is still the Rostelecom, but even the failure of one would not lead to more than 5,73% of networks losing global unavailability among Russian ISPs. At the same, Russia’s failover percentage, which is slightly better than last year, in 2017 provides only the 13th position in the top-20 of fault tolerant regions. Within Russia, there is almost no Tier-1 ISP presence. However, the transit market is represented by a lineup of big and middle sized Tier-2 networks, which are granting such high stability and failover percentage.

Does the country’s biggest ISP always influence the regional reliability more than everyone else? Our calculations show that this is not always true. For example, in Germany, the biggest ISP is Deutsche Telecom, but when speaking about the connection reliability in the region, an outage of AS 8881, belonging to Versatel, would influence on the largest number of German ASNs. We believe that such trend of growing importance of Tier-2 ISPs would prevail in the nearest future.

Also, while speaking about trends, we could not mention the fact that the average “instability” in 2017 is 41%, which is 1,6% less than in 2016.

National segments reliability map, from dark green (top reliability) to dark red (bottom reliability). Yellow is 25%

The most significant improvements in 2017 were achieved within the emerging Africa region. The failover capability of such regions like the Gambia and Liberia has grown significantly, almost by 40%. However, the movement towards fault tolerance of the world regions could not be named “unidirectional.” As an example, we could call the Jamaica, and the grown dependency on the stability of the single ISP, from 34% in 2016 up to 91% in 2017. External connectivity of this country almost completely relies on the AS 23520 (Columbus Networks) reliability.

Countries with 99% dependency on a single ISP

At the bottom of our rating, there are some regions, that could experience total (up to 100% of lost global connectivity) unavailability in the situation of a single ISP failure. So from the former Soviet Union countries, we are speaking only about Uzbekistan, that is taking position number 235 of our rating with the 99,94% unavailability in the case of AS 28910 failure.

Besides already mentioned countries Cuba and South Sudan are probably the most known to a reader countries, with more than 99% dependency from a single ISP. North Korea has 92%. Surprisingly, Monaco has a bad connectivity (the only red dot in Europe), while another small sovereign state of Luxembourg, is closing the top-20 regions with the maximum fault tolerance.

With the growing cybersecurity risks and many news about attacks on the Internet infrastructure, it is the right time for the governments, private and public companies, as well as common users to think about their position. Twice, if the region you are living in is characterized by a 50% instability, but being honest a 25% failover rate is enough to get in trouble with the availability and connectivity in case of a severe attack on a big, nation-wide services providers, as the DNS service. Do not forget, that the outer world also would be separated from the services and data located within the troubled region, in the case of total unavailability.

The results of our survey make it evident that the ISP market built upon competition is, in the end, much more stable and failure tolerant, concerning issues within or outside a specific region. Vice-versa, a single AS failure could lead to network unavailability for a significant portion of users from a country or even a larger region.

We would be happy to answer you comments and questions on this survey, fell free to ask!

--

--

Qrator Labs
HackerNoon.com

Empowering businesses with high network availability and clients loyalty through cloud-based network security solutions