Microservices Expo Authors: Yeshim Deniz, Pat Romanski, Elizabeth White, Liz McMillan, Zakia Bouachraoui

Related Topics: @CloudExpo, Java IoT, Microservices Expo, Cloud Security, @DXWorldExpo, SDN Journal

@CloudExpo: Blog Post

Traffic Advisory: Your Packets May Be Delayed

The past few years have seen a dramatic improvement in the latency in network switches

The past few years have seen a dramatic improvement in the latency in network switches. Single ASIC based switches can all pretty much switch packets in less than a microsecond. Current 10GE switching silicon provides anywhere from 300 to 800 nanoseconds, specialized silicon shaves that to less than 200 nanoseconds when limiting the amount of searching that needs to be done by reducing the size of lookup tables. Even other solutions play some smart tricks by providing forwarding hints for intermediate switches make those lookups take less than 50 nanoseconds.


Modular switches inherently have a higher latency. Line cards on modular switches typically have multiple ASICs, those ASICs are connected through a single or multi stage fabric. Each step takes time, resulting in latencies varying from around a microsecond when a packet stays on the same ASIC, to possibly 5-15 microseconds when a packet needs to travel through the fabric and back.

The speediest of ASICs achieve these low numbers by employing cut through switching. Cut through switching allows the ASIC to start transmitting a packet when enough of the header has been received to make a forwarding decision. The ASIC does not wait for the entire packet to be received (the more traditional store-and-forward mechanism), within the first few 100 bytes the forwarding decision has been made, and that same header (modified or not) is being transmitted out the destination port. It's somewhat odd to think that through, but the first bits of a packet may be received by the destination system before the last bits have left the first switch in the network.

Cut through switching comes with quite a few "buts". Most switches can only deploy cut through switching when the source and destination port are the same speed. 10GE in and 40GE out or vice versa is rarely supported and the ASIC will automatically switch to store-and-forward for those packets. For good reason. If a packet comes at you at 40GE rates, you simply cannot transmit it out a 10GE interface, that interface is not fast enough. In the reverse direction speed is not the issue, but if you were to employ cut through switching, for the duration of that packet your 40GE interface effectively runs at 10GE with lots of pauses in between pieces of a packet (figuratively speaking).

In addition, when the destination port has another packet being transmitted or in the queue, a new packet cannot be sent cut through. When another packet is ahead of you, you need to wait. And you may need to wait for quite a while. We often forget that it takes 1.2 microseconds to transmit a 1500 byte packet on a 10GE interface, more than 7 microseconds for a jumbo packet. When the destination port is being paused due to Data Center Bridging Priority Flow Control (PFC), the packet will be queued for store and forward. And make sure you add an extra 3 microseconds for 10GBASE-TX.

Datacenters are on a path to fewer layers of switching. Spine and leaf networks are being pitched as the best performing, low cost solution for dense networks. If you carefully examine the specs and pitches of some of the newer spine switches, you will notice that all of them make a case for deep buffers. Deep buffers assume that this switch needs to manage congestion by buffering packets, why else would you design expensive and power hungry buffer memory into those switches. Buffering and low latency don't go well together. If your spine and leaf network has nothing much to do, you may well see latency numbers of only a few microseconds or better. If the spine layer needs to buffer your packet, this number can jump up quickly to 10s of microseconds. And those large buffers seem to suggest it will.

There certainly are applications that are very sensitive to latency. Financial institution low latency trading networks are the example always used, and there are High Performance Computing environments with database, RDMI or similar applications that benefit from really low latency. Engineering the traffic in such a way that none of the low latency disruptive events described above happen is hard. Really hard. Extremely hard if there is a lot of traffic. Or a lot of endpoints. Networks that are specifically designed to aggregate and distribute (spine and leaf) will be more prone to these latency increasing scenarios. Creating a network with the ability to create isolated direct paths between switches that serve low latency applications is much more likely to avoid these. And even if the absolute latency is not the lowest, consistent latency with little jitter will certainly help the performance of adaptive mechanisms like TCP.

For the vast majority of applications in a typical enterprise datacenter, or a public cloud provider, the difference between 200 nanosecond switch latency and 1 microseconds is not measurably different in terms of user or application performance. The set of applications that behave noticably different at a few microseconds vs 10s of microseconds end to end latency is probably larger. I argue that you will get better results by carefully engineering the traffic for the applications that do care about low latency and low jitter. Make sure they get the bandwidth they need. Make sure they do not clash in the network with data hungry applications. Affinitize your network. It may just give you the latency and jitter performance you need.

The post Traffic advisory: your packets may be delayed appeared first on Plexxi.

Read the original blog entry...

More Stories By Marten Terpstra

Marten Terpstra is a Product Management Director at Plexxi Inc. Marten has extensive knowledge of the architecture, design, deployment and management of enterprise and carrier networks.

Microservices Articles
Consumer-driven contracts are an essential part of a mature microservice testing portfolio enabling independent service deployments. In this presentation we'll provide an overview of the tools, patterns and pain points we've seen when implementing contract testing in large development organizations.
Containers and Kubernetes allow for code portability across on-premise VMs, bare metal, or multiple cloud provider environments. Yet, despite this portability promise, developers may include configuration and application definitions that constrain or even eliminate application portability. In this session we'll describe best practices for "configuration as code" in a Kubernetes environment. We will demonstrate how a properly constructed containerized app can be deployed to both Amazon and Azure ...
In his general session at 19th Cloud Expo, Manish Dixit, VP of Product and Engineering at Dice, discussed how Dice leverages data insights and tools to help both tech professionals and recruiters better understand how skills relate to each other and which skills are in high demand using interactive visualizations and salary indicator tools to maximize earning potential. Manish Dixit is VP of Product and Engineering at Dice. As the leader of the Product, Engineering and Data Sciences team at D...
In his session at 20th Cloud Expo, Scott Davis, CTO of Embotics, discussed how automation can provide the dynamic management required to cost-effectively deliver microservices and container solutions at scale. He also discussed how flexible automation is the key to effectively bridging and seamlessly coordinating both IT and developer needs for component orchestration across disparate clouds – an increasingly important requirement at today’s multi-cloud enterprise.
SYS-CON Events announced today that DatacenterDynamics has been named “Media Sponsor” of SYS-CON's 18th International Cloud Expo, which will take place on June 7–9, 2016, at the Javits Center in New York City, NY. DatacenterDynamics is a brand of DCD Group, a global B2B media and publishing company that develops products to help senior professionals in the world's most ICT dependent organizations make risk-based infrastructure and capacity decisions.
Most DevOps journeys involve several phases of maturity. Research shows that the inflection point where organizations begin to see maximum value is when they implement tight integration deploying their code to their infrastructure. Success at this level is the last barrier to at-will deployment. Storage, for instance, is more capable than where we read and write data. In his session at @DevOpsSummit at 20th Cloud Expo, Josh Atwell, a Developer Advocate for NetApp, will discuss the role and value...
DevOpsSummit New York 2018, colocated with CloudEXPO | DXWorldEXPO New York 2018 will be held November 11-13, 2018, in New York City. Digital Transformation (DX) is a major focus with the introduction of DXWorldEXPO within the program. Successful transformation requires a laser focus on being data-driven and on using all the tools available that enable transformation if they plan to survive over the long term.
CloudEXPO New York 2018, colocated with DXWorldEXPO New York 2018 will be held November 11-13, 2018, in New York City and will bring together Cloud Computing, FinTech and Blockchain, Digital Transformation, Big Data, Internet of Things, DevOps, AI, Machine Learning and WebRTC to one location.
Enterprise architects are increasingly adopting multi-cloud strategies as they seek to utilize existing data center assets, leverage the advantages of cloud computing and avoid cloud vendor lock-in. This requires a globally aware traffic management strategy that can monitor infrastructure health across data centers and end-user experience globally, while responding to control changes and system specification at the speed of today’s DevOps teams. In his session at 20th Cloud Expo, Josh Gray, Chie...
Discussions of cloud computing have evolved in recent years from a focus on specific types of cloud, to a world of hybrid cloud, and to a world dominated by the APIs that make today's multi-cloud environments and hybrid clouds possible. In this Power Panel at 17th Cloud Expo, moderated by Conference Chair Roger Strukhoff, panelists addressed the importance of customers being able to use the specific technologies they need, through environments and ecosystems that expose their APIs to make true ...