You are on page 1of 35

TRANSFORMING COMMUNICATIONS

COMMUNICATIONS INFRASTRUCTURE DIVISION

Intel Data Plane Development Kit (Intel DPDK) Overview Packet Processing on Intel Architecture

December 2012

Legal Disclaimer
INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. EXCEPT AS PROVIDED IN INTEL'S TERMS AND CONDITIONS OF SALE FOR SUCH PRODUCTS, INTEL ASSUMES NO LIABILITY WHATSOEVER AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO SALE AND/OR USE OF INTEL PRODUCTS INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT. A "Mission Critical Application" is any application in which failure of the Intel Product could result, directly or indirectly, in personal injury or death. SHOULD YOU PURCHASE OR USE INTEL'S PRODUCTS FOR ANY SUCH MISSION CRITICAL APPLICATION, YOU SHALL INDEMNIFY AND HOLD INTEL AND ITS SUBSIDIARIES, SUBCONTRACTORS AND AFFILIATES, AND THE DIRECTORS, OFFICERS, AND EMPLOYEES OF EACH, HARMLESS AGAINST ALL CLAIMS COSTS, DAMAGES, AND EXPENSES AND REASONABLE ATTORNEYS' FEES ARISING OUT OF, DIRECTLY OR INDIRECTLY, ANY CLAIM OF PRODUCT LIABILITY, PERSONAL INJURY, OR DEATH ARISING IN ANY WAY OUT OF SUCH MISSION CRITICAL APPLICATION, WHETHER OR NOT INTEL OR ITS SUBCONTRACTOR WAS NEGLIGENT IN THE DESIGN, MANUFACTURE, OR WARNING OF THE INTEL PRODUCT OR ANY OF ITS PARTS. Intel may make changes to specifications and product descriptions at any time, without notice. Designers must not rely on the absence or characteristics of any features or instructions marked "reserved" or "undefined". Intel reserves these for future definition and shall have no responsibility whatsoever for conflicts or incompatibilities arising from future changes to them. The information here is subject to change without notice. Do not finalize a design with this information. The products described in this document may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request. Contact your local Intel sales office or your distributor to obtain the latest specifications and before placing your product order. Copies of documents which have an order number and are referenced in this document, or other Intel literature, may be obtained by calling 1-800-548-4725, or go to: http://www.intel.com/design/literature.htm All information provided related to future Intel products and plans is preliminary and subject to change at any time, without notice. All dates provided are subject to change without notice. Intel may make changes to specifications and product descriptions at any time, without notice. Intel product plans in this presentation do not constitute Intel plan of record product roadmaps. Please contact your Intel representative to obtain Intels current plan of record product roadmaps. Intel product plans in this presentation do not con stitute Intel plan of record product roadmaps. Celeron, Intel, Intel logo, Intel Core, Intel Inside, Intel Inside logo, Intel. Leap ahead., Intel. Leap ahead. logo, Intel NetBurst, Intel SpeedStep, Intel XScale, Itanium, Pentium, Pentium Inside, VTune, Xeon, and Xeon Inside are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Intel Active Management Technology requires the platform to have an Intel AMT-enabled chipset, network hardware and software, as well as connection with a power source and a corporate network connection. With regard to notebooks, Intel AMT may not be available or certain capabilities may be limited over a host OS-based VPN or when connecting wirelessly, on battery power, sleeping, hibernating or powered off. For more information, see http://www.intel.com/technology/iamt. Enhanced Intel SpeedStep Technology for specified units of this processor available Q2/06. See the Processor Spec Finder at http://processorfinder.intel.com or contact your Intel representative for more information 64-bit computing on Intel architecture requires a computer system with a processor, chipset, BIOS, operating system, device drivers and applications enabled for Intel 64 architecture. Performance will vary depending on your hardware and software configurations. Consult with your system vendor for more information. No computer system can provide absolute security under all conditions. Intel Trusted Execution Technology is a security technology under development by Intel and requires for operation a computer system with Intel Virtualization Technology, an Intel Trusted Execution Technology-enabled processor, chipset, BIOS, Authenticated Code Modules, and an Intel or other compatible measured virtual machine monitor. In addition, Intel Trusted Execution Technology requires the system to contain a TPMv1.2 as defined by the Trusted Computing Group and specific software for some uses. See http://www.intel.com/technology/security/ for more information. Hyper-Threading Technology (HT Technology) requires a computer system with an Intel Pentium 4 Processor supporting HT Technology and an HT Technology-enabled chipset, BIOS, and operating system. Performance will vary depending on the specific hardware and software you use. See www.intel.com/products/ht/hyperthreading_more.htm for more information including details on which processors support HT Technology. Intel Virtualization Technology requires a computer system with an enabled Intel processor, BIOS, virtual machine monitor (VMM) and, for some uses, certain platform software enabled for it. Functionality, performance or other benefits will vary depending on hardware and software configurations and may require a BIOS update. Software applications may not be compatible with all operating systems. Please check with your application vendor. Intel AES-NI requires a computer system with an AES-NI enabled processor, as well as non-Intel software to execute the instructions in the correct sequence. AES-NI is available on select Intel processors. For availability, consult your reseller or system manufacturer. For more information, see Intel Advanced Encryption Standard Instructions (AES-NI) * Other names and brands may be claimed as the property of others. Other vendors are listed by Intel as a convenience to Intel's general customer base, but Intel does not make any representations or warranties whatsoever regarding quality, reliability, functionality, or compatibility of these devices. This list and/or these devices may be subject to change without notice. Copyright 2012, Intel Corporation. All rights reserved.

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

Agenda 1. Intels Packet Processing Motivation and Value Proposition 2. Overview of Intel DPDK

3. Intel DPDK Performance Benchmarks 4. Lead Ecosystem Offerings


5. Intel DPDK Website and Collateral 6. Summary

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

CID Mission TRANSFORMING COMMUNICATIONS


ACCESS NETWORKS EDGE/CORE NETWORKS ENTERPRISE NETWORKS

+
Intel Architecture Ecosystem + Standards
WIRELESS BASE STATION WIRELESS INFRASTRUCTURE INTELLIGENT EDGE ROUTERS AND SWITCHES

MEDIA PROCESSING

NETWORK APPLIANCES

NETWORK SECURITY

>10 Years Serving Communications and Networking Segments


4

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

4:1 Workload Consolidation Assets


Unleashed by Multi-Core IA and Software
2011 Application Processing Control Processing 2012 2013

Intel Xeon Processor C3500/C5500


Intel QuickAssist Software Library

Intel Communications Chipset Series 89xx for Cryptography, Compression

Next Generation

Packet Processing
Signal Processing

NPU/ASIC

Intel Data Plane Development Kit

DSP

DSP

Intel Signal Processing Development Kit

One Instruction Set Architecture

One Tool Suite

Multiple Opportunities

TRANSFORMING COMMUNICATIONS

Intels Data Plane Value Proposition


Datacom/Telecom convergence increasing Data Plane processing requirements exponentially TCO concerns leading customers to seek single architecture design topto-bottom

Intel addresses TCO and TTM concerns with single architecture, multiworkload IA capability, allied to an industry-leading beat-rate of process and uArchitectural advancements (Tick-Tock Model)

Optimized Data Plane Software solutions will help unleash IA platform potential

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

The Engineering Problem Statement


10Gbps Line Rate
16,000,000 14,000,000

From a CPU perspective:


A Last-level-cache (L3) hit on Intel Xeon processor 5500 is ~40 cycles L3 miss, memory read is ~70ns (201 cycles at 3 GHz) This problem gets much worse at 40 Gb and 100 Gb rates

Packets per second

12,000,000 10,000,000 8,000,000 6,000,000 4,000,000 2,000,000

Intel Silicon and Software advances are proactively addressing this problem statement, making high performance Packet Processing on IA a reality!
PPS

0 136 640
100 172 208 244 280 316 352 388 424 460 496 532 568 604 676 712 748 784 820 856 892 928 964

1252

1000

1036

1072

1108

1144

1180

1216

1288

1324

1360

1396

1432

1468

Packet Size

Network Infrastructure Packet Sizes


Packet Size 10G Packets/second Packet arrival rate 64 bytes 14.88 Million each way 67.2 ns

Typical Server Packet Sizes


Packet Size 10G Packets/second 1024 bytes 1.2 Million each way

Packet arrival rate


2 GHz Clock cycles 3 Ghz Clock cycles

835 ns
1670 cycles 2505 cycles

2 GHz Clock cycles


3 Ghz Clock cycles

135 cycles
201 cycles

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

1504

64

Ongoing Silicon Architectural Enhancements


Packet Processing Enhancements: Pipeline Depth Direct Cache Access Integration of Memory Controller Integration of High Bandwidth PCIe Gen3 New AVX Extensions Intel Virtualization Technology (Intel VT) Intel Data Direct I/O Technology (Intel DDIO)
Intel Core 2 Microarchitecture
Hyper-Threading, Smart Cache, QuickPath, Integrated Memory Controller, SSE2/SSE3/SSE4 Instructions
Multi-Core Introduction Advanced Smart Cache Wide Dynamic Execution SSE2/SSE3, Power Management

Intel Xeon processor E52600 Microarchitecture

Intel Core i5 / i7 Microarchitecture


Enhanced Intel Core Microarchitecture 8C, 6C, 4C, 2C Product Choices 1S, 2S, 4S Configurations Intel Hyper-Threading Technology Integrated Memory Controller(s) Integrated High BW PCIe gen3 2 QPI Links for 2S Configurations Up to 20MB of L3 Cache

Intel Pentium 4 Processor Extreme Edition

Supporting Hyper-Threading Technology

Netburst microarchitecture Intel Extended Memory 64 Technology, Hyperthreading

Improved Packet Processing Capability

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

IA Performance over the Years


IPv4 Layer 3 Forwarding on an IA Platform
90

Introduction of Integrated PCIe* Controller

80

70

Introduction of Integrated Memory Controller + Intel DPDK

80 Mpps DPDK R1.1

60

50

40

30

20

10

Mpps

2006 2007 2008 DP Intel Xeon DP Intel Xeon DP Intel Xeon Processor LV Processor E5345 Processor E5410 2 x 2 Core 2 x 4 Core 2 x 4 Core 2.0 GHz, 667 MHz FSB 2.33 GHz, 1333 MHz FSB 2.33 GHz, 1333 MHz FSB

2009 2S Intel Xeon processor E5540 2 x 4 Core 2.53 GHz

2010 2S Intel Xeon processor E5645 2 x 6 Core 2.40 GHz

2011 1S Intel Xeon Processor E5-2600 (B0 stepping) 1 x 8 Core 2.0 GHz PCIe* Gen2 Performance

Standard off-the-shelf IA platform can deliver huge performance. Performance jump can be attributed to Core, Memory architecture (iMC) + Intel DPDK
9

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

Evolution of Data Plane Support


Integrated Memory Controller SSE 4.2 128-bit SSE CRC32 Faster Cache Data Direct IO 1 GB pages Faster AES instructions

Software & Hardware performance enhancements over the next 2-3 years

Core

Increasing Cache Sizes Increasing Core Count

PCIe* Gen 2

Integrated PCI-E (Latency) PCIe* Gen 3

40G Networking silicon (Intel)

Platform

Increasing I/O bandwidth


Increasing memory bandwidth

Acceleration

AES-NI instruction

Intel Communications Chipset Series 89xx for Cryptography, Compression

Next Gen Chipset for Cryptography, Compression

2010/2011 Intel Xeon processors E5540/E5640

2012/2013 Intel Xeon processor E5-2600

2013/14 Next Tick/Tock Processors

1010

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

Management, Control & Data Plane Environments


Typically more Data Plane elements than Management/Control Plane
Ecosystem or Data Proprietary Plane

Packet Processing Packet Processing Processor Switch Switch Processor Packet Processing Packet Processing

Intel Provided

Intel DPDK

Control Plane Data Plane

Processor Blade Control Plane

Packet Processing Blade Data Plane

Rack Mount Server, Enterprise Servers


Typically have Control and Data Plane on same board

AdvancedTCA*
Typically Control and Data Plane on different boards

Value Proposition is Consolidation of DP + CP


Customers have come up with solutions to add additional 3rd party boards for Data Plane IA can fill both roles

Value Proposition single blade for multiple purposes


Normally many more Packet Processing Blades IA allows a single Blade Architecture for both now

Value proposition of Intel DPDK is workload consolidation: Provides framework and performance for NPU workloads on IA cores
1111

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

Agenda 1. Intels Packet Processing Motivation and Value Proposition 2. Overview of Intel DPDK

3. Intel DPDK Performance Benchmarks 4. Lead Ecosystem Offerings


5. Intel DPDK Website and Collateral 6. Summary

1212

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

Goals of this Overview Section


Look into the Intel DPDK architecture and see how its designed to squeeze the best packet processing performance out of an IA-based platform Be able to articulate the most common performance bottlenecks for packet processing software on IA Understand the optimization tricks to remove them

1313

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

The Intel DPDK Philosophy


Intel DPDK Fundamentals
Implements a run to completion model or pipeline model No scheduler - all devices accessed by polling Supports 32-bit and 64-bit with/without NUMA Scales from Intel Atom to Intel Xeon processors Number of Cores and Processors not limited Optimal packet allocation across DRAM channels

Control Plane

Data Plane

Must run on any IA CPU


From Intel Atom processor to the latest Intel Xeon processor family Essential to the IA value proposition

Provide software examples that address common network performance deficits


Best practices for software architecture Tips for data structure design and storage Help the compiler generate optimum code Address the challenges of achieving 80 Mpps per CPU Socket

Focus on the fast-path


Sending large number of packets to the Linux Kernel /GPOS will bog the system down

1414

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

Intel Data Plane Development Kit (Intel DPDK)


Intel DPDK embeds optimizations for the IA platform: - Data Plane Libraries and Optimized NIC Drivers in Linux
User Space
Queue & Buffer Management, Packet Flow Classification, Poll-Mode NIC Drivers (1/10GbE), and more! Simple API Interface, Uses standard tool chain (gcc/icc, gdb, profiling tools)

Intel DPDK Libraries


Buffer Management Queue/Ring Functions

Customer Application

Packet Flow Classification


NIC Poll Mode Library

Customer Application
Customer Application

- Run-time Environment
Low overhead, run-to-completion model optimized for fastest possible data plane performance

Environment Abstraction Layer

- Environment Abstraction Layer and Boot Code


Primarily platform-specific boot guidelines and initialization code, eases application porting effort

User Space Kernel Space


Environment Abstraction Layer

- BSD-licensed & source downloadable from Intel and leading ecopartners


Provided under a very flexible BSD licensing model
Offered as a free, unsupported standalone solution by Intel or as part of commercial solutions and offerings from leading ecopartners

Linux Kernel

Platform Hardware

The Intel DPDK is a great starting point for customers and the industry in general delivering breakthrough packet processing performance.
1515

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

Intel DPDK Libraries and Drivers


Memory Manager: Responsible for allocating pools of objects in memory. A pool is created in huge page memory space and uses a ring to store free objects. It also provides an alignment helper to ensure that objects are padded to spread them equally on all DRAM channels. Buffer Manager: Reduces by a significant amount the time the operating system spends allocating and de-allocating buffers. The Intel DPDK pre-allocates fixed size buffers which are stored in memory pools. Queue Manager:: Implements safe lockless queues, instead of using spinlocks, that allow different software components to process packets, while avoiding unnecessary wait times. Flow Classification: Provides an efficient mechanism which incorporates Intel Streaming SIMD Extensions (Intel SSE) to produce a hash based on tuple information so that packets may be placed into flows quickly for processing, thus greatly improving throughput. Poll Mode Drivers: The Intel DPDK includes Poll Mode Drivers for 1 GbE and 10 GbE Ethernet* controllers which are designed to work without asynchronous, interrupt-based signaling mechanisms, which greatly speeds up the packet pipeline.

1616

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

Component Overviews
EAL Memory Management Overview Queue/Ring Overview Buffer Management Overview Flow Classification Overview

1717

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

Memory Usage
Basic unit for runtime object allocation is the memory zone Zones contain rings, pools, LPM routing tables, or any other performance-critical structures Always backed by Huge Page (2 MB/1 GB page) memory
Ring:
Ring:
Memory Pool: mbuf_pool Memory Zone: MP_mbuf_pool

RX_RING_0

TX_RING_0

Memory Zone:
RG_RX_RING_0

Memory Zone:
RG_TX_RING_0

Memory Segment 0

Memory Segment 1

Memory Segment N

2MB page

2MB page

2MB page

2MB 2MB page page

2MB 2MB page page

2MB 2MB page page

2MB 2MB page page

2MB 2MB page page

Physically contiguous memory


1818

Physically contiguous memory

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

Queue/Ring Management API


Effectively a FIFO implementation in software
Lockless implementations for single or multi-producer, single or multi- consumer enqueue/dequeue Supports bulk enqueue/dequeue to support packet-bunching Implements high & low watermark thresholds for back-pressure/flow control

Essential to optimizing throughput


Used to decouple stages of a pipeline

1919

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

The Buffer Management API (mempool)


The Buffer Manager allocates memory from the EAL and creates pools with fixed element sizes.
Typical usage is packet buffers, descriptor ring buffers, etc. Intent is to speed up runtime allocation/deallocation Does not support runtime resizing of pools

Memory Pool
Pkt Buffers (60K 2K buffers) Events (2K 100B buffers) Events (2K 100B buffers)

Multi-producer/multi-consumer safe
Pools are based on Intel DPDK rings so are multiproducer and multi-consumer safe No locking; use CAS instructions Pools can also be used in multi-process environments

Rings for Cached buffers


Data Plane

Processor 0

Optimized for performance


Cache alignment Per core buffer caches for each buffer pool so that allocation/freeing can be done without using shared variables Bulk allocation/freeing support

Data Plane

Data Plane

Data Plane

Intel DPDK C1

Intel DPDK C2

Intel DPDK C3

Intel DPDK C4

10G

10G

2020

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

Flow Classification API

The Intel DPDK provides a Flow Classification API

Not expecting every customer to use it more of a showcase to demonstrate how to do optimization for IA Classification is something that is very customer-specific Each customer/segment has different needs
Router implementations typically use longest-prefix-match Security implementations need to identify individual flows and can use flow classification

The Flow classification API is designed to take advantage of current and future hardware-based flow classification capabilities
Intel 82599 10GbE Ethernet Controller implements flow classification (limited in number of flows) Future Chipsets are expected to implement an extensive classifier

2121

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

30,000 ft Overview of Packet Flow


1. Initialization

Initialization

Initialize memory zones and pools Initialize devices and device queues Start the packet forwarding application
Poll devices RX queues and receive packets in bursts Allocate new RX buffers from per queue memory pools to stuff into descriptors Transmit the received packets from RX Free the buffers used to store the packets

Polling

2. Packet Reception (RX)


RX
Packets to send

3. Packet Transmission (TX)


TX

2222

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

Overcoming The Challenge of Achieving 80 Mpps (and More) Per CPU Socket

Memory and PCIe* access is really, really slow compared to CPU operation
Process a bunch of packets (e.g. 4 packets at a time) to minimize external memory and PCIe bandwidth. Avoid read-modify-write transactions in favour of single write, and multiple reads in favour of single read.

Data doesnt seem to be near the CPU when it needs it (and so it waits)
For memory access, use HW or SW controlled prefetching and align data structures to cache line size (64 Byte) to minimize external memory and PCIe* bandwidth, as all external memory accesses are in cache line increments; for PCIe access, use Direct Data IO (available on Intel Xeon processor E5 Product Family) to read data directly into cache.

The system cant keep up with the amount of interrupts for packet Rx
Switch from an interrupt-driven network device driver to a polled-mode driver.

Access to shared data structures is a bottleneck in the application


Figure out clever access schemes that reduce the amount of sharing (e.g. use lockless queues for message passing as semaphores/spinlocks are costly).

The out-of-the-box Linux* Scheduler causes too much overhead to task switch
Bind a single software thread to a logical core. Use CPU core isolation and thread affinities for 1:1 mapping of SW threads to HW threads.

Intel PTU Tool indicates that page tables are constantly evicted (D-TLB Thrashing)
Use 2MB or 1G Huge Pages in Linux* to reduce TLB misses.

The Challenge can be Overcome with Smart Programming and Hardware assists!!
2323

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

PCIe* connectivity and core usage


Using run-to-completion or pipeline software models NUMA
Physical Core 0 Linux* Control Plane
Pkt Pkt

Pool Caches Queue/Rings Buffers

NUMA

Processor 0

QPI
PCIe
Pkt Pkt

Pool Caches Queue/Rings Buffers

Processor 1
10 GbE Physical Rx Core 0 Tx Intel DPDK Physical Core 1 Intel DPDK

PMD Packet I/O Flow Classification

10 GbE

PCIe

Physical Core 1 Intel DPDK

PMD Packet I/O Packet work

Rx Tx

PCIe

10 GbE
Pkt

App A

App B

App C

Pkt

Physical PMD Packet I/O Core 2 Flow work Intel DPDK


Physical PMD Packet I/O Core 3 Flow Classification Intel DPDK App A, B, C Physical PMD Packet I/O Core 4 Flow Classification Intel DPDK App A, B, C Physical PMD Packet I/O Core 5 Flow Classification Intel DPDK App A, B, C

Rx Tx

10 GbE

Physical Core 2 Intel DPDK Physical Core 3 Intel DPDK

PCIe PCIe

App A

App B

App C

Rx Tx
Pkt Pkt

Rx Tx

10 GbE

RSS Mode

Physical Core 4 Intel DPDK


Physical Core 5 Intel DPDK

Rx Tx

Run to Completion model I/O and Application workload can be handled on a single core I/O can be scaled over multiple cores

Pipeline model I/O application disperses packets to other cores Application work performed on other cores

2424

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

Tools for Optimizing Intel DPDK

Intel VTune Amplifier XE profiles performance

Intel Performance Tuning Utility (Intel PTU) offers specific tuning advice

Download both tools at whatif.intel.com


2525

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

Agenda 1. Intels Packet Processing Motivation and Value Proposition 2. Overview of Intel DPDK

3. Intel DPDK Performance Benchmarks 4. Lead Ecosystem Offerings


5. Intel DPDK Website and Collateral 6. Summary

2626

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

Performance Going Forward


IPv4 Layer 3 Forwarding on an IA Platform
250

200

150

Introduction of Integrated Memory Controller + Intel DPDK

Introduction of Integrated PCIe* Controller

Testing with PCIe* Gen3 expected to be higher

164.9 Mpps DPDK R1.2

100

50

93 Mpps DPDK R1.2

Mpps

2009 2S Intel Xeon processor E5540 2 x 4 Core 2.53 GHz

2010 2S Intel Xeon processor E5645 2 x 6 Core 2.40 GHz

2011 1S Intel Xeon E5-2658 processors C1 Stepping 1 x 8 Core 2.1 GHz PCIe Gen2 Performance

2012 2S Intel Xeon E5-2658 processors C1 Stepping 2 x 8 Core 2.1 GHz PCIe Gen2 Performance

2012 2S Intel Xeon E5-2658 processors C1 Stepping 2 x 8 Core 2.1 GHz PCIe Gen3 Performance (Estimate ONLY)

Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products.

PCIe* Gen 3 will offer better performance and more options


In the case of IPv4 Layer 3 forwarding, we are still I/O limited i.e. cores capability is not maxed out!!
2727

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

Intel DPDK Native and Virtualized Forwarding Performance

2828

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

Agenda 1. Intels Packet Processing Motivation and Value Proposition 2. Overview of Intel DPDK

3. Intel DPDK Performance Benchmarks 4. Lead Ecosystem Offerings


5. Intel DPDK Website and Collateral 6. Summary

2929

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

Intel DPDK Go To Market Options


Intel DPDK free, unsupported standalone package for integration with proprietary customer stacks
Data Plane Applications

Dispatch Loop Services Memory & Buffer API PMD

Intel DPDK
Core 1 - n

Ecosystem Provided: Intel DPDK Integrated into Commercial Solutions and Intel DPDK services offerings For more information about ecosystem solutions, visit www.intel.com/go/dpdk

Intel Provided: Intel DPDK Free, Unsupported Package

3030

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

Agenda 1. Intels Packet Processing Motivation and Value Proposition 2. Overview of Intel DPDK

3. Intel DPDK Performance Benchmarks 4. Lead Ecosystem Offerings


5. Intel DPDK Website and Collateral 6. Summary

3131

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

Intel DPDK

www.intel.com/go/dpdk

Your One-Stop-Shop for: Documentation and articles, white papers, pod casts Ecosystem information and articles

Examples

See the Video: Intel Data Plane Development Kit (Intel DPDK). Found on EDC site at www.intel.com/go/dpdk under Video: Intel Data Plane Development Kit (Intel DPDK)

3232

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

Agenda 1. Intels Packet Processing Motivation and Value Proposition 2. Overview of Intel DPDK

3. Intel DPDK Performance Benchmarks 4. Lead Ecosystem Offerings


5. Intel DPDK Website and Collateral 6. Summary

3333

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

Summary

Intel DPDK enables multi-workload/single architecture potential by making IA extremely competitive for packet processing workloads

Distribution of enabling software under flexible and cost-free licensing model enabling maximum customer usability
Fully featured and supported IA Data Plane software solutions via Intels lead Ecosystem partners

3434

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

3535

Intel Restricted Secret

TRANSFORMING COMMUNICATIONS TRANSFORMING COMMUNICATIONS

You might also like