RARE routing platform - CERN Indico

36
RARE routing platform Potential usage in data-centres network www.geant.org LOUI Frédéric GÉANT/RENATER – RARE technical leader WLCG Data Center network architecture 7-8 th 2021 Public MATE Csaba GÉANT/KIFU – RARE/freeRtr lead core developer

Transcript of RARE routing platform - CERN Indico

RARE routing platformPotential usage in data-centres network

www.geant.org

LOUI FrédéricGÉANT/RENATER – RARE technical leader

WLCG Data Center network architecture7-8th 2021Public

MATE CsabaGÉANT/KIFU – RARE/freeRtr lead core developer

2 www.geant.org

• funded project• Control plane software• Programmable dataplane• Interface them and the result is …

• Feature rich routing platform• various hardware line rate• Flexible, DIY “hackable/extensible” router• Control plane independence

RARE project : Group focus

3 www.geant.org

Why RARE now?• Starting from early 2010:

• Several valuable Open Source control plane usage besides well know commercial vendor

• Starting from 2020:• Dataplane solution reached maturity ready to implement production grade use case

• NOS emergence

• Hypervisor Technology convergence

4 www.geant.org

RARE use casesBackbone

LSR/P

MANDC

MANIXP

COREAGGREGATIONACCESSCPE / SOHO

AggregationPE /MAN

BackboneLER/PE

Backbone component (i.e. BGP RR)

5 www.geant.org

Anatomy of a typical R&E worldwide research project #1

High speed Research & Education Network

Research project’sInstruments

Data processingcomputing center

Eyeballs

6 www.geant.org

Anatomy of a typical R&E worldwide research project #2

7 www.geant.org

RARE is for everyone• Routing (CP+DP) platform solution

• Open Platform• Programmable

• RARE for Research and Education connectivity• Emerging NREN• Or not …

• RARE for content provider DCI• IaaS owned by NREN• IaaS owned by International Global Research project

• RARE for end user institution• Primary/Secondary schools• University campus• MAN network for Regional network

• RARE for International Global research project connectivity• Network research• Science research

8

RARE latest news (Month 29 of 48)

• RARE p4 targetsbmv2 software switch

Programmable Ethernet ASIC on WEDGE-BF100-32X

under study

• RARE p4 discussion emulation targets

TOFINO

9

RARE “target” development

Code / Algorithmvalidation

(Learning reference)

Code portHardware validation

(Core backbone use cases)

DPDK Code port validation

(Access layer)

10 www.geant.org

11

RARE testing framework : ~ 2300 features = 2300 tests

12

RARE testing framework: Dataplane tests ~300 tests

And more features !

13 www.geant.org

RARE P4 switch

10GE link

PAR0001

FRA0001

AMS0001 POZ0001

BUD0001

2/0

1/0

1/0

2/0

1/0

2/0

1/0

2/0

3/0 VLAN 2200

VLAN 3068

VLAN 1916

x/y VLAN 701

3/0

x/y

x/y

VLAN x

VLAN 101

AMS0101

x/y

44:A8:42:09:D3:AC

VLAN 702

LON0301 (DTN)

7c:fe:90:9e:89:20

3/0VLAN 1213

VLAN 101

HAM0101

44:A8:42:09:D3:AC

VLAN 702

PAR0301 (DTN)

7c:fe:90:9e:89:d0

x/y

VLAN 766x/y

GVA0001x/x VLAN 559

VLAN 10

VLAN 1955

VLAN 20

4/0

100GE link

BUD1001

POZ0201

14 www.geant.org

RARE validation designs: P4 LAB network management via *!(Network Management as a Service)

P4 Testbed OOBMVPLS or EVPN

European Testbed

NMaaS

micro-services

Internet

Network Management as a Service:https://nmaas.euhttps://wiki.geant.org/display/NMAAS

15 www.geant.org

P4 LAB network management via (Network Management as a Service)

Network Management as a Service:https://nmaas.euhttps://wiki.geant.org/display/NMAAS

16 www.geant.org

Monitoring at node level! (Prometheus agent)

17 www.geant.org

Monitoring at node level! (Grafana dashboard)

https://grafana.com/grafana/dashboards?search=freeRouter

18 www.geant.org

Key take-away – We are ready to roll into production

• Automated testing• 3rd party testing via Spirent usage • (thanks PSNC@WB team)

• P4 profile calibration for only• currently in operation SOHO• Production deployment

• Work in progress production deployment

TOFINO

19 www.geant.org

Let’s get practical … and present you

actual real life use cases !

20 www.geant.org…

Upstream Service ProviderProviding inter-DCI

Virtual gateway

Spine

Leaf

ToR

Practical use case #001 100GE DC network

21 www.geant.org

Practical use case #001a ToR

• 2x100GE uplink toward LEAF nodes• 30x100GE server blade connection• Each server powered by DPDK NIC WEDGE-100BF-32X

22 www.geant.org

Practical use case #001b BNG / Virtual Gateway DC

• DPDK and P4 dataplaneèsuitable for CAMPUS / EDGE BACKBONE router

• nx1GE, nx10GE, nx100GE BF-2556X-1T

TOFINO

23 www.geant.org

Practical use case #001c LSR or SPINE router

• P4 dataplane fits perfectly pure LSR/SPINE router • NNI: 4 directions with (8x100GE) bundle

WEDGE-100BF-32X

TOFINO

24 www.geant.org

Practical use case #001d LER or LEAF router

• P4 dataplane fits perfectly pure LER/LEAF use case • NNI: EST/WEST direction @ (8x100GE) bundle• UNI: 16x100GE left for end user connection! WEDGE-100BF-32X

25 www.geant.org

Practical use case #001e high performance BGP RR

• Recycling old/new server?• cluster using BGP as CNI network plugin• Taking advantage of server « huge » amount of RAM• No need specific high performance dataplane

26 www.geant.org

Practical use case #001f « small PE/L3 ToR »

Ideal for aggregation• 2x10GE or 2x100GE NIC server side• 2x10g+48x1g or 1x100g+48x1/10g switch

27 www.geant.org

Practical use case #002 SOHO router

• DPDK flavor ideal for CPE• nx1GE • nx10GE small MAN ideal for small campus• Couple of 100GE (Depending on server generation)

28 www.geant.org

Practical use case #003 100GE Private Peering node

• High resilient Packet core• 2 direction @ 400Gb / 1,6 Tbps

• User ports connection• 24 ports left for 2x12 redundant Private peering• 1:3 ratio with redundant scenario

C1 C-2

C5

C4 C3

C8

C7RR1 RR2

PPN1

…eBGP peers

29 www.geant.org

Practical use case #xxx The sky is the limit

• Automation integration• IXP with MPLS core• ToR router combined to BGP aware network plugin• Spine/Leaf DC router• Global BGP monitoring for your entire BGP domain• Global IGP guard for your entire IGP domain• BGP flowspec aware anti DDOS• AAA servers (TACACS, RADIUS)• …

30 www.geant.org

Key take-away – Final words – RARE vision

• Open Network programming opportunity• R&E small institution• R&E global project (100GE is real, 400GE just landed)

• Opportunity to define Node/Flow Network Monitoring • Scaling new NMS (horizontal scaling with K8s)• Streaming Telemetry• INTèRethink how Network Management is handled

• Opportunity to integrate existing automation initiatives

31 www.geant.org

Acknowledgements …

32 www.geant.org

Useful links

• ProjectfreeRtr control plane’s home: freertr.netmore information on dataplanes: rare.freertr.netProject members’ journey: blog.freertr.netFreeRtr configuration guide: docs.freertr.net

• ContactFor daring RARE/freeRtr users: [email protected] RARE/freeRtr JEDI developer wanabee: [email protected] RARE/freeRtr supporters

IRC@DN42 #freertr@rare_freerouter

33 www.geant.org

Useful links: Source code!!!!!

34 www.geant.org

Looking ahead: Finalize transition to production

Extend HCL:new TOFINO based hardware supportnew DPDK release

New target:TOFINO2DPU

SmartNICFPGA

New idea:PolkaP42VPPT4P4S ELTELeverage Nix paradigm

And more …

35 www.geant.org

Last word: Worldwide End to end dynamic path ?

+As a CNI plugin

Workers node

Thank you

www.geant.org

Any questions?

© GÉANT Association on behalf of the GN4 Phase 3 project (GN4-3).The research leading to these results has received funding fromthe European Union’s Horizon 2020 research and innovation programme under Grant Agreement No. 856726 (GN4-3).