sonic-net · mgheorghe · Aug 8, 2022 · Aug 8, 2022 · Aug 10, 2022 · Aug 16, 2022
diff --git a/test/confgen/.gitignore b/test/confgen/.gitignore
@@ -0,0 +1,8 @@
+*.bkp
+*.log
+__pycache__/
+*.pyc
+*.json
+
+
+!sample_dash_conf.json
diff --git a/test/confgen/README.md b/test/confgen/README.md
@@ -0,0 +1,142 @@
+# Overview
+The intention of the dash config generator is to provide a way to build a large scale dash config. and to provide some insight into the actual values used in the test.
+
+The dash config format is close to [dash-reference-config-example.md](../../../documentation/gnmi/design/dash-reference-config-example.md) with small changes.
+
+This is not yet a config that can be deployed, but intends to be morphed into one as DASH standards get ratified and implemented
+
+## Features
+* Generate complex DASH configurations
+* Custom input parameters for scale and other options.
+* Output to file or stdio
+* Select output format: `JSON` [future: `yaml`?]
+* Generate all config (uber-generator) or just selected items (e.g. `aclgroups`)
+* Potential to create custom apps to transform streaming data e.g into device API calls w/o intermediate text rendering
+## High-level Diagram
+
+![confgen-hld-diag](images/confgen-hld-diag.svg)
+
+## Design
+The uber-generator `generate.d.py` instantiates sub-generators and produces a composite output stream which can be rendered into text files (JSON) or sent to stdout for custom pipelines.
+
+The uber-generator and sub-generators all derive from a base-class `ConfBase`. They all share a common `main` progam with CLI command-line options, which allows them to be used independently yet consistently.
+
+The generators produce Python data structures which can be rendered into output text (e.g. JSON) or used to feed custom applications such as a saithrift API driver, to directly configure a device. Likewise a custom API driver can be developed for vendor-specific APIs.
+
+Default parameters allow easy operations with no complex input. All parameters can be selectively overridden via cmd-line, input file or both.
+## Confgen Applications
+Two anticipated applications (see Figure below):
+* Generate a configuration file, e.g. JSON, and use this to feed downstream tools such as a DUT configuration utility.
+* Use the output of the config data stream generators to perform on-the-fly DUT configuration without intermediate JSON file rendering; also configure traffic-generators using data in the config info itself.
+
+![confgen-apps](images/confgen-apps.svg)
+
+## Sample CLI Usage
+This may not be current; check latest for actual content.
+```
+$ ./generate.d.py -h
+usage: generate.d.py [-h] [-f {json}] [-c {dict,list}] [-d] [-m] [-M "MSG"] [-P "{PARAMS}"] [-p PARAM_FILE]
+                     [-o OFILE]
+
+Generate DASH Configs
+
+optional arguments:
+  -h, --help            show this help message and exit
+  -f {json}, --format {json}
+                        Config output format.
+  -c {dict,list}, --content {dict,list}
+                        Emit dictionary (with inner lists), or list items only
+  -d, --dump-params     Just dump parameters (defaults with user-defined merged in
+  -m, --meta            Include metadata in output (only if "-c dict" used)
+  -M "MSG", --msg "MSG"
+                        Put MSG in metadata (only if "-m" used)
+  -P "{PARAMS}", --set-params "{PARAMS}"
+                        supply parameters as a dict, partial is OK; defaults and file-provided (-p)
+  -p PARAM_FILE, --param-file PARAM_FILE
+                        use parameter dict from file, partial is OK; overrides defaults
+  -o OFILE, --output OFILE
+                        Output file (default: standard output)
+
+Usage:
+=========
+./generate.d.py                - generate output to stdout using uber-generator
+./generate.d.py -o tmp.json    - generate output to file tmp.json
+./generate.d.py -o /dev/null   - generate output to nowhere (good for testing)
+./generate.d.py -c list        - generate just the list items w/o parent dictionary
+dashgen/aclgroups.py [options] - run one sub-generator, e.g. acls, routetables, etc.
+                               - many different subgenerators available, support same options as uber-generator
+
+Passing parameters. Provided as Python dict, see dflt_params.py for available items
+================
+./generate.d.py -d                          - display default parameters and quit
+./generate.d.py -d -P PARAMS                - override given parameters, display and quit; see dflt_params.py for template
+./generate.d.py -d -p PARAM_FILE            - override parameters in file; display only
+./generate.d.py -d -p PARAM_FILE -P PARAMS  - override params from file, then override params from cmdline; display only
+./generate.d.py -p PARAM_FILE -P PARAMS     - override params from file, then override params from cmdline, generate output
+
+Examples:
+./generate.d.py -d -p params_small.py -P "{'ENI_COUNT': 16}"  - use params_small.py but override ENI_COUNT; display params
+./generate.d.py -p params_hero.py -o tmp.json                 - generate full "hero test" scale config as json file
+dashgen/vpcmappingtypes.py -m -M "Kewl Config!"               - generate dict of vpcmappingtypes, include meta with message            
+```
+
+# TODO
+* Reconcile the param dicts vs. param attributes obtained via Munch, use of scalar variables inside performance-heavy loops etc. There is a tradeoff between elegance, expressiveness and performance.
+# IDEAS/Wish-List
+* Expose yaml format, need to work on streaming output (bulk output was working, but slow).
+* Use logger instead of print to stderr
+* logging levels -v, -vv, -vvv etc., otherwise silent on stderr
+* -O, --optimize flags for speed or memory (for speed - expand lists in-memory and use orjson serializer, like original code)
+* Use nested generators inside each sub-generator, instead of nested loops, to reduce in-memory usage; may require enhancing JSON output streaming to use recursion etc.
+# Logic
+ACLs and Routes should not be summarized.
+
+```
+we start with the `ENI_COUNT`
+for each eni we allocate a `MAC_L_START` and `IP_L_START`
+  when moving to next ENI we increment the mac by `ENI_MAC_STEP` and the ip by `IP_STEP4`
+  each eni has `ACL_TABLE_COUNT` inbound and `ACL_TABLE_COUNT` outbound NSGs
+
+  ACLs:
+    each NSG has `ACL_RULES_NSG` acl rules 
+    each acl rule has `IP_PER_ACL_RULE` ip prefixes
+    the acl rules priorities are alternating allow and deny
+    odd/even ip's are allocated to allow/deny rules
+    no ips from inbound are repeated in outbound rules or other way around  except a last rule in the last table of each direction that will allow the traffic to flow
+
+  Static VxLAN map:
+    not all ips will have a map entry only the first `IP_MAPPED_PER_ACL_RULE` ips from each acl rule will have a ip/mac map entry.
+
+  Routing:
+    all allow ips will have a route as well as some deny ips.
+    route allocation is controlled by `IP_ROUTE_DIVIDER_PER_ACL_RULE`
+    the ips of each acl will be divided in groups of `IP_ROUTE_DIVIDER_PER_ACL_RULE`
+    there will be routes for all but one ip in each group in such a way to prevent route summarization.
+
+    lets say `IP_ROUTE_DIVIDER_PER_ACL_RULE` is 8
+    we will have a route for:
+        1.128.0.0/30
+        1.128.0.4/31
+        1.128.0.7/32
+        and then it repeats
+        1.128.0.8/30......
+    this way there is no route for 1.128.0.6
+```
+
+# Scale
+
+| 8 ENI Scenario  |  required |   generated config number |
+| ----------------| --------- |---------------------------|
+| ENI's/VPorts    | 8         | ENI_COUNT                            |
+| NSGs            | 48        | ENI_COUNT * 2[^1] * ACL_TABLE_COUNT  |
+| ACL rules       | 48000     | NSG * ACL_RULES_NSG                  |
+| Prefixes        | 9.6M      | ACL * IP_PER_ACL_RULE                |
+| Mapping Table   | 2M        | ACL * IP_MAPPED_PER_ACL_RULE         |
+| Routes          | 1.6M      | ACL * (IP_PER_ACL_RULE / IP_ROUTE_DIVIDER_PER_ACL_RULE) *  log(IP_ROUTE_DIVIDER_PER_ACL_RULE, 2)  |
+
+# Sample (low scale)
+[sample_dash_conf.json](sample_dash_conf.json)
+
+
+[^1]: we multiply by 2 because we have inbound and outbound
+
diff --git a/test/confgen/dashgen/__init__.py b/test/confgen/dashgen/__init__.py
@@ -0,0 +1,11 @@
+import dashgen.confbase
+import dashgen.confutils
+import dashgen.dflt_params
+import dashgen.enis
+import dashgen.aclgroups
+import dashgen.prefixtags
+import dashgen.vpcs
+import dashgen.vpcmappings
+import dashgen.vpcmappingtypes
+import dashgen.routetables
+import dashgen.routingappliances
diff --git a/test/confgen/dashgen/aclgroups.py b/test/confgen/dashgen/aclgroups.py
@@ -0,0 +1,105 @@
+#!/usr/bin/python3
+
+from dashgen.confbase import *
+from dashgen.confutils import *
+from copy import deepcopy
+import sys
+class AclGroups(ConfBase):
+
+    def __init__(self, params={}):
+        super().__init__('acl-groups', params)
+
+    def items(self):
+        self.numYields = 0
+        print('  Generating %s...' % self.dictname, file=sys.stderr)
+        p=self.params
+        cp=self.cooked_params
+        IP_STEP1=cp.IP_STEP1
+        IP_STEP2=cp.IP_STEP2
+        IP_STEP3=cp.IP_STEP3
+        IP_STEP4=cp.IP_STEP4
+        IP_STEPE=cp.IP_STEPE
+        IP_R_START=cp.IP_R_START
+        IP_L_START=cp.IP_L_START
+        ACL_TABLE_COUNT=p.ACL_TABLE_COUNT
+        ACL_RULES_NSG=p.ACL_RULES_NSG
+        IP_PER_ACL_RULE=p.IP_PER_ACL_RULE
+
+        for eni_index in range(1, p.ENI_COUNT + 1):
+            local_ip = IP_L_START + (eni_index - 1) * IP_STEP4
+            l_ip_ac = deepcopy(str(local_ip)+"/32")
+
+            for table_index in range(1, (ACL_TABLE_COUNT*2+1)):
+                table_id = eni_index * 1000 + table_index
+
+                rules = []
+                rappend = rules.append
+                for ip_index in range(1, (ACL_RULES_NSG+1), 2):
+                    rule_id_a = table_id * 10 * ACL_RULES_NSG + ip_index
+                    remote_ip_a = IP_R_START + (eni_index - 1) * IP_STEP4 + (
+                        table_index - 1) * 4 * IP_STEP3 + (ip_index - 1) * IP_STEP2
+
+                    ip_list_a = [str(remote_ip_a + expanded_index * IP_STEPE)+"/32" for expanded_index in range(0, IP_PER_ACL_RULE)]
+                    ip_list_a.append(l_ip_ac)
+
+                    rule_a = {
+                        "priority": ip_index,
+                        "action": "allow",
+                        "terminating": False,
+                        "src_addrs": ip_list_a[:],
+                        "dst_addrs":  ip_list_a[:],
+                    }
+                    rappend(rule_a)
+                    rule_id_d = rule_id_a + 1
+                    remote_ip_d = remote_ip_a + IP_STEP1
+
+                    ip_list_d = [str(remote_ip_d + expanded_index * IP_STEPE)+"/32" for expanded_index in range(0, IP_PER_ACL_RULE)]
+                    ip_list_d.append(l_ip_ac)
+
+                    rule_d = {
+                        "priority": ip_index+1,
+                        "action": "deny",
+                        "terminating": True,
+                        "src_addrs": ip_list_d[:],
+                        "dst_addrs":  ip_list_d[:],
+                    }
+                    rappend(rule_d)
+
+                # add as last rule in last table from ingress and egress an allow rule for all the ip's from egress and ingress
+                if ((table_index - 1) % 3) == 2:
+                    rule_id_a = table_id * 10 *ACL_RULES_NSG + ip_index
+                    all_ipsA = IP_R_START + (eni_index - 1) * IP_STEP4 + (table_index % 6) * 4 * IP_STEP3
+                    all_ipsB = all_ipsA + 1 * 4 * IP_STEP3
+                    all_ipsC = all_ipsA + 2 * 4 * IP_STEP3
+
+                    ip_list_all = [
+                        l_ip_ac,
+                        str(all_ipsA)+"/14",
+                        str(all_ipsB)+"/14",
+                        str(all_ipsC)+"/14",
+                    ]
+
+                    rule_allow_all = {
+                        "priority": ip_index+2,
+                        "action": "allow",
+                        "terminating": "true",
+                        "src_addrs": ip_list_all[:],
+                        "dst_addrs":  ip_list_all[:],
+                    }
+                    rappend(rule_allow_all)
+
+                acl_group = deepcopy(
+                    {
+                        "ACL-GROUP:ENI:%d:TABLE:%d" % (eni_index, table_id): {
+                            "acl-group-id": "acl-group-%d" % table_id,
+                            "ip_version": "IPv4",
+                            "rules": rules
+                        }
+                    }
+                )
+                self.numYields+=1
+                yield acl_group
+
+if __name__ == "__main__":
+    conf=AclGroups()
+    common_main(conf)
diff --git a/test/confgen/dashgen/confbase.py b/test/confgen/dashgen/confbase.py
@@ -0,0 +1,91 @@
+import ipaddress
+from abc import ABC, abstractmethod
+from copy import deepcopy
+from datetime import datetime
+
+import macaddress
+from munch import DefaultMunch
+
+from dashgen.dflt_params import *
+
+ipp = ipaddress.ip_address
+macM = macaddress.MAC
+
+
+class ConfBase(ABC):
+
+    def __init__(self, name='base', params={}):
+        self.dictname = name
+        self.dflt_params = deepcopy(dflt_params)
+        self.cooked_params = {}
+        self.mergeParams(params)
+        self.numYields = 0
+
+    def mergeParams(self, params):
+        # Merge provided params into/onto defaults
+        self.params_dict = deepcopy(self.dflt_params)
+        self.params_dict.update(params)
+
+        # make scalar attributes for speed & brevity (compared to dict)
+        # https://stackoverflow.com/questions/1305532/how-to-convert-a-nested-python-dict-to-object
+        self.cookParams()
+        self.params = DefaultMunch.fromDict(self.params_dict)
+        # print ('%s: self.params=' % self.dictname, self.params)
+        self.cooked_params = DefaultMunch.fromDict(self.cooked_params_dict)
+        # print ("cooked_params = ", self.cooked_params)
+
+    def cookParams(self):
+        self.cooked_params_dict = {}
+        for ip in [
+            'IP_STEP1',
+            'IP_STEP2',
+            'IP_STEP3',
+            'IP_STEP4',
+            'IP_STEPE'
+        ]:
+            self.cooked_params_dict[ip] = int(ipp(self.params_dict[ip]))
+        for ip in [
+            'IP_L_START',
+            'IP_R_START',
+            'PAL',
+            'PAR'
+        ]:
+            self.cooked_params_dict[ip] = ipp(self.params_dict[ip])
+        for mac in [
+            'MAC_L_START',
+            'MAC_R_START'
+        ]:
+            self.cooked_params_dict[mac] = macM(self.params_dict[mac])
+
+    @abstractmethod
+    def items(self):
+        pass
+
+    # expensive - runs generator
+    def itemCount(self):
+        return len(self.items())
+
+    def itemsGenerated(self):
+        """ Last count of # yields, reset each time at begining"""
+        return self.num_yields
+
+    def dictName(self):
+        return self.dictname
+
+    def toDict(self):
+        return {self.dictname: list(self.items())}
+
+    def getParams(self):
+        return self.params_dict
+
+    def getMeta(self, message=''):
+        """Generate metadata. FOr reference, could also add e.g. data to help drive tests"""
+        return {'meta': {
+            'tstamp': datetime.now().strftime("%m/%d/%Y, %H:%M:%S"),
+            'msg': message,
+            'params': self.getParams()
+        }
+        }
+
+    def pretty(self):
+        pprint.pprint(self.toDict())