Nephron: Fix host traffic aggregations

Description

When calculating the "Other" part that is not covered by the topK hosts the total volume that is used to determine that other part has to be doubled.

Assume the following flows reported for an exporter/interface:

src	dst	direction	bytes
h1	h2	in	b1
h1	h2	out	b2
h1	h3	in	b3
h1	h3	out	b4

The total volume for that exporter/interface is:

in	out
b1+b3	b2+b4

When deriving host aggregations the observed traffic is assigned to both, the src and dst hosts. In case of the example flows the derived host aggregations are:

host	in	out
h1	b1+b3	b2+b4
h2	b1	b2
h3	b3	b4

When the host aggregations are summed up the total is:

in	out
2*(b1+b3)	2*(b2+b4)

In case of a top-1 aggregation the result including the "Other" part is:

host	in	out
h1	b1+b3	b2+b4
Other	2*(b1+b3) - (b1+b3) = b1+b3	2*(b2+b4) - (b2 + b4) = b2+b4

In case of a top-2 aggregation (assuming b1+b2 > b3+b4) the result including the "Other" part is:

host	in	out
h1	b1+b3	b2+b4
h2	b1	b2
Other	2*(b1+b3) - (b1+b3) - b1 = b3	2*(b2+b4) - (b2 + b4) - b2 = b4

h3. Alternatives

There are two alternatives how traffic volumes could be assigned to hosts:

ingress bytes and egress bytes could be divided by two and assigned to both hosts as before
ingress bytes could be assigned to the source host and egress bytes to the destination host

The first alternative has the advantage that host volumes would sum up to the total exporter/interface volume. The disadvantage is that for each host the ingress/egress bytes now are only half of the value reported in flows.

The second alternative also has the advantage that host volumes sum up to the total exporter/interface volume. The ingress / egress numbers separate the traffic when a host was the source host and when a host was the target host. The disadvantage is that the current aggregation scheme that first calculates aggregations for conversations must be changed. Conversations do not distinguish between src and dst host. Therefore they can not be used as the base for deriving host aggregations.

Acceptance / Success Criteria

None

Attachments

Lucidchart Diagrams

Activity

Stefan Wachter September 1, 2021 at 3:56 PM

For now, the Cortex Flow Deep Dive Dashboard doubles the total bytes before subtracting the summed bytes for topK hosts. This must be adapted when we change the aggregation for hosts.

Details
Assignee
Stefan Wachter
Reporter
Stefan Wachter
Labels
FRC
HB Backlog Status
Reviewed
FRC Grooming Date
Mar 08, 2022
Story Points
13
Priority
Minor

PagerDuty

Created August 25, 2021 at 10:31 AM

Updated March 8, 2022 at 8:58 PM

Nephron: Fix host traffic aggregations

Description

Acceptance / Success Criteria

Attachments

Lucidchart Diagrams

Activity

Stefan Wachter September 1, 2021 at 3:56 PM

DetailsAssigneeStefan WachterStefan WachterReporterStefan WachterStefan WachterLabelsFRCHB Backlog StatusReviewedFRC Grooming DateMar 08, 2022Story Points13PriorityMinor

Details

Assignee

Reporter

Labels

HB Backlog Status

FRC Grooming Date

Story Points

Priority

PagerDutyPagerDuty Incident

PagerDuty

Details
Assignee
Stefan Wachter
Reporter
Stefan Wachter
Labels
FRC
HB Backlog Status
Reviewed
FRC Grooming Date
Mar 08, 2022
Story Points
13
Priority
Minor

PagerDuty