Create Elastic Agent enrolment tokens in the operator #5846

pebrc · 2022-06-29T14:15:42Z

See issue for detail on the approach taken. But to repeat the gist: do not expose highly privileged Kibana credentials to Elastic Agent Pods. Instead make the operator do all the Kibana API interactions with the more limited user (possible as of 8.1) and expose only the enrolment token to the Elastic Agent as it would be also the case on a typical bare metal installation.

Tradeoffs:

Agent rollout needs to wait until Kibana is available (but in the old system the Agents would just crash loop until that was the case so I consider this actually an improvement)
Operator needs to stay up to date with Fleet API development. So far the relevant APIs have been fairly stable with one API being renamed for consistency.
Each Kibana association creates a Kibana user. Technically we could use a single user to do all the API interactions with Kibana but that would have broken out of the association mechanism and it seemed convenient and correct to keep using that.

Some additional choices I made that are maybe of interest to reviewers: I am not reusing the default enrolment token but creating a new one. No particular strong reason for that. It allows us to clean the tokens up if something goes wrong and also allows us to run without explicitly calling /api/fleet/setup (which creates the default tokens). In practice I ended up calling the setup API anyway because we cannot rely on users configuring Kibana with pre-defined policies in kibana.yml (as do all our recipes). So if there are strong feelings against creating our own tokens we can revisit this choice.

I hope I did not miss anything fundamental. I did manual tests with 7.14 (first version of Fleet we support), 7.17 and a recent 8.x.

pebrc · 2022-06-29T19:36:06Z

run/e2e-tests tags=agent

pebrc · 2022-06-30T06:35:12Z

run/e2e-tests tags=agent

pebrc · 2022-07-04T05:19:05Z

Jenkins test this please

david-kow

LGTM! One thing I'd like to clarify is what happens if the user sets FLEET_SERVER_POLICY_ID env var separately from policyID. Should we document the precedence?

pkg/controller/agent/config.go

pkg/controller/agent/driver.go

pkg/controller/agent/fleet.go

pebrc · 2022-07-13T09:48:16Z

pkg/controller/agent/fleet.go

+	if kb.Status.Health != commonv1.GreenHealth {
+		return false, nil // requeue
+	}
+	return true, nil


I realise this does not work well with external KibanaRefs. I will look into making an actuall HTTP request instead here.

It turns out this is a rabbit hole. Making external refs work is more complicated. Other things required: communication between Fleet Server and Elasticsearch needs to use a service token otherwise Agent will try to talk to Kibana to generate one. ~~Also I ran into some weird connectivity issues where Agent seemed to connect to an IP directly without host header and failed to connect through the ESS proxy. Still looking into that~~, but I might punt the external ref support to another PR if it makes sense at all.

pebrc · 2022-07-18T10:00:26Z

run/e2e-tests tags=agent

pebrc · 2022-07-18T18:20:14Z

@elasticmachine, run elasticsearch-ci/docs

Do not expose highly privileged Kibana credentials to Elastic Agent Pods. Instead make the operator do all the Kibana API interactions with the more limited user (possible as of 8.1) and expose only the enrolment token to the Elastic Agent as it would be also the case on a typical bare metal installation. Tradeoffs: - Agent rollout needs to wait until Kibana is available (but in the old system the Agents would just crash loop until that was the case so I consider this actually an improvement) - Operator needs to stay up to date with Fleet API development. So far the relevant APIs have been fairly stable with one API being renamed for consistency. - Each Kibana association creates a Kibana user. Technically we could use a single user to do all the API interactions with Kibana but that would have broken out of the association mechanism and it seemed convenient and correct to keep using that.

pebrc added 7 commits June 28, 2022 09:13

wip

f45c214

pagination

8a3f379

move fleet token reconciliation to top level

287507b

Merge remote-tracking branch 'upstream/main' into agent-enrolment-tokens

ee5c516

fix unit tests

af7eb45

refactor common error handling

e5fb3e2

Add policy field to Agent CRD

62eb371

botelastic bot added the triage label Jun 29, 2022

pebrc added the >enhancement Enhancement of existing functionality label Jun 29, 2022

botelastic bot removed the triage label Jun 29, 2022

pebrc added 4 commits June 29, 2022 16:38

lint

27a480a

more lint

cbbf8ee

check for Kibana reachability

87dbe84

call fleet setup in Kibana

a05bc38

pebrc added 10 commits June 30, 2022 14:36

Use restricted user for Fleet setup when possible

d655374

add event on requeue

8a59b41

Unit tests

0fa5f02

Do not mount Kibana certs

819aece

lint

5ef6c1e

fix unit tests

871c78c

improve unit test

dcf6e87

fix license header weirdness

3f82cca

reuse getPodConnectionSettings

7de3bc3

lint

e8c850e

pebrc marked this pull request as ready for review July 3, 2022 18:58

pebrc requested a review from david-kow July 4, 2022 05:19

pebrc added the v2.4.0 label Jul 4, 2022

pebrc mentioned this pull request Jul 7, 2022

Elastic Agent to Kibana association uses a superuser #5778

Closed

david-kow approved these changes Jul 12, 2022

View reviewed changes

pkg/controller/agent/config.go Outdated Show resolved Hide resolved

pkg/controller/agent/config.go Outdated Show resolved Hide resolved

pkg/controller/agent/driver.go Outdated Show resolved Hide resolved

pkg/controller/agent/fleet.go Outdated Show resolved Hide resolved

pebrc added 3 commits July 12, 2022 21:13

review feedback

742953e

Merge remote-tracking branch 'upstream/main' into agent-enrolment-tokens

bfdf86e

doc changes

1c78ce8

pebrc commented Jul 13, 2022

View reviewed changes

Set FLEET_SERVER_POLICY_ID

786c969

pebrc added 2 commits July 18, 2022 16:17

doc updates for FLEET_SERVER_POLICY_ID

9ed4efd

Do not replace default cert pool when no custom certs present

1af696c

pebrc mentioned this pull request Jul 18, 2022

Support externalRef in Elastic Agent #5879

Open

lint

e8647b1

pebrc merged commit 2d0f7fe into elastic:main Jul 19, 2022

david-kow changed the title ~~Create agent enrollment tokens in the operator~~ Create Elastic Agent enrolment tokens in the operator Aug 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create Elastic Agent enrolment tokens in the operator #5846

Create Elastic Agent enrolment tokens in the operator #5846

pebrc commented Jun 29, 2022 •

edited

Loading

pebrc commented Jun 29, 2022

pebrc commented Jun 30, 2022

pebrc commented Jul 4, 2022

david-kow left a comment

pebrc Jul 13, 2022

pebrc Jul 14, 2022 •

edited

Loading

pebrc commented Jul 18, 2022

pebrc commented Jul 18, 2022

Create Elastic Agent enrolment tokens in the operator #5846

Create Elastic Agent enrolment tokens in the operator #5846

Conversation

pebrc commented Jun 29, 2022 • edited Loading

pebrc commented Jun 29, 2022

pebrc commented Jun 30, 2022

pebrc commented Jul 4, 2022

david-kow left a comment

Choose a reason for hiding this comment

pebrc Jul 13, 2022

Choose a reason for hiding this comment

pebrc Jul 14, 2022 • edited Loading

Choose a reason for hiding this comment

pebrc commented Jul 18, 2022

pebrc commented Jul 18, 2022

pebrc commented Jun 29, 2022 •

edited

Loading

pebrc Jul 14, 2022 •

edited

Loading