sidecar: Add `/api/v1/flush` endpoint #7359

Nashluffy · 2024-05-14T16:44:18Z

I added CHANGELOG entry for this change.
Change is not relevant to the end user.

Changes

Adds a sidecar API with one endpoint: /api/v1/flush which calls the TSDB snapshot endpoint on the prometheus instance, then uploads all not-already-present blocks in the snapshot to object store.

There are a few issues that explain the motivation:

Essentially if this is the last time sidecar will be running (ie. cluster is being deleted, shard being removed, etc...) then without some flushing mechanism you will permanently lose up to 2 hours of data.

Verification

Beside the unit tests, running prometheus locally and calling the endpoint works as expected.

Signed-off-by: mluffman <nashluffman@gmail.com>

GiedriusS · 2024-05-15T09:45:26Z

pkg/api/sidecar/v1.go

+	BlocksUploaded int `json:"blocksUploaded"`
+}
+
+func (s *SidecarAPI) flush(r *http.Request) (interface{}, []error, *api.ApiError, func()) {


Doesn't /api/v1/snapshot just create hardlinks to existing blocks? https://github.com/prometheus/prometheus/blob/main/tsdb/block.go#L687-L696. You will have to somehow filter out the non-head block here. I'm not sure how to do that.

Alternatively, Prometheus could have a new parameter to take a snapshot of the head only.

shipper.Sync only uploads blocks that don't already exist in the bucket
https://github.com/thanos-io/thanos/blob/main/pkg/shipper/shipper.go#L284-L289

GiedriusS · 2024-05-15T09:46:01Z

pkg/api/sidecar/v1.go

+	snapshotDir := s.dataDir + "/" + dir
+
+	s.shipper.SetDirectoryToSync(snapshotDir)
+	uploaded, err := s.shipper.Sync(r.Context())


Prometheus might produce a block in the middle of the snapshot call. I think you will have to disable the other syncer before doing anything.

Is the concern that it'd produce an overlapping block? Or that there's a potential race between two shippers Syncing at the same time?

GiedriusS · 2024-05-15T09:47:11Z

pkg/api/sidecar/v1.go

+	if err != nil {
+		return nil, nil, &api.ApiError{Typ: api.ErrorInternal, Err: fmt.Errorf("failed to upload head block: %w", err)}, func() {}
+	}
+	return &flushResponse{BlocksUploaded: uploaded}, nil, nil, func() {}


Shouldn't we immediately shutdown Prometheus after this is done? Otherwise, the same risk of overlapping blocks appears.

That's a good point - if the endpoint is only meant to be called once before shutting down, does it even need to be an endpoint? Should it just be the default behavior of the sidecar shutting down? That (I think) should eliminate the concern of both overlapping blocks and two shippers racing against one another.

Although the endpoint alleviates concerns around ordering shutdown, prometheus-operator could hit the endpoint and then just delete the statefulset.

I would love it if all of this would be handled automatically by Sidecar when it is shutting down, but that would entail deeper integration between Sidecar & Prometheus. In Sidecar, we should only care about the head block and avoid overlaps. If only there were a way to tell Prometheus to trim data from the head block that had been uploaded by Sidecar already. :(

Maybe an alternative would be to shut down Prometheus and then read the HEAD block ourselves from Sidecar? We could produce a block from it, upload it, and then trim the head. What do you think about such idea?

I quite like that idea - I played around with it locally and it does work!
https://gist.github.com/Nashluffy/097e7df7d0a90b0cdefd2b87fb3129c8

But there's a few issues with it, mostly that opening a ReadOnlyDB reads in the WAL before it can be persisted to disk, which depending on the size can be very time consuming & memory intense. If we did want to take this approach, I think there'd have to be a finalizer on the prometheus-owned statefulsets, and a controller that would remove the finalizer. The controller would create a pod that mounts the volume, opens a ReadOnlyDB, and then persists to disk & uploads. But this is quite a lot of orchestration.

The more I think about this, the more I think persist-head-to-block needs to be an endpoint on prometheus/tsdb that we consume. There's quite an extensive thread here about it prometheus-junkyard/tsdb#346

GiedriusS

If overlapping blocks is not a problem due to vertical compaction being on, it should be clearly noted in the docs + logs

MichaHoffmann · 2024-05-19T08:23:25Z

If we want a flush api we probably should add it to shipper so that ruler and receiver also get it, right?

add flush endpoint

307854c

Signed-off-by: mluffman <nashluffman@gmail.com>

pull-request-size bot added the size/L label May 14, 2024

Nashluffy added 3 commits May 14, 2024 17:45

update changelog

1a7748e

Signed-off-by: mluffman <nashluffman@gmail.com>

fix imports

890b5c9

Signed-off-by: mluffman <nashluffman@gmail.com>

fix pr ref in changelog, check error in test

f57e1cf

Signed-off-by: mluffman <nashluffman@gmail.com>

Nashluffy marked this pull request as ready for review May 15, 2024 08:44

GiedriusS requested changes May 15, 2024

View reviewed changes

GiedriusS reviewed May 15, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sidecar: Add `/api/v1/flush` endpoint #7359

sidecar: Add `/api/v1/flush` endpoint #7359

Nashluffy commented May 14, 2024

GiedriusS May 15, 2024

Nashluffy May 15, 2024

GiedriusS May 15, 2024

Nashluffy May 15, 2024

GiedriusS May 15, 2024

Nashluffy May 15, 2024 •

edited

GiedriusS May 20, 2024 •

edited

Nashluffy May 24, 2024

GiedriusS left a comment

MichaHoffmann commented May 19, 2024

sidecar: Add /api/v1/flush endpoint #7359

Are you sure you want to change the base?

sidecar: Add /api/v1/flush endpoint #7359

Conversation

Nashluffy commented May 14, 2024

Changes

Verification

GiedriusS May 15, 2024

Choose a reason for hiding this comment

Nashluffy May 15, 2024

Choose a reason for hiding this comment

GiedriusS May 15, 2024

Choose a reason for hiding this comment

Nashluffy May 15, 2024

Choose a reason for hiding this comment

GiedriusS May 15, 2024

Choose a reason for hiding this comment

Nashluffy May 15, 2024 • edited

Choose a reason for hiding this comment

GiedriusS May 20, 2024 • edited

Choose a reason for hiding this comment

Nashluffy May 24, 2024

Choose a reason for hiding this comment

GiedriusS left a comment

Choose a reason for hiding this comment

MichaHoffmann commented May 19, 2024

sidecar: Add `/api/v1/flush` endpoint #7359

sidecar: Add `/api/v1/flush` endpoint #7359

Nashluffy May 15, 2024 •

edited

GiedriusS May 20, 2024 •

edited