You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Deduplication in thanos-compact should not be enabled by default because it is an experimental feature. The example configuration in kube-thanos enables offline deduplication in thanos-compact on Prometheus replicas but does not set the correct deduplication strategy. This leads to data loss as deduplication is irreversible.
Label to treat as a replica indicator of blocks that can be deduplicated (repeated flag). This will merge multiple replica blocks into one. This process is irreversible. Experimental. When one or more labels are set, compactor will ignore the given labels so that vertical compaction can merge the blocks. Please note that by default this uses a NAIVE algorithm for merging which works well for deduplication of blocks with precisely the same samples like produced by Receiver replication. If you need a different deduplication algorithm (e.g one that works well with Prometheus replicas), please set it via --deduplication.func.
Since #164 offline deduplication in the compactor is enabled by default on label prometheus_replica. But the flag --deduplication.func=penalty is not set.
It might be better to deactivate offline deduplication by default because it is an experimental feature.
The text was updated successfully, but these errors were encountered:
jonasmatthias
changed the title
thanos-compact deduplication on prometheus_replica requires deduplication.func=penalty
data loss: thanos-compact deduplication is experimental and should not be enabled by default
Feb 13, 2023
jonasmatthias
changed the title
data loss: thanos-compact deduplication is experimental and should not be enabled by default
DATA LOSS: thanos-compact deduplication is experimental and should not be enabled by default
Feb 13, 2023
Deduplication in thanos-compact should not be enabled by default because it is an experimental feature. The example configuration in kube-thanos enables offline deduplication in thanos-compact on Prometheus replicas but does not set the correct deduplication strategy. This leads to data loss as deduplication is irreversible.
The documentation explains
The description of the
deduplication.replica-label
flag in the code also clarifies that the default deduplication algorithm should NOT be used on HA prometheus replicas:I learned about this via
Since #164 offline deduplication in the compactor is enabled by default on label
prometheus_replica
. But the flag--deduplication.func=penalty
is not set.kube-thanos/examples/all/manifests/thanos-compact-statefulSet.yaml
Lines 40 to 41 in 6533e7c
It might be better to deactivate offline deduplication by default because it is an experimental feature.
The text was updated successfully, but these errors were encountered: