-
Notifications
You must be signed in to change notification settings - Fork 103
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* Added Mimir Integration * updated meta-monitoring example * Update charts/k8s-monitoring/charts/feature-integrations/integrations/mimir-values.yaml Co-authored-by: Pete Wall <[email protected]> * Update charts/k8s-monitoring/charts/feature-integrations/integrations/mimir-values.yaml Co-authored-by: Pete Wall <[email protected]> * Updated docs * Doc improvements to link to other charts and reference resource requests and limits (#1131) * Doc improvements to link to other charts and reference resource requests and limits Signed-off-by: Pete Wall <[email protected]> * Fix yaml lint issues Signed-off-by: Pete Wall <[email protected]> --------- Signed-off-by: Pete Wall <[email protected]> * Update Update dependency "beyla" for Helm chart "feature-auto-instrumentation" to 1.6.3 (#1134) Signed-off-by: Pete Wall <[email protected]> Co-authored-by: petewall <[email protected]> * Update Update dependency "beyla" for Helm chart "k8s-monitoring-v1" to 1.6.3 (#1135) Signed-off-by: Pete Wall <[email protected]> Co-authored-by: petewall <[email protected]> * Add permissions for the internal directory Signed-off-by: Pete Wall <[email protected]> * Actually use certFile and keyFile settings (#1143) Signed-off-by: Pete Wall <[email protected]> * Bump versions to 1.6.21 and 2.0.4 Signed-off-by: Pete Wall <[email protected]> * Rebuilt --------- Signed-off-by: Pete Wall <[email protected]> Co-authored-by: Pete Wall <[email protected]> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: petewall <[email protected]>
- Loading branch information
1 parent
628ab11
commit 22a1e7d
Showing
24 changed files
with
4,089 additions
and
28 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
286 changes: 286 additions & 0 deletions
286
charts/k8s-monitoring/charts/feature-integrations/default-allow-lists/mimir.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,286 @@ | ||
--- | ||
# The set of metrics from Grafana Loki required for the Grafana Loki integration | ||
- cortex_alertmanager_alerts | ||
- cortex_alertmanager_alerts_invalid_total | ||
- cortex_alertmanager_alerts_received_total | ||
- cortex_alertmanager_dispatcher_aggregation_groups | ||
- cortex_alertmanager_notification_latency_seconds_bucket | ||
- cortex_alertmanager_notification_latency_seconds_count | ||
- cortex_alertmanager_notification_latency_seconds_sum | ||
- cortex_alertmanager_notifications_failed_total | ||
- cortex_alertmanager_notifications_total | ||
- cortex_alertmanager_partial_state_merges_failed_total | ||
- cortex_alertmanager_partial_state_merges_total | ||
- cortex_alertmanager_ring_check_errors_total | ||
- cortex_alertmanager_silences | ||
- cortex_alertmanager_state_fetch_replica_state_failed_total | ||
- cortex_alertmanager_state_fetch_replica_state_total | ||
- cortex_alertmanager_state_initial_sync_completed_total | ||
- cortex_alertmanager_state_initial_sync_duration_seconds_bucket | ||
- cortex_alertmanager_state_initial_sync_duration_seconds_count | ||
- cortex_alertmanager_state_initial_sync_duration_seconds_sum | ||
- cortex_alertmanager_state_persist_failed_total | ||
- cortex_alertmanager_state_persist_total | ||
- cortex_alertmanager_state_replication_failed_total | ||
- cortex_alertmanager_state_replication_total | ||
- cortex_alertmanager_sync_configs_failed_total | ||
- cortex_alertmanager_sync_configs_total | ||
- cortex_alertmanager_tenants_discovered | ||
- cortex_alertmanager_tenants_owned | ||
- cortex_blockbuilder_consume_cycle_duration_seconds | ||
- cortex_blockbuilder_consumer_lag_records | ||
- cortex_blockbuilder_tsdb_compact_and_upload_failed_total | ||
- cortex_bucket_blocks_count | ||
- cortex_bucket_index_estimated_compaction_jobs | ||
- cortex_bucket_index_estimated_compaction_jobs_errors_total | ||
- cortex_bucket_index_last_successful_update_timestamp_seconds | ||
- cortex_bucket_store_block_drop_failures_total | ||
- cortex_bucket_store_block_drops_total | ||
- cortex_bucket_store_block_load_failures_total | ||
- cortex_bucket_store_block_loads_total | ||
- cortex_bucket_store_blocks_loaded | ||
- cortex_bucket_store_indexheader_lazy_load_duration_seconds_bucket | ||
- cortex_bucket_store_indexheader_lazy_load_duration_seconds_count | ||
- cortex_bucket_store_indexheader_lazy_load_duration_seconds_sum | ||
- cortex_bucket_store_indexheader_lazy_load_total | ||
- cortex_bucket_store_indexheader_lazy_unload_total | ||
- cortex_bucket_store_series_batch_preloading_load_duration_seconds_sum | ||
- cortex_bucket_store_series_batch_preloading_wait_duration_seconds_sum | ||
- cortex_bucket_store_series_blocks_queried_sum | ||
- cortex_bucket_store_series_data_size_fetched_bytes_sum | ||
- cortex_bucket_store_series_data_size_touched_bytes_sum | ||
- cortex_bucket_store_series_hash_cache_hits_total | ||
- cortex_bucket_store_series_hash_cache_requests_total | ||
- cortex_bucket_store_series_request_stage_duration_seconds_bucket | ||
- cortex_bucket_store_series_request_stage_duration_seconds_count | ||
- cortex_bucket_store_series_request_stage_duration_seconds_sum | ||
- cortex_bucket_stores_blocks_last_successful_sync_timestamp_seconds | ||
- cortex_bucket_stores_gate_duration_seconds_bucket | ||
- cortex_bucket_stores_gate_duration_seconds_count | ||
- cortex_bucket_stores_gate_duration_seconds_sum | ||
- cortex_bucket_stores_tenants_synced | ||
- cortex_build_info | ||
- cortex_cache_memory_hits_total | ||
- cortex_cache_memory_requests_total | ||
- cortex_compactor_block_cleanup_failures_total | ||
- cortex_compactor_block_cleanup_last_successful_run_timestamp_seconds | ||
- cortex_compactor_block_max_time_delta_seconds_bucket | ||
- cortex_compactor_block_max_time_delta_seconds_count | ||
- cortex_compactor_block_max_time_delta_seconds_sum | ||
- cortex_compactor_blocks_cleaned_total | ||
- cortex_compactor_blocks_marked_for_deletion_total | ||
- cortex_compactor_blocks_marked_for_no_compaction_total | ||
- cortex_compactor_disk_out_of_space_errors_total | ||
- cortex_compactor_group_compaction_runs_started_total | ||
- cortex_compactor_last_successful_run_timestamp_seconds | ||
- cortex_compactor_meta_sync_duration_seconds_bucket | ||
- cortex_compactor_meta_sync_duration_seconds_count | ||
- cortex_compactor_meta_sync_duration_seconds_sum | ||
- cortex_compactor_meta_sync_failures_total | ||
- cortex_compactor_meta_syncs_total | ||
- cortex_compactor_runs_completed_total | ||
- cortex_compactor_runs_failed_total | ||
- cortex_compactor_runs_started_total | ||
- cortex_compactor_tenants_discovered | ||
- cortex_compactor_tenants_processing_failed | ||
- cortex_compactor_tenants_processing_succeeded | ||
- cortex_compactor_tenants_skipped | ||
- cortex_config_hash | ||
- cortex_discarded_exemplars_total | ||
- cortex_discarded_requests_total | ||
- cortex_discarded_samples_total | ||
- cortex_distributor_deduped_samples_total | ||
- cortex_distributor_exemplars_in_total | ||
- cortex_distributor_inflight_push_requests | ||
- cortex_distributor_instance_limits | ||
- cortex_distributor_instance_rejected_requests_total | ||
- cortex_distributor_latest_seen_sample_timestamp_seconds | ||
- cortex_distributor_non_ha_samples_received_total | ||
- cortex_distributor_received_exemplars_total | ||
- cortex_distributor_received_requests_total | ||
- cortex_distributor_received_samples_total | ||
- cortex_distributor_replication_factor | ||
- cortex_distributor_requests_in_total | ||
- cortex_distributor_samples_in_total | ||
- cortex_inflight_requests | ||
- cortex_ingest_storage_reader_buffered_fetched_records | ||
- cortex_ingest_storage_reader_fetch_errors_total | ||
- cortex_ingest_storage_reader_fetches_total | ||
- cortex_ingest_storage_reader_missed_records_total | ||
- cortex_ingest_storage_reader_offset_commit_failures_total | ||
- cortex_ingest_storage_reader_offset_commit_requests_total | ||
- cortex_ingest_storage_reader_read_errors_total | ||
- cortex_ingest_storage_reader_receive_delay_seconds_count | ||
- cortex_ingest_storage_reader_receive_delay_seconds_sum | ||
- cortex_ingest_storage_reader_records_failed_total | ||
- cortex_ingest_storage_reader_records_total | ||
- cortex_ingest_storage_reader_requests_failed_total | ||
- cortex_ingest_storage_reader_requests_total | ||
- cortex_ingest_storage_strong_consistency_failures_total | ||
- cortex_ingest_storage_strong_consistency_requests_total | ||
- cortex_ingest_storage_writer_buffered_produce_bytes | ||
- cortex_ingest_storage_writer_buffered_produce_bytes_limit | ||
- cortex_ingester_active_native_histogram_buckets | ||
- cortex_ingester_active_native_histogram_buckets_custom_tracker | ||
- cortex_ingester_active_native_histogram_series | ||
- cortex_ingester_active_native_histogram_series_custom_tracker | ||
- cortex_ingester_active_series | ||
- cortex_ingester_active_series_custom_tracker | ||
- cortex_ingester_client_request_duration_seconds_bucket | ||
- cortex_ingester_client_request_duration_seconds_count | ||
- cortex_ingester_client_request_duration_seconds_sum | ||
- cortex_ingester_ingested_exemplars_total | ||
- cortex_ingester_ingested_samples_total | ||
- cortex_ingester_instance_limits | ||
- cortex_ingester_instance_rejected_requests_total | ||
- cortex_ingester_local_limits | ||
- cortex_ingester_memory_series | ||
- cortex_ingester_memory_series_created_total | ||
- cortex_ingester_memory_series_removed_total | ||
- cortex_ingester_memory_users | ||
- cortex_ingester_oldest_unshipped_block_timestamp_seconds | ||
- cortex_ingester_owned_series | ||
- cortex_ingester_queried_exemplars_bucket | ||
- cortex_ingester_queried_exemplars_count | ||
- cortex_ingester_queried_exemplars_sum | ||
- cortex_ingester_queried_samples_bucket | ||
- cortex_ingester_queried_samples_count | ||
- cortex_ingester_queried_samples_sum | ||
- cortex_ingester_queried_series_bucket | ||
- cortex_ingester_queried_series_count | ||
- cortex_ingester_queried_series_sum | ||
- cortex_ingester_shipper_last_successful_upload_timestamp_seconds | ||
- cortex_ingester_shipper_upload_failures_total | ||
- cortex_ingester_shipper_uploads_total | ||
- cortex_ingester_tsdb_checkpoint_creations_failed_total | ||
- cortex_ingester_tsdb_checkpoint_creations_total | ||
- cortex_ingester_tsdb_checkpoint_deletions_failed_total | ||
- cortex_ingester_tsdb_compaction_duration_seconds_bucket | ||
- cortex_ingester_tsdb_compaction_duration_seconds_count | ||
- cortex_ingester_tsdb_compaction_duration_seconds_sum | ||
- cortex_ingester_tsdb_compactions_failed_total | ||
- cortex_ingester_tsdb_compactions_total | ||
- cortex_ingester_tsdb_exemplar_exemplars_appended_total | ||
- cortex_ingester_tsdb_exemplar_exemplars_in_storage | ||
- cortex_ingester_tsdb_exemplar_last_exemplars_timestamp_seconds | ||
- cortex_ingester_tsdb_exemplar_series_with_exemplars_in_storage | ||
- cortex_ingester_tsdb_head_max_timestamp_seconds | ||
- cortex_ingester_tsdb_head_truncations_failed_total | ||
- cortex_ingester_tsdb_mmap_chunk_corruptions_total | ||
- cortex_ingester_tsdb_out_of_order_samples_appended_total | ||
- cortex_ingester_tsdb_storage_blocks_bytes | ||
- cortex_ingester_tsdb_symbol_table_size_bytes | ||
- cortex_ingester_tsdb_wal_corruptions_total | ||
- cortex_ingester_tsdb_wal_truncate_duration_seconds_count | ||
- cortex_ingester_tsdb_wal_truncate_duration_seconds_sum | ||
- cortex_ingester_tsdb_wal_truncations_failed_total | ||
- cortex_ingester_tsdb_wal_truncations_total | ||
- cortex_ingester_tsdb_wal_writes_failed_total | ||
- cortex_kv_request_duration_seconds_bucket | ||
- cortex_kv_request_duration_seconds_count | ||
- cortex_kv_request_duration_seconds_sum | ||
- cortex_lifecycler_read_only | ||
- cortex_limits_defaults | ||
- cortex_limits_overrides | ||
- cortex_partition_ring_partitions | ||
- cortex_prometheus_notifications_dropped_total | ||
- cortex_prometheus_notifications_errors_total | ||
- cortex_prometheus_notifications_queue_capacity | ||
- cortex_prometheus_notifications_queue_length | ||
- cortex_prometheus_notifications_sent_total | ||
- cortex_prometheus_rule_evaluation_duration_seconds_count | ||
- cortex_prometheus_rule_evaluation_duration_seconds_sum | ||
- cortex_prometheus_rule_evaluation_failures_total | ||
- cortex_prometheus_rule_evaluations_total | ||
- cortex_prometheus_rule_group_duration_seconds_count | ||
- cortex_prometheus_rule_group_duration_seconds_sum | ||
- cortex_prometheus_rule_group_iterations_missed_total | ||
- cortex_prometheus_rule_group_iterations_total | ||
- cortex_prometheus_rule_group_rules | ||
- cortex_querier_blocks_consistency_checks_failed_total | ||
- cortex_querier_blocks_consistency_checks_total | ||
- cortex_querier_request_duration_seconds_bucket | ||
- cortex_querier_request_duration_seconds_count | ||
- cortex_querier_request_duration_seconds_sum | ||
- cortex_querier_storegateway_instances_hit_per_query_bucket | ||
- cortex_querier_storegateway_instances_hit_per_query_count | ||
- cortex_querier_storegateway_instances_hit_per_query_sum | ||
- cortex_querier_storegateway_refetches_per_query_bucket | ||
- cortex_querier_storegateway_refetches_per_query_count | ||
- cortex_querier_storegateway_refetches_per_query_sum | ||
- cortex_query_frontend_queries_total | ||
- cortex_query_frontend_queue_duration_seconds_bucket | ||
- cortex_query_frontend_queue_duration_seconds_count | ||
- cortex_query_frontend_queue_duration_seconds_sum | ||
- cortex_query_frontend_queue_length | ||
- cortex_query_frontend_retries_bucket | ||
- cortex_query_frontend_retries_count | ||
- cortex_query_frontend_retries_sum | ||
- cortex_query_scheduler_connected_querier_clients | ||
- cortex_query_scheduler_querier_inflight_requests | ||
- cortex_query_scheduler_queue_duration_seconds_bucket | ||
- cortex_query_scheduler_queue_duration_seconds_count | ||
- cortex_query_scheduler_queue_duration_seconds_sum | ||
- cortex_query_scheduler_queue_length | ||
- cortex_request_duration_seconds | ||
- cortex_request_duration_seconds_bucket | ||
- cortex_request_duration_seconds_count | ||
- cortex_request_duration_seconds_sum | ||
- cortex_ring_members | ||
- cortex_ruler_managers_total | ||
- cortex_ruler_queries_failed_total | ||
- cortex_ruler_queries_total | ||
- cortex_ruler_ring_check_errors_total | ||
- cortex_ruler_write_requests_failed_total | ||
- cortex_ruler_write_requests_total | ||
- cortex_runtime_config_hash | ||
- cortex_runtime_config_last_reload_successful | ||
- cortex_tcp_connections | ||
- cortex_tcp_connections_limit | ||
- go_memstats_heap_inuse_bytes | ||
- keda_scaler_errors | ||
- keda_scaler_metrics_value | ||
- kube_deployment_spec_replicas | ||
- kube_deployment_status_replicas_unavailable | ||
- kube_deployment_status_replicas_updated | ||
- kube_endpoint_address | ||
- kube_horizontalpodautoscaler_spec_target_metric | ||
- kube_horizontalpodautoscaler_status_condition | ||
- kube_pod_info | ||
- kube_statefulset_replicas | ||
- kube_statefulset_status_current_revision | ||
- kube_statefulset_status_replicas_current | ||
- kube_statefulset_status_replicas_ready | ||
- kube_statefulset_status_replicas_updated | ||
- kube_statefulset_status_update_revision | ||
- kubelet_volume_stats_capacity_bytes | ||
- kubelet_volume_stats_used_bytes | ||
- memberlist_client_cluster_members_count | ||
- memcached_limit_bytes | ||
- mimir_continuous_test_queries_failed_total | ||
- mimir_continuous_test_query_result_checks_failed_total | ||
- mimir_continuous_test_writes_failed_total | ||
- node_disk_read_bytes_total | ||
- node_disk_written_bytes_total | ||
- process_memory_map_areas | ||
- process_memory_map_areas_limit | ||
- prometheus_tsdb_compaction_duration_seconds_bucket | ||
- prometheus_tsdb_compaction_duration_seconds_count | ||
- prometheus_tsdb_compaction_duration_seconds_sum | ||
- prometheus_tsdb_compactions_total | ||
- rollout_operator_last_successful_group_reconcile_timestamp_seconds | ||
- thanos_cache_hits_total | ||
- thanos_cache_operation_duration_seconds_bucket | ||
- thanos_cache_operation_duration_seconds_count | ||
- thanos_cache_operation_duration_seconds_sum | ||
- thanos_cache_operation_failures_total | ||
- thanos_cache_operations_total | ||
- thanos_cache_requests_total | ||
- thanos_objstore_bucket_last_successful_upload_time | ||
- thanos_objstore_bucket_operation_duration_seconds_bucket | ||
- thanos_objstore_bucket_operation_duration_seconds_count | ||
- thanos_objstore_bucket_operation_duration_seconds_sum | ||
- thanos_objstore_bucket_operation_failures_total | ||
- thanos_objstore_bucket_operations_total | ||
- thanos_store_index_cache_hits_total | ||
- thanos_store_index_cache_requests_total |
45 changes: 45 additions & 0 deletions
45
charts/k8s-monitoring/charts/feature-integrations/docs/integrations/mimir.md
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,45 @@ | ||
# mimir | ||
|
||
## Values | ||
|
||
### Discovery Settings | ||
|
||
| Key | Type | Default | Description | | ||
|-----|------|---------|-------------| | ||
| fieldSelectors | list | `[]` | Discover Mimir instances based on field selectors. | | ||
| labelSelectors | object | `{"app.kubernetes.io/name":"mimir"}` | Discover Mimir instances based on label selectors. | | ||
| namespaces | list | `[]` | Namespaces to look for Mimir instances in. Will automatically look for Mimir instances in all namespaces unless specified here | | ||
|
||
### Logs Settings | ||
|
||
| Key | Type | Default | Description | | ||
|-----|------|---------|-------------| | ||
| logs.enabled | bool | `true` | Whether to enable special processing of Mimir pod logs. | | ||
| logs.tuning.dropLogLevels | list | `[]` | The log levels to drop. Will automatically keep all log levels unless specified here. | | ||
| logs.tuning.excludeLines | list | `[]` | Line patterns (valid RE2 regular expression)to exclude from the logs. | | ||
| logs.tuning.scrubTimestamp | bool | `true` | Whether the timestamp should be scrubbed from the log line | | ||
| logs.tuning.structuredMetadata | object | `{}` | The structured metadata mappings to set. To not set any structured metadata, set this to an empty object (e.g. `{}`) | | ||
| logs.tuning.timestampFormat | string | `"RFC3339Nano"` | The timestamp format to use for the log line, if not set the default timestamp which is the collection will be used for the log line | | ||
|
||
### Metrics Settings | ||
|
||
| Key | Type | Default | Description | | ||
|-----|------|---------|-------------| | ||
| metrics.enabled | bool | `true` | Whether to enable metrics collection from Mimir. | | ||
| metrics.portName | string | `"http-metrics"` | Name of the port to scrape metrics from. | | ||
| metrics.scrapeInterval | string | `60s` | How frequently to scrape metrics from Mimir. | | ||
|
||
### Metric Processing Settings | ||
|
||
| Key | Type | Default | Description | | ||
|-----|------|---------|-------------| | ||
| metrics.maxCacheSize | string | `100000` | Sets the max_cache_size for prometheus.relabel component. This should be at least 2x-5x your largest scrape target or samples appended rate. ([docs](https://grafana.com/docs/alloy/latest/reference/components/prometheus.relabel/#arguments)) Overrides global.maxCacheSize | | ||
| metrics.tuning.excludeMetrics | list | `[]` | Metrics to drop. Can use regular expressions. | | ||
| metrics.tuning.includeMetrics | list | `[]` | Metrics to keep. Can use regular expressions. | | ||
| metrics.tuning.useDefaultAllowList | bool | `true` | Filter the list of metrics from Grafana Mimir to the minimal set required for the Grafana Mimir integration. | | ||
|
||
### General Settings | ||
|
||
| Key | Type | Default | Description | | ||
|-----|------|---------|-------------| | ||
| name | string | `""` | Name for this Mimir instance. | |
Oops, something went wrong.