Agent
Prelude
Release on: 2024-05-29
- Please refer to the 7.54.0 tag on integrations-core for the list of changes on the Core Checks
Upgrade Notes
- Upgraded JMXFetch to 0.49.1 which adds support for ZGC Major and Minor Cycles and ZGC Major and Minor Pauses beans.
- Add a configuration option
admission_controller.inject_config.inject_container_name
to retrieve the container tags for dogstatsd metrics and apm traces in setups where origin detection is not supported (windows, nested virtualization with cgroupv2...). This option will inject the container name along with the pod uid inDD_ENTITY_ID
if it was not set manually. Note that this option is disabled by default and is incompatible with prior versions of the agent . Thus, we recommend enabling it only if you are using the latest version of the agent. In case a rollback to an incompatible version is needed, it will be necessary to delete the pods in whichDD_ENTITY_ID
was injected by the latest agent to preserve pod tags in dogstatsd metrics and traces.
New Features
- APM: Add a new Probabilistic Sampler sampling mechanism, which is compatible with OpenTelemetry's Probabilistic Sampling Processor.
- CWS: Add support for
write
events on Windows. - CWS: Add support for
delete
event on Windows. - CWS: Add
chdir
,mount
andumount
inptracer
mode. - CWS: Add APM spans in
ptracer
mode. - CWS: Add support for file hashing in
ptracer
mode. - CWS: Allow writing rules for rename events on Windows.
- Set kubelet core check to be enabled by default.
- Add
dogstatsd_pipeline_autoadjust_strategy
setting withmax_throughput
andper_origin
strategies.max_throughput
is the already existing strategy (whendogstatsd_pipeline_autoadjust
istrue
).per_origin
will let you create an arbitrary number of pipelines (based ondogstatsd_pipeline_count
), and will try to isolate containers based on their advertised origin. This will improve compression. - [NDM] Add Cisco SD-WAN integration.
- In the Logs Agent, add support for the Sensitive Data Scanner library to process logs.
- USM now captures TLS traffic from NodeJS applications.
Enhancement Notes
- This change allows DBM Autodiscovery users to optionally set the region for where their aurora clusters are running. This can be used in lieu of relying on IMDS to discover the region through instance metadata. This is a nicer experience for users running in Docker, who would be required to complete extra steps in their instance metadata configuration to allow the Docker container access to the instance metadata.
- Activate the autodiscovery automatically if at least one configuration is given in network_devices.autodiscovery. Remove the listeners: -snmp requirement.
- Updates the Agent status output to show if a check is in a cancelling state. Once canceled, the check is removed from the status output.
- Report EKS Fargate configuration to the Agent metadata payload.
- Adding
dirname
tag for full filepath log configurations. This tag was only added previously if using a wildcard filepath log configuration. - Agents are now built with Go
1.21.9
. - Collect clusters' extended resources capacity and allocatable.
- Collect all node resource capacities and allocatable metrics for cluster management.
- Collect pod limits and requests.
- CWS: Always snapshot memory mapped files in activity dumps. This allows easier mapping of workload to loaded shared libraries.
- CWS: Normalize Windows filenames before going through glob matching.
- CWS: Support non seccomp mode in
ptracer
mode. - Add
agent status [name]
subcommand to fetch only a given section status (for example,datadog-agent status forwarder
` to get forwarder status). - dogstatsd: Implement new config option origin_detection_unified. This new option will allow users to configure the origin detection behavior for DogStatsD. When enabled, the DogStatsD server will use the default Origin Detection logic.
- snmp_listener now lives under network_devices with the name network_devices.autodiscovery. While the old configuration will continue to work, if both configurations are present, the new one will take priority.
- [oracle] Add
oracle.user_sessions
metric. - The OTLP ingestion endpoint now supports the same settings and protocol as the OpenTelemetry Collector OTLP receiver v0.97.0.
- Supports Podman newer versions (4.8+) using SQLite instead of BoltDB for the containers database backend. Setting
podman_db_path
to the path with thedb.sql
file (e.g./var/lib/containers/storage/db.sql
) will make the Datadog Agent use the SQLite format. Note: Ifpodman_db_path
is not set (default), the Datadog Agent attempts to use the default filelibpod/bolt_state.db
anddb.sql
from/var/lib/containers/storage
. - Allow certain Process Agent checks to be run from the core agent using the processchecks subcommand.
- check output from the Process Agent component are added to the flare when used in the core agent.
- expvars from the Process Agent component are added to the flare when used in the core Agent.
- Status of the Process Agent component will be shown when used in the core Agent.
- NDM: SNMP devices are now tagged by device_ip and device_id.
Deprecation Notes
- The
datadog-agent status component [name]
syntax will be replaced bydatadog-agent status [name]
in Datadog Agent 7.55 - Removed log that was noisy when Process Agent was running on ECS Fargate.
- The flare_stripped_keys configuration is now deprecated. Use scrubber.additional_keys instead.
Bug Fixes
- APM: Fix potential connection issues by ensuring connection semaphore release during errors.
- APM: Removed unsupported configuration parameter
apm_config.log_throttling
from code and documentation. - Disables the creation of build-id files in RPM packages. These are provided for debugging tools, but can lead to conflicts when multiple packages have the same build-id files. This conflict prevents the second package from being installed.
- Fix a race condition that could prevent JMX checks from running.
- OTLP ingest for traces now supports stable (v1.23.0+) semantic conventions for HTTP Spans. Old (v1.20.0 and older) semantic conventions are still supported. When both are reported, the new semantic conventions take precedence and old semantic conventions are ignored.
- Don't log when failing to fetch config of other Agents
- The User-Agent header is now set to datadog-agent/<version> for logs forwarding requests to /api/v2/logs. Previously it was set to Go-http-client/1.1.
- APM: Fixes issue where match-all replace tags rules could inadvertently affect required datadog tags. It is still possible to redact specific Datadog tags by targeting them explicitly.
- Fixes containers file-based log collection when using the
k8s-file
podman log driver and thelogs_config.use_podman_logs
parameter. - The Process Agent no longer crashes when pidMode is misconfigured on ECS Fargate. A warning is logged instead.
- Fix panic when running process checks in the core Agent with telemetry enabled.
- The Agent health check will now continue running even if the API key validation endpoint returns a 403 response code. Because this can occur due to transient issues, retrying will allow the Agent health indicator to recover in some cases.
- Fix type conversion error while generating the trace-agent status.
- APM: fix a bug where the Trace Agent would ignore proxy.no_proxy configuration values
- Create missing default configuration files during the upgrade, change, and repair actions of the Windows Installer.
Other Notes
- APM: While adding another sampler, we have modified the Rare sampler so that it no longer has a separate TTL for priority traces, meaning it no longer distinguishes between traces with priority > 0 and priority <= 0. It is necessary to detangle the various samplers so they can work independently from each other, in this case, so the Rare sampler can work with the Probabilistic Sampler. This should not have a noticeable impact on users.
- CWS: Allow
fim_enabled
to explicitly be set to false on Windows to enable process monitoring only.
Datadog Cluster Agent
Prelude
Released on: 2024-05-29 Pinned to datadog-agent v7.54.0: CHANGELOG.
New Features
- Add LimitRange and StorageClass collection in the orchestrator check.
Enhancement Notes
- Added retry mechanism to language detection patcher in order to retry failed patching operations.
Bug Fixes
- Fix collection of numeric rolling update options in Kubernetes deployments and daemonsets.
- Fixed initialization of language expiration time for detected languages.