Releases: DataDog/datadog-agent
7.77.1
Agent
Prelude
Released on: 2026-03-24
- Please refer to the 7.77.1 tag on integrations-core for the list of changes on the Core Checks
Enhancement Notes
- Agents are now built with Go
1.25.8.
Bug Fixes
- Fixed a bug introduced in 7.77.0 that prevents system-probe from starting on Fargate environments when Workload Protection is enabled
- Fixed a command injection vulnerability in the Private Action Runner's inline PowerShell script execution. Parameter values are now assigned as PowerShell single-quoted string literals in a preamble instead of being substituted directly into the script body, preventing arbitrary code execution via crafted parameter inputs.
Datadog Cluster Agent
Prelude
Released on: 2026-03-24 Pinned to datadog-agent v7.77.1: CHANGELOG.
7.77.0
Agent
Known Issues
- A bug introduced in this release prevents system-probe from starting on Fargate environments when Workload Protection is enabled. There is currently no workaround and the recommendation at this time is to downgrade to Agent v7.76.3 or upgrade to v7.77.1 when it becomes available.
Prelude
Released on: 2026-03-18
- Please refer to the 7.77.0 tag on integrations-core for the list of changes on the Core Checks
Upgrade Notes
-
APM OTLP: The
datadog.*namespaced span attributes are no longer used to construct Datadog span fields. Previously, attributes likedatadog.service,datadog.env, anddatadog.container_idwere used to directly set corresponding Datadog span fields. This functionality has been removed and the Agent now relies solely on standard OpenTelemetry semantic conventions.Exceptions:
- The
datadog.host.nameattribute continues to be respected for hostname resolution as documented at https://docs.datadoghq.com/opentelemetry/mapping/hostname/. - The
datadog.container.tag.*attributes continue to be supported for custom container tags.
The configuration option
otlp_config.traces.ignore_missing_datadog_fields(and corresponding environment variableDD_OTLP_CONFIG_IGNORE_MISSING_DATADOG_FIELDS) is deprecated and no longer has any effect. The Agent now always uses standard OTel semantic conventions.Migration: If you were using
datadog.*attributes, switch to the standard OpenTelemetry semantic conventions:datadog.service→service.namedatadog.env→deployment.environment.name(OTel 1.27+) ordeployment.environmentdatadog.version→service.versiondatadog.container_id→container.id
Who is affected: Users who explicitly set
datadog.*attributes (other thandatadog.host.nameanddatadog.container.tag.*) in their OpenTelemetry instrumentation to override default field mappings. Users relying solely on standard OpenTelemetry semantic conventions are not affected. - The
New Features
- Add
dd-procmgrd, a minimal Rust daemon for the Datadog process manager. The daemon starts, logs, and waits for a shutdown signal. It does not provide user-facing functionality. - Add a new listener based on all Custom Resource Definitions (CRDs) found on the cluster.
- Logs pipeline failover: Added automatic failover capability to prevent log loss when compression blocks pipelines. When a pipeline becomes blocked during compression, log messages are automatically routed to healthy pipelines. N router channels (one per pipeline) distribute tailers via round-robin, each with its own forwarder goroutine that handles failover independently across all pipelines. Enable with
logs_config.pipeline_failover.enabled: true(default: false). When all pipelines are blocked, backpressure is applied to prevent data loss. - The system memory check on Linux can now collect memory pressure metrics from /proc/vmstat to help detect memory pressure before OOM events occur. To enable, set
collect_memory_pressure: truein the memory check configuration. New metrics:system.mem.allocstall(withzonetag),system.mem.pgscan_direct,system.mem.pgsteal_direct,system.mem.pgscan_kswapd,system.mem.pgsteal_kswapd. - APM: Add support for span-derived primary tags in APM stats aggregation. This allows configuring tag keys via
apm_config.span_derived_primary_tagsthat will be extracted from span tags and used as additional aggregation dimensions for APM statistics. - APM: Add initial support for converting trace payload formats to the new "v1.0" format. This feature is disabled by default but can be enabled by adding the feature flag "convert-traces" to apm_config.features. It is not recommended to use this flag without direction from Datadog Support.
- Integrate the Private Action Runner into the Datadog Cluster Agent.
- The Private Action Runner (PAR) now runs in the Datadog Cluster Agent with improved identity management for Kubernetes environments. PAR identity (URN and private key) is now stored in a Kubernetes secret and shared across all DCA replicas using leader election. The leader replica handles enrollment and secret creation, while follower replicas wait for and read the shared identity. This enables multiple DCA replicas to execute PAR tasks using a single cluster identity, eliminating the need for per-replica enrollment.
- Add a Windows PowerShell example config for private action runner scripts.
- APM: Add image_volume-based library injection as an alternative to init containers and csi driver (experimental). Available only for Kubernetes 1.33+. This provides faster pod startup.
- Autodiscovery template variables are now supported in
ad.datadoghq.com/tagsandad.datadoghq.com/<container>.tagsKubernetes pod annotations. Template variables are resolved at runtime, enabling dynamic tagging based on pod and container metadata. This allows centralized tag configuration that applies to all checks, logs, and traces without hardcoding pod-specific values. - Start the Windows Private Action Runner service alongside the Agent when
private_action_runner.enabledis set indatadog.yaml. - On Windows, the private action runner binary is now included in the MSI installer and registered as the
datadog-agent-actionWindows service. The service is installed as demand-start with a dependency on the main Agent service, and its credentials and ACLs are managed alongside the other Agent services during install, upgrade, and repair. - Add
runPredefinedPowershellScriptaction to the Private Action Runner on Windows. This action allows running predefined PowerShell scripts (inline or file-based) with optional parameter templating, JSON schema parameter validation, environment variable allowlisting, configurable timeouts, and a 10 MB output limit. - On Windows, the Agent stops the private action runner service during MSI upgrades and fleet-driven stop-all operations so it is shut down alongside the Agent.
Enhancement Notes
-
The Agent's embedded Python has been upgraded from 3.13.11 to 3.13.12.
-
Add
ntp.offsetmetric withsource:intaketag to monitor clock drift using Datadog intake server timestamps. Originalntp.offsetmetric calculated from an NTP server is now taggedsource:ntp. -
As of Kubernetes version 1.33, the
EndpointAPI object has been deprecated in favor ofEndpointSlice. Autodiscovery now supports the use of anEndpointSlicelistener and provider to collect endpoint checks. To enable this feature, setkubernetes_use_endpoint_slicesto true in your Datadog Agent configuration. -
Add
bucketlabel toimage_resolution_attemptstelemetry to track gradual rollout progress. -
Added a private action runner bundle that exposes the Network Path traceroute functionality through the
getNetworkPathaction. -
Sends telemetry for synthetics tests run on the agent, including checks received, checks processed, and error counts for test configuration, traceroute, and event platform result submission.
-
Added support for two new configurations for tag-based gradual rollout in Kubernetes SSI deployments. The gradual rollout can be configured using the following parameters:
-
DD_ADMISSION_CONTROLLER_AUTO_INSTRUMENTATION_GRADUAL_ROLLOUT_ENABLED: Whether to enable gradual rollout (default: true) -
DD_ADMISSION_CONTROLLER_AUTO_INSTRUMENTATION_GRADUAL_ROLLOUT_CACHE_TTL: The cache TTL duration for the gradual rollout image cache (default: 1h)- This cache is used to store the mapping of mutable tags to image digest for the gradual rollout, and setting this TTL helps prevent the image resolution from becoming stale.
-
-
Agent metrics now include a
connection_typetag with a value oftcp,uds, orpipefor lib-to-agent communications. -
Automatically collect the team tag when a Kubernetes resource has a
teamlabel or annotation and explicit team tag extraction is not configured. -
Enables the agent to support built-in credentials like IRSA for AWS cloud environments.
-
Bump
go-sqllexerto v0.1.13, improving SQL obfuscation performance and fixing incorrect tokenization of multi-byte UTF-8 characters (e.g., CJK characters, full-width punctuation). -
Agents are now built with Go
1.25.7. -
NDM: Cisco SD-WAN interface metadata now includes the
is_physicalfield to distinguish physical from virtual interfaces (loopback, tunnel). cEdge interfaces also include thetypefield with the IANA interface type number. -
In the Cluster Autoscaling controller, use Kubernetes client
updateinstead ofpatch. -
On ECS Managed Instances, detect hostname from IMDS when the agent runs in daemon mode.
-
On ECS Managed Instances with daemon scheduling, the agent uses
ECS_CONTAINER_METADATA_URI_V4environment variable as a fallback signal for v4 availability. -
Expose a new metric
kube_apiserver.api_resourcethat holds thename,kind,group, andversionof all known cluster-wide (non namespaced) resources on the cluster. -
Add new DDOT feature gate 'exporter.datadogexporter.DisableAllMetricRemapping' to disable all client-side metric remapping.
-
Increases the reliability of
namespaceLabelsAsTagsandnamespaceAnnotationsAsTagsfor new pods by caching the last seen namespace metadata. -
Added a new, optional configuration setting for journald logs:
default_application_name. If set to a non-empty string, the value will replace "docker" as the default application name for contained based journald logs. If set to an empty string, the application name will be determined by the systemd journal fields, like all non-container based journ...
7.76.3
Agent
Prelude
Released on: 2026-03-09
- Please refer to the 7.76.3 tag on integrations-core for the list of changes on the Core Checks
Security Notes
- Bump github.com/cloudflare/circl to fix v1.6.3 to fix CVE-2026-1229.
- Fixed a limited out-of-bounds memory read and DoS vulnerability in Windows kernel driver while handling TLS traffic. The host must have the ddnpm kernel driver service running, by having system_probe_config and network_config enabled, to be affected. This configuration is not enabled by default. Query with PowerShell: Get-Service ddnpm Query with command prompt: sc query ddnpm
Bug Fixes
- Fixed IPv6 address matching logic that caused network traffic to be tracked incorrectly. Fixed failed classification of HTTP DELETE requests. Added additional memory handling and overflow safety checks.
Datadog Cluster Agent
Prelude
Released on: 2026-03-09 Pinned to datadog-agent v7.76.3: CHANGELOG.
7.76.2
Agent
Prelude
Released on: 2026-03-05
- Please refer to the 7.76.2 tag on integrations-core for the list of changes on the Core Checks
Bug Fixes
- The
infra_modetag is now correctly added tosystem.cpu.useron Windows wheninfrastructure_modeis not set to"full", matching the behavior of the Linux cpu check.
Datadog Cluster Agent
Prelude
Released on: 2026-03-05 Pinned to datadog-agent v7.76.2: CHANGELOG.
7.76.1
Agent
Prelude
Released on: 2026-02-26
- Refer to the 7.76.1 tag on integrations-core for the list of changes on the Core Checks
Security Notes
- APM: On span tags, add obfuscation for
ACLcommand.
Bug Fixes
- Fixes a rare crash in the system-probe process caused by concurrent access to an internal LRU cache.
- Fix a Windows file-permission issue that prevented workload selection policy files from being updated after the initial write.
- Fixed a bug in the disk Go check (diskv2) where custom tags from one check instance would leak into metrics from other instances. Tags are now correctly isolated per instance.
- GPU: ensure
gpu.nvlink.speedmetric is emitted in Blackwell or newer devices.
Datadog Cluster Agent
Prelude
Released on: 2026-02-26 Pinned to datadog-agent v7.76.1: CHANGELOG.
7.76.0
Agent
Prelude
Released on: 2026-02-23
- Please refer to the 7.76.0 tag on integrations-core for the list of changes on the Core Checks
Upgrade Notes
- DDOT now submits Fleet Automation metadata through the upstream datadogextension, which is enabled by default. As a result, your DDOT configuration will now appear under the OTel Collector tab. If you configured
otelcollector.converter.features, you may need to add thedatadogfeature to enable Fleet Automation, as DDOT Fleet Automation metadata is no longer submitted through theddflareextension.
New Features
-
Allow users to filter agent check instances using a new --instance-id parameter, which filters by the instance hash found in the agent status.
-
Add
privateactionrunnerbinary in Agent artifacts to allow running actions using the Agent, and enable running it on Linux. The binary is disabled by default. To enable it, setprivateactionrunner.enabled: truein your configuration file. -
Integration check failures are now automatically reported to the Agent Health Platform component when enabled via
health_platform.enabled: true. This provides structured health issue tracking with:- Detailed error context including check name, error message, and configuration source
- Actionable remediation steps for debugging check failures
- Automatic issue resolution when checks recover
- Integration with the health platform telemetry and reporting system
This feature helps users proactively identify and troubleshoot integration issues across their fleet.
-
The Agent Profiling check now supports automatic Agent termination after flare generation when memory or CPU thresholds are exceeded. This feature is useful in resource-constrained environments where the Agent needs to be restarted after generating diagnostic information.
Enable this feature by setting
terminate_agent_on_threshold: truein the Agent Profiling check configuration. When enabled, the Agent uses its established shutdown mechanism to trigger graceful shutdown after successfully generating a flare, ensuring proper cleanup before exit.Warning: This feature will cause the Agent to exit. This feature is disabled by default and should be used with caution.
-
Experimental support the ConfigSync HTTP endpoints over unix sockets with
agent_ipc.use_socket: true(defaults to false). -
Implements the
flarecommand for the otel-agent binary. Now you can runotel-agent flaredirectly in the otel-agent container to get OTel flares. -
Adds system info metadata collection for macOS end-user devices.
-
Adds system info metadata collection for Windows end-user devices.
-
Added GPU runtime discovery support for ECS EC2 environments. The Datadog Agent can now detect GPU device UUIDs assigned to containers by extracting the
NVIDIA_VISIBLE_DEVICESenvironment variable from the Docker container configuration. This enables GPU-to-container mapping for GPU metrics without requiring the Kubernetes PodResources API, which is not available in ECS environments. -
After falling back to TCP, the Logs Agent periodically retries to establish HTTP and upgrades the connection once HTTP connectivity is available.
-
Container logs now include a
LogSourcetag indicating whether each log message originated from stdout or stderr. This applies to logs parsed via Docker and Kubernetes CRI runtimes. -
Added paging file metrics to the Windows memory check for
pagefile.sysusage.
Enhancement Notes
-
Add a new
global_view_dbvariable to AWS Autodisovery templates. By default this is the value of thedatadoghq.com/global_view_dbtag on the instance or cluster. -
Add NotReady endpoint processing to be on par with EndpointSlices processing.
-
The agentprofiling check now retries flare generation 2 times with exponential backoff (1 minute after first failure, 5 minutes after second failure) when flare creation or sending fails. This improves reliability when encountering transient failures during flare generation.
-
Adds a
kubernetes_kube_service_new_behaviorflag (default false) to alterkube_servicetag behavior. If the flag is set to true,kube_servicetag is attached unconditionally. Previously, the tag was only attached when the Kubernetes service has the statusReady. -
APM: Add custom protobuf encoder for trace writer v1 with string compaction to reduce payload size.
-
Extended the autodiscovery secret resolver to support refreshing secrets.
-
Agents are now built with Go
1.25.7. -
The datadog-installer
setupcommand now prints human-readable errors instead of mixing JSON and text. -
Added
GPUDeviceIDsfield to the workloadmeta Container entity to store GPU device UUIDs. This field is populated by the Docker collector in ECS environments from theNVIDIA_VISIBLE_DEVICESenvironment variable (e.g.,GPU-xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx). -
The GPU collector now uses
GPUDeviceIDsfrom workloadmeta as the primary source for GPU-to-container mapping in ECS, with fallback to procfs for regular Docker environments and PodResources API for Kubernetes. -
GPU: add new tag
gpu_typeto the GPU metrics to identify the type of GPU (e.g.,a100,h100). -
Improve eBPF conntracker support by using alternate probes when the primary probe is unavailable, enabling compatibility with GKE Autopilot and other environments running Google COS.
-
The
logs.droppedmetric now tracks dropped logs for both TCP and HTTP log transports. Previously, this metric was only available when using TCP transport. Customers can now monitor dropped logs with a single unified metric regardless of which transport protocol is configured, making it easier to detect and troubleshoot log delivery issues. -
The logs agent now supports using
start_position: beginningandstart_position: forceBeginningwith wildcard file paths. Previously, configurations likepath: /var/log/*.logwithstart_position: beginningwould fail validation. The agent's fingerprinting system when enabled prevents duplicate log reads during file rotation, making this combination safe to use. -
Site config URLs are now lowercased for consistent handling.
-
APM: Add tags
databricks_job_id,databricks_job_run_id,databricks_task_run_id,config.spark_app_startTime,config.spark_databricks_job_parentRunIdto the default list of tags that are known to not be credit card numbers so they are skipped by the credit card obfuscator. -
Add option to switch on/off Infra-Attribute-Processor for traces in the OTLP ingest pipeline.
otlp_config:
traces:
infra_attributes:
enabled: falseThese settings can be configured in the Agent config file or by using the environment variables.
-
The Datadog Agent now collects AWS Spot preemption events (requires IMDS access) as Datadog events.
-
Added
network_config.dns_monitoring_ports, which is a list of DNS ports Cloud Network Monitoring will use to monitor DNS traffic on. -
Automatically tag, but don't aggregate, multiline logs. Logs are tagged with the number of other logs they could potentially be aggregated with.
-
Update the histogram helpers API in the
pkg/opentelemetry-mapping-go/otlp/metricspackage. The API now accepts accept pointers to the OTLP data points, and returns blank DDSketches when the pointer is nil. -
Update image resolution attempt telemetry to include the
tagspecified in the configuration, and remove theregistryanddigest_resolutiontags. -
Windows: Add a new flare artifact
agent_loaded_modules.jsonlisting loaded DLLs with metadata (full path, timestamp, size, perms) and version info (CompanyName, ProductName, OriginalFilename, FileVersion, ProductVersion, InternalName). Keeps<flavor>_open_files.txtfor compatibility.
Deprecation Notes
- The command
agent diagnose show-metadata inventory-otelhas been removed. To display DDOT metadata, you can query the datadog extension endpoint:http://localhost:9875/metadata.
Bug Fixes
- Properly scrub sensitive information from Kubernetes pod specifications in agent flares. Environment variables with sensitive names are now redacted.
- Fixed a bug where long Kubernetes event bundles were being truncated by dogweb.
- APM: Fix a bug where the Agent would log a warning when the
DD_APM_MODEenvironment variable was unset. - Properly parse the
image_tagtag when defining a container spec that uses both an image tag and a digest likenginx:1.23@sha256:xxx. - Updates tag enrichment logic to retry on failed tag resolution attempts. This regression was introduced in #41587 on Agent v7.73+. Impacts origin detection on cgroup v2 runtimes with DogStatsD, which led to tags not being enriched, even if origin detection was possible by using other methods like container ID from socket or ExternalData.
- Fixed a regression in the Go-native disk check (diskv2) where a failure in IO counter collection (e.g.
ERROR_INVALID_FUNCTIONfromDeviceIoControlon Windows Server 2016) caused all disk metrics to be discarded, including successfully collected partition/usage metrics such assystem.disk.total,system.disk.used, andsystem.disk.free. IO counter collection is now best-effort: known errors such asERROR_INVALID_FUNCTIONare logged at debug level, while unexpected errors are logged as warnings. Neither prevent partition metrics from being reported. - Fleet installer: ensure the
DD_LOGS_ENABLEDenvironment variable is honored again when running setup scripts, so Windows installs using the new installer flow properly. Setslogs_enabledindatadog.yaml. - Fixes a bug introduced in 7.73.0 that can cause a remote Agent update through Fleet Automation to fail to restore the previous version if the MSI fails a...
7.75.4
Agent
Prelude
Released on: 2026-02-17
- Please refer to the 7.75.4 tag on integrations-core for the list of changes on the Core Checks
Enhancement Notes
- Agents are now built with Go
1.25.7.
Security Notes
- APM: On span tags, add obfuscation for
HELLOandMIGRATERedis commands.
Similar toAUTH, all arguments passed to these commands will be obfuscated and replaced with?.
Datadog Cluster Agent
Prelude
Released on: 2026-02-17 Pinned to datadog-agent v7.75.4: CHANGELOG.
7.75.3
Agent
Prelude
Released on: 2026-02-11
- Please refer to the 7.75.3 tag on integrations-core for the list of changes on the Core Checks
Security Notes
- Bump the version of envoyproxy/gateway to 1.5.7
Datadog Cluster Agent
Prelude
Released on: 2026-02-11 Pinned to datadog-agent v7.75.3: CHANGELOG.
7.75.2
Agent
Prelude
Released on: 2026-02-04
- Please refer to the 7.75.2 tag on integrations-core for the list of changes on the Core Checks
Upgrade Notes
- Update OpenJDK to 11.0.30. This release includes changes that may negatively affect JMX integrations that use TLS. Refer to OpenJDK release notes for more information.
Bug Fixes
- Disable the SNMP device scan by default.
- Fixes a regression introduced in version 7.75 that caused Workload Protection File Integrity Monitoring to be disabled by default when installing the Datadog Agent via the Helm chart.
- Fixes a bug introduced in Agent v7.74 where unresolved SSH sessions could cause Workload Protection events to be delayed for several minutes, potentially blocking the delivery of other Workload Protection events.
- GPU: fix metric type for gpu.nvlink.*, gpu.pci.replay_counter and gpu.remapped_rows.* metric that were reported as counters instead of gauges
Datadog Cluster Agent
Prelude
Released on: 2026-02-04 Pinned to datadog-agent v7.75.2: CHANGELOG.
7.75.1
Agent
Prelude
Release on: 2026-01-28
- Please refer to the 7.75.1 tag on integrations-core for the list of changes on the Core Checks
Enhancement Notes
- Agents are now built with Go
1.25.6.
Bug Fixes
- GPU: fix an issue where containerd image creation could be blocked sporadically when advanced eBPF metrics are enabled
- Change the Log Agent default TCP port for datadoghq.eu from the incorrect value of 10516 to the correct 443.
- Resolves an issue where NetFlow metrics are submitted every 10 seconds, instead of aggregating for the full interval per Source/Destination pair.
Datadog Cluster Agent
Prelude
Released on: 2026-01-28 Pinned to datadog-agent v7.75.1: CHANGELOG.