Skip to content

Remove log-injection diagnostic and workaround code (fixed by #11509)#11548

Merged
gh-worker-dd-mergequeue-cf854d[bot] merged 2 commits into
masterfrom
brian.marks/remove-log-injection-diagnostics
Jun 3, 2026
Merged

Remove log-injection diagnostic and workaround code (fixed by #11509)#11548
gh-worker-dd-mergequeue-cf854d[bot] merged 2 commits into
masterfrom
brian.marks/remove-log-injection-diagnostics

Conversation

@bm1549
Copy link
Copy Markdown
Contributor

@bm1549 bm1549 commented Jun 3, 2026

What Does This Do

Removes the diagnostic instrumentation added to LogInjectionSmokeTest while investigating the check raw file injection flakiness. Since #11509 fixed the root cause (a DDTraceId/DD64bTraceId class-initialization deadlock), the diagnostics are no longer needed.

Only LogInjectionSmokeTest.groovy is changed. The actual fixes from the earlier PRs (#10999, #11375) are kept.

Removed (from #11075 and #11400):

  • waitForTraceCountAlive() and all helper methods: dumpThreadStacks, filterThreadDump, isBoilerplateThread, isHighPriorityThread, getTestedProcessPid, runJstack, tailProcessLog, INLINE_DUMP_CAP
  • import datadog.environment.OperatingSystem (was only used by dumpThreadStacks)

Kept (actual fixes, not diagnostics):

Motivation

The traceCount=0 failures in check raw file injection were caused by a class-initialization deadlock between DDTraceId and DD64bTraceId, diagnosed using the thread dumps added in #11400 and fixed in #11509. With the deadlock fixed, the ~260-line diagnostic block in LogInjectionSmokeTest is dead weight.

Additional Notes

N/A — only the diagnostic instrumentation is removed.

Contributor Checklist

  • Title formatted per the contribution guidelines
  • type: and comp: labels assigned
  • No issue-linking keywords used
  • CODEOWNERS — n/a (no file additions/deletions)
  • Public documentation — n/a (test-only change)

Jira ticket: N/A

@bm1549 bm1549 added tag: ai generated Largely based on code generated by an AI or LLM tag: no release notes Changes to exclude from release notes labels Jun 3, 2026
…1509

The DDTraceId/DD64bTraceId class-initialization deadlock was the root
cause of the traceCount=0 flakiness that prompted these diagnostics.
With the deadlock fixed, the diagnostic instrumentation is no longer
needed.

Removes from #11075 and #11400:
- waitForTraceCountAlive() and all helpers (dumpThreadStacks,
  filterThreadDump, isBoilerplateThread, isHighPriorityThread,
  getTestedProcessPid, runJstack, tailProcessLog, INLINE_DUMP_CAP)
- The OperatingSystem import that was only used by dumpThreadStacks

Keeps the actual fixes from #10999, #11375:
- BaseApplication TIMEOUT_IN_NANOS 10s→30s (prevents app-side timeout
  racing the test-side timeout under CI load)
- @flaky(isZulu8()) and JavaVirtualMachine.isZulu8() (covers a separate
  JUL race on Zulu 8 that is not fixed by #11509)

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
@bm1549 bm1549 force-pushed the brian.marks/remove-log-injection-diagnostics branch from 9f96ca9 to 4c0fb1f Compare June 3, 2026 13:37
@bm1549 bm1549 marked this pull request as ready for review June 3, 2026 13:42
@bm1549 bm1549 requested a review from a team as a code owner June 3, 2026 13:42
@bm1549 bm1549 requested review from mhlidd and removed request for a team June 3, 2026 13:42
@datadog-datadog-prod-us1

This comment has been minimized.

@dd-octo-sts
Copy link
Copy Markdown
Contributor

dd-octo-sts Bot commented Jun 3, 2026

🟢 Java Benchmark SLOs — All performance SLOs passed

Suite Status
Startup 🟢 pass

SLO thresholds are defined here based on automatically generated metrics. A warning is raised when results are within 5% of the threshold.

PR vs. master results
Scenario Candidate master Δ (95% CI of mean)
startup:insecure-bank:iast:Agent 14.02 s 13.98 s [-0.8%; +1.4%] (no difference)
startup:insecure-bank:tracing:Agent 12.86 s 13.00 s [-2.2%; +0.0%] (no difference)
startup:petclinic:appsec:Agent 16.41 s 15.48 s [-3.0%; +15.0%] (unstable)
startup:petclinic:iast:Agent 16.53 s 16.64 s [-1.9%; +0.5%] (no difference)
startup:petclinic:profiling:Agent 16.37 s 16.41 s [-1.5%; +1.1%] (no difference)
startup:petclinic:tracing:Agent 15.77 s 15.83 s [-1.4%; +0.7%] (no difference)

Commit: eab7aa50 · CI Pipeline · Benchmarking Platform UI


Load and DaCapo benchmarks can be triggered manually in the GitLab pipeline. Results will appear in the Benchmarking Platform UI after completion.

@bm1549 bm1549 added this pull request to the merge queue Jun 3, 2026
@dd-octo-sts
Copy link
Copy Markdown
Contributor

dd-octo-sts Bot commented Jun 3, 2026

/merge

@gh-worker-devflow-routing-ef8351
Copy link
Copy Markdown

gh-worker-devflow-routing-ef8351 Bot commented Jun 3, 2026

View all feedbacks in Devflow UI.

2026-06-03 14:49:10 UTC ℹ️ Start processing command /merge


2026-06-03 14:49:15 UTC ℹ️ MergeQueue: pull request added to the queue

The expected merge time in master is approximately 1h (p90).


2026-06-03 16:04:48 UTC ℹ️ MergeQueue: This merge request was merged

@github-merge-queue github-merge-queue Bot removed this pull request from the merge queue due to failed status checks Jun 3, 2026
@gh-worker-dd-mergequeue-cf854d gh-worker-dd-mergequeue-cf854d Bot merged commit 1b95bfa into master Jun 3, 2026
572 checks passed
@gh-worker-dd-mergequeue-cf854d gh-worker-dd-mergequeue-cf854d Bot deleted the brian.marks/remove-log-injection-diagnostics branch June 3, 2026 16:04
@github-actions github-actions Bot added this to the 1.64.0 milestone Jun 3, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

tag: ai generated Largely based on code generated by an AI or LLM tag: no release notes Changes to exclude from release notes

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants