07 enablement · Governance

Engineering Manager

People and delivery health.

Updated: 2026-04-24 14 sections Download .zip

The Engineering Manager is the persona accountable for the health of the people and the health of the delivery flow. In an AI-native SDLC, the Engineering Manager operates a governance loop over DORA signals, 1:1 agendas, and team-level retrospectives, not a spreadsheet of tickets.

Executive summary

The Engineering Manager ensures that engineers grow, that delivery flow is predictable, and that organizational risk is surfaced before it becomes incident. In an AI-native SDLC, the Engineering Manager works inside the Governance phase with a fixed set of primitives: one delivery-health agent, four slash prompts, scoped instructions, schema-validated hooks, and a curated list of validated MCPs. The primary outputs are DORA scorecards, 1:1 agendas, retrospective briefs, and staffing recommendations tied to measured flow.

Role and responsibilities

Think of the Engineering Manager like the conductor of an orchestra during rehearsal season. The conductor does not play an instrument on stage, but the orchestra falls apart without their sense of tempo, dynamics, and rehearsal discipline. In an AI-native SDLC, the code and the architecture are owned by other personas. The Engineering Manager is accountable for tempo: the cadence at which the team learns, ships, and recovers.

Primary responsibilities:

  • Track DORA metrics (lead time, deployment frequency, change failure rate, mean time to restore) via GitHub Actions and Azure Monitor workbooks
  • Run weekly 1:1s with engineers using a rolling agenda surfaced by the M365 Agents SDK on Microsoft Teams
  • Facilitate sprint retrospectives with data pulled from Azure Boards and GitHub Projects
  • Own the delivery-health dashboard in Azure Workbooks and keep it linked from the repository README.md
  • Detect burnout, conflict, and attrition risk using weekly SPACE signals
  • Partner with the Technical Lead on staffing, career ladders, and skill gaps
  • Operate the Delivery Pulse agent and the /pulse, /one-on-one, /retro, /staff-ops prompts

Jobs to be done

  1. As an Engineering Manager, I want a weekly DORA scorecard auto-generated from GitHub Actions and Azure Monitor, so that I discuss facts with my skip-level instead of anecdotes.
  2. As an Engineering Manager, I want each 1:1 agenda prefilled with the engineer’s recent PRs, incidents, and learning goals, so that thirty minutes feel coached, not improvised.
  3. As an Engineering Manager, I want retro inputs aggregated from Azure Boards, GitHub Projects, and incident logs, so that the team debates patterns instead of re-listing events.
  4. As an Engineering Manager, I want attrition and burnout signals surfaced on a weekly cadence, so that I intervene before a resignation, not after.
  5. As an Engineering Manager, I want a staffing model that maps roadmap cost to current capacity, so that delivery commitments are honest.
  6. As an Engineering Manager, I want an audit trail of every people-facing AI suggestion, so that the team trusts the loop.

Pain points before AI-native

  1. Metrics theatre. Leadership asks for DORA, the team pastes screenshots from three different tools, and nobody trusts the number. Without a signed aggregation pipeline, each report is a new negotiation.
  2. 1:1s that drift into status. Without a prefilled agenda tied to real artifacts, the half-hour becomes a micro-standup and the growth conversation is postponed forever.
  3. Retros that blame individuals. Without data, the loudest voice wins the narrative. Systemic causes are invisible.
  4. Invisible burnout. On-call rotations, late PR reviews, and growing rework rates are three different dashboards. The manager only connects them after someone resigns.
  5. Staffing by intuition. Roadmap commitments are made on gut feel for team velocity, then re-negotiated every quarter in a budget meeting.

AI-native daily workflow

The Engineering Manager operates a weekly and daily loop. The loop uses GitHub Copilot primitives inside Visual Studio Code, Claude Code at the terminal for report generation, and the M365 Agents SDK on Microsoft Teams for 1:1 surfacing.

Morning setup

  1. Open Visual Studio Code on the eng-ops repository. GitHub Copilot Chat loads the scoped .github/instructions/management.instructions.md.
  2. Invoke /pulse to refresh the DORA and SPACE dashboards. The Delivery Pulse agent calls the GitHub MCP for Actions runs and the Azure MCP for Application Insights and Azure Monitor queries.
  3. Read the Teams channel where the M365 Agents SDK posts the overnight anomalies (flaky test spikes, change failure rate regressions, stalled PRs).

Midday execution

  1. Run two or three 1:1s. Each 1:1 opens with an agenda prefilled by /one-on-one <engineer>, which pulls the engineer’s merged PRs, incident involvement, and ticket participation from the GitHub MCP and Azure DevOps MCP.
  2. Review the Azure Boards risk register with the Project Manager. Any item tagged delivery-risk gets a linked workbook view from Azure Monitor.
  3. Run /staff-ops to evaluate capacity versus the next two sprints of committed work. The agent returns a gap analysis with named risks, never promises.

Afternoon review

  1. Lead a retro for one of the teams using /retro. The agent ingests sprint data from Azure Boards and GitHub Projects and produces a structured brief: what worked, what stalled, systemic causes, proposed experiments.
  2. Update the delivery-health dashboard in Azure Workbooks. Commit the query changes. GitHub Actions publishes the updated workbook.
  3. Close the day by posting the daily pulse summary to the leadership Teams channel via the M365 Agents SDK.

Agent

AgentFilePurpose
delivery-pulse.github/agents/delivery-pulse.agent.mdAggregate DORA and SPACE signals, draft 1:1 agendas, facilitate retros, produce staffing recommendations

The Delivery Pulse agent uses claude-sonnet-4-6 by default, with tools read, search, grep, bash, and access to GitHub, Azure, Azure DevOps, and Microsoft 365 Agents SDK MCPs. Extended thinking is enabled for staffing analyses where multi-hop reasoning over capacity and skill data is required.

Slash prompts

CommandFilePurpose
/pulse.github/prompts/pulse.prompt.mdRefresh DORA and SPACE dashboards, flag anomalies
/one-on-one.github/prompts/one-on-one.prompt.mdGenerate a 1:1 agenda from recent artifacts and rolling goals
/retro.github/prompts/retro.prompt.mdProduce a retrospective brief with systemic-cause hypotheses
/staff-ops.github/prompts/staff-ops.prompt.mdCapacity and skill-gap analysis for upcoming sprints

Instructions scoped

Scoped applyTo reduces token cost and keeps people-facing content distinct from code review guidance.

Scope (applyTo)FilePurpose
eng-ops/dora/**.github/instructions/dora.instructions.mdDORA aggregation rules, anomaly thresholds
eng-ops/one-on-ones/**.github/instructions/one-on-ones.instructions.md1:1 tone, confidentiality boundaries, never suggest performance verdicts
eng-ops/retros/**.github/instructions/retros.instructions.mdRetro structure, systemic-cause over individual blame

Hooks

Hooks are zero-token governance for management artifacts.

  • pre-commit: redact engineer names from retro drafts before they are committed to a shared branch
  • post-commit: regenerate the delivery-health dashboard JSON when DORA queries change
  • pre-push: validate that every staffing recommendation cites a capacity query, never a hunch

Validated MCPs

Every MCP below is registered in the MCP catalog. Do not reference any MCP that is not in the catalog.

MCPStatusUse in this persona
GitHub MCP ServerOfficialRead PRs, Actions runs, Projects boards; draft 1:1 agendas from recent contributions
Azure MCP ServerOfficial (Microsoft)Query Azure Monitor and Application Insights for deployment and incident metrics
Azure DevOps MCP ServerOfficial (Microsoft)Read Azure Boards work items, iterations, and risk register entries
Microsoft Learn Docs MCPOfficialGround management guidance in current Microsoft Learn and GitHub Docs
Microsoft 365 Agents SDK MCPOfficial (Microsoft)Post pulse summaries and 1:1 agendas into Microsoft Teams channels

Real examples

Example 1: weekly DORA pulse

Input: The Monday pulse is due at 09:00. The team merged 42 PRs last week, with two rollbacks.

Invocation: /pulse.

Expected output:

  1. A generated eng-ops/dora/2026-W17.md with the four DORA metrics, trend arrows, and a link to each underlying query in Azure Monitor.
  2. A Teams post via the Microsoft 365 Agents SDK summarizing the scorecard and flagging the change failure rate regression.
  3. An Azure Boards work item opened automatically for the regression, assigned to the on-call Technical Lead for investigation.

Example 2: 1:1 agenda for a senior engineer

Input: A 1:1 with a senior engineer is scheduled for 14:00. The engineer shipped three PRs last week and was paged twice on-call.

Invocation: /one-on-one priya.nair.

Expected output:

  1. A rolling agenda in eng-ops/one-on-ones/priya.nair/2026-04-24.md with three sections: career goals, recent work highlights (linked PRs and incidents), blockers.
  2. A private draft surfaced only to the Engineering Manager in Teams, never auto-shared.
  3. A follow-up todo list that carries forward any unresolved item from the previous 1:1.

Anti-patterns

  1. Turning 1:1 data into performance evidence. The 1:1 agenda is a conversation scaffold, not an audit trail. Mitigation: the one-on-ones.instructions.md forbids verdict phrasing and requires opt-in sharing.
  2. Running DORA without the engineers seeing it first. Metrics weaponized in leadership decks before the team sees them kill trust. Mitigation: the pulse posts to the team channel before the leadership channel.
  3. Retros that name individuals. Blaming people is a management failure. Mitigation: the Delivery Pulse agent rewrites any individual-blame phrasing into systemic-cause phrasing.
  4. Staffing models that hide assumptions. Capacity divided by story points is a lie. Mitigation: /staff-ops returns explicit assumptions and flags each one.
  5. Dashboards that never drive action. A workbook nobody reads is noise. Mitigation: every anomaly on the pulse opens an Azure Boards work item with an owner.

KPIs and impact metrics

MetricBaseline (manual)Target (agentic)Measurement
Weekly DORA scorecard deliveryAd hocEvery Monday 09:00GitHub Actions schedule
1:1 prep time per engineer20 minutesUnder 5 minutesTime-to-agenda log
Retro systemic-cause rateUnder 30 percentOver 70 percentRetro label audit
Attrition early warning lead time0 daysOver 30 daysPulse anomaly history
Staffing commitment accuracyPlus or minus 40 percentPlus or minus 10 percentRoadmap versus actual delivery
Manager token efficiencyN/AUnder 200k tokens per weekCopilot usage report

Maturity in four levels

LevelNameMarkers
L1ManualSpreadsheet DORA, improvised 1:1s, retros run from memory
L2AssistedGitHub Copilot Chat for drafting, no agent, dashboards in one tool only
L3AugmentedDelivery Pulse agent, four slash prompts, scoped instructions, DORA dashboard in Azure Workbooks
L4AgenticFull primitives kit, hooks enforced, M365 Agents SDK surfacing in Teams, attrition and burnout anomaly detection on a weekly cadence, maturity scorecard above 80 percent

Integration with other personas

  • With Technical Lead: shared staffing model, paired review of architecture-health indicators
  • With Scrum Master: retro facilitation, sprint flow diagnostics from Azure Boards
  • With Project Manager: risk register reconciliation, stakeholder reporting cadence
  • With SRE: on-call load and incident toil feed into burnout signals
  • With Product Owner: roadmap feasibility review against capacity
  • With InfoSec Officer: people-risk signals (access review, separation-of-duties) surfaced in the pulse
  • With DevRel: external contribution trends influence hiring signals

Glossary

  • DORA metrics: the four key delivery metrics defined by the DORA research program: lead time, deployment frequency, change failure rate, mean time to restore.
  • SPACE framework: a productivity model covering Satisfaction, Performance, Activity, Communication, Efficiency.
  • Pulse: the weekly rollup artifact that combines DORA, SPACE, and anomaly signals.
  • Delivery health dashboard: the Azure Workbooks view linked from the repo README.md that makes the team’s flow public.
  • Attrition early warning: a composite signal derived from on-call load, rework rate, and PR latency that indicates elevated resignation risk.
  • Staffing model: a capacity-versus-commitment projection that makes assumptions explicit.

References