Overview
A digital banking product team wrote Product Requirements Documents (PRDs) without consistent accessibility requirements, so color contrast, focus order, and keyboard behavior were clarified late and reworked during QA. Intelligex implemented an AI copilot inside Confluence that checked PRD drafts against Web Content Accessibility Guidelines (WCAG) criteria, suggested improvements, and triggered a Design Ops review in Jira when gaps appeared. Accessibility issues were flagged earlier, design sign-offs were clearer, and QA cycles saw fewer rejectionswithout changing the teams core tools or delivery rituals.
Client Profile
- Industry: Digital banking and payments
- Company size (range): Multi-squad product and engineering organization
- Stage: Mature Confluence/Jira workflows; accessibility reviews ad hoc and late
- Department owner: Product Management & R&D
- Other stakeholders: Design/UX, Design Ops, Web/Mobile Engineering, QA, Compliance/Legal, Security/Privacy, Customer Support, Accessibility SMEs
The Challenge
PRDs were authored as free-text pages with variable acceptance criteria. Some documents called out semantic headings and keyboard navigation; others focused only on visual behavior. Designers captured accessible states in Figma inconsistently, and engineers received stories that did not specify focus handling, ARIA roles, label associations, or error announcements. QA raised issues after builds shipped to test, and Compliance flagged risk when customer-facing changes did not align with internal accessibility standards derived from the WCAG.
Cycle pressure made things worse. Releases bundled features across squads; each team interpreted accessibility needs differently; and sign-offs depended on a handful of experts. When guidance arrived late, sprint work reset and backlogs grew. Product managers wanted accessibility to be part of requirements, not a separate checklist at the end.
Why It Was Happening
Root causes were fragmentation and unstructured specifications. Confluence templates offered no standard fields for accessibility. Acceptance criteria talked about happy paths but skipped screen reader behavior, reduced motion preferences, or focus management. Design tokens covered color and spacing but not contrast or state cues. Subject matter expertise sat with a few people, so teams relied on memory and past examples rather than a governed review.
Ownership was diffuse. Product authored PRDs, Design documented patterns, Engineering built stories, QA validated outcomes, and Compliance reviewed risk. Without a shared workflow that linked requirements to recognized criteria and enforced a light gate before work started, the organization corrected issues after code was written.
The Solution
Intelligex added a PRD-era governance layer in Confluence with an AI copilot that checked drafts against WCAG and pattern guidelines, suggested acceptance criteria, and created review tasks in Jira when gaps or high-risk changes appeared. Standardized PRD templates included required fields for semantics, interaction, and feedback states. The copilot highlighted missing contrast notes, unclear focus behavior, or absent announcements for errors and dynamic content, and linked to relevant criteria. Humans approved suggestions before they landed in stories, and Design Ops owned final sign-off for pattern changes.
- Integrations: Confluence for PRD authoring (with macros and the AI copilot); Jira for design review tasks, issue linking, and status badges; optional read-only references to design assets; criteria aligned to WCAG and internal pattern libraries.
- PRD templates: Structured sections for semantic markup, keyboard interaction, focus order, announcements and live regions, motion and timeouts, contrast and state variations, and accessible names/labels. Each section included example acceptance criteria and test hooks.
- AI checks and suggestions: Natural language analysis of PRD text and linked assets to flag gaps and propose acceptance criteria tied to relevant WCAG success criteria and internal patterns.
- Design Ops review gates: Automatic creation of Jira tasks for pattern changes or high-risk components; required sign-off before stories moved to development. Status badges surfaced approval state back in Confluence.
- Regression guidance: When PRDs modified established patterns, the copilot suggested regression checks for related areas (for example, modal behavior and focus trapping), which QA could import into test suites.
- Dashboards and traceability: Views showing open accessibility findings by epic and squad, with links from PRDs to Jira decisions and back to criteria references.
- Permissions and audit: Role-based access to PRD checks and approvals; immutable logs of suggestions, edits, and sign-offs.
Implementation
- Discovery: Reviewed recent PRDs, QA findings, and support escalations tied to accessibility. Cataloged common components and patterns (navigation, forms, tables, modals) and current design tokens. Mapped existing sign-off steps and where accessibility entered the flow.
- Design: Defined PRD template fields and example acceptance criteria; configured the copilots rule set against internal standards and WCAG topics; specified Jira workflows for Design Ops review and approval; agreed on status badges, permissions, and audit fields.
- Build: Deployed Confluence macros, the AI copilot service, and Jira integration for task creation and status sync. Seeded guidance per component and linked criteria references. Built dashboards for open findings and review throughput.
- Testing/QA: Ran in shadow mode on active PRDs while teams kept existing reviews. Compared copilot findings to QA issues and SME guidance; tuned prompts, templates, and triggers; validated that Jira tasks reflected risk appropriately. Included a human-in-the-loop review board with Design Ops and QA.
- Rollout: Turned on the templates and copilot for selected epics; kept manual checklists as a controlled fallback. Expanded to all squads after stable cycles and positive SME reviews.
- Training/hand-off: Delivered short sessions for PMs, Designers, Engineers, and QA on templates, interpreting findings, and approvals. Updated SOPs for PRD authoring and design sign-off. Transferred ownership of rule sets and templates to Design Ops and Product Ops under change control.
Results
Accessibility requirements moved into the PRD phase. Authors received immediate feedback on missing acceptance criteria, Design Ops reviewed pattern-level changes before development, and engineers picked up stories with clear expectations for semantics, focus, and announcements. QA saw fewer accessibility rejections because criteria and tests aligned from the start.
Reviews were more predictable. PRDs referenced the same criteria and patterns across squads, and Jira badges showed whether design sign-off was complete. Compliance discussions shifted from finding gaps late to confirming that requirements matched recognized guidance. The team kept Confluence and Jira; the difference was a lightweight governance layer that made accessibility part of normal planning.
What Changed for the Team
- Before: PRDs varied in how they captured accessibility. After: Templates required consistent sections and acceptance criteria tied to patterns.
- Before: Issues surfaced during QA. After: The copilot flagged gaps in Confluence, and design reviews were triggered in Jira early.
- Before: Sign-offs depended on a few experts. After: A defined Design Ops gate and dashboards made ownership and status visible.
- Before: Engineers inferred focus and announcements. After: Stories carried clear criteria for keyboard navigation, focus order, and live regions.
- Before: Regression risk was implicit. After: Pattern changes included suggested regression checks that QA pulled into suites.
- Before: Compliance escalations debated intent. After: PRDs linked to relevant WCAG topics, aligning decisions with recognized guidance.
Key Takeaways
- Write accessibility into requirements; structured PRD fields turn guidance into actionable acceptance criteria.
- Automate prompts, not judgment; an AI copilot can highlight gaps while humans approve what gets built.
- Gate pattern changes; a light Design Ops review prevents late rework across shared components.
- Tie to recognized standards; linking to WCAG creates a common language for Product, Design, Engineering, and Compliance.
- Expose status where teams plan; Confluence pages and Jira badges keep sign-offs visible without new tools.
- Integrate, dont replace; add checks to existing authoring and planning workflows.
FAQ
What tools did this integrate with? The copilot ran inside Confluence for PRD authoring and created tasks and approvals in Jira. Criteria aligned to the WCAG and internal pattern libraries. Optional links referenced design assets where helpful; delivery and test tools remained unchanged.
How did you handle quality control and governance? PRD templates enforced required fields. The copilots suggestions entered a review queue where Product or Design approved or edited them before use. Pattern-level changes triggered Jira reviews with designated approvers in Design Ops. All suggestions, edits, and approvals were logged, and rule sets were versioned under change control.
How did you roll this out without disruption? The copilot and templates ran in shadow mode first, producing findings while teams continued existing reviews. After tuning accuracy and thresholds with Accessibility SMEs, the review gate was enabled for select epics and then expanded. Manual checklists remained as a controlled fallback during early cycles.
How were WCAG criteria operationalized in PRDs? Templates mapped common requirements to PRD fields and example acceptance criteriafor example, focus order and trapping for modals, ARIA roles and labels for forms, announcements for validation errors, reduced motion preferences, and color contrast for tokens and states. The copilot linked findings to the relevant WCAG topics and internal patterns.
How did you ensure the AI didnt over-flag or miss context? The copilot used rule- and prompt-based checks tuned with SME feedback. Low-confidence findings required human confirmation, and repeated declines adjusted thresholds over time. Teams could suppress non-applicable checks per component with rationale, which improved future suggestions.
Department/Function: IT & InfrastructureLegal & ComplianceProduct Management & R&D
Capability: AI AgentsCopilots & Intelligent Automation
Get a FREE
Proof of Concept
& Consultation
No Cost, No Commitment!


