Skip to main content

How a 14-Day Sprint Unlocked $5,000 AWS Credits for MontyCloud MSPs

MontyCloud


My RoleSenior Product Designer
Team1 Product Manager2 Frontend Developers2 Backend Developers
ToolsFigma
Timeline14-day sprint
Description

0 MSPs had successfully claimed $5,000 AWS credits in two years. Six months after this redesign shipped, 16 had. The thesis: augmentation over automation. Identification time dropped from 8–12 minutes to under 1, completion rate climbed from 34% to 87%.

Context

MontyCloud's WAFR platform produced findings but left cloud architects to manually correlate questions, AWS risks, and severities across disconnected views, making live customer assessments cognitively exhausting. MSPs were tracking 57 questions in spreadsheets. Zero customers had ever successfully claimed credits through the platform.

Inline risk · Updates instantly as you answer · 14 sec

MontyCloud's MSP customers ran AWS Well-Architected Reviews to claim $5,000 AWS credits per assessment, but the platform had no question-level interface for it. Architects were tracking 57 questions in spreadsheets, scanning rows manually for 8–12 minutes to find a single high-risk issue. In two years, zero MSPs had successfully claimed credits through the platform.

I led the redesign of the WAFR Review workflow, first as a 14-day sprint shipping question-level tracking, then by reframing the whole live-assessment experience around inline risk intelligence. The thesis: in cybersecurity workflows, augmenting expert decisions beats automating them.

Impact

· 6 months after launch

Before · 0 in 2 years

16

MSPs successfully claimed AWS credits

Before · 8–12 min

<1min

Time to identify high-risk issues

Before · 34%

87%

Assessment completion rate

Before · 23 / mo

3/ mo

Support tickets tagged 'WAFR confusion'


The broken workflow

MontyCloud's AWS Well-Architected Review platform was used by Managed Service Providers (MSPs) and cloud-operations teams to assess customer environments against AWS best practices. The platform produced findings, but the workflow looked like this:

  1. Architect answers a 57-question assessment.
  2. System generates findings in a separate view, disconnected from the questions.
  3. Architect manually cross-references each question against AWS best practice, severity, failed resource, and remediation guidance.
  4. Architect explains findings live to the customer, under time pressure, with 57 questions and dozens of findings open across screens.

AWS PLATFORM • DISCONNECTED FINDINGS, NO RISK FILTERING

AWS WAFR platform before redesign: disconnected views and manual workflows

The result, observed across seven facilitation sessions:

  • Users didn't know the impact of their answers until later
  • Findings were disconnected from questionnaire decisions
  • Analysts manually mapped AWS risks to controls
  • Large finding volumes made prioritization difficult
  • Workshops became cognitively exhausting

Zero MSPs had ever successfully claimed AWS credits through the platform. Architects were maintaining shadow spreadsheets just to keep track of what they'd answered and what mattered.


Who it was for

Cloud Security / Solutions Architect

Primary
  • Runs customer WAFR assessments end-to-end
  • Needs to identify risk quickly during live calls
  • Must explain findings to customers in real time
  • Works under time pressure with stakeholders watching
  • Technical, but not deeply familiar with every AWS control

Sales Teams

Secondary
  • Need understandable risk explanations for customer-facing decks
  • Need prioritization clarity to position remediation services
  • Need faster onboarding and lower training burden
  • Rely on the tool to be a sales-enablement asset, not a black box

I relied on workflow observation, stakeholder interviews, and review-session walkthroughs with cloud architects, the enterprise environment didn't permit broad usability testing, so deep observation of a small number of architects became the primary research method.


The design question

How might we help cloud architects understand risk implications faster during live assessment workflows: without overwhelming them with security complexity?

The path I explored and rejected

Before settling on a workflow redesign, I pushed hard on full automation, partly because it was the obvious "AI moment" answer, and partly because the PM was open to it. Here's why I cut it.

Rejected option

Option A: Auto-Evaluate & Prepopulate Recommendations

The platform would auto-scan cloud environments, prepopulate questionnaire answers, generate HRI/MRI classifications automatically, and recommend remediation paths upfront. The questionnaire would become a "review and confirm" workflow instead of a manual assessment.

Why I cut it

  • In cybersecurity, pure automation struggled with contextual correctness, auto-generated severity classifications often missed customer-specific architectural nuance, eroding trust the first time they were wrong.
  • High business cost and engineering effort to build, validate, and maintain, the platform team didn't have the infrastructure or model-evaluation pipeline to ship it responsibly inside a quarter.
  • It removed the architect from the loop in exactly the conversations where architects added the most customer value, which would have shrunk the platform's strategic position with MSPs.

The thesis: augmentation, not automation

In cybersecurity workflows, full automation can reduce trust and explainability. We focused on augmenting expert decision-making instead of replacing it.

Design thesis · WAFR Review Workflow

That reframe, augmentation over automation: became the prioritization filter for every feature decision that followed. If a feature took judgment away from the architect, it was out. If it surfaced the right information at the right moment so the architect could decide faster, it was in.


What I designed

The redesign added an Inline Risk Intelligence layer across the existing WAFR workflow:

  1. Risk Exposure preview on every best-practice check, architects could see whether a specific answer would generate an HRI (High Risk Issue) or MRI (Medium Risk Issue) before submitting, closing the feedback loop that previously took hours.
  2. Linked findings for each best practice, eliminated the manual correlation step between findings view and questionnaire view.
  3. Unique findings grouping: collapsed dozens of repeating findings into clean, actionable issue groups.
  4. Risk-exposure filtering by pillar: filter the question list by Operational Excellence, Security, Reliability, Performance, Cost, or Sustainability.
  5. Live Coaching Module: surfaced talking points and remediation guidance inline during customer workshops, so the architect's explanation matched the platform's evidence.
  6. Question-level tracking (the 14-day sprint piece), first interface that let MSPs actually demonstrate progress to AWS and claim the $5,000 credit.

Feature Overview

WAFR Checks key features: inline risk intelligence, filtering, and status indicators

Live Coaching Module

Live Coaching Module: contextual talking points and remediation guidance

The solution. What shipped, and the three decisions inside it

The redesigned WAFR Checks interface, and the three design decisions that made it work.

Decision 1 · Risk filtering

I chose

Clickable summary counts ("6 High Risk", "12 Medium Risk") that filter the question list directly.

I rejected

A separate filter dropdown panel, the pattern the PM initially proposed because it matched nOps.

Why

The existing summary counts were dead text. Architects were scanning the full list while trying to hold the counts in working memory, classic cognitive-load failure. Connecting the summary directly to the list removed the scanning step entirely. It also avoided adding a new UI control to a screen already crowded with information.

Outcome

Time-to-identify-high-risk dropped from 8–12 minutes to under 1 minute in post-launch observation.

Decision 2 · Status indicators

I chose

Semantic shape-based icons: clock for unanswered, x-mark for high risk, exclamation for medium, checkmark for none, dash for not applicable.

I rejected

The competitor pattern: red / yellow / green colored dots. nOps used it. The PM initially proposed it.

Why

Red-green color blindness affects roughly 1 in 12 men. A red dot and a green dot look identical to them, and our MSP audience skewed heavily male. Shape communicates status without relying on color, with color as a redundant cue rather than the primary one. Dual-coding theory (Paivio, 1971), two encoding channels lower cognitive ambiguity.

Outcome

PM agreed in five minutes once the accessibility argument was on the table. Shipped as the platform default.

Decision 3 · Scope for the 14-day sprint

I chose

Ship question-level tracking, filtering, and pillar progress. Cut historical timeline tracking.

I rejected

The "complete vision", including a historical timeline showing WAFR scores improving over time, which architects had asked for.

Why

Cross-run history needed data infrastructure that didn't exist yet, at least a quarter of backend work. The 14-day window was real (an MSP partner was waiting on credit claims) so every feature had to answer one question: does this help MSPs claim credits faster? Historical tracking didn't pass that filter.

Outcome

Shipped on time. MSPs started claiming credits in week three. Historical tracking landed in a later release once the data layer was in place.

Decision 4 · Live Coaching Module in the All-Findings table

I chose

A contextual coaching panel inside the All-Findings table: talking points, remediation guidance, and customer-language explanations surface inline as the architect clicks each finding.

I rejected

Separate documentation pages and pre-recorded video tutorials linked from the table, the pattern most enterprise platforms ship.

Why

Architects don't open new tabs during live customer workshops. Knowledge that lives one click away might as well live on another planet, under facilitation pressure, the architect needs the right sentence in the right place. Coaching had to be in the table, not around it. The same logic that put filtering inside the summary counts put guidance inside the row that triggered the question.

Outcome

Customer-facing explanations stayed consistent with the platform's own evidence, which closed the "architect says one thing, the dashboard shows another" credibility gap that had been the loudest sales-team complaint.

Live Coaching · In Action

Info Panel · Detailed Context


A note on attribution

The metrics at the top are real, but they're not all UX wins.

The jump from 0 to 16 credit claims isn't a design victory, the question-tracking interface made the workflow possible at all. Before this, MSPs literally could not claim credits through MontyCloud. They used spreadsheets or AWS's native tool. That number is capability impact.

The clearer UX signal is the completion rate going from ~34% to ~87%: that's people finishing a process they'd previously abandoned. That is design impact.

I want to be precise about that because business outcomes are easy to overattribute to design work, and the two stories are worth telling separately.


How I prioritized in 14 days

Every feature had to answer one question:

"Does this help MSPs claim credits faster?"

If no, it was out. That filter killed historical tracking (architects asked for it but it needed backend infrastructure), advanced sorting (nice-to-have, not load-bearing), and three other ideas the team was attached to. Saved roughly a quarter of engineering effort and let us ship the workflow that actually unlocked the revenue.

The daily PM syncs were short, 15-20 minutes, and mostly about scope management. "Can we add X?" → "What do we cut?" The constraint was real and nobody pretended otherwise.


How I validated

  • Stakeholder walkthroughs with PM, architects, and a customer-facing engineer
  • Live workflow reviews: sat through seven facilitation sessions before designing anything
  • Iterative feedback with two senior architects mid-sprint
  • Post-launch observation of three users to measure time-to-identify-high-risk

Honest admission: this is what was possible inside the enterprise constraint, not what was ideal. Even 2–3 structured MSP usability sessions would have caught a few edge cases earlier.


What I'd do differently

Push for analytics instrumentation from day one. The "under 1 minute" metric came from observing three users, not from proper Mixpanel events. I should have scoped tracking into the initial sprint so the impact story was data-backed, not observational.

Run two MSP sessions even at the cost of one shipped feature. The PM relayed customer needs accurately, but PM-filtered research is still PM-filtered. The tradeoff of one less feature for direct user contact would have been worth it.

Document the design system tokens earlier. I shipped the icon set and the filter component without naming them in a shared token library, which meant the next designer working on WAFR rebuilt 30% of it before finding mine.


What I learned

Augmentation beats automation in trust-critical workflows. Cybersecurity, healthcare, finance, anywhere the cost of being wrong is high, removing the expert from the loop destroys the product's positioning even when the automation works. The whole redesign hinged on holding that line.

Competitive research isn't admitting defeat. Looking at nOps and AWS's native WAFR tool gave us validated patterns and saved days of exploration. The value wasn't inventing a new paradigm, it was executing well within proven patterns, then differentiating on the things that actually mattered (accessibility, cognitive load, filtering directness).

Constraints force honesty. 14 days meant we couldn't hide behind complexity. Every decision had to justify itself immediately. The prioritization filter, does this help MSPs claim credits faster?, was the cleanest design tool I've ever used.

The PM relationship matters more than the PM disagreement. The status icons conversation resolved in five minutes because we had a working relationship, not because I "won" the argument. Bring the strongest argument, but bring it to someone who's already on your team.


The room this happened in

MontyCloud · 2024

Working session at MontyCloud: collaborating with PM and engineering during the WAFR sprint.

The work didn't happen in isolation. The PM relationship I keep coming back to in this case study, that's what made the icons conversation resolve in five minutes instead of five weeks. The engineering pair sat next to me and would tell me "we can't ship that in 14 days, but here's what we can ship" before I'd even finished sketching. The constraint felt collaborative, not adversarial. That's the part you can't put in a results strip.


Want a faster read?

I also built a slide-based version of this story, designed to skim in about 30 seconds, one decision per slide.

→ Skim it in 30 seconds