Skip to main content

Search...

Pains and Hypes of Software Testing

Testers keep losing the value argument not because their work is wrong, but because they pitch defects instead of business outcomes.

9 min read
Cover for Pains and Hypes of Software Testing

Software testing value refers to the ability to communicate quality work in business terms rather than technical metrics. Testers who frame their work around revenue targets and user satisfaction gain more organizational acceptance than those who cite code coverage numbers. Domain knowledge, prompt literacy, and understanding of system architecture are the skills that matter most as AI-generated code becomes standard.

Key Takeaways

  • Testing’s core problem is not AI disruption but a decades-long failure to communicate testing value in business terms, such as revenue impact and user satisfaction, rather than technical metrics like code coverage.
  • Prompt reviews are an emerging shift-left practice: because small changes in wording produce significantly different AI outputs, reviewing the prompts used to generate code becomes a quality gate in its own right.
  • Domain knowledge is becoming more valuable than deep technical skills, because testers increasingly need to judge AI-generated outcomes rather than write or maintain code themselves.
  • Non-functional requirements, including load, performance, and accessibility, are routinely overlooked, and ignoring architecture early makes them expensive to fix once a product ships.
  • Cutting testers in a downturn is a short-term saving that creates a long-term shortage of people who can understand and verify system architectures, especially as AI-generated codebases grow.

The hardest part of testing is selling its value

Testers are good at communicating problems. They are bad at communicating what their craft is worth to a business. That gap, not any single technology, has shadowed software testing for close to two decades.

The pattern repeats with every hype cycle. Around 2010 and 2011, frameworks like Selenium turned automation into a trending topic, and the promise was the same one heard today: automate everything, drop the testers. AI now arrives with the identical pitch. The technology changes, the misunderstanding does not.

The cost shows up on social media. Skilled people wear “open to work” badges, and the question lands every time: why is someone this good looking for a job? Part of the answer is that the industry never learned to advertise its own craft in terms decision-makers care about.

Why managers cut quality first in a downturn

When the economy stalls, quality work is the easy thing to cut. Developers write the code, designers draw the interface, the product ships anyway. Testing looks optional from that vantage point, so it goes first.

Pushing testing onto developers can hold up in the short term. The same companies often pile on more: write the code, test it, maintain the pipelines, run the product in production. Replacing testers and junior developers with whatever AI tool is on the market may keep velocity up for a while.

The long run is where this breaks. Daniel Knott expects a sharp demand in five to ten years for people who can actually understand system architectures. Building an app, a web product, almost anything publishable now takes minutes. Moving that fast without anyone who grasps what was generated stores up trouble.

Speed without architecture is a debt you pay later

Generated code with no real architecture behind it is where non-functional problems take root. Performance and security issues rarely sit on the surface. They live in the architecture, in a bad design or a gap in the structure.

Architecture is the most expensive thing to change once a product is released. That made non-functional testing matter before AI, and it matters more when code arrives faster than anyone can reason about it.

Non-functional requirements are quietly slipping off the radar. Asked when they last thought about them, some product managers had to ask what the term even meant. Accessibility has pulled a little attention back, partly through regulation. Load and performance testing get far less. Security testing at least often has outside experts attached, but the topic as a whole is easy to forget.

Talk in the language the listener already speaks

There is no silver bullet for communicating the value of testing. The right move depends on the person in front of you and where they come from.

Find their background first. Someone from economics or marketing responds to business value. Tie testing to the outcome that person owns. If a feature carries revenue, frame it there: the KPI is a million in revenue next month, and skipping holistic coverage in specific areas makes that target unlikely.

Someone technical needs a different entry. Open the system architecture map, point at what is already automated and where the weak spots are, then sell activities against that picture. More API testing, contract testing, the non-functional work, whatever fits that person’s toolbox.

Business and user-facing numbers land. Revenue and user satisfaction get testing accepted. Walking in with “we have 20 percent code coverage and need 80” does not. To most managers that is just a number, and “code coverage” sounds too complicated to act on.

The honest difficulty is the gap between daily testing work and a high-level KPI like user satisfaction. There is no clean formula for closing it. What the business sells, what competitors add to their products, what customers ask for in feedback: those data points have to be connected before the next step is obvious.

Domain knowledge may matter more than deep technical skill

For years the advice to a business-focused tester was blunt: get technical, learn a programming language, study architecture patterns and clean code. That knowledge still helps. It is no longer the thing that decides whether the role survives.

AI is not leaving in the next years or decades. With vibe coders and tools generating features on demand, the work shifts toward judging the output of a model rather than writing every line of test code by hand. Judging that output well depends on knowing the business domain deeply.

You still need the foundations of software testing, and you still need the basics of how a system works. What a request is, what a response is, how client-server communication runs, what a modern tech stack looks like. That base knowledge feeds new testing ideas and lets you ask the right questions, including which parts of a feature were generated and which a human developer still owns.

Thinking past your own business box becomes more valuable, not less. When AI produces the code and it appears done, the edges and the cases outside the obvious path are where a tester earns their keep.

Prompt reviews could be the next shift-left practice

Reviewing the prompts that generate code is an underused entry point for quality. The prompt is where defects can enter before a single line is written.

Context decides everything when you prompt an AI. Change one small part of a sentence and the output can come back completely different. A developer prompting a model is making decisions that shape the resulting code, and right now almost no one inspects that step.

This is, for me, a new way of shift left testing, you know, to do like review the prompting that go into the system. — Daniel Knott

As long as people write their own prompts, prompt reviews have room to grow into a real practice. It is a quality check moved as far upstream as it goes.

Good testing tools earn their place by being usable

A testing tool is an extension of your work, so it has to be easy to install and friendly to use. Anything that needs days of configuration, dependency checks, and gigabytes of libraries before it runs slows down the testing it was meant to speed up. Vendors should treat that as a requirement, not a nicety.

The AI features worth having already exist in shipping products. A few categories stand out from the marketing:

CapabilityWhat the tool does
Test case generationA plugin reads requirements and designs in the ticketing system and derives test cases from them
Usage-based test suggestionsA hidden library tracks what users click, maps the real user journey, and points to the cases worth automating
Prompt-driven automationA prompt interface turns a written instruction into running test automation or configuration

None of this replaces the tester’s judgment. You still pick the tool that fits your tech stack and your situation. The tool that is right for one team is wrong for another, which is why the homework cannot be skipped.

Try the tools yourself instead of trusting the pitch

Trial the tools firsthand rather than believing the white papers. Vendors will always present their product in the best light, the same way you would in their position, so the marketing tells you what could be there, not what works for you.

Install trial versions, run them, see what holds up and what falls apart. Doing this regularly gives you a feel for where the industry is heading and what actually matters in your current setup. There is no universal pick anymore. Years ago everyone reached for Selenium, and on mobile for Appium, and that was the end of the discussion. Now the right tool is specific to you.

The way vendors reach testers is shifting too. Search engines lose ground as people move to chat-style interfaces, so vendors lean harder on social media and on conferences where they can demo to the right audience. A test automation lab at a conference, with workstations set up to try tools away from the booth, beats the usual hurdle of a salesperson catching you the moment you slow down to read.

Don’t discount the skills you already built

The fundamental testing skills learned over the past ten, fifteen, twenty years are still valid, and they still help with AI. Ignore anyone claiming testing is no longer needed or that your knowledge is irrelevant.

The one thing to act on now is to get into AI seriously. Learn prompting, learn how large language models work internally, including the vector databases and the underlying principles. Little of that connects to testing directly, yet it sharpens how you test.

The pace is the reason to start today. The mobile revolution around 2010 took some companies more than fifteen years to catch up with, and a few still ship a poor app or none at all. AI is moving an order of magnitude faster than that. Sitting it out is not a safe position.

Share this page