How to Prepare for a Threat-Led Pentest: What the Report Won't Tell You If You Get It Wrong
By
Mike Rotondo
·
16 minute read
Preparing for a threat-led penetration test requires six things your testing firm cannot do for you: mapping your crown jewel assets before scope is defined, conducting a complete asset inventory that includes shadow IT and undocumented APIs, building a threat profile from sector-specific intelligence tied to real adversary TTPs, assembling documentation that gives testers context instead of making them discover it, aligning internal stakeholders with formal rules of engagement, and configuring your detection environment to measure what your SOC actually catches. Organizations that skip these steps get a report. Organizations that complete them get an accurate picture of their breach risk.
The Validation Gap Nobody Talks About in the Debrief
Forty-five percent of enterprises increased their security stack to an average of 75 tools in 2024 and 2025. Sixty-seven percent of those same organizations still experienced a data breach within the prior two years.
More tools. More breaches. That paradox has a name: the validation gap. Organizations invest heavily in security controls and almost nothing in verifying whether those controls actually work against the specific adversaries targeting them.
Penetration testing was supposed to close that gap. And for organizations that approach it correctly, it does. But here is the number that does not appear in most pentest sales conversations: pentesters successfully breached internal network perimeters in 93% of companies they tested in 2025. Ninety-three percent. Not because the organizations had no defenses. Because their defenses were not tested against realistic attack paths by testers with real adversary context.
The problem in almost every case is not the quality of the testing. It is the quality of the preparation.
A threat-led penetration test pointed at the wrong assets, built on generic scenarios with no sector-specific threat intelligence, run against a scope that omits shadow IT, and delivered to a stakeholder who has no plan for remediation ownership produces a report. A well-prepared threat-led penetration test produces an accurate, actionable picture of what a real adversary targeting your organization could actually do.
This article is the preparation guide for the second kind.
Why Most Penetration Tests Fail Before They Start
Compliance Testing Is Not Threat-Led Testing
There are two entirely different exercises that share the name penetration testing, and organizations that conflate them consistently overpay for the wrong one.
A compliance-based penetration test is scoped to satisfy an auditor. It checks whether the controls required by PCI DSS, SOC 2, HIPAA, or ISO 27001 are present and functioning. The scope is drawn from a documented asset list. The methodology follows a defined framework checklist. The output tells you whether you can pass the audit.
A threat-led penetration test is scoped to answer a different question entirely: can the specific adversaries that target organizations in your sector, your geography, and your size breach your specific defenses to reach your specific most valuable data?
That question requires threat intelligence, adversary-specific scenarios, and organizational context that no compliance framework provides. DORA, the EU's Digital Operational Resilience Act that came into force in January 2025, recognized this gap formally enough to mandate threat-led penetration testing as a separate and more demanding requirement for significant financial entities, precisely because compliance-based testing was demonstrably failing to capture real-world resilience.
The 2025 State of Penetration Testing report found that Security Misconfiguration was the top finding category at 20.67% of all results, followed by Identification and Authentication Failures at 9.96% and Insecure Design at 9.62%. These are not exotic vulnerabilities that require advanced tooling to find. They are fundamental organizational and architectural failures that persist in production environments because the testing that was supposed to find them was not designed to look for them in context.
The Four Ways Poor Preparation Destroys an Engagement
Organizations consistently fail their penetration test engagements in four specific ways, all of which occur before testing begins.
Wrong scope. When scope is defined by convenience rather than risk, testers spend their time in systems that are unlikely to lead to critical findings while the actual attack paths to crown jewel assets remain untested. One of the most common mistakes observed across TLPT engagements is inadequate scope definition: either the net is cast too wide, diluting focus, or it is too narrow, leaving critical assets untested and creating a false sense of coverage.
Generic scenarios. A retail bank faces different attack patterns than a healthcare system. A professional services firm faces different adversaries than a defense contractor. Scenarios built without sector-specific threat intelligence simulate a generic attacker, not the real ones. The MITRE ATT&CK 2025 Enterprise Evaluations demonstrated this clearly: testing against real threat actor behavior, in that case Scattered Spider and Mustang Panda, produces categorically different results than testing against generic attack patterns.
Incomplete asset inventory. Shadow IT exists in at least 53% of organizations, and the assets it represents are the ones most likely to carry the access control failures, default credentials, and undocumented API endpoints that real attackers find first. A test scoped against a documented asset list misses everything that is not on the list, which is frequently the most vulnerable part of the actual environment.
No remediation plan. The 2026 State of Pentesting report identified a structural failure that occurs after delivery: pentest results are delivered as static reports, often disconnected from vulnerability management systems, ticketing workflows, and remediation accountability structures. Findings without owners do not get fixed. A report delivered into an organizational vacuum is a document, not a security improvement.
All four failures are preparation problems. None of them are testing problems. The testing firm cannot fix them during the engagement. They have to be fixed before it starts.
The Six Preparation Steps That Determine What You Get Back
Step 1: Map Your Crown Jewels Before Anyone Defines Scope
The most important conversation in a threat-led penetration test is not between your CISO and the testing firm. It is the internal conversation that happens first: what would hurt most if it were compromised?
Crown jewels are the assets, systems, and data whose compromise would carry the highest organizational consequence. Not the highest technical severity score. The highest business consequence. In healthcare, that means patient records, clinical systems, and research databases tied to regulatory exposure. In financial services, that means transaction processing infrastructure, customer account data, and the systems whose downtime would trigger operational resilience obligations. In any organization, it includes the data and access pathways that connect peripheral systems to the things that matter most.
Identifying crown jewels requires cross-functional input that security teams alone cannot provide. Legal identifies data whose compromise carries the highest regulatory consequence. Operations identifies systems whose unavailability directly impacts revenue or service delivery. Finance identifies the specific transaction and authorization workflows that attackers targeting your sector most commonly pursue. Security maps the technical pathways that connect the rest of the organization to those identified assets.
This process consistently produces two revelations. First, the most valuable data is frequently housed in systems that have not been treated as high-priority security targets because no one had explicitly mapped their regulatory or operational consequences before. Second, the attack path to crown jewel assets runs through systems that nobody initially flagged as relevant to the engagement.
The crown jewel map becomes the compass that orients every subsequent preparation decision. Scope that is not aimed at the crown jewels is scope aimed at the compliance report.
Step 2: Build the Real Asset Inventory, Not the Documented One
Every organization has two asset inventories. The documented one, which lives in the CMDB, the network diagram, and the official API register. And the real one, which includes everything that exists in the environment but was never formally catalogued.
The gap between them is where breaches happen and where threat-led tests find their most critical results.
Shadow IT is the primary source of that gap. SaaS applications procured by business units outside IT approval, cloud instances spun up by development teams and never decommissioned, former employee accounts that were never disabled, third-party integrations from vendors whose contracts expired, development environments carrying production data: all of these exist in most organizations and none of them appear in a standard CMDB-based scope definition.
APIs are the second major source. Most organizations cannot accurately enumerate their own API surface. Endpoints left behind by previous development cycles, APIs exposed by third-party integrations, undocumented endpoints discoverable through JavaScript bundle analysis and mobile application decompilation: these are the attack surface that threat actors map before they ever touch a documented asset.
Building the real inventory requires active discovery: network scanning to identify live hosts not in the CMDB, API discovery against external-facing systems, cloud environment enumeration to find services deployed outside the managed infrastructure, and credential exposure monitoring to identify accounts with active organizational domain credentials appearing in infostealer and breach data.
The inventory that results from this process is what the testing team needs to scope realistically. Everything else is documented fiction.
Step 3: Build a Threat Profile From Real Adversary Intelligence
This is the step that separates a threat-led penetration test from a penetration test with a different name on the report. And it is the step most organizations either skip entirely or hand off to the testing firm without doing their own preparatory work first.
A threat profile is a documented analysis of which adversaries actually target organizations in your sector, what initial access techniques they use, how they move laterally once inside, what data they are trying to reach, and what their operational signature looks like at each stage of an attack chain. It is built from sources that exist specifically to answer these questions.
CISA's Known Exploited Vulnerabilities catalog identifies the specific vulnerabilities being actively exploited against organizations in your sector right now. CISA and FBI joint advisories, issued regularly for the most active threat actor groups, specify the exact TTPs those groups use, mapped to MITRE ATT&CK framework IDs that your testing team can translate directly into test scenarios. Health-ISAC, FS-ISAC, and sector-specific Information Sharing and Analysis Centers publish intelligence that is specific to your vertical and often specific to your geography.
The MITRE ATT&CK framework itself is the structural backbone of this work. It provides a standardized taxonomy for describing adversary behavior from initial access through impact, which allows your preparation to translate threat intelligence into specific test scenarios and allows your testing team to execute those scenarios in a way that produces comparable results across engagements.
A healthcare organization building a threat profile in 2026 should draw from documented North Korean Andariel unit techniques in Maui and Medusa ransomware campaigns targeting hospitals, Volt Typhoon's living-off-the-land persistence techniques documented in CISA advisories for critical infrastructure targeting, and the specific EHR and clinical system attack paths documented in Health-ISAC reporting. A financial services organization should draw from Scattered Spider's cloud infrastructure pivot techniques documented in the 2025 MITRE ATT&CK evaluations, APT29's credential abuse and OAuth exploitation TTPs, and the business logic abuse patterns documented in DORA's regulatory technical standards for TLPT.
The threat profile you build before the engagement becomes the brief that turns generic test scenarios into adversary-specific ones. Without it, the testing team is simulating an attacker. With it, they are simulating your attacker.
Step 4: Assemble the Documentation Package That Saves Engagement Days
Every hour a penetration test team spends discovering information about your environment is an hour they are not spending finding vulnerabilities in it. A comprehensive pre-engagement documentation package converts discovery time into testing time.
The documentation package should include network and architecture diagrams that show how systems connect and how data flows between them, including the integrations to third-party services. User role and permission matrixes that describe what each user type can access in each in-scope application, which is essential for any tester evaluating authorization flaws, IDOR vulnerabilities, and privilege escalation paths. API documentation for all in-scope applications, including endpoint lists, authentication mechanisms, and data models, because testers who understand intended API behavior can immediately begin probing for deviations from it.
The package should also include all previous penetration test reports, so the team can see what has been found before, verify whether prior findings were remediated, and direct their attention toward the areas and techniques that have not yet been tested. Any threat intelligence the organization has already collected about recent suspicious activity, indicators of compromise, or active targeting should be shared. And compliance and regulatory context should be provided so testers understand which findings carry the highest consequence in terms the board and audit committee will recognize.
For gray-box and white-box engagements, which provide testers with partial or full knowledge of the environment, the package additionally includes cloud configuration exports, application source code access, and test account credentials at each privilege level in the application hierarchy.
The documentation investment is returned in finding depth. A testing team that arrives with this context finds significantly more in the same time window than a team that arrives with an IP range and a start date.
Step 5: Align Stakeholders and Lock Down Rules of Engagement
A threat-led penetration test is a simulated attack against live systems. Without explicit alignment across every relevant internal team and a formally executed rules of engagement document, three things go wrong consistently: the security operations team responds to red team activity as an active incident and burns resources on a false response, critical operational systems are disrupted by testing activity that should have been coordinated with operational owners, and the organization accumulates legal exposure because no one formally authorized the testing to proceed against specific systems.
The stakeholder alignment process has three layers.
The white team is the internal group that knows the test is happening. In a mature threat-led engagement, the white team is kept deliberately small: the CISO, the legal representative, and the engagement coordinator. The security operations team is frequently left uninformed in the initial testing phases specifically to measure real detection capability, which is only visible when the SOC is operating in its normal state rather than on elevated alert.
Operational notifications go to the teams whose work could be disrupted by testing activity. IT operations, help desk, network operations, and any managed service providers whose monitoring covers the in-scope environment all need to know that a test is occurring, from which source addresses or with which identifiers, and during which specific windows. A managed security service provider that is not informed of a penetration test will respond to red team traffic as an active attack, potentially blocking testing activity and escalating to the client as a live incident.
The rules of engagement document is the legal foundation of the entire engagement. It specifies exactly which systems are authorized for testing, the exact testing window, whether social engineering and phishing simulations are in scope, whether physical access testing is authorized, what constitutes an out-of-scope action requiring testing to stop, how the testing team communicates a critical finding that requires immediate remediation before the engagement concludes, and what the testing team does if they inadvertently access data outside the authorized scope. This document requires legal sign-off before testing begins, and no legitimate firm will start without it.
Step 6: Configure Your Detection Environment to Measure What Matters
The final preparation step is also the most frequently overlooked, because most organizations think of it as a testing question rather than a preparation question. It is not. The configuration decision has to be made before the engagement starts because it fundamentally shapes what the engagement measures.
A threat-led penetration test measures two distinct things simultaneously: what attack paths exist in your environment, and whether your detection and response capability would catch an attacker using them. The second measurement requires your SOC to be operating in its normal state during the engagement. If the team knows they are being tested, they will be on elevated alert, which inflates apparent detection performance and makes the measurement useless.
The standard approach in mature TLPT engagements is a structured purple team debrief after the main testing phase concludes. The red team and blue team sit down with the complete engagement timeline. The red team walks through every technique they executed, every tool they used, and every access they established. The blue team checks their logs and alerting systems against that timeline and identifies every event that did and did not generate a detection. The gap between what happened and what was detected is the most operationally valuable output of the entire engagement, and it feeds directly into SIEM tuning, EDR configuration, and detection engineering priorities.
Preparing for this debrief means ensuring, before testing begins, that logging and alerting is fully configured across all in-scope systems, that log retention is sufficient to allow retrospective timeline analysis, and that SIEM alerting thresholds are set to their normal operational state rather than elevated for the engagement period. These baseline conditions are what make the detection measurement meaningful rather than performative.
What to Expect Once Testing Begins
Understanding the phases of the engagement helps organizations support it rather than inadvertently disrupt it.
Reconnaissance comes first and is entirely passive from the organization's perspective. The red team uses open-source intelligence to map the external attack surface: public-facing systems, employee names and roles from LinkedIn and corporate directories, technology stack indicators from job postings and GitHub repositories, and credential exposure in infostealer logs and breach data. The organization takes no action during this phase and should not attempt to block or interfere with it.
Initial access attempts follow. Depending on what the rules of engagement authorize, these may include spear phishing campaigns, exploitation of external-facing vulnerabilities, credential stuffing against authentication endpoints, or social engineering of support staff. This is the phase most likely to generate security alerts. Organizations should resist the urge to respond to those alerts as live incidents unless the white team has confirmed the source is not the authorized testing team.
Lateral movement and privilege escalation occupy the middle phase of the engagement. Once initial access is achieved, testers move through the environment using legitimate credentials and administrative tools, avoiding detection while mapping internal systems and attempting to reach crown jewel assets. This is where living-off-the-land techniques, the kind used by nation-state actors like Volt Typhoon, are most operationally relevant. If the SOC does not detect these techniques during the engagement, that is not a testing problem. It is a detection coverage gap that the purple team debrief will surface and the detection engineering team will need to close.
Crown jewel access simulation concludes the active testing phase. The team attempts to reach and demonstrate access to the specific data and systems identified in Step 1. Data is never actually exfiltrated; the test documents whether the access path existed and whether it generated a detection event.
The purple team debrief is not a formality. It is the session that generates the most immediately actionable output of the engagement: a precise timeline of adversary behavior mapped against security team visibility, with every gap in detection coverage explicitly identified.
After the Report: The Work That Makes the Test Worth Paying For
A penetration test report is not an end state. It is the starting point of a risk reduction process. Organizations that treat the report as a deliverable to file until the next audit cycle extract the least possible value from an engagement that cost them significant time and budget.
The findings should be re-triaged by attacker value against the crown jewel map from Step 1, not just by severity score. A critical CVSS rating on a system with no path to crown jewel data is lower priority than a medium-severity authorization flaw that provides an authenticated attacker with direct access to the patient database or the financial transaction system.
Every finding needs an owner with both the authority and the resources to remediate it before the report is distributed. Findings without owners become findings without fixes. The CISO distributes the technical report with assigned remediation owners and deadlines. The board receives an executive summary that translates technical findings into business risk terms with clear remediation timelines.
Remediation without verification is an assumption. A retest confirms that the specific attack path identified in the engagement is actually closed, not just that a patch was applied or a configuration note was added to a ticket. Most reputable firms include one retest cycle in the engagement scope. Use it, document the results, and attach them to the compliance evidence package.
The detection gap analysis from the purple team debrief feeds directly into detection engineering: every technique the red team used that the blue team did not detect becomes a specific tuning target for the SIEM and EDR. This work converts the engagement from a point-in-time assessment into a durable improvement in detection coverage.
Finally: adjust the next engagement scope based on what this one found. The organizations that extract the most security value from threat-led pentesting use each engagement to build on the prior one, testing deeper into the pathways that previous engagements identified, validating that prior remediations held, and updating the threat profile with current adversary intelligence before each new cycle.
Frequently Asked Questions
Q: How is a threat-led pentest different from a standard annual pentest?
A standard annual pentest typically uses automated tooling against a defined scope to identify known vulnerability classes, then maps findings to a compliance framework. A threat-led pentest starts from current threat intelligence about your specific adversaries, builds scenarios around those adversaries' real TTPs, and evaluates whether your defenses stop those specific attack paths to your specific crown jewels. The deliverable from a compliance pentest is a finding list. The deliverable from a threat-led pentest is an accurate picture of your breach risk against the adversaries you actually face.
Q: Should our SOC know the test is happening?
It depends on what you are measuring. If the primary objective is vulnerability discovery, a disclosed collaborative engagement with the SOC is more efficient. If the primary objective includes measuring real detection capability, the SOC should operate normally during the engagement and receive the full timeline only in the purple team debrief. Many mature organizations run a phased approach: initial undisclosed testing to measure baseline detection, followed by a disclosed purple team phase to work through gaps collaboratively.
Q: How long does preparation realistically take?
For a first engagement, plan three to six weeks of preparation before testing begins. The asset inventory is the most time-consuming step, particularly if shadow IT discovery reveals significant gaps between documented and actual infrastructure. Organizations that have completed prior engagements and maintained their documentation can compress this to one to two weeks.
Q: What does a threat-led pentest actually cost, and how do we justify it internally?
A comprehensive threat-led engagement from a qualified firm costs between $20,000 and $150,000 depending on scope, duration, and the inclusion of red team operations and purple team components. The internal justification is straightforward: the average U.S. data breach cost $10.22 million in 2025. One analysis found that for every dollar spent on penetration testing, up to $10 in breach costs are saved. Against a potential $10 million breach, a $50,000 threat-led engagement represents a rational risk management investment, not a discretionary security expense.
Q: How do we evaluate whether a testing firm is qualified for threat-led work?
Ask specifically how they translate current threat intelligence into test scenarios, and whether those scenarios are mapped to MITRE ATT&CK. Ask for evidence of sector-specific experience. Ask how they conduct the purple team debrief and what format the detection gap analysis takes. Ask whether they have worked in production environments comparable to yours. A compliance-focused firm will answer these questions with references to frameworks and certification acronyms. A threat-led firm will answer them with specific adversary TTPs and concrete examples of what they found in previous sector-specific engagements.
Preparation Is the Test
There is a version of a threat-led penetration test that accurately answers the question every CISO actually needs answered: if the adversary group most likely to target an organization like mine tried to reach my most valuable data today, what would stop them and what would not?
And there is the version that generates a report with a handful of medium-severity findings, a clean bill of health on systems that were never on an attacker's radar, and a set of remediation recommendations that sit in a backlog until the next audit.
The difference between them is almost entirely in what happens before the testing team arrives.
The six preparation steps in this article take organizational investment, internal coordination, and a willingness to look honestly at what the real attack surface is rather than what the documented one says it is. They require mapping crown jewels that some stakeholders would prefer not to explicitly acknowledge. They require an asset discovery process that will surface shadow IT that no one officially wants on record. They require building a threat profile from intelligence about adversaries that are specifically targeting organizations like yours, which is a more uncomfortable exercise than assuming the threat is generic.
None of that work is comfortable. All of it determines whether the test tells you something true.
Critical Insights and Findings
Pentesters successfully breached internal network perimeters in 93% of tested companies in 2025, demonstrating that defenses exist but are not validated against realistic attack paths. 67% of enterprises that increased their security tool stack still experienced a breach within two years, confirming the validation gap. Poor preparation, not poor testing, is the primary cause of engagements that miss real attack paths. Crown jewel mapping before scope definition ensures the test is aimed at what matters most. Complete asset discovery, including shadow IT, undocumented APIs, and cloud services, gives testers the real attack surface rather than the documented fiction. A sector-specific threat profile built from MITRE ATT&CK-mapped TTPs is what separates threat-led scenarios from generic ones. The purple team debrief is the highest-value output of the engagement and requires pre-configured logging and normal SOC operating conditions during testing. For every dollar spent on penetration testing, up to $10 in breach costs are saved, making a well-prepared threat-led engagement one of the highest-ROI investments in a security budget. First-engagement preparation realistically takes three to six weeks; maintained programs compress to one to two weeks per cycle.
Sources: Pentera 2025 State of Pentesting Report, Blaze InfoSec Annual Penetration Testing Review 2025, MITRE ATT&CK Enterprise Evaluations 2025, DORA Regulatory Technical Standards for TLPT (2025), TIBER-EU Framework, IBM Cost of a Data Breach 2025, Deepstrike Penetration Testing Cost and Statistics 2025-2026, CISA Known Exploited Vulnerabilities Catalog, NIST SP 800-115 Technical Guide to Information Security Testing, Astra Penetration Testing Trends 2025, Brightdefense Penetration Testing Statistics 2026, The Hacker News 2026 State of Pentesting Report.
Your Threat-Led Test Is Only as Good as What You Put Into It
Most penetration test briefs contain an IP range, a testing window, and a signature page.
That is not enough to run a threat-led engagement. Not because the testing team is not capable, but because capability without context produces generic results. The team that arrives at your environment knowing your crown jewels, your threat profile, your real asset inventory, and your detection baseline will find things that a team handed an IP range never reaches.
RITC Cybersecurity builds threat-led engagements from the preparation stage forward. We work with your team to map crown jewels, run the asset discovery process, build the sector-specific threat profile, and configure the engagement conditions that make the purple team debrief actually tell you something true about your SOC's real detection coverage.
We do not show up and start testing. We show up ready to simulate the adversary that is actually looking at your organization, against the systems that actually matter, and tell you exactly what they would find.
Book a free pre-engagement consultation with RITC Cybersecurity. In 30 minutes, we will review your current testing posture, walk through what a properly scoped and prepared threat-led engagement would look like for your sector and size, and show you the difference between what a compliance test finds and what your actual adversaries would find if they tested the same environment.
Because the 93% of organizations whose perimeters were breached in testing also thought they had adequate defenses. The ones who found out in a pentest were the fortunate ones.