Hunting Infostealers with AI in Large-Scale Security Operations Centers

At the NorthSec Conference 2026, François Labrèche, Principal Data Scientist at Sophos, unveiled groundbreaking research that focused on combating one of the critical challenges faced by security operations centers (SOCs): alert fatigue. This phenomenon occurs when analysts are inundated with a staggering volume of security alerts, many of which are irrelevant or benign. The research highlighted an advanced multi-layered detection pipeline designed to sift through extensive datasets to effectively identify genuine threats.

The context of Labrèche’s study was built upon two weeks of telemetry from Taegis XDR, a powerful system that processes over 800 billion events each day. From this impressive total, the research analyzed a colossal 11.8 trillion events to demonstrate how effective threat hunting can be realized at an enterprise scale. The findings offer vital implications for organizations struggling with excessive alerting which can obfuscate real threats.

Central to the detection pipeline are four distinct stages, each engineered to progressively diminish the volume of alerts produced. In the first stage, a variety of detectors are applied, ranging from simple indicator matches to more sophisticated machine learning models, including a Long Short-Term Memory (LSTM) network specifically designed to identify domain generation algorithms. Additionally, logistic regression models are utilized for detecting potentially malicious command-line activity. The preliminary filtering process effectively reduced the initial dataset to 2.6 billion alerts, which constitutes a mere 0.02% of the original events.

Following the initial filtering, the second stage of the detection pipeline includes a process of deduplication and correlation. This method groups related alerts, such as those stemming from denial-of-service attacks or scanning activities, thereby diminishing the total count down to 251.4 million alerts. This substantial reduction exemplifies how effective correlation can simplify the alert landscape, bringing analysts closer to managing their workloads.

The third stage adds another layer of sophistication through context-based suppression. Here, alerts are filtered based on customer-specific circumstances, which could include scenarios like authorized vulnerability scanning or known false positives gleaned from threat intelligence feeds. This critical filtering step removed an additional 16% of the remaining alerts, leaving a more manageable 211 million for review.

The pipeline culminates in a final prioritization stage, where a Gradient Boosted Trees Classifier is applied. This model, trained on 1.8 million historical alerts, encompasses both static features, such as tactics, techniques, and procedures (TTPs) from the MITRE ATT&CK framework, as well as dynamic features that track investigation rates for similar alerts. This intelligent model excels in automatically dismissing low-probability threats while elevating alerts deemed high-risk, ultimately condensing the two-week dataset to a mere 81,573 high and critical alerts.

Among the successful identifications made during the study was an infostealer attack targeting one particular customer. The detection process revealed two high-severity alerts indicating password theft. Further contextual examination of surrounding medium and lower-severity alerts revealed anomalous program behavior, followed by malware detections and behavioral indicators of credential theft. Remarkably, this pattern repeated twice on the same user’s machine, providing compelling evidence of compromise. In response to the findings, Sophos promptly contacted the customer and initiated incident response measures to contain the incident effectively.

Labrèche’s research emphasizes a pivotal recommendation for organizations managing extensive SOCs: instead of relying solely on single-layer detection systems, there is a significant advantage in adopting multi-stage filtering approaches. By integrating rule-based detection, machine learning models, and context-aware filtering, organizations can drastically reduce alert volumes while simultaneously preserving their threat detection capabilities.

Continuing forward, the research indicates a future trajectory focused on applying prioritization models to aggregated incidents rather than assessing individual alerts in isolation. This change could potentially enhance detection accuracy significantly through comprehensive alert analysis, presenting a promising avenue for organizations seeking to fine-tune their threat detection capabilities.

In conclusion, the research presented by Sophos at the NorthSec Conference serves as a compelling blueprint for organizations to manage and optimize their threat detection systems. With the ever-evolving landscape of cybersecurity threats, the implementation of advanced, multi-layered detection strategies presents a meaningful solution to tackle alert fatigue and enhance overall security effectiveness. For further insights into these findings, more details are available through the original source provided by Sophos.

Source link

Select a plan

Monthly plan

Yearly plan

All plans include

Search for an article

Hunting Infostealers with AI in Large-Scale Security Operations Centers

Latest articles

Phantom Stealer Campaign Employs JavaScript and PowerShell to Theft Browser Credentials

AegisAI Secures $36M for AI-Driven Email Security

FBI and CISA Alert to Rising Iranian Cyber Attacks

Google Introduces Unified Cryptonym-Based Naming System for Threat Actors

More like this

Phantom Stealer Campaign Employs JavaScript and PowerShell to Theft Browser Credentials

AegisAI Secures $36M for AI-Driven Email Security

FBI and CISA Alert to Rising Iranian Cyber Attacks