Spotting the Bots: A Graph-Based Defense for Online Retail

Author: Denis Avetisyan

A new approach leverages network analysis and machine learning to identify malicious automated traffic in e-commerce without impacting legitimate users.

Despite partial overlap in individual behavioral characteristics, distributions consistently differentiate human and bot sessions, suggesting relational graph modeling offers a crucial approach to discerning nuanced patterns of interaction.

This review details a non-intrusive bot detection system utilizing Inductive Graph Neural Networks to analyze user session behavior and web content relationships for enhanced e-commerce security.

Despite increasingly sophisticated evasion techniques, malicious bots continue to pose a significant threat to e-commerce platforms through data scraping, inventory manipulation, and fraudulent activities. This paper introduces a novel, non-intrusive bot detection framework, ‘Non-Intrusive Graph-Based Bot Detection for E-Commerce Using Inductive Graph Neural Networks’, which models user session behavior as a graph and leverages inductive graph neural networks to identify subtle automated activity. Experimental results demonstrate improved accuracy and robustness compared to traditional session-level methods, even under adversarial conditions and with limited data. Could this approach represent a scalable and adaptable solution for maintaining the integrity of online marketplaces?

The Inevitable Rise of Automated Deception

E-commerce platforms are increasingly targeted by automated malicious activity, presenting a significant and evolving threat to both revenue and the user experience. This constant barrage of bots isn’t simply a nuisance; it actively degrades platform performance through activities like account creation fraud, credential stuffing, and scraping of valuable data. These attacks don’t discriminate, impacting businesses of all sizes and across diverse product categories. The resulting financial losses stem not only from direct fraud but also from diminished customer trust and increased operational costs associated with mitigation efforts. Furthermore, legitimate users often bear the brunt of this bot activity, experiencing slower website speeds, inaccurate product pricing, and even stock depletion caused by automated purchasing, creating a frustrating and potentially costly online shopping experience.

Contemporary e-commerce platforms are locked in a continuous arms race against increasingly cunning automated bots. Traditional bot detection, reliant on static signatures or rule-based systems, finds itself consistently outmaneuvered by evolving evasion tactics. These bots now employ techniques like IP rotation, user-agent spoofing, and even behavioral mimicry – subtly imitating human browsing patterns to blend in with legitimate traffic. Consequently, a significant number of malicious bot sessions slip past defenses, resulting in substantial false negatives. This not only leads to direct revenue loss through fraudulent transactions and resource depletion – such as scraping of content or denial-of-service attacks – but also erodes user trust and degrades the overall online experience as legitimate customers compete with automated activity for access to goods and services.

A significant hurdle in automated bot detection lies in the “cold-start problem,” which concerns the accurate classification of new user sessions lacking sufficient historical data for analysis. Traditional bot detection systems often rely on established patterns of behavior, but these systems struggle when confronted with entirely new users or sessions where little to no prior activity exists. This creates a vulnerability, as malicious bots can initially mimic legitimate user behavior, evading detection until enough data accumulates to reveal their true nature. Consequently, solutions must incorporate techniques that go beyond simple pattern matching, leveraging real-time behavioral analysis, device fingerprinting, and even subtle contextual clues to assess the legitimacy of these nascent sessions – a complex undertaking that requires balancing accuracy with minimal disruption to genuine users.

Mitigating malicious bot activity in e-commerce necessitates detection strategies that prioritize a seamless user experience; overly aggressive security measures can inadvertently block legitimate customers, leading to lost sales and damaged brand reputation. Consequently, research focuses on non-intrusive detection methods, employing techniques like behavioral analysis and device fingerprinting to differentiate between human users and automated bots without requiring disruptive challenges or captchas. These approaches analyze patterns in user interactions – mouse movements, scrolling speed, and keystroke dynamics – to build a profile of expected behavior, flagging anomalies indicative of bot activity. The goal isn’t simply to block suspicious sessions, but to dynamically adjust security levels based on perceived risk, ensuring that genuine customers encounter minimal friction while effectively neutralizing automated threats. This delicate balance-between robust security and positive user experience-is paramount for sustaining growth within the competitive e-commerce landscape.

Mapping the Landscape of User Interaction

Graph-based modeling represents user interactions by constructing a network where individual user sessions are modeled as Session Nodes and the content or URLs accessed within those sessions are represented as Content/URL Nodes. Edges connect these node types, indicating a user’s access of specific content during a session. This approach moves beyond treating sessions as isolated events, instead capturing relationships and sequences of access. The resulting graph structure allows for the representation of complex behavioral patterns, such as frequently accessed resources, common navigation paths, and the temporal relationships between content consumption, facilitating analysis of user behavior at a more granular level than traditional methods.

Traditional user behavior analysis often treats each user session as an independent event, neglecting the connections between sessions and the resources accessed. By modeling user interactions as a dynamic graph, we establish relationships between individual sessions and the content-URLs, products, or other digital assets-they involve. This relational approach allows the system to identify patterns that emerge from sequences of interactions, recognizing, for example, that a user who views product A and then product B is likely to exhibit similar behavior to other users who follow the same path. Analyzing these connections reveals broader behavioral trends and dependencies that would be obscured by isolated session-level analysis, providing a more holistic understanding of user activity and intent.

Session features are numerical or categorical attributes extracted from user sessions to quantify behavioral characteristics. These features are generated through feature engineering, a process that transforms raw session data – such as timestamps, duration, pages visited, and event sequences – into usable inputs for machine learning models. Examples include session length, number of page views, dwell time on specific content types, and the frequency of particular user actions. These engineered features are then directly associated with each session node within the graph structure, providing a detailed characterization of user activity at a granular level and enabling the differentiation of sessions based on behavioral patterns.

The graph-based representation of user behavior facilitates the application of graph neural networks (GNNs) for learning effective user representations. GNNs operate directly on the graph structure, enabling the model to aggregate information from a user’s interaction history – represented as connected nodes – and their neighbors. This aggregation process allows the network to capture complex dependencies and patterns within the user’s activity that would be difficult to discern through traditional methods. The resulting learned representations, or embeddings, can then be used for downstream tasks such as user profiling, recommendation systems, and anomaly detection, offering improved performance due to the contextual understanding derived from the graph structure.

GraphSAGE: An Adaptive Response to Evolving Threats

Graph Neural Networks (GNNs), and specifically the GraphSAGE model, are utilized to generate node embeddings representing individual user sessions. These embeddings are created by learning to aggregate feature information from each session’s network neighborhood – that is, the sessions it directly interacts with. The resulting embedding is a vector representation encapsulating the behavioral characteristics of the session, derived from both its intrinsic features and its relationships within the broader network of user interactions. This approach allows the model to move beyond static session features and incorporate contextual information into its representation of each session, enabling more accurate bot detection.

The capacity for inductive learning within GraphSAGE is particularly valuable in bot detection due to the dynamic nature of online sessions. Unlike transductive models requiring retraining with each new session, GraphSAGE can generalize to previously unseen sessions by applying learned aggregation functions to new node feature combinations. This is achieved through sampling and aggregation of features from a node’s local neighborhood, allowing the model to infer embeddings for new sessions without requiring full model updates. Consequently, GraphSAGE effectively addresses the cold-start problem – the difficulty of accurately classifying new sessions with limited historical data – by leveraging the relational information encoded within the graph structure and the learned patterns of interaction.

GraphSAGE learns node embeddings by utilizing an aggregation function to combine feature information from a node’s immediate neighbors within the graph. This process effectively captures the contextual relationships inherent in user interaction data; a session’s embedding isn’t solely determined by its intrinsic features but is informed by the behavior of connected sessions. The aggregation function considers the features of neighboring nodes and combines them to create a representation of the central node, allowing the model to infer patterns and relationships based on network proximity. Multiple aggregation layers can be stacked to capture higher-order neighborhood information, providing a more comprehensive understanding of each session’s context within the broader network of user activity.

Evaluation of the GraphSAGE model for bot detection indicates a performance advantage over a traditional Multi-Layer Perceptron (MLP). When trained on session features and assessed using the Area Under the ROC Curve (AUC), GraphSAGE achieved a score of 0.9705. In comparison, the MLP, trained on the same raw session features, yielded an AUC of 0.9102. This represents a 6.03% improvement in discriminatory power, suggesting that GraphSAGE more effectively distinguishes between legitimate user sessions and bot activity based on the features analyzed.

The Inevitable Imperfection of Security

Evaluation of the proposed bot mitigation system reveals a substantial advantage in resilience against adversarial perturbation, as quantified by the $Area Under the ROC Curve (AUC)$ . This metric, a standard for assessing classification performance, demonstrates the graph-based approach consistently distinguishes between legitimate users and malicious bots even when subjected to subtle, intentionally misleading alterations in bot behavior. The system’s capacity to maintain accuracy under such conditions highlights a key improvement over traditional methods, which often struggle with even minor variations in attack patterns. This robustness stems from the model’s ability to consider relationships between sessions, rather than treating each one in isolation, enabling it to identify malicious activity based on broader network behavior and maintain high performance despite adversarial efforts.

The efficacy of this novel bot mitigation technique is demonstrably high, as evidenced by a comprehensive performance evaluation. Achieving an $Area Under the ROC Curve (AUC)$ of 0.9705 indicates exceptional discriminatory power between legitimate users and malicious bots. This translates to a 90% recall rate – meaning the system successfully identifies 90% of bot activity – while maintaining a low false positive rate of approximately 1%. Further solidifying its advantage, the system achieves an F1-score of 0.85, a harmonic mean of precision and recall, notably surpassing the 0.75 F1-score attained by a traditional Multilayer Perceptron (MLP) approach. These metrics collectively highlight a significant improvement in both the accuracy and reliability of bot detection, minimizing disruption for genuine users.

The efficacy of this bot detection framework extends beyond trained datasets, as evidenced by strong performance in a cold-start evaluation utilizing entirely unseen user sessions. Despite lacking prior exposure to these new interactions, the model maintained a high $Area Under the ROC Curve (AUC)$ of 0.963, indicating a robust capacity for generalization. This resilience suggests the system doesn’t simply memorize patterns from the training data, but rather learns underlying characteristics indicative of bot-like behavior, enabling accurate identification even with novel and previously unobserved bot strategies. The ability to perform effectively in a cold-start scenario is particularly valuable in dynamic online environments where bot tactics are constantly evolving, and relying on established profiles would quickly prove ineffective.

Ongoing development aims to transform this bot mitigation framework from a reactive system into a proactive, self-learning defense. Researchers are integrating real-time feedback loops, allowing the model to continuously refine its understanding of malicious behavior as new threats emerge. This dynamic adaptation will involve techniques like continual learning and reinforcement learning, enabling the system to autonomously adjust its detection criteria and respond effectively to evolving bot tactics. The ultimate goal is a resilient and scalable solution capable of anticipating and neutralizing bot attacks before they can significantly impact online platforms, ensuring a safer and more reliable user experience.

The pursuit of identifying malicious actors within the complex ecosystems of e-commerce mirrors a fundamental challenge in all systems – discerning signal from noise. This work, focused on graph-based bot detection, acknowledges the inherent limitations of static definitions. It doesn’t build a barrier against bots so much as cultivate a system capable of growing its understanding of anomalous behavior. As Carl Friedrich Gauss observed, “If others would think as hard as I do, they would not have so many criticisms.” The inductive nature of the Graph Neural Networks allows the system to adapt and refine its perception of threats, mirroring a constant state of learning. The system doesn’t presume complete knowledge, instead recognizing that every architectural choice – every feature selected for analysis – is a prophecy of future misclassifications, requiring continual refinement. It’s a humble approach, acknowledging that the silence of a system doesn’t imply security, but merely a temporary lack of observed anomalies.

What Lies Ahead?

The presented work, while demonstrating efficacy in identifying automated malicious traffic, merely maps a local minimum onto a far more complex energy surface. The assumption of stationarity – that patterns of bot behavior will remain consistent – is a comfortable fiction. Any architecture designed to detect current bots implicitly selects for bots that learn to evade it. The inevitable arms race necessitates a shift from signature-based detection to systems that model the process of adaptation itself. This demands a move beyond feature engineering, and towards truly generative models capable of anticipating novel adversarial strategies.

The reliance on graph structure, while intuitively appealing, introduces a fragility. Graphs are, after all, brittle abstractions of fluid relationships. The system’s performance will degrade as the underlying web ecosystem evolves, as content relationships become increasingly ephemeral, and as user behavior becomes more fragmented. A guarantee of continued accuracy is, naturally, a contract with probability, not a statement of fact.

Ultimately, the pursuit of “bot detection” is a symptom, not a solution. The true problem lies in the incentives that create bots – the economic asymmetries that make automation profitable. Stability is merely an illusion that caches well; the long-term solution demands a fundamental re-evaluation of the underlying economic architecture of e-commerce itself. Chaos isn’t failure – it’s nature’s syntax.

Original article: https://arxiv.org/pdf/2601.22579.pdf

Contact the author: https://www.linkedin.com/in/avetisyan/

The Inevitable Rise of Automated Deception

Mapping the Landscape of User Interaction

GraphSAGE: An Adaptive Response to Evolving Threats

The Inevitable Imperfection of Security

What Lies Ahead?

See also: